Since its first public release nine months ago, the John Snow Labs NLP Library for Apache Spark has experienced widespread adoption as it set a new bar for production-grade natural language understanding. In terms of performance, detailed benchmarks published by O’Reilly Media show the library to be 38x to 80x times faster than spaCy (the top performing library to date) on a single machine. Spark NLP is also the only open source library which can natively scale on a distributed cluster, often delivering near-linear scalability.

In terms of accuracy, Spark NLP is the first and only library to productize several of the most recent and best performing deep learning algorithms for NLP. Among others, the library provides scalable, production-grade code for named entity recognition, assertion status detection, and entity resolution, based on academic papers that were published less than a year ago. The library’s API enables users to either leverage pre-trained models or train their own models for tackling domain-specific texts.

As a result, three of the most prominent and selective technical conferences in the AI space have recently chosen to highlight Spark NLP as the leading technology choice for data scientists:

“We are super excited to see how quickly the industry’s most selective thought leaders have picked up Spark NLP and increasingly recommend it as the default choice to data scientists worldwide. We are committed as ever to keep pushing the state of the art and provide the community with the best performing and most accurate NLP software ever built”, said Saif Addin Ellafi, lead NLP engineer at John Snow Labs.

John Snow Labs maintains a full-time development team to keep improving the open source library, which delivered 10 new releases during the first six months of 2018. The library is monetized by licensing pre-trained models and data sets for the healthcare vertical. The models and data sets are continuously updated and optimized for accuracy on top of the high-performing Spark NLP core. The global market for natural language pro.

Sam Brake Guia

Sam is an energetic and passionate writer/blogger, always looking for the next adventure. In August 2016 he donated all of his possessions to charity, quit his job, and left the UK. Since then he has been on the road travelling through North, Central and South America searching for new adventures and amazing stories.

StartUp Beat has relaunched to take a new and improved look at startups from around the world. We will continue to profile the world’s most innovative early-stage startups through company pitches, interviews and guest columns from entrepreneurial experts. StartUp Beat is now based in Medellín, Colombia, one of Latin America’s burgeoning tech hubs.