SpaCy.io : spaCy is a library for industrial-strength natural language processing in Python and Cython. It features state-of-the-art speed and accuracy, a concis...
Cliquet : Cliquet is a toolkit to ease the implementation of HTTP microservices, such as data-driven REST APIs.
NLTK : The Natural Language Toolkit (NLTK) is a Python package for natural language processing.
It provides easy-to-use interfaces to over 50 corpora and lex...
pydantic : Data validation and settings management using python type annotations.
Kartograph : The Kartograph map generator has just one method, which is there to generate SVG maps (surprise).
In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.