souvenir
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
readability-lxml http://pypi.python.org/pypi/readability-lxml
11/07/2012 cluster icon
  • SpaCy.io : spaCy is a library for industrial-strength natural language processing in Python and Cython. It features state-of-the-art speed and accuracy, a concis...
  • Cliquet : Cliquet is a toolkit to ease the implementation of HTTP microservices, such as data-driven REST APIs.
  • NLTK : The Natural Language Toolkit (NLTK) is a Python package for natural language processing. It provides easy-to-use interfaces to over 50 corpora and lex...
  • pydantic : Data validation and settings management using python type annotations.
  • Kartograph : The Kartograph map generator has just one method, which is there to generate SVG maps (surprise).
thumbnail

In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.

python html readable library
1645 links
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn