souvenir
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
readability-lxml http://pypi.python.org/pypi/readability-lxml
11/07/2012 cluster icon
  • Polymer : Web Components usher in a new era of web development based on encapsulated and interoperable custom elements that extend HTML itself. Built atop these...
  • Bespoke.js : Bespoke.js is a super minimal, modular presentation library for modern browsers, designed to foster a rich plugin ecosystem.
  • cliff : Command Line Interface Formulation Framework. cliff is a framework for building command line programs. It uses plugins to define sub-commands, output ...
  • SpaCy.io : spaCy is a library for industrial-strength natural language processing in Python and Cython. It features state-of-the-art speed and accuracy, a concis...
  • scikit-learn : Machine Learning in Python Simple and efficient tools for data mining and data analysis Accessible to everybody, and reusable in various contexts Buil...
thumbnail

In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.

python html readable library
1638 links
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn