souvenir
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
readability-lxml http://pypi.python.org/pypi/readability-lxml
11/07/2012 cluster icon
  • docopt : Command-line interface description language. docopt helps you define interface for your command-line app, and automatically generate parser for it.
  • Polymer : Web Components usher in a new era of web development based on encapsulated and interoperable custom elements that extend HTML itself. Built atop these...
  • HTML Purifier : HTML Purifier is a standards-compliant HTML filter library written in PHP. HTML Purifier will not only remove all malicious code (better known as XSS)...
  • Clint : Clint is a python module filled with a set of awesome tools for developing commandline applications.
  • cssQuery() : cssQuery() is a powerful cross-browser JavaScript function that enables querying of a DOM document using CSS selectors. All CSS1 and CSS2 selectors ar...
thumbnail

In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.

python html readable library
1650 links
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn