Scrapy : Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be...
HTML_QuickForm2 : This PHP package provides methods to create, validate and render HTML forms.
Bespoke.js : Bespoke.js is a super minimal, modular presentation library for modern browsers, designed to foster a rich plugin ecosystem.
impress.js : It's a presentation framework based on the power of CSS3 transforms and transitions in modern browsers and inspired by the idea behind prezi.com
docopt : Command-line interface description language. docopt helps you define interface for your command-line app, and automatically generate parser for it.
In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.