cssQuery() : cssQuery() is a powerful cross-browser JavaScript function that enables querying of a DOM document using CSS selectors. All CSS1 and CSS2 selectors ar...
HTML Purifier : HTML Purifier is a standards-compliant HTML filter library written in PHP. HTML Purifier will not only remove all malicious code (better known as XSS)...
Essentia : Open-source library and tools for audio and music analysis, description and synthesis
Photon : UI toolkit for building desktop apps with Electron.
Clint : Clint is a python module filled with a set of awesome tools for developing commandline applications.
In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.