Piwik : Piwik is the leading open source web analytics platform that gives you valuable insights on your website's visitors, your marketing campaigns and much...
ProjectPier.org : ProjectPier is a Free, Open-Source, PHP application for managing tasks, projects and teams through an intuitive web interface. It must be downloaded a...
scikit-learn : Machine Learning in Python
Simple and efficient tools for data mining and data analysis
Accessible to everybody, and reusable in various contexts
Buil...
HTML Purifier : HTML Purifier is a standards-compliant HTML filter library written in PHP. HTML Purifier will not only remove all malicious code (better known as XSS)...
SpaCy.io : spaCy is a library for industrial-strength natural language processing in Python and Cython. It features state-of-the-art speed and accuracy, a concis...
Bespoke.js : Bespoke.js is a super minimal, modular presentation library for modern browsers, designed to foster a rich plugin ecosystem.
Photon : UI toolkit for building desktop apps with Electron.
In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.