Maily Herald : Maily Herald is a Ruby on Rails engine that helps you send and manage your mailings.
Think of Maily as a self-hosted Mailchimp® you can easily integra...
Ampache : Ampache is a web based audio/video streaming application and file manager allowing you to access your music & videos from anywhere, using almost any i...
Cachet : Beautiful & simple service statuses. The open source status page system, for everyone.
arkOS : A project to help users self-host their websites, email, files and more. Decentralize your web and reclaim your privacy rights while keeping the conve...
poche is a self hosted read-it-later app. unlike to other services, poche is free and open source.
Scrapy : Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be...
cssQuery() : cssQuery() is a powerful cross-browser JavaScript function that enables querying of a DOM document using CSS selectors. All CSS1 and CSS2 selectors ar...
HTML Purifier : HTML Purifier is a standards-compliant HTML filter library written in PHP. HTML Purifier will not only remove all malicious code (better known as XSS)...
pydantic : Data validation and settings management using python type annotations.
SpaCy.io : spaCy is a library for industrial-strength natural language processing in Python and Cython. It features state-of-the-art speed and accuracy, a concis...
In few words, Given a html document, it pulls out the main body text and cleans it up. It also can clean up title based on latest readability.js code.