Scrapy : Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be...
NLTK : The Natural Language Toolkit (NLTK) is a Python package for natural language processing.
It provides easy-to-use interfaces to over 50 corpora and lex...
scikit-learn : Machine Learning in Python
Simple and efficient tools for data mining and data analysis
Accessible to everybody, and reusable in various contexts
Buil...
cliff : Command Line Interface Formulation Framework. cliff is a framework for building command line programs. It uses plugins to define sub-commands, output ...
Clint : Clint is a python module filled with a set of awesome tools for developing commandline applications.
ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.).