Search: [crawler]

24/05/2021

Colly is a Go framework that provides a clean interface to write any kind of crawler/scraper/spider

With Colly you can easily extract structured data from websites, which can be used for a wide range of applications, like data mining, data processing or archiving.

Chromeless https://github.com/graphcool/chromeless

31/07/2017

Chrome automation made simple. Runs locally or headless on AWS Lambda.

Chromeless can be used to...

Run 1000s of browser integration tests in parallel ⚡️
Crawl the web & automate screenshots
Write bots that require a real browser
Do pretty much everything you've used PhantomJS, NightmareJS or Selenium for before

Scrapy http://scrapy.org

27/09/2013

Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Goutte https://github.com/fabpot/Goutte

23/11/2011

Goutte is a screen scraping and web crawling library for PHP.

Goutte provides a nice API to crawl websites and extract data from the HTML/XML responses.

Links per page

Filters