Topic: scraping

scrapy

scrapy

20635 0,17,20,36,26,0,21,0

Scrapy, a fast high-level web crawling & scraping framework for Python.

webmagic

webmagic

4510 0,7,11,0,14,8,17,0

A scalable web crawler framework for Java.

newcrawler

newcrawler

474

Free Web Scraping Tool

django-dynamic-scraper

django-dynamic-scraper

456 0,1

Creating Scrapy scrapers via the Django admin interface

scrapple

scrapple

371 0

A framework for creating semi-automatic web content extractors

ImageScraper

ImageScraper

297 0,0

:scissors: High performance, multi-threaded image scraper

scrapy-cluster

scrapy-cluster

201

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.

arachnid

arachnid

153

Crawl all unique internal links found on a given website

jekyll

jekyll

100 0,0

static site version of Programming Historian

lambda-soup

lambda-soup

79 0,0

Functional HTML scraping and rewriting with CSS in OCaml.

geeks-pdf

geeks-pdf

78

PDF versions of Geeks for Geeks articles. Compiled into book form.

Katastrophe

Katastrophe

78 0

Command Line Tool to download torrents from kat.ph https://pythonhosted.org/katastrophe/