For Python you could go with Frontera by Scrapinghub
https://github.com/scrapinghub/frontera
They’re the same guys that make Scrapy.
There’s also Apache Nutch which is a much older project.
http://nutch.apache.org/
1
solved How to build a powerful crawler like google’s? [closed]