pydoll—WebDriver-Free Browser Automation Python LibraryThis is a Python library for automating Chromium-based browsers. It controls browsers directly throuautoscrape-labs·Python·16 days ago1k
FlareSolverr—Proxy Server Bypassing CF ProtectionThis project can help developers bypass Cloudflare and DDoS-GUARD protection. It sets up a proxy serFlareSolverr·Python·5 months ago763
crawl4ai—LLM-Friendly Asynchronous Web Crawler Framework2This is an asynchronous web crawler framework developed with Python, capable of transforming websiteunclecode·Python·a year ago4.7k
SeleniumBase—Comprehensive Browser Automation Framework1This project is a Python automation testing framework based on Selenium, integrating multiple functiseleniumbase·Python·a year ago4.6k
helium—A Python Library for Simplifying Browser AutomationThis project is a lightweight Python library based on Selenium, which makes writing browser automatimherrmann·Python·a year ago2.6k
crawlee—Crawler Framework that Mimics Human BehaviorThis is a web scraping and browser automation library characterized by its ability to write crawlersapify·TypeScript·a year ago2.3k
Scrapegraph-ai—AI-based Python Web Scraper2This is a Python web scraping library powered by AI. Leveraging the capabilities of Large Language MScrapeGraphAI·Python·a year ago4.9k
katana—Out-of-the-box Spider Tool and FrameworkThis project is a web scraping framework written in Go, which can be used as a command-line tool or projectdiscovery·Go·a year ago1.8k
undetected-chromedriver—Python Library to Bypass Anti-Scraping Detection1This is an optimized Selenium WebDriver patch specifically designed to prevent triggering anti-robotultrafunkamsterdam·Python·a year ago4k
DrissionPage—A Web Automation Tool Similar to Selenium12This is a Python-based web automation tool that supports Chromium-based browsers. It combines the fug1879·Python·2 years ago3.7w
URLFinder—A Rapid Tool for Extracting Webpage InformationThis project is capable of quickly crawling information such as URLs and API interfaces from JavaScrpingc0y·Go·2 years ago4.3k
EasySpider—A Visual Web Crawler Tool6This project allows users to perform automatic data collection/scraping without writing any code thrNaiboWang·JavaScript·2 years ago2.2w
rod—Go 语言的网页自动化和爬虫库1该项目是 Go 语言封装的 DevTools 协议库,实现用 Go 语言操作浏览器,自动化之前需要手动完成的操作,比如:爬取客户端渲染的页面、端到端测试、自动填go-rod·Go·3 years ago1.2w
colly—可能是最知名的 Go 爬虫框架它拥有友好的 API 和丰富代码示例,短时间内即可上手。性能方面单核能达到 1K 请求/秒,还可以轻松管理请求方式、间隔和最大并发数,功能强大且优雅。gocolly·Go·3 years ago1.9k
MechanicalSoup—自动与网站交互的轻量级 Python 库我们写爬虫一般是请求+解析两步走,该项目将 Requests(请求) 和 BeautifulSoup(解析) 两大 Python 爬虫常用库,封装成一个浏览器对MechanicalSoup·Python·3 years ago2.1k