Repository Details
Shared by


HelloGitHub Rating
10.0
3 ratings
Free•Apache-2.0
Claim
Discuss
Collect
Share
51.8k
Stars
No
Chinese
Python
Language
Yes
Active
51
Contributors
255
Issues
No
Organization
0.7.4
Latest
5k
Forks
Apache-2.0
License
More
This is an asynchronous web crawler framework developed with Python, capable of transforming website data into LLM-friendly output formats such as Markdown, JSON, etc. It is fully open-source and free, greatly simplifying the writing of asynchronous crawlers. Compared to the paid Firecrawl, it has faster crawling speed, supports simultaneous capture of multiple URLs, page screenshot, keyword optimized extraction (based on LLM), and complex multi-page session management.
Comments
Rating:
No comments yet