Repository Details
Shared by
HelloGitHub Rating
10.0
2 ratings
Free•Apache-2.0
Claim
Discuss
Collect
Share
14.9k
Stars
No
Chinese
Python
Language
Yes
Active
11
Contributors
46
Issues
No
Organization
None
Latest
1k
Forks
Apache-2.0
License
More
This is an asynchronous web crawler framework developed with Python, capable of transforming website data into LLM-friendly output formats such as Markdown, JSON, etc. It is fully open-source and free, greatly simplifying the writing of asynchronous crawlers. Compared to the paid Firecrawl, it has faster crawling speed, supports simultaneous capture of multiple URLs, page screenshot, keyword optimized extraction (based on LLM), and complex multi-page session management.
Comments
Rating:
No comments yet