Repository Details
Shared by


HelloGitHub Rating
10.0
1 ratings
Python Library for Easily Extracting PDF Text and Tables
Past 6 days Received 54 stars ✨
Free•MIT
Claim
Discuss
Collect
Share
8.9k
Stars
Yes
Chinese
Python
Language
Yes
Active
38
Contributors
74
Issues
No
Organization
0.11.7
Latest
805
Forks
MIT
License
More

This project is a Python-based PDF parsing and data extraction library that can easily extract text and tables. It is able to accurately obtain detailed positions, sizes and font information of each character, line, rectangle and other elements in the PDF document, and supports one-click generation of page snapshots for convenient debugging.
Comments
Rating:
No comments yet