Repository Details
Shared by
HelloGitHub Rating
10.0
1 ratings
Python Library for Easily Extracting PDF Text and Tables
Past 7 days Received 34 stars ✨
Free•MIT
Claim
Discuss
Collect
Share
9k
Stars
Yes
Chinese
Python
Language
No
Active
38
Contributors
79
Issues
No
Organization
0.11.7
Latest
821
Forks
MIT
License
More

This project is a Python-based PDF parsing and data extraction library that can easily extract text and tables. It is able to accurately obtain detailed positions, sizes and font information of each character, line, rectangle and other elements in the PDF document, and supports one-click generation of page snapshots for convenient debugging.
Comments
Rating:
No comments yet