Repository Details
Shared by
HelloGitHub Rating
10.0
1 ratings
Python Library for Easily Extracting PDF Text and Tables
Past 6 days Received 30 stars ✨
Free•MIT
Claim
Discuss
Collect
Share
10k
Stars
Yes
Chinese
Python
Language
Yes
Active
1
Contributors
86
Issues
No
Organization
0.11.9
Latest
875
Forks
MIT
License
More

This project is a Python-based PDF parsing and data extraction library that can easily extract text and tables. It is able to accurately obtain detailed positions, sizes and font information of each character, line, rectangle and other elements in the PDF document, and supports one-click generation of page snapshots for convenient debugging.
Comments
Rating:
No comments yet