pandoc—General Markup Language Conversion ToolThis project can convert multiple document formats to each other, supporting formats such as Markdowjgm·Haskell·7 days ago128
pdfplumber—Python Library for Easily Extracting PDF Text and Tables1This project is a Python-based PDF parsing and data extraction library that can easily extract text jsvine·Python·a month ago990
SpeedyNote—Handwritten Note Tool for Old TabletsThis is a handwritten note application specifically designed for classic tablet PCs and old devices.alpha-liu-01·C++·4 months ago2.1k
clawPDF—Open Source Virtual Printer ToolThis is a virtual (network) printer tool specifically designed for Windows systems. It supports expoclawsoftware·C#·5 months ago1.2k
olmocr—Library for Intelligent PDF Document ParsingThis project leverages Vision-Language Models (VLMs) to parse and linearize complex PDF documents, callenai·Python·8 months ago1.6k
linuxpdf—Linux System Running Inside a PDF FileThis project embeds a Linux system into a PDF file, powered by the RISC-V simulator TinyEMU. Users cading2210·C·8 months ago1.1k
maroto—Generate Stylish PDF Files with GoThis is a library developed in Go that is inspired by the Bootstrap framework for creating PDF filesjohnfercher·Go·a year ago2.5k
PDFQFZ—Free PDF Saddle Stitch Stamp Tool1This project is a tool for stamping saddle stitches on PDF files, which is applicable to the Windowsflytkgl·C#·a year ago1.9k
diff-pdf—Tool for Intuitive Comparison of Two PDF FilesThis is a PDF file comparison tool written in C++. It supports two viewing modes, allowing the diffevslavik·C++·a year ago2.9k
marker—Project for Converting PDF to Markdown FileThis is a Python project capable of converting PDF, EPUB, and MOBI formatted files into Markdown fildatalab-to·Python·2 years ago2.4k
PDF-Explained—PDF Explained1This project is the unofficial Chinese translation of the book 'PDF Explained'. It introduces how tozxyle·Other·2 years ago2.3k
QuestPDF—.NET Library for Generating PDF FilesThis is a .NET library for generating PDF files, providing an easy-to-understand API for designing aQuestPDF·C#·2 years ago1k
Stirling-PDF—A Web Application for Various Operations on PDF Files1This is a powerful and ready-to-use PDF tool that supports splitting/merging files, adding/extractinStirling-Tools·Java·2 years ago9.8k
sumatrapdf—Free and Lightweight Open Source PDF ReaderThis is a free, compact, and fast Chinese Windows PDF reading tool that comes with all the necessarysumatrapdfreader·C·3 years ago4k
pdf2docx—可将 PDF 转换成 docx 文件的 Python 库该项目通过 PyMuPDF 库提取 PDF 文件中的数据,然后采用 python-docx 库解析内容的布局、段落、图片、表格等,最后自动生成 docx 文件。ArtifexSoftware·Python·3 years ago2.9k