docling

这是一个由 IBM 开源的 Python 工具，专门用于将各类文档转化为适合生成式 AI 使用的格式。它能够将 PDF、DOCX、PPTX、图片、HTML、Markdown 等多种流行文档格式，导出为 Markdown 和 JSON 格式，支持多种 OCR 引擎（PDF）、统一的文档对象（DoclingDocument），轻松集成检索增强生成（RAG）和问答应用，适用于需要将文档作为生成式 AI 模型输入的场景。

This is a Python tool open-sourced by IBM, specifically designed to convert various documents into formats suitable for generative AI. It can export multiple popular document formats such as PDF, DOCX, PPTX, images, HTML, and Markdown into Markdown and JSON formats. It supports multiple OCR engines (for PDF) and a unified document object (DoclingDocument), and can be easily integrated into retrieval-augmented generation (RAG) and question-answering applications. It is suitable for scenarios where documents need to be used as input for generative AI models.

docling

docling

评论