Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Docling Project

Welcome to the Docling Project

This is the GitHub organization Docling open-source project. We like to get continuous feedback from the community: take the poll!

Docling

Docling is our main open-source package. It is a powerful library which simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

We support an amazing community which helps us driving forward the adoption of Docling. Give it a try and join the community!



The key repositories of Docling are:

  • docling - The home of the main docling package.
  • docling-core - The definition of types, transforms, serializers, etc. If it has to do with the DoclingDocument you will find it here.
  • docling-parse - The backend PDF parser used by Docling.
  • docling-serve - The FastAPI wrappers for running Docling as REST API and distribute large jobs.
  • docling-ibm-models - The AI models powering Docling.
  • docling-sdg - Synthetic data generation (SDG) on documents for dataset generation for RAG, finetuning, etc.
  • docling-mcp - The definition of tools with the Model Context Protocol for document conversion, manipulation and generation agents.

LF AI & Data

Docling is hosted as a project in the LF AI & Data Foundation.

IBM ❤️ Open Source AI

The project was started by the AI for knowledge team at IBM Research Zurich.

Pinned Loading

  1. docling docling Public

    Get your documents ready for gen AI

    Python 41.6k 3k

  2. docling-serve docling-serve Public

    Running Docling as an API service

    Python 825 187

  3. docling-core docling-core Public

    A python library to define and validate data types in Docling.

    Python 191 97

  4. community community Public

    6 1

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 21 repositories
  • docling-java Public
    docling-project/docling-java’s past year of commit activity
    1 MIT 1 4 1 Updated Oct 15, 2025
  • docling Public

    Get your documents ready for gen AI

    docling-project/docling’s past year of commit activity
    Python 41,623 MIT 2,963 636 (8 issues need help) 19 Updated Oct 15, 2025
  • docling-serve Public

    Running Docling as an API service

    docling-project/docling-serve’s past year of commit activity
    Python 825 MIT 186 80 8 Updated Oct 15, 2025
  • docling-project/docling-ibm-models’s past year of commit activity
    Python 155 MIT 49 24 9 Updated Oct 15, 2025
  • docling-parse Public

    Simple package to extract text with coordinates from programmatic PDFs

    docling-project/docling-parse’s past year of commit activity
    C++ 203 MIT 46 37 7 Updated Oct 15, 2025
  • docling-eval Public

    Evaluation framework for document processing models and services.

    docling-project/docling-eval’s past year of commit activity
    Python 46 MIT 10 10 8 Updated Oct 15, 2025
  • docling-core Public

    A python library to define and validate data types in Docling.

    docling-project/docling-core’s past year of commit activity
    Python 191 MIT 97 34 9 Updated Oct 15, 2025
  • website Public

    The Docling website

    docling-project/website’s past year of commit activity
    TypeScript 1 0 0 0 Updated Oct 14, 2025
  • docling-mcp Public

    Making docling agentic through MCP

    docling-project/docling-mcp’s past year of commit activity
    Python 289 MIT 62 15 4 Updated Oct 14, 2025
  • docling-project/docling-jobkit’s past year of commit activity
    Python 12 MIT 8 8 1 Updated Oct 13, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

AltStyle によって変換されたページ (->オリジナル) /