Python Specific Formats Processing PDF

10

8

6

4

2


8.8

9.5

8.5

9.7

8.5

9.7

8.3
0.0

7.7

10.0

7.2

8.2

11 PDF packages and projects

  • PyPDF2

    8.8 9.5 L2 Python
    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
  • PyMuPDF

    8.5 9.7 Python
    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
  • Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.

    Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
    Promo getstream.io
  • WeasyPrint

    8.5 9.7 L1 Python
    The awesome document factory
  • PDFMiner

    8.3 0.0 L3 Python
    DISCONTINUED. Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • Kreuzberg

    7.7 10.0 HTML
    A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 50+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
  • Camelot

    7.2 8.2 Python
    A Python library to extract tabular data from PDFs
  • borb

    6.8 8.9 Python
    borb is a library for reading, creating and manipulating PDF files in python.
  • pdftabextract

    6.4 0.0 L3 Python
    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
  • plutoprint

    4.3 9.4 Python
    A Python Library for Generating PDFs and Images from HTML, powered by PlutoBook
  • ReportLab

    3.4 -
    Allowing Rapid creation of rich PDF documents.
  • Meltano Singer SDK

    2.7 9.8 Python
    Write 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Add another 'PDF' Package

Do not miss the trending, packages, news and articles with our weekly report.

Awesome Python is part of the LibHunt network. Terms. Privacy Policy.

(CC)
BY-SA
We recommend Spin The Wheel Of Names for a cryptographically secure random name picker.

AltStyle によって変換されたページ (->オリジナル) /