Jump to content
Wikipedia The Free Encyclopedia

Apache PDFBox

From Wikipedia, the free encyclopedia
Open-source PDF library
This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages)
The topic of this article may not meet Wikipedia's notability guidelines for products and services . Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention. If notability cannot be shown, the article is likely to be merged, redirected, or deleted.
Find sources: "Apache PDFBox" – news · newspapers · books · scholar · JSTOR
(June 2014) (Learn how and when to remove this message)
This article may rely excessively on sources too closely associated with the subject , potentially preventing the article from being verifiable and neutral. Please help improve it by replacing them with more appropriate citations to reliable, independent sources. (June 2014) (Learn how and when to remove this message)
(Learn how and when to remove this message)
PDFBox
Developer Apache Software Foundation
Stable release
1.8.x:1.8.17 / 15 September 2022; 3 years ago (2022年09月15日)[1]
2.0.x:2.0.32 / 24 July 2024; 16 months ago (2024年07月24日)[1]
3.0.x:3.0.3 / 8 August 2024; 15 months ago (2024年08月08日)[1]
Repository PDFBox Repository (Mirror)
Written inJava
Operating system Cross-platform
Type Portable Document Format (PDF)
License Apache License 2.0
Websitepdfbox.apache.org

Apache PDFBox is an open source pure-Java library that can be used to create, render, print, split, merge, alter, verify and extract text and meta-data of PDF files.

Open Hub reports over 11,000 commits (since the start as an Apache project) by 18 contributors representing more than 140,000 lines of code. PDFBox has a well established, mature codebase maintained by an average size development team with increasing year-over-year commits. Using the COCOMO model, it took an estimated 46 person-years of effort.[2]

Structure

[edit ]

Apache PDFBox has these components:

  • PDFBox: the main part
  • FontBox: handles font information
  • XmpBox: handles XMP metadata
  • Preflight (optional): checks PDF files for PDF/A-1b conformity.

History

[edit ]

PDFBox was started in 2002 in SourceForge by Ben Litchfield who wanted to be able to extract text of PDF files for Lucene.[3] It became an Apache Incubator project in 2008, and an Apache top level project in 2009.[4]

Preflight was originally named PaDaF and developed by Atos worldline, and donated to the project in 2011.[5]

In February 2015, Apache PDFBox was named an Open Source Partner Organization of the PDF Association.[6]

See also

[edit ]

References

[edit ]
  1. ^ a b c "Apache PDFBox - Blog". pdfbox.apache.org. Apache Software Foundation. Retrieved 2024年10月30日.
  2. ^ "The Apache PDFBox Open Source Project on Open Hub". openhub.net. 2017年03月18日. Retrieved 2017年03月18日.
  3. ^ Apache PDFBox and FontBox 1.0.0 released, The H Open, 16 February 2010
  4. ^ PDFBox Project Incubation Status
  5. ^ PaDaF Preflight Codebase Intellectual Property (IP) Clearance Status
  6. ^ ApacheTM PDFBoxTM named an Open Source Partner Organization of the PDF Association, February 3, 2015
[edit ]
Top-level
projects
Commons
Incubator
Other projects
Attic
Licenses

AltStyle によって変換されたページ (->オリジナル) /