fbpx
Wikipedia

Apache PDFBox

Apache PDFBox is an open source pure-Java library that can be used to create, render, print, split, merge, alter, verify and extract text and meta-data of PDF files.

PDFBox
Developer(s)Apache Software Foundation
Stable release
1.8.x:1.8.17 / 15 September 2022; 19 months ago (2022-09-15)[1]
2.0.x:2.0.29 / 1 July 2023; 10 months ago (2023-07-01)[1]
3.0.x:3.0.0 / 18 August 2023; 8 months ago (2023-08-18)[1]
RepositoryPDFBox Repository (Mirror)
Written inJava
Operating systemCross-platform
TypePortable Document Format (PDF)
LicenseApache License 2.0
Websitepdfbox.apache.org

Open Hub reports over 11,000 commits (since the start as an Apache project) by 18 contributors representing more than 140,000 lines of code. PDFBox has a well established, mature codebase maintained by an average size development team with increasing year-over-year commits. Using the COCOMO model, it took an estimated 46 person-years of effort.[2]

Structure edit

Apache PDFBox has these components:

  • PDFBox: the main part
  • FontBox: handles font information
  • XmpBox: handles XMP metadata
  • Preflight (optional): checks PDF files for PDF/A-1b conformity.

History edit

PDFBox was started in 2002 in SourceForge by Ben Litchfield who wanted to be able to extract text of PDF files for Lucene.[3] It became an Apache Incubator project in 2008, and an Apache top level project in 2009.[4]

Preflight was originally named PaDaF and developed by Atos worldline, and donated to the project in 2011.[5]

In February 2015, Apache PDFBox was named an Open Source Partner Organization of the PDF Association.[6]

See also edit

References edit

  1. ^ a b c "Apache PDFBox - Blog". pdfbox.apache.org. Apache Software Foundation. Retrieved 2022-09-27.
  2. ^ "The Apache PDFBox Open Source Project on Open Hub". openhub.net. 2017-03-18. Retrieved 2017-03-18.
  3. ^ Apache PDFBox and FontBox 1.0.0 released, The H Open, 16 February 2010
  4. ^ PDFBox Project Incubation Status
  5. ^ PaDaF Preflight Codebase Intellectual Property (IP) Clearance Status
  6. ^ Apache™ PDFBox™ named an Open Source Partner Organization of the PDF Association, February 3, 2015

External links edit

  • Apache PDFBox Project

apache, pdfbox, this, article, multiple, issues, please, help, improve, discuss, these, issues, talk, page, learn, when, remove, these, template, messages, topic, this, article, meet, wikipedia, notability, guidelines, products, services, please, help, demonst. This article has multiple issues Please help improve it or discuss these issues on the talk page Learn how and when to remove these template messages The topic of this article may not meet Wikipedia s notability guidelines for products and services Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention If notability cannot be shown the article is likely to be merged redirected or deleted Find sources Apache PDFBox news newspapers books scholar JSTOR June 2014 Learn how and when to remove this message This article may rely excessively on sources too closely associated with the subject potentially preventing the article from being verifiable and neutral Please help improve it by replacing them with more appropriate citations to reliable independent third party sources June 2014 Learn how and when to remove this message Learn how and when to remove this message Apache PDFBox is an open source pure Java library that can be used to create render print split merge alter verify and extract text and meta data of PDF files PDFBoxDeveloper s Apache Software FoundationStable release1 8 x 1 8 17 15 September 2022 19 months ago 2022 09 15 1 2 0 x 2 0 29 1 July 2023 10 months ago 2023 07 01 1 3 0 x 3 0 0 18 August 2023 8 months ago 2023 08 18 1 RepositoryPDFBox Repository Mirror Written inJavaOperating systemCross platformTypePortable Document Format PDF LicenseApache License 2 0Websitepdfbox wbr apache wbr org Open Hub reports over 11 000 commits since the start as an Apache project by 18 contributors representing more than 140 000 lines of code PDFBox has a well established mature codebase maintained by an average size development team with increasing year over year commits Using the COCOMO model it took an estimated 46 person years of effort 2 Contents 1 Structure 2 History 3 See also 4 References 5 External linksStructure editApache PDFBox has these components PDFBox the main part FontBox handles font information XmpBox handles XMP metadata Preflight optional checks PDF files for PDF A 1b conformity History editPDFBox was started in 2002 in SourceForge by Ben Litchfield who wanted to be able to extract text of PDF files for Lucene 3 It became an Apache Incubator project in 2008 and an Apache top level project in 2009 4 Preflight was originally named PaDaF and developed by Atos worldline and donated to the project in 2011 5 In February 2015 Apache PDFBox was named an Open Source Partner Organization of the PDF Association 6 See also edit nbsp Free Software portal List of PDF softwareReferences edit a b c Apache PDFBox Blog pdfbox apache org Apache Software Foundation Retrieved 2022 09 27 The Apache PDFBox Open Source Project on Open Hub openhub net 2017 03 18 Retrieved 2017 03 18 Apache PDFBox and FontBox 1 0 0 released The H Open 16 February 2010 PDFBox Project Incubation Status PaDaF Preflight Codebase Intellectual Property IP Clearance Status Apache PDFBox named an Open Source Partner Organization of the PDF Association February 3 2015External links editApache PDFBox Project Retrieved from https en wikipedia org w index php title Apache PDFBox amp oldid 1195398869, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.