The Apache PDFBox library is an open source Java tool for working with PDF documents.
This artefact contains commandline tools using Apache PDFBox.
Artifacts using Apache PDFBox Tools (113)
2.OpenCms133 usages
org.opencms » opencms-core LGPL
OpenCms is an enterprise-ready, easy to use website content management system based on Java and XML technology. Offering a complete set of features, OpenCms helps content managers worldwide to create and maintain beautiful websites fast and ...
Last Release on Apr 13, 2026
# Tess4J
## Description:
A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.
Last Release on Jan 17, 2026
Apache Solr Content Extraction Library integrates Apache Tika
content extraction framework into Solr
Last Release on Sep 25, 2024
6.GATE Core28 usages
uk.ac.gate » gate-core LGPL
GATE - general architecture for text engineering - is open source
software capable of solving almost any text processing problem. This
artifact enables you to embed the core GATE Embedded with its essential
dependencies.
Last Release on Mar 9, 2021
