Encoding detector backed by the juniversalchardet library (a Java port of
Mozilla's universalchardet). Not SPI-loaded by default; add explicitly to
your encoding-detector chain when needed.
Artifacts using Apache Tika JUniversalChardet Encoding Detector (8)
Apache Tika Standard Parser Package
Last Release on Mar 23, 2026
Encoding detector backed by a vendored copy of the ICU4J charset detection
engine (CharsetDetector / CharsetRecog_* classes, originally from the
unicode-org/icu project, licensed under the Unicode licence).
Not SPI-loaded by default; add ...
Last Release on May 10, 2026
7.Apache Tika ML Charset Detector — Training and Evaluation Tools
org.apache.tika » tika-ml-chardetect Apache
Training, evaluation, and diagnostic tools for the ML charset detector.
Runtime detector classes live in tika-encoding-detector-mojibuster.
Last Release on May 10, 2026
8.Apache Tika Standard Parser Integration Tests
org.apache.tika » tika-parsers-standard-integration-tests Apache
Apache Tika Standard Parser Integration Tests
Last Release on May 10, 2026
- Prev
- 1
- Next
