Encoding detector backed by the juniversalchardet library (a Java port of Mozilla's universalchardet). Not SPI-loaded by default; add explicitly to your encoding-detector chain when needed.

Artifacts using Apache Tika JUniversalChardet Encoding Detector (8)
Sort by:Popular

Apache Tika Standard Parser Package
Last Release on Mar 23, 2026
Apache Tika Text Parser Module
Last Release on Mar 23, 2026
Apache Tika Microsoft Parser Module
Last Release on Mar 23, 2026
Apache Tika HTML Parser Module
Last Release on Mar 23, 2026
Apache Tika Mail Parser Module
Last Release on Mar 23, 2026
Encoding detector backed by a vendored copy of the ICU4J charset detection engine (CharsetDetector / CharsetRecog_* classes, originally from the unicode-org/icu project, licensed under the Unicode licence). Not SPI-loaded by default; add ...
Last Release on May 10, 2026
Training, evaluation, and diagnostic tools for the ML charset detector. Runtime detector classes live in tika-encoding-detector-mojibuster.
Last Release on May 10, 2026
Apache Tika Standard Parser Integration Tests
Last Release on May 10, 2026
  • Prev
  • 1
  • Next