Providing algorithms for processing texts like editing distance or similarity is out of scope of the standard Java libraries. The Commons Text Package provides these extra methods.
This proposal is to create a package of Java utility classes implementing well known string algorithms and metrics.
Commons Text relies only on standard JDK 7 (or later) APIs for production deployment. It utilizes the JUnit unit testing framework and the hamcrest matcher library for developing and executing unit tests, but this is of interest only to developers of the component. Commons Text may be a dependency for several existing components in the open source world that implement higher order text processing.
No external configuration files are utilized.
The initial classes came from the Commons Lang and Commons Codec subprojects.
The proposed package name for the new component is
org.apache.commons.text.
commons-text.The initial committers on the Commons Text component shall be as follows:
Copyright © 2014-2025 The Apache Software Foundation. All Rights Reserved.