All Classes
-
All Classes Interface Summary Class Summary Class Description BackwardsCompatibleNgramFilter Filters those that were not generated by the old n-gram generator.BuiltInLanguages CharNormalizer Some character normalization (and exclusion) functionality.CharNormalizerTextFilterImpl Deprecated. can't be used because it would be a big loss to not inline this code.CommandLineInterface LangDetect Command Line Interface.CommonTextObjectFactories Contains some standardTextObjectFactory
s ready to use for common use cases.DetectedLanguage Holds information about a detected language: the locale (language) and the probability.GenProfile Load Wikipedia's abstract XML as corpus and generate its language profile in JSON format.GenProfile Generate a language profile from any given text file.IOUtils Deprecated. LangProfile Deprecated. replaced by LanguageProfileLangProfileReader ReadsLangProfile
s.LangProfileWriter Writes aLangProfile
to an output stream (file).LanguageDetector Guesses the language of an input string or text.LanguageDetectorBuilder Builder forLanguageDetector
.LanguageDetectorImpl This class is immutable and thus thread-safe.LanguageProfile A language profile knows the locale (language), and contains the n-grams and some statistics.LanguageProfileBuilder Builder forLanguageProfile
.LanguageProfileImpl This class is immutable.LanguageProfileReader ReadsLanguageProfile
s.LanguageProfileWriter Writes aLanguageProfile
to an output stream or file.LdLocale A language-detector implementation of a Locale, similar to the java.util.Locale.Messages This isMessages
class generated by Eclipse automatically.MultiTextFilter Groups multipleTextFilter
s as one and runs them in the given order.NGram TODO document.NgramExtractor Class for extracting n-grams out of a text.NgramExtractors Provides easy access to commonly used NgramExtractor configs.NgramFilter Filters out some undesired n-grams.NgramFrequencyData Contains frequency information for n-grams coming from multipleLanguageProfile
s.OldLangProfileConverter Converts an oldLangProfile
to a newLanguageProfile
.OldNgramExtractor Deprecated. OldNgramExtractor.Filter RemoveMinorityScriptsTextFilter Removes text written in scripts that are not the dominant script of the text.StandardNgramFilter Filters what is generally not desired.TagExtractor TagExtractor
is a class which extracts inner texts of specified tag.TextFilter Allows to filter content from a text to be ignored for the n-gram analysis.TextObject A convenient text object implementing CharSequence and Appendable.TextObjectFactory Factory forTextObject
s.TextObjectFactoryBuilder Builder forTextObjectFactory
.UrlTextFilter Removes URLs and email addresses from the text.Util A place for sharing code.