Class LuceneSingleIndexLanguageModel

  • All Implemented Interfaces:
    AutoCloseable, LanguageModel

    public class LuceneSingleIndexLanguageModel
    extends BaseLanguageModel
    Information about ngram occurrences, taken from Lucene indexes (one index per ngram level). This is not a real language model as it only returns information about occurrence counts but has no probability calculation, especially not for the case with 0 occurrences.
    Since:
    3.2
    • Constructor Detail

      • LuceneSingleIndexLanguageModel

        public LuceneSingleIndexLanguageModel​(File topIndexDir)
        Parameters:
        topIndexDir - a directory which contains at least another sub directory called 3grams, which is a Lucene index with ngram occurrences as created by org.languagetool.dev.FrequencyIndexCreator.
      • LuceneSingleIndexLanguageModel

        @Experimental
        public LuceneSingleIndexLanguageModel​(int maxNgram)