A B C D E F G H I K L M N O P R S T U V W
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- a - Variable in class morfologik.fsa.builders.FSAUtils.IntIntHolder
- accept(int) - Method in interface morfologik.fsa.StateVisitor
- add(byte[], int, int) - Method in class morfologik.fsa.builders.FSABuilder
-
Add a single sequence of bytes to the FSA.
- ADDRESS_OFFSET - Static variable in class morfologik.fsa.FSA5
-
An offset in the arc structure, where the address and flags field begins.
- annotation - Variable in class morfologik.fsa.FSA5
-
Annotation character.
- annotationByte - Variable in class morfologik.fsa.builders.FSA5Serializer
- applyReplacements(CharSequence, LinkedHashMap<String, String>) - Static method in class morfologik.stemming.DictionaryLookup
-
Apply partial string replacements from a given map.
- arcs - Variable in class morfologik.fsa.CFSA
-
An array of bytes with the internal representation of the automaton.
- arcs - Variable in class morfologik.fsa.CFSA2
-
An array of bytes with the internal representation of the automaton.
- arcs - Variable in class morfologik.fsa.FSA5
-
An array of bytes with the internal representation of the automaton.
- arcsCount - Variable in class morfologik.fsa.builders.FSAInfo
-
Number of arcs in the automaton, excluding an arcs from the zero node (initial) and an arc from the start node to the root node.
- arcsCountTotal - Variable in class morfologik.fsa.builders.FSAInfo
-
Total number of arcs, counting arcs that physically overlap due to merging.
- ARG_OVERWRITE - Static variable in class morfologik.tools.CliTool
- ARG_VALIDATE - Static variable in class morfologik.tools.CliTool
- asShort(Set<FSAFlags>) - Static method in enum morfologik.fsa.FSAFlags
- author(String) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- AUTHOR - morfologik.stemming.DictionaryAttribute
-
Dictionary author.
- AUTOMATON_HAS_PREFIX - Static variable in class morfologik.fsa.MatchResult
-
The automaton contains a prefix of the input sequence (but the full sequence does not exist).
B
- b - Variable in class morfologik.fsa.builders.FSAUtils.IntIntHolder
- BIT_FINAL_ARC - Static variable in class morfologik.fsa.CFSA
-
Bitmask indicating that an arc corresponds to the last character of a sequence available when building the automaton.
- BIT_FINAL_ARC - Static variable in class morfologik.fsa.CFSA2
-
The arc corresponds to the last character of a sequence available when building the automaton (acceptor transition).
- BIT_FINAL_ARC - Static variable in class morfologik.fsa.FSA5
-
Bit indicating that an arc corresponds to the last character of a sequence available when building the automaton.
- BIT_LAST_ARC - Static variable in class morfologik.fsa.CFSA
-
Bitmask indicating that an arc is the last one of the node's list and the following one belongs to another node.
- BIT_LAST_ARC - Static variable in class morfologik.fsa.CFSA2
-
The arc is the last one from the current node's arcs list.
- BIT_LAST_ARC - Static variable in class morfologik.fsa.FSA5
-
Bit indicating that an arc is the last one of the node's list and the following one belongs to another node.
- BIT_TARGET_NEXT - Static variable in class morfologik.fsa.CFSA
-
Bitmask indicating that the target node of this arc follows it in the compressed automaton structure (no goto field).
- BIT_TARGET_NEXT - Static variable in class morfologik.fsa.CFSA2
-
The target node of this arc follows the last arc of the current state (no goto field).
- BIT_TARGET_NEXT - Static variable in class morfologik.fsa.FSA5
-
Bit indicating that the target node of this arc follows it in the compressed automaton structure (no goto field).
- bits - Variable in enum morfologik.fsa.FSAFlags
-
Bit mask for the corresponding flag.
- BufferUtils - Class in morfologik.stemming
- build() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- build(byte[][]) - Static method in class morfologik.fsa.builders.FSABuilder
-
Build a minimal, deterministic automaton from a sorted list of byte sequences.
- build(Iterable<byte[]>) - Static method in class morfologik.fsa.builders.FSABuilder
-
Build a minimal, deterministic automaton from an iterable list of byte sequences.
- builder() - Static method in class morfologik.stemming.DictionaryMetadata
- ByteSequenceIterator - Class in morfologik.fsa
-
An iterator that traverses the right language of a given node (all sequences reachable from a given node).
- ByteSequenceIterator(FSA) - Constructor for class morfologik.fsa.ByteSequenceIterator
-
Create an instance of the iterator iterating over all automaton sequences.
- ByteSequenceIterator(FSA, int) - Constructor for class morfologik.fsa.ByteSequenceIterator
-
Create an instance of the iterator for a given node.
- bytesToChars(CharsetDecoder, ByteBuffer, CharBuffer) - Static method in class morfologik.stemming.BufferUtils
-
Convert byte buffer's content into characters.
C
- calculateFanOuts(FSA, int) - Static method in class morfologik.fsa.builders.FSAUtils
-
Calculate fan-out ratio (how many nodes have a given number of outgoing arcs).
- call() - Method in class morfologik.tools.DictApply
- call() - Method in class morfologik.tools.DictCompile
- call() - Method in class morfologik.tools.DictDecompile
- call() - Method in class morfologik.tools.FSACompile
- call() - Method in class morfologik.tools.FSADecompile
- call() - Method in class morfologik.tools.FSAInfo
- CFSA - Class in morfologik.fsa
-
CFSA (Compact Finite State Automaton) binary format implementation.
- CFSA2 - Class in morfologik.fsa
-
CFSA (Compact Finite State Automaton) binary format implementation, version 2:
CFSA2.BIT_TARGET_NEXT
applicable on all arcs, not necessarily the last one. v-coded goto field v-coded perfect hashing numbers, if any 31 most frequent labels integrated with flags byte - CFSA2 - morfologik.tools.SerializationFormat
- CFSA2Serializer - Class in morfologik.fsa.builders
- CFSA2Serializer() - Constructor for class morfologik.fsa.builders.CFSA2Serializer
- charsToBytes(CharsetEncoder, CharBuffer, ByteBuffer) - Static method in class morfologik.stemming.BufferUtils
-
Convert chars into bytes.
- checkNotNull(T) - Static method in class morfologik.tools.CliTool
- clearAndEnsureCapacity(ByteBuffer, int) - Static method in class morfologik.stemming.BufferUtils
-
Ensure the buffer's capacity is large enough to hold a given number of elements.
- clearAndEnsureCapacity(CharBuffer, int) - Static method in class morfologik.stemming.BufferUtils
-
Ensure the buffer's capacity is large enough to hold a given number of elements.
- CliTool - Class in morfologik.tools
-
Base class for command-line applications.
- CliTool() - Constructor for class morfologik.tools.CliTool
- clone() - Method in class morfologik.stemming.WordData
-
Declare a covariant of
Object.clone()
that returns a deep copy of this object. - code - Variable in enum morfologik.tools.ExitStatus
- compareTo(Speller.CandidateData) - Method in class morfologik.speller.Speller.CandidateData
- complete() - Method in class morfologik.fsa.builders.FSABuilder
- CONSTANT_ARC_AUTOMATON_SIZE - morfologik.fsa.builders.FSABuilder.InfoEntry
- CONVERT_CASE - morfologik.stemming.DictionaryAttribute
-
if the spelling dictionary is supposed to treat upper and lower case as equivalent.
- convertCase() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- convertCase(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- convertsCase() - Method in class morfologik.speller.Speller
-
Used to determine whether the dictionary supports case conversions.
- CREATION_DATE - morfologik.stemming.DictionaryAttribute
-
Dictionary creation date.
- creationDate(String) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- cuted(int, int, int) - Method in class morfologik.speller.Speller
-
Calculates cut-off edit distance.
D
- decode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in interface morfologik.stemming.ISequenceEncoder
- decode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.NoEncoder
- decode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.TrimInfixAndSuffixEncoder
- decode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.TrimPrefixAndSuffixEncoder
- decode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.TrimSuffixEncoder
- DEFAULT_ANNOTATION - Static variable in class morfologik.fsa.FSA5
-
Default annotation byte.
- DEFAULT_FILLER - Static variable in class morfologik.fsa.FSA5
-
Default filler byte.
- DictApply - Class in morfologik.tools
-
Applies a morphological dictionary automaton to the input.
- DictApply(Path, Path, String) - Constructor for class morfologik.tools.DictApply
- DictCompile - Class in morfologik.tools
-
Decompiles morphological dictionary automaton back to source state.
- DictCompile(Path, boolean, boolean, boolean, boolean, boolean) - Constructor for class morfologik.tools.DictCompile
- DictDecompile - Class in morfologik.tools
-
Decompiles morphological dictionary automaton back to source state.
- DictDecompile(Path, Path, boolean, boolean) - Constructor for class morfologik.tools.DictDecompile
- Dictionary - Class in morfologik.stemming
-
A dictionary combines
FSA
automaton andDictionaryMetadata
describing the way terms are encoded in the automaton. - Dictionary(FSA, DictionaryMetadata) - Constructor for class morfologik.stemming.Dictionary
-
It is strongly recommended to use static methods in this class for reading dictionaries.
- DictionaryAttribute - Enum in morfologik.stemming
-
Attributes applying to
Dictionary
andDictionaryMetadata
. - DictionaryIterator - Class in morfologik.stemming
-
An iterator over
WordData
entries of aDictionary
. - DictionaryIterator(Dictionary, CharsetDecoder, boolean) - Constructor for class morfologik.stemming.DictionaryIterator
- DictionaryLookup - Class in morfologik.stemming
-
This class implements a dictionary lookup of an inflected word over a dictionary previously compiled using the
dict_compile
tool. - DictionaryLookup(Dictionary) - Constructor for class morfologik.stemming.DictionaryLookup
-
Creates a new object of this class using the given FSA for word lookups and encoding for converting characters to bytes.
- DictionaryMetadata - Class in morfologik.stemming
-
Description of attributes, their types and default values.
- DictionaryMetadata(Map<DictionaryAttribute, String>) - Constructor for class morfologik.stemming.DictionaryMetadata
-
Create an instance from an attribute map.
- DictionaryMetadataBuilder - Class in morfologik.stemming
-
Helper class to build
DictionaryMetadata
instances. - DictionaryMetadataBuilder() - Constructor for class morfologik.stemming.DictionaryMetadataBuilder
E
- ed(int, int, int, int) - Method in class morfologik.speller.Speller
-
Calculates edit distance.
- encode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in interface morfologik.stemming.ISequenceEncoder
- encode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.NoEncoder
- encode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.TrimInfixAndSuffixEncoder
- encode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.TrimPrefixAndSuffixEncoder
- encode(ByteBuffer, ByteBuffer, ByteBuffer) - Method in class morfologik.stemming.TrimSuffixEncoder
- encoder(EncoderType) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ENCODER - morfologik.stemming.DictionaryAttribute
-
Locale associated with the dictionary.
- EncoderType - Enum in morfologik.stemming
-
Known
ISequenceEncoder
s. - encoding(String) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- encoding(Charset) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ENCODING - morfologik.stemming.DictionaryAttribute
-
Character to byte encoding used for strings inside the FSA.
- equals(Object) - Method in class morfologik.stemming.WordData
- EQUIVALENT_CHARS - morfologik.stemming.DictionaryAttribute
-
Equivalent characters (treated similarly as equivalent chars with and without diacritics).
- ERROR_CONFIRMATION_REQUIRED - morfologik.tools.ExitStatus
-
A potentially destructive command requires explicit confirmation that was not present.
- ERROR_INVALID_ARGUMENTS - morfologik.tools.ExitStatus
-
Invalid input arguments or their combination.
- ERROR_OTHER - morfologik.tools.ExitStatus
-
Unknown error cause.
- ESTIMATED_MEMORY_CONSUMPTION_MB - morfologik.fsa.builders.FSABuilder.InfoEntry
- EXACT_MATCH - Static variable in class morfologik.fsa.MatchResult
-
The automaton has exactly one match for the input sequence.
- ExitStatus - Enum in morfologik.tools
F
- filler - Variable in class morfologik.fsa.FSA5
-
Filler character.
- fillerByte - Variable in class morfologik.fsa.builders.FSA5Serializer
- finalStatesCount - Variable in class morfologik.fsa.builders.FSAInfo
-
Number of final states (number of input sequences stored in the automaton).
- findReplacementCandidates(String) - Method in class morfologik.speller.Speller
-
Find and return suggestions by using K.
- findReplacements(String) - Method in class morfologik.speller.Speller
-
Find suggestions by using K.
- FLEXIBLE - morfologik.fsa.FSAFlags
-
Daciuk: flexible FSA encoding.
- FREQUENCY_INCLUDED - morfologik.stemming.DictionaryAttribute
-
If the FSA dictionary includes frequency data.
- frequencyIncluded() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- frequencyIncluded(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- fromPropertyName(String) - Static method in enum morfologik.stemming.DictionaryAttribute
- fromString(String) - Method in enum morfologik.stemming.DictionaryAttribute
-
Converts a string to the given attribute's value.
- fsa - Variable in class morfologik.stemming.Dictionary
-
FSA
automaton with the compiled dictionary data. - FSA - Class in morfologik.fsa
-
This is a top abstract class for handling finite state automata.
- FSA() - Constructor for class morfologik.fsa.FSA
- FSA5 - Class in morfologik.fsa
-
FSA binary format implementation for version 5.
- FSA5 - morfologik.tools.SerializationFormat
- FSA5Serializer - Class in morfologik.fsa.builders
-
Serializes in-memory
FSA
graphs to a binary format compatible with Jan Daciuk'sfsa
's packageFSA5
format. - FSA5Serializer() - Constructor for class morfologik.fsa.builders.FSA5Serializer
- FSABuild - Class in morfologik.tools
-
Deprecated.
- FSABuild() - Constructor for class morfologik.tools.FSABuild
-
Deprecated.
- FSABuilder - Class in morfologik.fsa.builders
-
Fast, memory-conservative finite state automaton builder, returning an in-memory
FSA
that is a tradeoff between construction speed and memory consumption. - FSABuilder() - Constructor for class morfologik.fsa.builders.FSABuilder
- FSABuilder(int) - Constructor for class morfologik.fsa.builders.FSABuilder
- FSABuilder.InfoEntry - Enum in morfologik.fsa.builders
-
Debug and information constants.
- FSACompile - Class in morfologik.tools
-
Build finite state automaton out of text input.
- FSACompile(Path, Path, SerializationFormat, boolean, boolean, boolean) - Constructor for class morfologik.tools.FSACompile
- FSADecompile - Class in morfologik.tools
-
Dump all byte sequences encoded in a finite state automaton.
- FSADecompile(Path, Path) - Constructor for class morfologik.tools.FSADecompile
- FSADump - Class in morfologik.tools
-
Deprecated.
- FSADump() - Constructor for class morfologik.tools.FSADump
-
Deprecated.
- FSAFlags - Enum in morfologik.fsa
-
FSA automaton flags.
- FSAHeader - Class in morfologik.fsa
-
Standard FSA file header, as described in
fsa
package documentation. - FSAInfo - Class in morfologik.fsa.builders
-
Compute additional information about an FSA: number of arcs, nodes, etc.
- FSAInfo - Class in morfologik.tools
-
Print extra information about a compiled automaton file.
- FSAInfo(int, int, int, int) - Constructor for class morfologik.fsa.builders.FSAInfo
- FSAInfo(Path) - Constructor for class morfologik.tools.FSAInfo
- FSAInfo(FSA) - Constructor for class morfologik.fsa.builders.FSAInfo
- FSASerializer - Interface in morfologik.fsa.builders
-
All FSA serializers (to binary formats) will implement this interface.
- FSATraversal - Class in morfologik.fsa
-
This class implements some common matching and scanning operations on a generic FSA.
- FSATraversal(FSA) - Constructor for class morfologik.fsa.FSATraversal
-
Traversals of the given FSA.
- FSAUtils - Class in morfologik.fsa.builders
-
Other FSA-related utilities not directly associated with the class hierarchy.
- FSAUtils() - Constructor for class morfologik.fsa.builders.FSAUtils
- FSAUtils.IntIntHolder - Class in morfologik.fsa.builders
G
- get() - Method in enum morfologik.stemming.EncoderType
- get(int, int) - Method in class morfologik.speller.HMatrix
-
Provide an item of hMatrix indexed by indices.
- getAllReplacements(String, int, int) - Method in class morfologik.speller.Speller
- getArc(int, byte) - Method in class morfologik.fsa.CFSA
- getArc(int, byte) - Method in class morfologik.fsa.CFSA2
- getArc(int, byte) - Method in class morfologik.fsa.FSA
- getArc(int, byte) - Method in class morfologik.fsa.FSA5
- getArcCount(int) - Method in class morfologik.fsa.FSA
- getArcLabel(int) - Method in class morfologik.fsa.CFSA
- getArcLabel(int) - Method in class morfologik.fsa.CFSA2
- getArcLabel(int) - Method in class morfologik.fsa.FSA
- getArcLabel(int) - Method in class morfologik.fsa.FSA5
- getAttributes() - Method in class morfologik.stemming.DictionaryMetadata
- getCandLen() - Method in class morfologik.speller.Speller
- getDecoder() - Method in class morfologik.stemming.DictionaryMetadata
- getDictionary() - Method in class morfologik.stemming.DictionaryLookup
- getDictionary() - Method in class morfologik.stemming.polish.PolishStemmer
- getEffectiveED() - Method in class morfologik.speller.Speller
- getEncoder() - Method in class morfologik.stemming.DictionaryMetadata
- getEncoding() - Method in class morfologik.stemming.DictionaryMetadata
- getEndNode(int) - Method in class morfologik.fsa.CFSA
- getEndNode(int) - Method in class morfologik.fsa.CFSA2
- getEndNode(int) - Method in class morfologik.fsa.FSA
- getEndNode(int) - Method in class morfologik.fsa.FSA5
- getEquivalentChars() - Method in class morfologik.stemming.DictionaryMetadata
- getExpectedMetadataFileName(String) - Static method in class morfologik.stemming.DictionaryMetadata
-
Returns the expected name of the metadata file, based on the name of the dictionary file.
- getExpectedMetadataLocation(Path) - Static method in class morfologik.stemming.DictionaryMetadata
- getFirstArc(int) - Method in class morfologik.fsa.CFSA
- getFirstArc(int) - Method in class morfologik.fsa.CFSA2
- getFirstArc(int) - Method in class morfologik.fsa.FSA
- getFirstArc(int) - Method in class morfologik.fsa.FSA5
- getFlags() - Method in class morfologik.fsa.builders.CFSA2Serializer
-
Return supported flags.
- getFlags() - Method in class morfologik.fsa.builders.FSA5Serializer
-
Return supported flags.
- getFlags() - Method in interface morfologik.fsa.builders.FSASerializer
- getFlags() - Method in class morfologik.fsa.CFSA
- getFlags() - Method in class morfologik.fsa.CFSA2
- getFlags() - Method in class morfologik.fsa.FSA
- getFlags() - Method in class morfologik.fsa.FSA5
- getFrequency(CharSequence) - Method in class morfologik.speller.Speller
-
Get the frequency value for a word form.
- getInfo() - Method in class morfologik.fsa.builders.FSABuilder
- getInputConversionPairs() - Method in class morfologik.stemming.DictionaryMetadata
- getLocale() - Method in class morfologik.stemming.DictionaryMetadata
- getNextArc(int) - Method in class morfologik.fsa.CFSA
- getNextArc(int) - Method in class morfologik.fsa.CFSA2
- getNextArc(int) - Method in class morfologik.fsa.FSA
- getNextArc(int) - Method in class morfologik.fsa.FSA5
- getOutputConversionPairs() - Method in class morfologik.stemming.DictionaryMetadata
- getReplacementPairs() - Method in class morfologik.stemming.DictionaryMetadata
- getRightLanguageCount(int) - Method in class morfologik.fsa.CFSA
- getRightLanguageCount(int) - Method in class morfologik.fsa.CFSA2
- getRightLanguageCount(int) - Method in class morfologik.fsa.FSA
- getRightLanguageCount(int) - Method in class morfologik.fsa.FSA5
-
Returns the number encoded at the given node.
- getRootNode() - Method in class morfologik.fsa.CFSA
-
Returns the start node of this automaton.
- getRootNode() - Method in class morfologik.fsa.CFSA2
- getRootNode() - Method in class morfologik.fsa.FSA
- getRootNode() - Method in class morfologik.fsa.FSA5
-
Returns the start node of this automaton.
- getSeparator() - Method in class morfologik.stemming.DictionaryMetadata
- getSeparatorAsChar() - Method in class morfologik.stemming.DictionaryMetadata
- getSeparatorChar() - Method in class morfologik.stemming.DictionaryLookup
- getSequenceEncoderType() - Method in class morfologik.stemming.DictionaryMetadata
- getSequences() - Method in class morfologik.fsa.FSA
- getSequences(int) - Method in class morfologik.fsa.FSA
-
Returns an iterator over all binary sequences starting at the given FSA state (node) and ending in final nodes.
- getStem() - Method in class morfologik.stemming.WordData
- getStemBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
-
Copy the stem's binary data (no charset decoding) to a custom byte buffer.
- getTag() - Method in class morfologik.stemming.WordData
- getTagBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
-
Copy the tag's binary data (no charset decoding) to a custom byte buffer.
- getWord() - Method in class morfologik.stemming.WordData
- getWordBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
-
Copy the inflected word's binary data (no charset decoding) to a custom byte buffer.
- getWordLen() - Method in class morfologik.speller.Speller
- gtl - Variable in class morfologik.fsa.CFSA
-
Number of bytes each address takes in full, expanded form (goto length).
- gtl - Variable in class morfologik.fsa.FSA5
-
Number of bytes each address takes in full, expanded form (goto length).
H
- hashCode() - Method in class morfologik.stemming.WordData
- hasNext() - Method in class morfologik.fsa.ByteSequenceIterator
-
Returns
true
if there are still elements in this iterator. - hasNext() - Method in class morfologik.stemming.DictionaryIterator
- HMatrix - Class in morfologik.speller
-
Keeps track of already computed values of edit distance.
- HMatrix(int, int) - Constructor for class morfologik.speller.HMatrix
-
Allocates memory and initializes matrix (constructor).
I
- IGNORE_ALL_UPPERCASE - morfologik.stemming.DictionaryAttribute
-
If the spelling dictionary is supposed to ignore ALL UPPERCASE words.
- IGNORE_CAMEL_CASE - morfologik.stemming.DictionaryAttribute
-
If the spelling dictionary is supposed to ignore CamelCase words.
- IGNORE_DIACRITICS - morfologik.stemming.DictionaryAttribute
-
If the spelling dictionary is supposed to ignore diacritics, so that 'a' would be treated as equivalent to 'Ä…'.
- IGNORE_NUMBERS - morfologik.stemming.DictionaryAttribute
-
If the spelling dictionary is supposed to ignore words containing digits
- IGNORE_PUNCTUATION - morfologik.stemming.DictionaryAttribute
-
If the spelling dictionary is supposed to ignore punctuation.
- ignoreAllUppercase() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignoreAllUppercase(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignoreCamelCase() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignoreCamelCase(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignoreDiacritics() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignoreDiacritics(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignoreNumbers() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignoreNumbers(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignorePunctuation() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- ignorePunctuation(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- index - Variable in class morfologik.fsa.MatchResult
-
Input sequence's index, interpretation depends on
MatchResult.kind
. - INFIX - morfologik.stemming.EncoderType
- INPUT_CONVERSION - morfologik.stemming.DictionaryAttribute
-
Input conversion pairs to replace non-standard characters before search in a speller dictionary.
- IntIntHolder() - Constructor for class morfologik.fsa.builders.FSAUtils.IntIntHolder
- IntIntHolder(int, int) - Constructor for class morfologik.fsa.builders.FSAUtils.IntIntHolder
- isArcFinal(int) - Method in class morfologik.fsa.CFSA
- isArcFinal(int) - Method in class morfologik.fsa.CFSA2
- isArcFinal(int) - Method in class morfologik.fsa.FSA
- isArcFinal(int) - Method in class morfologik.fsa.FSA5
- isArcLast(int) - Method in class morfologik.fsa.CFSA
-
Returns
true
if this arc hasNEXT
bit set. - isArcLast(int) - Method in class morfologik.fsa.CFSA2
-
Returns
true
if this arc hasNEXT
bit set. - isArcLast(int) - Method in class morfologik.fsa.FSA5
-
Returns
true
if this arc hasNEXT
bit set. - isArcTerminal(int) - Method in class morfologik.fsa.CFSA
- isArcTerminal(int) - Method in class morfologik.fsa.CFSA2
- isArcTerminal(int) - Method in class morfologik.fsa.FSA
- isArcTerminal(int) - Method in class morfologik.fsa.FSA5
- isCamelCase(String) - Method in class morfologik.speller.Speller
- isConvertingCase() - Method in class morfologik.stemming.DictionaryMetadata
- ISequenceEncoder - Interface in morfologik.stemming
-
The logic of encoding one sequence of bytes relative to another sequence of bytes.
- isFrequencyIncluded() - Method in class morfologik.stemming.DictionaryMetadata
- isIgnoringAllUppercase() - Method in class morfologik.stemming.DictionaryMetadata
- isIgnoringCamelCase() - Method in class morfologik.stemming.DictionaryMetadata
- isIgnoringDiacritics() - Method in class morfologik.stemming.DictionaryMetadata
- isIgnoringNumbers() - Method in class morfologik.stemming.DictionaryMetadata
- isIgnoringPunctuation() - Method in class morfologik.stemming.DictionaryMetadata
- isInDictionary(CharSequence) - Method in class morfologik.speller.Speller
-
Test whether the word is found in the dictionary.
- isLabelCompressed(int) - Method in class morfologik.fsa.CFSA
- isMisspelled(String) - Method in class morfologik.speller.Speller
-
Checks whether the word is misspelled, by performing a series of checks according to properties of the dictionary.
- isNextSet(int) - Method in class morfologik.fsa.CFSA
- isNextSet(int) - Method in class morfologik.fsa.CFSA2
- isNextSet(int) - Method in class morfologik.fsa.FSA5
- isSet(int) - Method in enum morfologik.fsa.FSAFlags
- isSupportingRunOnWords() - Method in class morfologik.stemming.DictionaryMetadata
- IStemmer - Interface in morfologik.stemming
-
A generic "stemmer" interface in Morfologik.
- iterator() - Method in class morfologik.fsa.FSA
-
Returns an iterator over all binary sequences starting from the initial FSA state (node) and ending in final nodes.
- iterator() - Method in class morfologik.stemming.DictionaryLookup
-
Return an iterator over all
WordData
entries available in the embeddedDictionary
. - iterator() - Method in class morfologik.stemming.polish.PolishStemmer
-
Iterates over all dictionary forms stored in this stemmer.
K
- kind - Variable in class morfologik.fsa.MatchResult
-
One of the match types defined in this class.
L
- LABEL_INDEX_SIZE - Static variable in class morfologik.fsa.CFSA2
-
Maximum size of the labels index.
- labelMapping - Variable in class morfologik.fsa.CFSA
-
Label mapping for arcs of type (1) (see class documentation).
- labelMapping - Variable in class morfologik.fsa.CFSA2
-
Label mapping for M-indexed labels.
- Launcher - Class in morfologik.tools
-
JAR entry point.
- LEXICAL_ORDERING - Static variable in class morfologik.fsa.builders.FSABuilder
-
A comparator comparing full byte arrays.
- license(String) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- LICENSE - morfologik.stemming.DictionaryAttribute
-
Dictionary license attribute.
- locale(String) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- locale(Locale) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- LOCALE - morfologik.stemming.DictionaryAttribute
-
Locale associated with the dictionary.
- lookup(CharSequence) - Method in class morfologik.stemming.DictionaryLookup
-
Searches the automaton for a symbol sequence equal to
word
, followed by a separator. - lookup(CharSequence) - Method in interface morfologik.stemming.IStemmer
-
Returns a list of
WordData
entries for a given word. - lookup(CharSequence) - Method in class morfologik.stemming.polish.PolishStemmer
-
Returns a list of
WordData
entries for a given word.
M
- main(String[]) - Static method in class morfologik.tools.DictApply
- main(String[]) - Static method in class morfologik.tools.DictCompile
- main(String[]) - Static method in class morfologik.tools.DictDecompile
- main(String[]) - Static method in class morfologik.tools.FSACompile
- main(String[]) - Static method in class morfologik.tools.FSADecompile
- main(String[]) - Static method in class morfologik.tools.FSAInfo
- main(String[]) - Static method in class morfologik.tools.Launcher
- main(String[], CliTool) - Static method in class morfologik.tools.CliTool
-
Parse and execute a single command.
- main(String[], CliTool...) - Static method in class morfologik.tools.CliTool
-
Parse and execute one of the commands.
- match(byte[]) - Method in class morfologik.fsa.FSATraversal
- match(byte[], int) - Method in class morfologik.fsa.FSATraversal
- match(byte[], int, int, int) - Method in class morfologik.fsa.FSATraversal
-
Finds a matching path in the dictionary for a given sequence of labels from
sequence
and starting at nodenode
. - match(MatchResult, byte[], int, int, int) - Method in class morfologik.fsa.FSATraversal
-
Same as
FSATraversal.match(byte[], int, int, int)
, but allows passing a reusableMatchResult
object so that no intermediate garbage is produced. - MatchResult - Class in morfologik.fsa
-
A matching result returned from
FSATraversal
. - MatchResult() - Constructor for class morfologik.fsa.MatchResult
- MAX_ACTIVE_PATH_LENGTH - morfologik.fsa.builders.FSABuilder.InfoEntry
- MAX_WORD_LENGTH - Static variable in class morfologik.speller.Speller
-
Maximum length of the word to be checked.
- metadata - Variable in class morfologik.stemming.Dictionary
-
Metadata associated with the dictionary.
- METADATA_FILE_EXTENSION - Static variable in class morfologik.stemming.DictionaryMetadata
-
Expected metadata file extension.
- morfologik.fsa - package morfologik.fsa
- morfologik.fsa.builders - package morfologik.fsa.builders
- morfologik.speller - package morfologik.speller
- morfologik.stemming - package morfologik.stemming
- morfologik.stemming.polish - package morfologik.stemming.polish
- morfologik.tools - package morfologik.tools
N
- next() - Method in class morfologik.fsa.ByteSequenceIterator
- next() - Method in class morfologik.stemming.DictionaryIterator
- NEXTBIT - morfologik.fsa.FSAFlags
-
Daciuk: next bit in use.
- NO_MATCH - Static variable in class morfologik.fsa.MatchResult
-
The automaton has no match for the input sequence and no sequence in the automaton is a prefix of the input.
- node - Variable in class morfologik.fsa.MatchResult
-
Automaton node, interpretation depends on the
MatchResult.kind
. - nodeCount - Variable in class morfologik.fsa.builders.FSAInfo
-
Number of nodes in the automaton.
- nodeDataLength - Variable in class morfologik.fsa.CFSA
-
The length of the node header structure (if the automaton was compiled with
NUMBERS
option). - nodeDataLength - Variable in class morfologik.fsa.FSA5
-
The length of the node header structure (if the automaton was compiled with
NUMBERS
option). - NoEncoder - Class in morfologik.stemming
-
No relative encoding at all (full target form is returned).
- NoEncoder() - Constructor for class morfologik.stemming.NoEncoder
- NONE - morfologik.stemming.EncoderType
- NUMBERS - morfologik.fsa.FSAFlags
-
The FSA contains right-language count numbers on states.
O
- OUTPUT_CONVERSION - morfologik.stemming.DictionaryAttribute
-
Output conversion pairs to replace non-standard characters before search in a speller dictionary.
P
- perfectHash(byte[]) - Method in class morfologik.fsa.FSATraversal
- perfectHash(byte[], int, int, int) - Method in class morfologik.fsa.FSATraversal
-
Calculate perfect hash for a given input sequence of bytes.
- PolishStemmer - Class in morfologik.stemming.polish
-
A dictionary-based stemmer for the Polish language.
- PolishStemmer() - Constructor for class morfologik.stemming.polish.PolishStemmer
- PREFIX - morfologik.stemming.EncoderType
- prefixBytes() - Method in interface morfologik.stemming.ISequenceEncoder
-
Deprecated.
- prefixBytes() - Method in class morfologik.stemming.NoEncoder
- prefixBytes() - Method in class morfologik.stemming.TrimInfixAndSuffixEncoder
- prefixBytes() - Method in class morfologik.stemming.TrimPrefixAndSuffixEncoder
- prefixBytes() - Method in class morfologik.stemming.TrimSuffixEncoder
- printf(String, Object...) - Static method in class morfologik.tools.CliTool
- propertyName - Variable in enum morfologik.stemming.DictionaryAttribute
-
Property name for this attribute.
R
- read(InputStream) - Static method in class morfologik.fsa.FSA
-
A factory for reading automata in any of the supported versions.
- read(InputStream) - Static method in class morfologik.fsa.FSAHeader
-
Read FSA header and version from a stream, consuming read bytes.
- read(InputStream) - Static method in class morfologik.stemming.DictionaryMetadata
-
Read dictionary metadata from a property file (stream).
- read(InputStream, InputStream) - Static method in class morfologik.stemming.Dictionary
-
Attempts to load a dictionary from opened streams of FSA dictionary data and associated metadata.
- read(InputStream, Class<? extends T>) - Static method in class morfologik.fsa.FSA
-
A factory for reading a specific FSA subclass, including proper casting.
- read(URL) - Static method in class morfologik.stemming.Dictionary
-
Attempts to load a dictionary using the URL to the FSA file and the expected metadata extension.
- read(Path) - Static method in class morfologik.stemming.Dictionary
-
Attempts to load a dictionary using the path to the FSA file and the expected metadata extension.
- readRemaining(InputStream) - Static method in class morfologik.fsa.FSA
- remove() - Method in class morfologik.fsa.ByteSequenceIterator
-
Not implemented in this iterator.
- remove() - Method in class morfologik.stemming.DictionaryIterator
- REPLACEMENT_PAIRS - morfologik.stemming.DictionaryAttribute
-
Replacement pairs for non-obvious candidate search in a speller dictionary.
- replaceRunOnWords(String) - Method in class morfologik.speller.Speller
-
Propose suggestions for misspelled run-on words.
- restartFrom(int) - Method in class morfologik.fsa.ByteSequenceIterator
-
Restart walking from
node
. - rightLanguageForAllStates(FSA) - Static method in class morfologik.fsa.builders.FSAUtils
-
Calculate the size of "right language" for each state in an FSA.
- RUN_ON_WORDS - morfologik.stemming.DictionaryAttribute
-
If the spelling dictionary is supposed to split runOnWords.
S
- separator(char) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- SEPARATOR - morfologik.stemming.DictionaryAttribute
-
Logical fields separator inside the FSA.
- SEPARATORS - morfologik.fsa.FSAFlags
-
The FSA supports legacy built-in separator and filler characters (Daciuk's FSA package compatibility).
- SEQUENCE_IS_A_PREFIX - Static variable in class morfologik.fsa.MatchResult
-
The sequence is a prefix of at least one sequence in the automaton.
- SERIALIZATION_BUFFER_REALLOCATIONS - morfologik.fsa.builders.FSABuilder.InfoEntry
- SERIALIZATION_BUFFER_SIZE - morfologik.fsa.builders.FSABuilder.InfoEntry
- SerializationFormat - Enum in morfologik.tools
-
The serialization and encoding format to use for compressing the automaton.
- serialize(FSA, T) - Method in class morfologik.fsa.builders.CFSA2Serializer
- serialize(FSA, T) - Method in class morfologik.fsa.builders.FSA5Serializer
-
Serialize root state
s
to an output stream inFSA5
format. - serialize(FSA, T) - Method in interface morfologik.fsa.builders.FSASerializer
-
Serialize a finite state automaton to an output stream.
- set(int, int, int) - Method in class morfologik.speller.HMatrix
-
Set an item in hMatrix.
- setCallSystemExit(boolean) - Method in class morfologik.tools.CliTool
-
Call
System.exit(int)
at the end of command processing. - size - Variable in class morfologik.fsa.builders.FSAInfo
-
Arcs size (in serialized form).
- Speller - Class in morfologik.speller
-
Finds spelling suggestions.
- Speller(Dictionary) - Constructor for class morfologik.speller.Speller
- Speller(Dictionary, int) - Constructor for class morfologik.speller.Speller
- Speller.CandidateData - Class in morfologik.speller
-
Used to sort candidates according to edit distance, and possibly according to their frequency in the future.
- STATE_REGISTRY_SIZE - morfologik.fsa.builders.FSABuilder.InfoEntry
- STATE_REGISTRY_TABLE_SLOTS - morfologik.fsa.builders.FSABuilder.InfoEntry
- StateVisitor - Interface in morfologik.fsa
-
State visitor.
- STOPBIT - morfologik.fsa.FSAFlags
-
Daciuk: stop bit in use.
- SUCCESS - morfologik.tools.ExitStatus
-
The command was successful.
- SUFFIX - morfologik.stemming.EncoderType
- supportRunOnWords() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- supportRunOnWords(boolean) - Method in class morfologik.stemming.DictionaryMetadataBuilder
T
- TAILS - morfologik.fsa.FSAFlags
-
Daciuk: tails compression.
- toArray(ByteBuffer) - Static method in class morfologik.stemming.BufferUtils
- toDot(Writer, FSA, int) - Static method in class morfologik.fsa.builders.FSAUtils
-
Saves the right-language reachable from a given FSA node, formatted as an input for the graphviz package (expressed in the
dot
language), to the given writer. - toDot(FSA, int) - Static method in class morfologik.fsa.builders.FSAUtils
-
Returns the right-language reachable from a given FSA node, formatted as an input for the graphviz package (expressed in the
dot
language). - toMap() - Method in class morfologik.stemming.DictionaryMetadataBuilder
- toString() - Method in enum morfologik.fsa.builders.FSABuilder.InfoEntry
- toString() - Method in class morfologik.fsa.builders.FSAInfo
- toString() - Method in class morfologik.stemming.NoEncoder
- toString() - Method in class morfologik.stemming.TrimInfixAndSuffixEncoder
- toString() - Method in class morfologik.stemming.TrimPrefixAndSuffixEncoder
- toString() - Method in class morfologik.stemming.TrimSuffixEncoder
- toString() - Method in class morfologik.stemming.WordData
- toString(ByteBuffer, Charset) - Static method in class morfologik.stemming.BufferUtils
- toString(CharBuffer) - Static method in class morfologik.stemming.BufferUtils
- TrimInfixAndSuffixEncoder - Class in morfologik.stemming
-
Encodes
dst
relative tosrc
by trimming whatever non-equal suffix and infixsrc
anddst
have. - TrimInfixAndSuffixEncoder() - Constructor for class morfologik.stemming.TrimInfixAndSuffixEncoder
- TrimPrefixAndSuffixEncoder - Class in morfologik.stemming
-
Encodes
dst
relative tosrc
by trimming whatever non-equal suffix and prefixsrc
anddst
have. - TrimPrefixAndSuffixEncoder() - Constructor for class morfologik.stemming.TrimPrefixAndSuffixEncoder
- TrimSuffixEncoder - Class in morfologik.stemming
-
Encodes
dst
relative tosrc
by trimming whatever non-equal suffixsrc
has. - TrimSuffixEncoder() - Constructor for class morfologik.stemming.TrimSuffixEncoder
U
- UnmappableInputException - Exception in morfologik.stemming
-
Thrown when some input cannot be mapped using the declared charset (bytes to characters or the other way around).
V
- validate(String, Path) - Method in class morfologik.tools.ValidateFileExists
- validate(String, Path) - Method in class morfologik.tools.ValidateParentDirExists
- ValidateFileExists - Class in morfologik.tools
- ValidateFileExists() - Constructor for class morfologik.tools.ValidateFileExists
- ValidateParentDirExists - Class in morfologik.tools
- ValidateParentDirExists() - Constructor for class morfologik.tools.ValidateParentDirExists
- valueOf(String) - Static method in enum morfologik.fsa.builders.FSABuilder.InfoEntry
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum morfologik.fsa.FSAFlags
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum morfologik.stemming.DictionaryAttribute
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum morfologik.stemming.EncoderType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum morfologik.tools.ExitStatus
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum morfologik.tools.SerializationFormat
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum morfologik.fsa.builders.FSABuilder.InfoEntry
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum morfologik.fsa.FSAFlags
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum morfologik.stemming.DictionaryAttribute
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum morfologik.stemming.EncoderType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum morfologik.tools.ExitStatus
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum morfologik.tools.SerializationFormat
-
Returns an array containing the constants of this enum type, in the order they are declared.
- VERSION - Static variable in class morfologik.fsa.CFSA
-
Automaton header version value.
- VERSION - Static variable in class morfologik.fsa.CFSA2
-
Automaton header version value.
- VERSION - Static variable in class morfologik.fsa.FSA5
-
Automaton version as in the file header.
- visitAllStates(T) - Method in class morfologik.fsa.FSA
-
Visit all states.
- visitInPostOrder(T) - Method in class morfologik.fsa.FSA
-
Same as
FSA.visitInPostOrder(StateVisitor, int)
, starting from root automaton node. - visitInPostOrder(T, int) - Method in class morfologik.fsa.FSA
-
Visits all states reachable from
node
in postorder. - visitInPreOrder(T) - Method in class morfologik.fsa.FSA
-
Same as
FSA.visitInPreOrder(StateVisitor, int)
, starting from root automaton node. - visitInPreOrder(T, int) - Method in class morfologik.fsa.FSA
-
Visits all states in preorder.
W
- withAnnotationSeparator(byte) - Method in class morfologik.fsa.builders.CFSA2Serializer
- withAnnotationSeparator(byte) - Method in class morfologik.fsa.builders.FSA5Serializer
-
Sets the annotation separator (only if
FSASerializer.getFlags()
returnsFSAFlags.SEPARATORS
). - withAnnotationSeparator(byte) - Method in interface morfologik.fsa.builders.FSASerializer
-
Sets the annotation separator (only if
FSASerializer.getFlags()
returnsFSAFlags.SEPARATORS
). - withEquivalentChars(Map<Character, List<Character>>) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- withFiller(byte) - Method in class morfologik.fsa.builders.CFSA2Serializer
- withFiller(byte) - Method in class morfologik.fsa.builders.FSA5Serializer
-
Sets the filler separator (only if
FSASerializer.getFlags()
returnsFSAFlags.SEPARATORS
). - withFiller(byte) - Method in interface morfologik.fsa.builders.FSASerializer
-
Sets the filler separator (only if
FSASerializer.getFlags()
returnsFSAFlags.SEPARATORS
). - withInputConversionPairs(Map<String, String>) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- withNumbers() - Method in class morfologik.fsa.builders.CFSA2Serializer
-
Serialize the automaton with the number of right-language sequences in each node.
- withNumbers() - Method in class morfologik.fsa.builders.FSA5Serializer
-
Serialize the automaton with the number of right-language sequences in each node.
- withNumbers() - Method in interface morfologik.fsa.builders.FSASerializer
-
Enables support for right language count on nodes, speeding up perfect hash counts (only if
FSASerializer.getFlags()
returnsFSAFlags.NUMBERS
). - withOutputConversionPairs(Map<String, String>) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- withReplacementPairs(Map<String, List<String>>) - Method in class morfologik.stemming.DictionaryMetadataBuilder
- WordData - Class in morfologik.stemming
-
Stem and tag data associated with a given word.
- write(OutputStream, byte) - Static method in class morfologik.fsa.FSAHeader
-
Writes FSA magic bytes and version information.
- write(Writer) - Method in class morfologik.stemming.DictionaryMetadata
-
Write dictionary attributes (metadata).
All Classes All Packages