Class MultiWordChunker
- java.lang.Object
-
- org.languagetool.tagging.disambiguation.AbstractDisambiguator
-
- org.languagetool.tagging.disambiguation.MultiWordChunker
-
- All Implemented Interfaces:
Disambiguator
public class MultiWordChunker extends AbstractDisambiguator
Multiword tagger-chunker.- Author:
- Marcin MiĆkowski
-
-
Constructor Summary
Constructors Constructor Description MultiWordChunker(String filename)
MultiWordChunker(String filename, boolean allowFirstCapitalized)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description AnalyzedSentence
disambiguate(AnalyzedSentence input)
Implements multiword POS tags, e.g., <ELLIPSIS> for ellipsis (...) start, and </ELLIPSIS> for ellipsis end.-
Methods inherited from class org.languagetool.tagging.disambiguation.AbstractDisambiguator
preDisambiguate
-
-
-
-
Constructor Detail
-
MultiWordChunker
public MultiWordChunker(String filename)
- Parameters:
filename
- file text with multiwords and tags
-
MultiWordChunker
public MultiWordChunker(String filename, boolean allowFirstCapitalized)
- Parameters:
filename
- file text with multiwords and tagsallowFirstCapitalized
- if set totrue
, first word of the multiword can be capitalized
-
-
Method Detail
-
disambiguate
public final AnalyzedSentence disambiguate(AnalyzedSentence input)
Implements multiword POS tags, e.g., <ELLIPSIS> for ellipsis (...) start, and </ELLIPSIS> for ellipsis end.- Parameters:
input
- The tokens to be chunked.- Returns:
- AnalyzedSentence with additional markers.
-
-