Interface Tagger

  • All Known Implementing Classes:
    BaseTagger, DemoTagger

    public interface Tagger
    The part-of-speech tagger interface, whose implementations are usually language-dependent.
    Author:
    Daniel Naber
    • Method Detail

      • tag

        List<AnalyzedTokenReadings> tag​(List<String> sentenceTokens)
                                 throws IOException
        Returns a list of AnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).

        Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.

        Parameters:
        sentenceTokens - the text as returned by a WordTokenizer
        Throws:
        IOException
      • createNullToken

        AnalyzedTokenReadings createNullToken​(String token,
                                              int startPos)
        Create the AnalyzedToken used for whitespace and other non-words. Use null as the POS tag for this token.
      • createToken

        AnalyzedToken createToken​(String token,
                                  String posTag)
        Create a token specific to the language of the implementing class.