Class DocumentTokenizer


  • public class DocumentTokenizer
    extends Object
    INTERNAL:
    • Constructor Detail

      • DocumentTokenizer

        public DocumentTokenizer​(TermDatabase tdb)
    • Method Detail

      • setTermDatabase

        public void setTermDatabase​(TermDatabase tdb)
      • setTokenizer

        public void setTokenizer​(TokenizerIF tokenizer)
      • addTermNormalizer

        public void addTermNormalizer​(TermNormalizerIF normalizer)
      • tokenize

        public void tokenize​(Document doc)
      • tokenize

        protected void tokenize​(Region region)