Class TermDatabase


  • public class TermDatabase
    extends Object
    PUBLIC: A collection of terms representing the result of classifying a piece of content. The terms have scores indicating their importance within the content, and variants, indicating different spellings for the same term within the content.

    Use SimpleClassifier to create TermDatabase objects.

    • Method Detail

      • getTerms

        public Collection<Term> getTerms()
        PUBLIC: Returns all terms found in the classified content.
      • getTermsByRank

        public Term[] getTermsByRank()
        PUBLIC: Returns all terms found in the classified content sorted by score.
      • getTermCount

        public int getTermCount()
        PUBLIC: Returns the number of terms in the database.
      • getTerm

        public Term getTerm​(String stem)
        PUBLIC: Looks up a particular term by its stem. Returns null if no term is found.
      • getVariant

        public Variant getVariant​(String variant)
        PUBLIC: Looks up a particular variant by its string representation. Returns null if no variant is found.
      • createDelimiter

        protected Token createDelimiter​(String delimiter)
      • mergeTerms

        protected void mergeTerms​(Term t1,
                                  Term t2)
      • createTerm

        protected Term createTerm​(String stem)
      • getMaxScore

        protected double getMaxScore()
      • createVariant

        protected Variant createVariant​(String variant)
      • dump

        public void dump()
        INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.
      • dump

        public void dump​(int firstN)
        INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.
        Parameters:
        firstN - how many terms to output