Class TermDatabase

java.lang.Object
net.ontopia.topicmaps.classify.TermDatabase

public class TermDatabase extends Object
PUBLIC: A collection of terms representing the result of classifying a piece of content. The terms have scores indicating their importance within the content, and variants, indicating different spellings for the same term within the content.

Use SimpleClassifier to create TermDatabase objects.

  • Field Details

  • Method Details

    • getTerms

      public Collection<Term> getTerms()
      PUBLIC: Returns all terms found in the classified content.
    • getTermsByRank

      public Term[] getTermsByRank()
      PUBLIC: Returns all terms found in the classified content sorted by score.
    • getTermCount

      public int getTermCount()
      PUBLIC: Returns the number of terms in the database.
    • getTerm

      public Term getTerm(String stem)
      PUBLIC: Looks up a particular term by its stem. Returns null if no term is found.
    • getVariant

      public Variant getVariant(String variant)
      PUBLIC: Looks up a particular variant by its string representation. Returns null if no variant is found.
    • createDelimiter

      protected Token createDelimiter(String delimiter)
    • mergeTerms

      protected void mergeTerms(Term t1, Term t2)
    • createTerm

      protected Term createTerm(String stem)
    • getMaxScore

      protected double getMaxScore()
    • createVariant

      protected Variant createVariant(String variant)
    • dump

      public void dump()
      INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.
    • dump

      public void dump(int firstN)
      INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.
      Parameters:
      firstN - how many terms to output