Package net.ontopia.topicmaps.classify
Class TermDatabase
- java.lang.Object
-
- net.ontopia.topicmaps.classify.TermDatabase
-
public class TermDatabase extends Object
PUBLIC: A collection of terms representing the result of classifying a piece of content. The terms have scores indicating their importance within the content, and variants, indicating different spellings for the same term within the content.Use SimpleClassifier to create TermDatabase objects.
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected TokencreateDelimiter(String delimiter)protected TermcreateTerm(String stem)protected VariantcreateVariant(String variant)voiddump()INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.voiddump(int firstN)INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.protected doublegetMaxScore()TermgetTerm(String stem)PUBLIC: Looks up a particular term by its stem.intgetTermCount()PUBLIC: Returns the number of terms in the database.Collection<Term>getTerms()PUBLIC: Returns all terms found in the classified content.Term[]getTermsByRank()PUBLIC: Returns all terms found in the classified content sorted by score.VariantgetVariant(String variant)PUBLIC: Looks up a particular variant by its string representation.protected voidmergeTerms(Term t1, Term t2)
-
-
-
Method Detail
-
getTerms
public Collection<Term> getTerms()
PUBLIC: Returns all terms found in the classified content.
-
getTermsByRank
public Term[] getTermsByRank()
PUBLIC: Returns all terms found in the classified content sorted by score.
-
getTermCount
public int getTermCount()
PUBLIC: Returns the number of terms in the database.
-
getTerm
public Term getTerm(String stem)
PUBLIC: Looks up a particular term by its stem. Returns null if no term is found.
-
getVariant
public Variant getVariant(String variant)
PUBLIC: Looks up a particular variant by its string representation. Returns null if no variant is found.
-
getMaxScore
protected double getMaxScore()
-
dump
public void dump()
INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.
-
dump
public void dump(int firstN)
INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.- Parameters:
firstN- how many terms to output
-
-