Package net.ontopia.topicmaps.classify
Class TermDatabase
- java.lang.Object
-
- net.ontopia.topicmaps.classify.TermDatabase
-
public class TermDatabase extends Object
PUBLIC: A collection of terms representing the result of classifying a piece of content. The terms have scores indicating their importance within the content, and variants, indicating different spellings for the same term within the content.Use SimpleClassifier to create TermDatabase objects.
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected Token
createDelimiter(String delimiter)
protected Term
createTerm(String stem)
protected Variant
createVariant(String variant)
void
dump()
INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.void
dump(int firstN)
INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.protected double
getMaxScore()
Term
getTerm(String stem)
PUBLIC: Looks up a particular term by its stem.int
getTermCount()
PUBLIC: Returns the number of terms in the database.Collection<Term>
getTerms()
PUBLIC: Returns all terms found in the classified content.Term[]
getTermsByRank()
PUBLIC: Returns all terms found in the classified content sorted by score.Variant
getVariant(String variant)
PUBLIC: Looks up a particular variant by its string representation.protected void
mergeTerms(Term t1, Term t2)
-
-
-
Method Detail
-
getTerms
public Collection<Term> getTerms()
PUBLIC: Returns all terms found in the classified content.
-
getTermsByRank
public Term[] getTermsByRank()
PUBLIC: Returns all terms found in the classified content sorted by score.
-
getTermCount
public int getTermCount()
PUBLIC: Returns the number of terms in the database.
-
getTerm
public Term getTerm(String stem)
PUBLIC: Looks up a particular term by its stem. Returns null if no term is found.
-
getVariant
public Variant getVariant(String variant)
PUBLIC: Looks up a particular variant by its string representation. Returns null if no variant is found.
-
getMaxScore
protected double getMaxScore()
-
dump
public void dump()
INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.
-
dump
public void dump(int firstN)
INTERNAL: Writes the contents of the term database out to System.out for debugging purposes.- Parameters:
firstN
- how many terms to output
-
-