Class StopWords
java.lang.Object
org.ek9lang.assist.StopWords
Stop word set and synonym group map for Q&A matching.
Stop words are removed during tokenization as they carry no matching value.
Synonym groups map related terms to a canonical form for better matching.
-
Method Summary
Modifier and TypeMethodDescriptionstatic booleanisStopWord(String word) Check if a word is a stop word.static StringresolveSynonym(String word) Map a word to its canonical synonym form.Tokenize text for document field indexing: lowercase, split, strip punctuation, remove stop words, resolve synonyms.tokenizeQuery(String input) Tokenize user query text.
-
Method Details
-
tokenize
-
tokenizeQuery
Tokenize user query text. Same astokenize(String)but also injects "migrate" when a recognized programming language name is detected, steering results toward migration-focused Q&As. -
resolveSynonym
-
isStopWord
Check if a word is a stop word.
-