Thursday, September 15, 2011

9/13/2011

Auto correction/suggestion
Discovered the use of k-gram bag intersection with both the lexicon base and/or query log to either provide auto-correction or enhance user search patterns. One striking thing is that if the IDF of a word in lexicon base is less than a threshold, it could be a typo error and hence we should probably not consider when intersecting! Also an interesting aspect on the use of edit distances, where transposition and alignment could reduce distances considerably. And it was refreshing to look at Bayes rule from a different perspective in correcting query errors which are not syntactical.
--
Aneeth