Tuesday, September 20, 2011

09/20/2011

Why don't we reduce the dimensionality of the data set to 1- dimension?
We use Cosine similarity for computing similarity value between data points. As the dot product of two orthogonal vectors is zero, hence we do not reduce the dimensionality to 1.

LSI is an indexing and retrieval method that uses SVD to identify patterns of terms in a collection of text. It overcomes synonomy and polysemy. It will also subsume correlation analysis.

Archana