Clustering is used for structuring of results in an IR system and also its a very good method to understand user's queries.
It can be done during
Clustering can be done based on distance between the pages
- Url based distance.
- Text based distance
- Link based distance
Preethi