Thursday, September 22, 2011

9/22/2011

The traditional IR cannot be applied directly to the web pages as we pages are
  • very voluminous
  • widely distributed
  • extremely dynamic
  • the data is more structured
  • link between pages needs to be considered
We need to establish importance and trust worthiness of pages which influences the ranking.
The tag information is used to determine the importance and  the presence of links makes the content non local.


Srividya