Thursday, September 22, 2011


Web IR as opposed to Traditional IR
--> Pages on the web contain links to other pages
--> Possible to exploit the anchor text contained in links as an indication of the content of the web page being pointed to
--> Processing the collection involves gathering static pages and learning about dynamic pages.
