Most of the Information Extraction (IE) done now a days uses the structure of the web instead of relying on NLP. (Most of the data on the web has some form of structure).
IE consists of 4 processes : 1) Segmentation 2) Classification 3) association 4) clustering
--Shreejay