This program gathers summary data for each domain in a list. It collects country/state geographical information by parsing pages such as "Contact" pages. It collects Google Adsense IDs, page titles, charset, and builds a word vector containing the most frequent words found on the domain s home page (disregarding stop words).