Am I What I Say I Am ?
Metadata vs. Content, Website Classification and Similarity Determination Project
CSCI 5417 - Information Retrieval (IR)
Fall 2009 Class project for Peter Elespuru and Jerel Moffatt.
This site uses a given website's metadata and content to determine two things.
- (1) Is the site consistent in its representation of itself, based on a comparison of its proclaimed metadata to the actual content ?
- (2) What other sites is it similar to, based on our classification engine (and sites we've crawled and indexed so far) ?