Query Expansion: Internet Mining vs. Pseudo Relevance Feedback

Dmitri Roussinov and Gheorghe Muresan

We report an investigation of techniques for mining world wide web in order to identify terms (single words or phrases) that are highly related to a topic (query) described by a short (one sentence or a paragraph-long) interest state-ment. These related terms are subsequently used to improve automated document retrieval. By following a standard test-ing methodology, we established that our technique improves the effectiveness of retrieval up to 8% over currently best known techniques (strong baseline).


