Back in 2006 there was data leak from AOL. Since then dataset has been lived eternally on internet, which is a boon given that you don’t easily get access to datasets anymore. Following are some interesting research on that front

  1. How Google May Reform Queries Based On Co-Occurrence In Query Sessions
  2. Analysis of a Very Large Web Search Engine Query Log
  3. Agglomerative clustering of a search engine query log
  4. Web Query Recommendation via Sequential Query Prediction
  5. Query Recommendation for Improving Search Engine Results
  6. An Optimization Framework for Query Recommendation
  7. Mining Search Engine Query Logs for Query Recommendation
  8. SlideShare: Building Recommendation Platforms with Hadoop