Monthly Archives: May 2011

How to Implement a Search Engine Part 2: Query Index

Overview This is the second part of our implementing a search engine project. The first part was about creating the inverted index. Now, we will use the index to answer actual search queries. Query Types Let’s first remember the query … Continue reading

Posted in Information Retrieval, Search Engines, Web Search | 11 Comments

My Favorite Interview Question

I am working at Microsoft Bing as an intern this summer. To get an internship I had lots of interviews with various tech companies this year, and this was my favorite question: In an integer array with N elements (N is … Continue reading

Posted in Programming Interview | 12 Comments

Python Optimization

Fast executing efficient code is always desirable, especially for programs that will operate on web-scale data. Small performance gains will lead to big time improvements due to massive size of the input. Here I want to share python optimization lessons … Continue reading

Posted in Optimization, Python | Leave a comment

How to Implement a Search Engine Part 1: Create Index

Overview Ok, let’s start! We will implement a search engine that answers queries on Wikipedia articles. There will be two main parts of the project. First creating the index by going through the documents, and second answering the search queries … Continue reading

Posted in Information Retrieval, Search Engines, Web Search | 25 Comments