A recent comment here noted that the core algorithm behind how Google works hasn’t changed very much since its earliest days. I’m not sure that I agree. Many of the posts I’ve made over the past five years that involve Google patents and whitepapers describe ways that Google may be changing how it determines which … Read more We’re All Google’s Lab Rats
When you arrive at a web page, the owner of that page might start collecting information about your visit for a number of reasons. One of the most commonly collected pieces of information is an internet protocol (or IP) address. An IP address is a number that can be associated with the way and the … Read more How and Why Google Might Estimate the Number of Users Behind an IP Address
Two Microsoft papers being presented at this week’s SIGIR’10 conference in Geneva, Switzerland explore the topics of Search Trails – The pages that a searcher travels through after performing a search for a query before reaching a final destination page. The idea of delivering searchers to a final destination page, a page where previous searchers … Read more The Importance of the Journey: Search Trails and Destination Pages
A search engine might use two sets of indexes – one for query terms that tend to show up in more searches and on more web pages, and another larger index that includes queries that aren’t searched for as much by searchers and don’t appear on many web pages. By showing results for some terms … Read more Head URLs and Tail URLs and Bing’s Supplemental Index?
What are Named Entities? Named Entities are specific people, places or things, and a focus of what Google might look for when returning information about queries. They got a lot smarter in answering questions about named entities with the acquisition of MetaWeb, which had developed a way of better understanding named entitied in searches for … Read more Google Gets Smarter with Named Entities: Acquires MetaWeb