In the last installment of this series, we looked at how Google may be using phrase based indexing to use the fact that many phrases often tend to co-occur with other phrases within the content of web pages, to re-rank those pages. When we look at phrases, we also need to drill down to a special set of phrases describing named entities, or specific people, places, or things. In addition to trying to understand which phrases might tend to co-occur with those named entities, the search engines may look to other sources such as Wikipedia, Freebase from Metaweb, the Internet Movie Database (IMDB), and different map databases to attempt to understand when a phrase indicates an actual (or fictional) entity and may perform named entity detection on queries searching for those pages
Google, Bing, and Yahoo all look for named entities on web pages and in search queries, and will use their named entity detection to do things like answer questions such as “where was Barack Obama born?” This type of named entity detection is having an increased role in Search.
Continue reading “10 Most Important SEO Patents: Part 6 – Named Entity Detection in Queries”
Looks like Google and IBM are working together again to build up Google’s patent portfolio, from an update at the United States Patent and Trademark Office (USPTO) patent assignment database. Details beyond the actual patents involved aren’t known yet. The last couple of times I wrote about large patent transactions between Google and IBM this past July and September, Google ended up sending out emails a few hours after my posts to a number of large media sites such as the New York Times, Bloomberg News, the Wall Street Journal, and a number of others disclosing the acquisitions. We’ll see if they do that again.
The last week of 2011, Google acquired 188 granted patents and 29 published pending patent applications from IBM, according to the USPTO assignment database, with an execution data on the assignment of the patents on December 28, 2011, in a deal that was officially recorded at the patent office on December 30, 2011.
The patents cover a broad range of topics, such as presentation software, blade servers, data caching, server load balancing, network performance, video conferencing, email administration, and instant messaging applications. A number of the patents cover specific internet, phone, and mobile phone technologies as well.