Google finds terms and phrases to associate with entities that can be considered terms of interest for businesses, locations, and other entities. These terms can influence what shows up in search results and in knowledge panels for those entities. Consider it part of a growing knowledge base of concepts, entities, attributes for entities, and keywords that shape the new Google after Hummingbird. Semantics play a role as things that specific entities are known for are identified.
For example, the Warrenton, Virginia, Red Truck Bakery (local to me) is known for:
Is Hummingbird the key to understanding the expertise of an author for things like In-Depth articles, and a possible future Author Rank? With content from an author considered using a concept-based knowledge base, it’s quite possible.
The Google Hummingbird rewrite of Google’s search engine wasn’t just aimed at providing a way to better understand long and complex queries, like the type that someone might speak into their phone. It was also likely aimed at better understanding the concepts and topics written about and discussed on Web pages, and in social signals such as posts at Google+ and comments on those posts, in Tweets, in Status Updates, and other short text based messages where there might not be a lot of additional context to go with those messages.
There are a few different parts to this story, though I’m not sure how many there will be because I’m still in the middle of writing them. I started with a prologue, titled Are You,Your Business, or Products in a Knowledge Base?, which introduced Microsoft’s Conceptual Knowledge Base Probase.
Microsoft’s Probase Knowledge Base
Sometime between when Microsoft acquired semantic search company Powerset and now, the software company began work on one of the largest knowledge bases in the world, Probase. Why Bing doesn’t use it now is a mystery, but it doesn’t appear to. There are a few papers about Probase, including one titled, Concept-Based Web Search. Here’s a snippet from the paper, which might evoke some recent memories of Google’s Hummingbird update:
It is important to note that the lack of a concept-based search feature in all main-stream search engines has, in many situations, discouraged people from expressing their queries in a more natural way. Instead, users are forced to formulate their queries as keywords. This makes it difficult for people who are new to keyword-based search to effectively acquire information from the web.
Added 2013-11-10 – Google was granted a continuation version of this same patent (Search queries improved based on query semantic information) on November 5th, 2013, where the claims section has been completely re-written in some interesting ways. It describes using a substitute term for one of the original terms in the query, and using an inverse document frequency count to see how many times that substitute term appears in the result set for the modified version of the query and for the original version of the query. The timing of this update of the patent is interesting. The link below points to the old version of the patent, so if you want you can compare the claims sections.
Back in September, Google announced that they had started using an algorithm that rewrites queries submitted by searchers which they had given the code name “Hummingbird.” At the time, I was writing a blog post about a patent from Google that seemed like it might be very related to the update because the focus was upon re-writing long and complex queries, while paying more attention to all the words within those queries. I called the post, The Google Hummingbird Patent because the patent seemed to be such a good match.
When I talk about, or write about entities, it’s normally in the context of specific people, places, or things. Google was granted a patent recently which discusses a different type of entity, in a more narrow manner. These entities are referred to as “search entities”, and the patent uses them to predict probabilities and understand the relationship between them better. This kind of analysis might result in some pages ranking higher than they otherwise might because of their similarities to other sites, and in some sets of search results favoring fresher results as well.
Google has come under fire the last year or so from critics who claim that the search engine has been providing too many pages from some of the same domains in search results. It appears that this has had them looking at ways that they could provide more diversity within those results. A patent granted to Google earlier this year describes one approach that could have an impact on both local search rankings and Web rankings for authority pages for business entities.
The impact of this approach would be that when these authority pages ranked highly in both Web results and local search results, Google might merge listings for the two, so that the Web search result no longer appears within search results for a specific query and the local search result is potentially boosted higher in results as well.
In the past, I’ve written about How Google Universal Search and Blended Results May Work, describing how Google might decide when and where to include multiple listings within Web search results from different vertical search types, such as local results, images, news articles, videos, and others. Each of these different types of results might be ranked based upon their relevance to a query, and might be included within results based upon how meaningful those results might be to the query and the intent of a searcher.
Will Google Plus morph into a social version of eBay, where people can post things for sale, write reviews of their belongings and show them off to their circles?
A few days ago, the Google File System Blog reported upon a new application from Google, which it referred to by the name Google Mine (as in, “it’s mine,” rather than as in Google mining more information about your life). The post also mentions a version for Android on Google’s internal Play store. The post tells us that the app allows you to be pretty active about posting your belongings.
As you can see, the service lets you enter a lot of information about your objects. For example, you can change the status of an object to “lent”, “given away”, “got it back”, “lost it”, “had in the past”. You can post videos about the object, write reviews, add it to a wishlist and maybe others can buy it using Google Shopping. You can also check popular items and the items others have shared.
Last week, Google was granted a number of patents exploring different aspects of how documents on the Web might be ranked in part based upon topics identified for those documents and the expertise and/or authority of authors involved in the creation of the documents. The process also describes how Google might use different methods to determine the authority of multiple authors who may have worked to create the documents.
This sounds very similar to some statements that Google’s Matt Cutts made at the start of May in a video about What should we expect in the next few months in terms of SEO for Google?