How Google May Rank Some Results based on Categorical Quality

A New Patent on Categorical Quality

Some of the people who write patents for Google tend to stand out to me. One of those is Trystan Upstill. I noticed that he has published another one that looks really interesting, and worth reading. When I started following his patents, I read his doctoral thesis, Document ranking using web evidence which was really interesting, from the early days in his professional career. It is from before he was listed as the inventor of a number of patents, that I also found interesting. I’ve written about a number of patents he has participated in creating as well because they often focus upon Site Quality, and I learn something from reading them and trying to understand them. Here are posts from his patents which I have written about previously:

I noticed his name on a new one granted at the end of May, and I’ve been working through it now, too.

Continue reading “How Google May Rank Some Results based on Categorical Quality”

Context Clusters in Search Query Suggestions

unsplash-logoSaketh Garuda

Context Clusters and Query Suggestions at Google

A new patent application from Google tells us about how the search engine may use context to find query suggestions before a searcher has completed typing in a full query. Think of Google as a Decision Engine, focused upon bringing searchers more information about interests they may have. After seeing this patent, I’ve been thinking about previous patents I’ve seen from Google that have similarities.

Continue reading “Context Clusters in Search Query Suggestions”

Quality Scores for Queries: Structured Data, Synthetic Queries and Augmentation Queries

Quality Scores and Augmentation Queries

In general, the subject matter of this specification relates to identifying or generating augmentation queries, storing the augmentation queries, and identifying stored augmentation queries for use in augmenting user searches. An augmentation query can be a query that performs well in locating desirable documents identified in the search results. The performance of an augmentation query can be determined by user interactions. For example, if many users that enter the same query often select one or more of the search results relevant to the query, that query may be designated an augmentation query.

In addition to actual queries submitted by users, augmentation queries can also include synthetic queries that are machine generated. For example, an augmentation query can be identified by mining a corpus of documents and identifying search terms for which popular documents are relevant. These popular documents can, for example, include documents that are often selected when presented as search results. Yet another way of identifying an augmentation query is mining structured data, e.g., business telephone listings, and identifying queries that include terms of the structured data, e.g., business names.

These augmentation queries can be stored in an augmentation query data store. When a user submits a search query to a search engine, the terms of the submitted query can be evaluated and matched to terms of the stored augmentation queries to select one or more similar augmentation queries. The selected augmentation queries, in turn, can be used by the search engine to augment the search operation, thereby obtaining better search results. For example, search results obtained by a similar augmentation query can be presented to the user along with the search results obtained by the user query.

Continue reading “Quality Scores for Queries: Structured Data, Synthetic Queries and Augmentation Queries”

Search Engine Queries May be Used to Identify Entity Attributes

How Search Engine Queries to Identify Entity Attributes

What are query stream ontologies, and how might they change search?

Search Engine Queries to identify entity attributes

Search engines trained us to use keywords when we searched – to try to guess what words or phrases might be the best ones to use to try to find something we are interested in. That we might have a situational or informational need to find out more about. Keywords were an important and essential part of SEO – trying to get pages to rank highly in search results for certain keywords found in search engine queries that people would search for. SEOs still optimize pages for keywords, hoping to use a combination of information retrieval relevance scores and link-based PageRank scores, to get pages to rank highly in search results.

With Google moving towards a knowledge-based attempt to find “things” rather than “strings”, we are seeing patents that focus upon returning results that provide answers to questions in response to search engine queries. One of those from January describes how query stream ontologies might be created from search engine queries, that can be used to identify entity attributes which could be used to respond to fact-based questions using information about those entities.

There is a white paper from Google co-authored by the same people who are the inventors of this patent published around the time this patent was filed in 2014, and it is worth spending time reading through. The paper is titled, Biperpedia: An Ontology for Search Applications

Continue reading “Search Engine Queries May be Used to Identify Entity Attributes”

Citations behind the Google Brain Word Vectors Approach

Cardiff-Tidal-pools

Google’s Word Vectors Approach

In October of 2015, a new algorithm was announced by members of the Google Brain team, described in this post from Search Engine Land – Meet RankBrain: The Artificial Intelligence That’s Now Processing Google Search Results One of the Google Brain team members who gave Bloomberg News a long interview on Rankbrain, Gregory S. Corrado was a co-inventor on a word vectors patent that was granted this August along with other members of the Google Brain team.

In the SEM Post article, RankBrain: Everything We Know About Google’s AI Algorithm we are told that Rankbrain uses concepts from Geoffrey Hinton, involving Thought Vectors.

The summary in the description from the patent tells us about how a word vectors approach might be used in such a system:

Continue reading “Citations behind the Google Brain Word Vectors Approach”

How Google Might Make Better Synonym Substitutions Using Knowledge Base Categories

Shea Stadium
Leigh Miller – Yankee Stadium, francis_leigh, Some rights reserved

How Google May Use Synonym Substitutions to Rewrite Queries

A couple of months ago, I wrote about a Google patent that involved rewriting queries, titled Investigating Google RankBrain and Query Term Substitutions. There’s likely a lot more to how Google’s RankBrain approach works, but I came across a patent that seems to be related to the patent I wrote about in that post and thought it was worth sharing and starting a discussion about. The patent I wrote about in that post was Using concepts as contexts for query term substitutions. The title for this new patent was very similar to that one (Synonym identification based on categorical contexts), and the more recent patent was granted on December 1st of this year.

Continue reading “How Google Might Make Better Synonym Substitutions Using Knowledge Base Categories”