Category Archives: Search Engine Optimization (SEO)

Search Engine Optimization tips and strategies and information, from SEO by the Sea, to help make web sites easier to find.

How Google Might Index Link Behavior Information

Under a conventional approach to indexing links by a search engine, information about the targeted address that a link is pointed towards might be included in a search engine’s index, as well as the anchor text displayed within the links, and possibly even some text near the link itself. The Google Reasonable Surfer model points to the possibility of other information being collected about a link as well, which could be taken together as a whole to calculate how much value or weight might be passed along by the link to another page under a PageRank link analysis model or even in determining how much weight the anchor text used to point to a link might carry.

The question, Just How Smart are Search Engine Robots has been asked with more frequency lately, and a pending patent application published by Google shows how the search engine might be collecting a whole different type of link behavior information about links that are found on the Web. Given Google’s move towards building their own Chrome Browser and providing access to web pages via alternative screens such as those on smart phones and other handheld devices and television screens, it makes sense for the search engine to capture this kind of information as well. The image from the patent filing below shows sections of links, including target and onclick attributes that the search engine might now be indexing.

A screenshot from Google Maps showing an information box over the map that appears after clicking upon a link in the column to the left.

Continue reading How Google Might Index Link Behavior Information

Google’s Comment Patents and How Pages’ Web Rankings Might Be Influenced by Commentors’ Reputations

A rumor surfaced last week that Google would launch a third party commenting platform to rival Facebook’s. Coincidentally, Google was granted two patents this week describing comment systems, and how comments might be ranked under those systems. But the patents appear to describe comments on two different services from Google that have been discontinued. One of the patents appears to involve Google Sidewiki, which had more of a Web annotation service feel than that of a commenting system, and and the other involves comments on Google Knol.

Google Sidewiki and Google Knol and Commenting

Google Sidewiki enabled people to leave a comment on virtually any page on the Web, and could be accessed through the Google toolbar. A 1999 survey of Web annotation services showed that they have been around since the earliest days of the Web, and they differ from commenting systems in that they’ve been aimed at providing ways for people to leave private or public notes about web pages, sometimes but not necessarily with the participation of the authors of those pages. When Google announced that they were closing down Sidewiki last September, they told us that:

Continue reading Google’s Comment Patents and How Pages’ Web Rankings Might Be Influenced by Commentors’ Reputations

Most Important SEO Patents Part 10: Just the Beginning

I’ve been faced with a pretty difficult decision, choosing the last of the patents, or patent families to include in this series of posts about the most important search-related patents to people who promote sites on the Web. I find I just can’t choose one.

Synonyms

For the last few weeks, I’ve been arguing with myself over a choice of at least two sets of patents. One patent that I wanted to include involved responding to informational needs by going beyond matching keywords to expand the query terms used in search results to include synonyms and pages on related concepts. There are a number of related patents granted to Google that describe how the search engine might identify synonyms, and it’s worth spending some time with all of them.

Large Data Sets

Continue reading Most Important SEO Patents Part 10: Just the Beginning

Predicting SEO Changes in Rankings, Algorithms, and Penalties

Last Thursday, the Wall Street Journal published a couple of articles that point to a new direction in the future from Google, With Semantic Search, Google Eyes Competitors, and Google Gives Search a Refresh. On Friday, Barry Schwartz reported at Search Engine Land that Google’s Head of Spam, Matt Cutts announced that Google was working upon an “Over Optimization” penalty for websites that were stuffed with too many links and had excessive links pointed to them, in the post, Too Much SEO? Google’s Working On An “Over-Optimization” Penalty For That.

Thursday evening I visited the Philadelphia offices of Seer Interactive to give a presentation on some of the changes in Search and Social activities involving SEO in a free presentation hosted by Wil Reynolds and the Seer Interactive team. Amongst the possible changes I pointed out included more emphasis on search as a knowledge base, with more Q&A results, and a greater emphasis on information extraction around entities as described in the Wall Street Journal article.

Nuance Search-Related Patent Applications Published

Nuance Communications, which partners with Apple Computers to provide the voice recognition software behind Apple’s intelligent assistant Siri, had 4 patent applications published today at the USPTO that focus upon search and search technology. While the company has at least 274 granted patents and 104 pending patents listed as assigned to it at the US patent and trademark office, these appear to be the first that focus upon the operations of a search engine. They reference the Dragon Search application built for iPhones:

A screenshot from the patent showing the Dragon search interface from Nuance.

The topics covered in the Nuance patent portfolio primarily involve speech recognition technology, but include some areas that companies like Google have been focusing upon within a few of their patents as well, such as statistical language models and document segmentation algorithms, as well as a browser for the voice web which was filed in 1998.

Continue reading Nuance Search-Related Patent Applications Published

The New PageRank, Same as the Old PageRank?

When a judge writes a judicial opinion upon a case, he often includes more than just his ruling on the case. It usually contains an analysis of the present law, the legal atmosphere, and how the ultimate holding on the case was arrived at. Those written rulings can also include some legal opinions on issues that don’t necessarily play an essential role in the outcome of the case at hand, and those are often referred to as “dicta.”

When you read a patent, you’ll see that it’s broken into a number of parts. The most important of those is the claims section, which is what a patent examiner focuses upon when prosecuting a patent, and deciding whether or not it should be granted. There are also description sections in patents which give a richer and more detailed look at how the technology behind a patent might be implemented (with emphasis on the “might”). Often those descriptions include material that isn’t reflected within the claims section of a patent, and in many ways, those description sections could be considered as similar to the dicta that I mentioned sometimes appears within judicial opinions.

Stanford University was granted two new patents today under the name, Scoring documents in a database, both of which were filed at the United States Patent and Trademark Office on January 19, 2010. These two patents, assigned to Stanford and listing Lawrence Page as inventor, are described as continuation patents of the following patents assigned to Stanford which focus upon PageRank:

Continue reading The New PageRank, Same as the Old PageRank?

12 Google Link Analysis Methods That Might Have Changed

In the Google Inside Search blog, Google’s Amit Sighal published a post titled Search quality highlights: 40 changes for February that told us about many changes to how Google ranks pages, including the following:

Link evaluation. We often use characteristics of links to help us figure out the topic of a linked page. We have changed the way in which we evaluate links; in particular, we are turning off a method of link analysis that we used for several years. We often rearchitect or turn off parts of our scoring in order to keep our system maintainable, clean and understandable.

A diagram showing different values for links passing amongst three different web pages.

A lot of people were guessing which “method of link analysis” might have been changed, from PageRank being turned off, to anchor text being devalued, to Google ignoring rel=”nofollow” attributes in links, to others. I was asked my opinion by a few people, and mentioned that there were a number of potential approaches that Google might have changed.

Continue reading 12 Google Link Analysis Methods That Might Have Changed

Big Data at Google

According to Google’s Director of Research, Peter Norvig, if you look at Google Trends for trends related to “full moon” or “ice cream”, you’ll see that Google searches for those terms imitate actual physical trends in the world. With a very large number of queries performed for those terms, searches for “full moon” peak every 28 days. Searches for “ice cream” peak every summer, 365 days apart. Large amounts of data make interesting things possible.

If you’re interested in how search engines work, and how large amounts of data can help them do what they do more effectively, it’s highly recommended that you read the paper The Unreasonable Effectiveness of Data (pdf), written by Alon Halevy, Peter Norvig, and Fernando Pereira, from Google. Even more highly recommended is a presentation from Peter Norvig of the same name from a Distinguished Lecture Series at the University of British Columbia last fall, which sadly has less than a 1,000 views at YouTube presently:

Continue reading Big Data at Google