In 2005, Google’s John Lamping gave a presentation to a class at Berkeley on the Quality of Information, titled On the internet, nobody knows you’re a dog (pdf). In his talk he raised questions such as:
- Why is the Daily Californian advertising German pages?
- How much can the spam industry make by spamming search engines?
During his speech, he pointed out ways that people have attempted to manipulate search results, such as mad libs-like insertions of keywords into templates for pages like in his slide above, cloaking and other spam approaches to optimizing pages, and paid links and comment spamming. In addition to talking to academic audiences about search quality, he has been working on doing something to increase the quality of search results.
Continue reading “Google’s Quality Score Patent: The Birth of Panda?”
I don’t know who said that novelists read the novels of others only to figure out how they are written. I believe it’s true. We aren’t satisfied with the secrets exposed on the surface of the page: we turn the book around to find the seams.
In a way that’s impossible to explain, we break the book down to its essential parts and then put it back together after we understand the mysteries of its personal clockwork.
– Gabriel Garcia Marquez Meets Ernest Hemingway
I’m told that if you want to be a good photographer, you should look at a lot of photos. If you want to be a good painter, you should look at a lot of paintings. I believe that the same holds true with being a blogger, and seeing how other bloggers present their messages, tell their tales, and report their news.
Continue reading “Blog Lessons from Dead Tree Authors and Cereal Boxes”
Google, Yahoo, and Bing have joined forces to enable web publishers to include additional HTML that adds more structure to their pages, and possibly makes those pages easier to index and may provide them with a little more control over what may show up in search results for pages. There’s some controversy over the approach, some questions about the impact of related patents that all three search engines have been granted, and web publishers should be paying attention to the possible impacts of this initiative from the search giants.
Google’s Author Markup
Yesterday, Google announced that they were introducing a way to add HTML code to a page to indicate who the author of the page might be. This code would appear as part of a link pointing to an author’s page on the same site, so that a search engine might associate the content of that page with the author who wrote it. The announcement was made in the Google Inside Search blog, in the post Authorship markup and web search, which told us how Google would use rel=”author” and rel=”me” to learn about who may have authored what on the Web.
Continue reading “Author Markup, Schema.org and Patents, Oh My!”
Unlike Web pages, there are no links in books for Google to index and use to calculate PageRank. There’s no anchor text in links to use as if it were meta data about pages being pointed towards. Books aren’t broken down into separate pages that have a somewhat independent existence of their own the way that Web pages do, with unique title elements and meta descriptions and headings. There isn’t a structure of internal links in a book, with file and folder names between pages or sections that a search engine might used to try to understand and classify different sections of a book, like it might with a website.
A Google patent granted today describes some of the methods that Google might follow to index content found in books that people might search for. It’s probably not hard for the search engine to perform simple text based matching to find a specific passage that might be mentioned in a book. It’s probably also not hard to find all of the books that include a term or phrase in their title or text or which were written by a specific author. But how do you rank those? How do you decide which to show first, and which should follow?
Continue reading “Searching in Google’s Book Search (The SEO of Books?)”
Maybe a better question is when will TV set top boxes give people the ability to interact with others on their TV screens when watching? A patent filing published at the end of May shows one possibility that we might see in the future for making TV more social.
In addition to enabling television and social networking to be available via a real time interface, the set top box might allow people to preconfigure access to different social networks based upon channels on the television or certain time ranges.
For instance, you might associate ESPN with a Twitter Red Sox Fan Group at @Rdsxfans, as seen in the user interface screen below, or chat access to a specific person on Facebook between the hours of 7-10pm:
Continue reading “Will Social Networks invade Your TV?”
Almost 6 years ago, SEO by the Sea was born in a thought inspired by the sight of sails bobbing up and down on waves where the Susquehanna River meets the Chesapeake Bay, in Havre de Grace, Maryland.
The town is a sleepy historic place, which comes alive during weekends, when many visit the antique shops and docks and bed and breakfasts. Most of it burned down during the War of 1812, when nailmaker and militia lieutenant John O’Neill attempted to singlehandedly hold off British ships intending to sail up the River after their attack on Baltimore (the attack that inspired Francis Scott Key’s Star Spangled Banner).
O’Neill was eventually captured, and in retribution, the British set the buildings of Havre de Grace afire. Little of it remained unscathed, but the residents rebuilt the town, which originally got its name from Marquis de La Fayette, who called it Havre de Grace, or the “Harbor of Grace.”
Continue reading “Looking Back after 6 Years and 1,000 Posts”