The practice of search engine optimization doesn’t need to involve tricks and misdirection to enable searchers to find sites that fulfill their needs, and help them complete tasks on the Web. But there are folks who do attempt to trick searchers and search engines to profit from both.
When it comes to pages created and used to spam search engines, Microsoft came up with some interesting statistics in study conducted over the past six months.
Last October, the Live.com research team noticed three doorway pages showing up in the top ten search results for the search phrase “Cheap Ticket” and decided to investigate further.
When someone clicked on one of those pages, they were redirected to other pages recognized as being involved in spamming search engines. They also saw advertisements on the pages to which traffic was being redirected, from a company that they believed to be legitimate and not involved in misleading and misdirecting search engines.
Telling themselves that a reputable company wouldn’t buy advertisement services directly from spammers, they wondered if they could locate the middlemen who were responsible for placing those ads on those pages. The Microsoft paper, Spam Double-Funnel: Connecting Web Spammers with Advertisers , details the methods that they used to locate spam, and the domains that were being used to redirect traffic.
Some interesting observations from the study:
- “Drugs” and â€œringtoneâ€ were the two most-spammed categories (amongst those studied) with an average search-result spam density as high as 30.8% and 27.5%, respectively.
- When sites from non-commercial top-level domains, such as .gov and .edu, show up in a prominent manner in the search results of spammer-targeted commercial search terms, it often indicates that the site has been spammed.
- At least three in every four unique blogspot URLs that appeared in top-50 results for commercial queries (that they selected for this study) were spam (77% and 75%).
If you were able to read only one search engine related research paper this week, I would recommend this one.