Search Engine Classification and Assignment of Categories

search engine classification

Search Engine Classification into Categories If you have a website that classifies products or services or pages into different areas, and your products might be offered in a shopping search engine or other services that draw information from multiple websites, how you classify what you offer may play a role in how that shopping search … Read more Search Engine Classification and Assignment of Categories

Transient Links to Avoid Crawling Advertisement and Session Tracking Links

transient links

A newly published Yahoo patent application describes a couple of ways to filter out some of the URLs that it might crawl, to keep those pages from being indexed and presented to searchers. Those URLs are referred to in the patent filing as transient links because they change from visit to visit, often because they … Read more Transient Links to Avoid Crawling Advertisement and Session Tracking Links

An Expansion of Importance Scores for Web Page Rankings?

When a judge looks at evidence entered into court, he weighs a number of factors. One of them is whether the evidence offered is relevant to the case at hand. The other is how important that evidence might be. Now, a piece of evidence by itself doesn’t have to be groundbreaking to important, but for … Read more An Expansion of Importance Scores for Web Page Rankings?

How Does Congress Use Google? Mentions on the Congressional Record

I had read that a hearing regarding the proposed Google-Doubleclick merger was going in front of the Senate Judiciary Committee’s Subcommittee on Antitrust, Competition Policy and Consumer Rights on Thursday. In looking for information about the hearing, I also decided to take a look at when and how Google was mentioned by Congress on the … Read more How Does Congress Use Google? Mentions on the Congressional Record

Microsoft on Javascript Redirection Spam

A paper prepared by Microsoft researchers at the AIRWeb’07 conference this past May explores some methods that a few people use to try to trick search engines. The paper, A Taxonomy of JavaScript Redirection Spam (pdf), provides a nice overview of those methods. In this paper, we study common JavaScript redirection spam techniques on the … Read more Microsoft on Javascript Redirection Spam