Improving Text Segmentation for Displaying Advertisements and Filtering Search Results

When you type a domain name into your browser address bar and the domain isn’t found, sometimes you’ll be served a search results page that has advertisements and links related to a “subject” for that domain name.

For example, you might type “” into the address bar, and there may not be a website at the domain name “”. You may be redirected to a third-party website, with advertisements and/or links relevant to that domain name. Ads might be shown for the phrase “used rugs” on that web page, if it is determined to be the most likely segmented version of the string of text from the domain name.

Some sites might be filtered from appearing in search results because the domain names may seem to potentially indicate adult related material.

For example, a domain name, such as “”, could be filtered out of search results by an adult filter, because the word “sex” appears in the string of characters.

Continue reading “Improving Text Segmentation for Displaying Advertisements and Filtering Search Results”

Google’s Green Border Technologies Patent Filings

On May 11th, Google purchased security company Green Border Technologies, Inc.

Green Border has a handful of patent applications pending, and a granted patent for their security software. One of the names that appears on most of the documents is Ulfar Erlingsson, who left the company in 2003 to join Microsoft Research.

The software from the Mountain View, California based company works to isolate internet sessions from the rest of a user’s PC. I’ve seen speculation that this software might be offered to Google users as part of the free Google Pack software download. The software might also be used by the search engine to crawl sites using client software to locate malware.


Continue reading “Google’s Green Border Technologies Patent Filings”

Big Maps, Big Data: Google’s Keyhole Flatfile Patent

Google was granted a patent today on the way that they store, retrieve, and draw geospatially organized data in systems like Google Earth.

Server for geospatially organized flat file data
Invented by Chikai J. Ohazama, Phillip C. Keslin, and Mark A. Aubin
Assigned to Google
US Patent 7,225,207
Granted May 29, 2007
Filed October 10, 2002


A flat file data organization technique is used for storing and retrieving geospatially organized data. The invention reduces transfer time by transferring a few large files in lieu of a large number of small files. It also moves the process of locating a given data file away from the file system to a proprietary code base. Additionally, the invention simplifies database management by having quadtree packets generated on demand.

Continue reading “Big Maps, Big Data: Google’s Keyhole Flatfile Patent”

User Intent and Characteristics of Search Queries

User Intent behind Search Queries

One of the short posters at the recent WWW 2007 Conference in Banff, Alberta, Canada, provides an in-depth look at classifications of search queries after sampling more than 5 million queries, taken from transaction logs from three different search engines.

They use that data to come up with a classification algorithm, which was then used on a “separate Web search engine transaction log of over a million queries submitted by several hundred thousand users.” The results are interesting.

The article is Determining the User Intent of Web Search Engine Queries, from Bernard J. Jansen and Danielle L. Booth of Pennsylvania State University, and Amanda Spink of the Queensland University of Technology.

Their findings indicated that approximately 80 percent of the queries classified were informational in nature, with the remaining queries being split almost equally between navigational and transactional queries.

Continue reading “User Intent and Characteristics of Search Queries”

Catching Up With Memes, Part 1 – How Nerdy Am I?

I’ve seemed to have been tagged with a lot of memes recently, and haven’t posted any responses to them. The three day weekend seemed to be a good time to try to do that, but I seem to be gathering tags faster than I can reply to them.

My Friend Sophie tagged me with one that asks the question, “How nerdy are you.” I took this test a few days ago without being tagged. Seems my nerd quotient has slipped a little since the first time I took it.

I am nerdier than 65% of all people. Are you a nerd? Click here to find out!

The second part about a meme is that you tag others after taking it. I’m tagging the following folks:

Continue reading “Catching Up With Memes, Part 1 – How Nerdy Am I?”

Google Plus Box Patent Application

When you perform searches in Google, sometimes you will see within the listed search results one which has a plus sign next to it. If you click upon the plus sign, you are shown some more information. A new patent application published at the USPTO website provides some information about how expanded and collapsed data in search results might work.

It also raises some questions about how much information search engine results should actually show.

These types of results may have first started appearing displaying maps and local business information for specific businesses. Google referred to this feature in their Web Master Help pages as a plus box (though the link is no longer live):

The address link shown below some sites in our search results (in an expandable area called a Plus Box) is meant to help searchers locate businesses and compare search results. We show the address link for results that are local in nature and for which we have an associated address. If we don’t have an address for your business, or we don’t think that an address is relevant to your site we won’t show it.

Continue reading “Google Plus Box Patent Application”