We can make your web site easier to find, and easier to use.

Using Anchor Text to Find Documents in Other Languages

Google was granted a new patent this morning, which looks like an attempt to make it easier to find relevant documents in other languages for a query which relies considerably upon anchor text. It was originally filed in 2001, so it really isn’t that new.

Worth a look if you are concerned about how pages in one language might show up as results of a query in another language (or just interested in how search engines might approach something like this. Of course, it’s just a patent, so that doesn’t necessarily mean that it was ever implemented.

Systems and methods for using anchor text as parallel corpora for cross-language information retrieval
Invented by Luis Gravano and Monika H. Henzinger
Assigned to Google
US Patent 7,146,358
Granted December 5, 2006
Filed August 28, 2001

Abstract

A system performs cross-language query translations. The system receives a search query that includes terms in a first language and determines possible translations of the terms of the search query into a second language. The system also locates documents for use as parallel corpora to aid in the translation by:

(1) locating documents in the first language that contain references that match the terms of the search query and identify documents in the second language;

(2) locating documents in the first language that contain references that match the terms of the query and refer to other documents in the first language and identify documents in the second language that contain references to the other documents; or

(3) locating documents in the first language that match the terms of the query and identify documents in the second language that contain references to the documents in the first language.

The system may use the second language documents as parallel corpora to disambiguate among the possible translations of the terms of the search query and identify one of the possible translations as a likely translation of the search query into the second language.

LinkedInPinterestStumbleUponShare

6 comments to Using Anchor Text to Find Documents in Other Languages

Comments Policies

  • Relevant comments on the topic of a post are very much appreciated.
  • Please use your personal name rather your business name or keywords in the name field.
  • Comments filling the name field with anchor text to spam this site and search engines (in English or any other language) may be edited, have URLs removed, or deleted entirely.
  • If you include a link in the website field, please choose one about you rather than some product or service or site or blogpost that you are promoting.
  • No signature links in comments, please.