Are large news agencies, with a wide scope of international coverage on multiple topics, with large numbers of reporters, and finely edited articles better sources of news than smaller and more local papers, or narrow niche blogs?
A patent on ranking articles in Google News was granted this week that was originally filed in 2003, and it discusses a number of ranking factors that it might use to present news article based upon the “quality” of the news sources involved.
What is very interesting about it is that it provides some insight into the assumptions behind those ranking factors. I suspect that Google may have changed their stance on some of the assumptions behind those factors since then.
The patent doesn’t include a full range of signals that Google probably considers in ranking news stories, such as the freshness of the news (as noted in Google’s patent filing on Universal Search), or whether or not a certain source is the original.
Just which words show up most frequently on the Web? I’m not sure that question can be answered, but it’s something I’ve wondered for a while.
With a beta version of Google’s future update, code named Caffeine recently released to allow people to experiment with, I thought I would do a few comparisons.
I found a few lists of the most common words in the English language, and came up with a top 50 to see how frequently those were estimated to show up in Google, Yahoo, Bing, Ask, and Google Caffeine. Those are shown in a table and a chart below.
I’m not sure how informative this might be, even after looking at it. It’s not a very scientific test as well. There are a few reasons for that:
With billions of pages on the Web, trying to find the right words to use when you want to search for something can often be hard, especially when you’re looking for information on a topic that you don’t know too much about.
As a designer or site owner, coming up with the words on your pages that searchers expect to see, and may use to search for what you have to offer can also be difficult.
Search engines often act as a middle man between searcher and site owner, helping to bring people to pages that may help them satisfy some kind of informational need, or accomplish some task. Again, that can be difficult, trying to get some idea of what people are looking for with two or three words, and showing a list of web pages that might be helpful to those searchers.
Search engines will often offer search suggestions based upon the words that a searcher types into a search box, to try to make it easier for those searchers. Knowing more about how search engines find those suggestions may be helpful to searchers and to site owners.
You might see these query suggestions above or below the search results that you are shown when you search, or appearing in a dropdown under the search box as you type. They often are displayed with terms like the following showing up in front of them:
If you look up when the last five movies from Jim Carrey were released, and were able to sneak a peek at Google’s query logs, you’d see that searches for Jim Carrey spiked on those dates.
Same with Ben Stiller, Edward Norton, Leonardo Dicaprio, and Tom Hanks.
We know this from a footnote in a recently published paper from researchers at Google.
The authors of Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries checked to see if there was some kind of time-based relationship between searches for those movies’ names (and release dates) and the names of those actors.
It sounds obvious that there would be, but it’s interesting to see actual data from Google that explores relationships like that.
Relationships between Queries Based upon Time
Post your URL in an SEO forum, and get labeled as a Web Spammer? Maybe.
There are some site owners and internet marketers who attempt to increase how well their web sites rank in search engines by buying links to their sites, or exchanging links with others. Those kinds of activities are frowned upon by the major search engines because that kind of manipulation can impact which pages show up in search results. As Google notes on one of their help pages on link schemes:
Your site’s ranking in Google search results is partly based on analysis of those sites that link to you. The quantity, quality, and relevance of links count towards your rating. The sites that link to you can provide context about the subject matter of your site, and can indicate its quality and popularity. However, some webmasters engage in link exchange schemes and build partner pages exclusively for the sake of cross-linking, disregarding the quality of the links, the sources, and the long-term impact it will have on their sites. This is in violation of Google’s webmaster guidelines and can negatively impact your site’s ranking in search results.
Likewise, there are forums where people publicly discuss the exchange of links to manipulate search results.
Google announced on the Official Google Blog this morning that they have signed a deal to acquire On2Technologies:
Because we spend a lot of time working to make the overall web experience better for users, we think that video compression technology should be a part of the web platform.
To that end, we’re happy to announce today that we’ve signed a deal to acquire On2 Technologies, a leading creator of high-quality video compression technology.
A Google Press Release, Google to Acquire On2 Technologies provides more details of the deal.
You buy a new phone, and it doesn’t work as advertised, and customer service is even worse. Many people in your shoes would go online, and write a negative review somewhere.
You go on a vacation, and stay at an inexpensive and charming bed and breakfast. You have a wonderful time, in no small part to the thoughtfulness and suggestions of your hosts, and their incredible hospitality. Chances are, you write a glowing review about the experience on the Web.
The number of reviews and review sites on the Web has been growing over the past few years. Google’s recent “review” search option is one attempt to help people find both positive and negative reviews.
Google also presents reviews in Google Maps results. If you search for businesses and organizations in Google Maps, you’ll see under each listing a link to “write a review” for each business listed. If you click upon the “more info” link for a listed business, you’ll see a “review” tab in the box that appears in the middle of the map for that business. The results that show reviews are summaries, which often contain some level of sentiment about the businesses listed.
Does the amount of time it takes for a page to load in a browser influence search engine rankings for pages? Should it?
If it did, might sites that were all text show up higher in search results than sites that included pictures and different applications? Or, might a search engine find a way to account for different types of sites, and the amount of time it might take them to render in a browser, based upon actual user data in addition to the amount of time it takes a site to render in a browser?
A recent patent application from Yahoo explores ways that a search engine might consider the amount of time it takes different types of pages to render and other issues involving how quickly pages respond to a visits in ranking, classifying and crawling those pages.