SEO Resources: Books, Blog Posts, Maps, Articles, and Papers
To succeed at SEO; start by understanding the audience you have built a site for, what their informational needs are & the pain points they may be experiencing, the words they use to describe the goods & services you offer that they will search for & expect to see on your pages. And then learn about SEO and the Search Engines…
If you are new to SEO, you may want to learn some more than what is on this blog. I found some SEO Resources in the shape of papers, books, and articles online that I wanted to share.
A helpful introduction to SEO from Google. If you are going to do SEO or have SEO done to your Website, you should learn as much about it as you can. Seeing what Google says about it is a good place to start.
The Beginner’s Guide to SEO
Another SEO Resource, produced by SEO toolmaker Moz; with another helpful look at how Search Engine Optimization works.
A Standard for Robot Exclusion
by robots mailing list
In the earliest days of the Web, a mailing list launched to help figure out how to deal with robots crawling all over each other’s servers. The members of the list came up with some solutions, now known as a robots.txt file.
By the W3C
If you are going to have a presence on the Web, you should learn something about the language that it is written in. This introduction to HTML will tell you some of that and give you a little insight into the history of the technology behind the Web.
Dive into HTML5
by Mark Pigrim
Another guide to using HTML, by Mark Pilgrim, who had worked at Google for a number of years, and has written other helpful howto guide about subjects such as accessibility, and Python. More than just a look at the rules of HTML; a guide to how those work.
As We May Think
by Vannevar Bush
In July of 1945, Vannevar Bush speculated what scientists who had worked on the war effort should turn their hands to next, to make the world a better place. His article urged scientists to focus upon making knowledge more accessible to everyone and came up with an idea that in many ways foreshadowed the emergence of the internet.
Published in Science on July 15, 1955, Eugene Garfield proposes a citation index to scientific articles, in many ways like the legal Shepard’s Citation, which helps lawyers and legal scholars in US State and Federal Courts find publications and court cases that refer to other cases. Eugene Garfield’s work on citation analysis had an influence on how links are considered as citations in algorithms such as PageRank.
Improved Text Searching in Hypertext Systems (pdf)
by Lawrence Page
One of the essential SEO Resources – the first PageRank patent, filed by Lawrence Page with the USPTO on January 10, 1997. A plain language description of PageRank and the Backrub search engine in a provisional patent filing that was never actually assigned or published in the patents database, and which provides a comparison of Backrub with other search engines of the time.
Hypersearching the Web
by Soumen Chakrabarti, Byron Dom, S. Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew Tomkins, Jon M. Kleinberg, and David Gibson
IBM’s CLEVER Project explored how analyzing links between pages could be useful in indexing the Web, around the same time that Google was developing its PageRank approach. While the team never publicly released a search engine, many of the concepts they developed were used by Teoma/Ask Jeeves. This paper describes the concepts of “Authorities” and “Hubs” within a collection of pages for a query on a specific topic, which is used to refer to how some pages are linked to by many other pages, and other pages link out to many other pages.
Claude Hopkins published this classic book on advertising in 1923, and it’s still very relevant for today’s world of online marketing and advertising.
Hubs, Authorities, and Communities
By Jon Kleinberg
Kleinberg was someone who looked at the importance of links, and what they might be able to tell about the value of pages on the Web. Understanding the difference between pages perceived as hubs and pages perceived as authorities had value, and likely still does. Knowing what the difference may still help today.
A Taxonomy of Web Search
by Andrei Broder
One of the most important papers about the intent behind searches; knowing the difference between informational, transactional, and navigational searches influences many algorithms that play a role in what search results people may see from search engines. The author wrote this when he was at Alta Vista, one of the earliest leading search engines, but it is good to know the difference between those types of searches today.
The Intention Behind Web Queries
by Ricardo Baeza-Yates, Liliana CalderÂ´on-Benavides and Cristina GonzÂ´alez-Caro
Another look at how the intent behind web searches may be categorized, and why they might be. It spends some time looking at the Broder Paper above this one on the taxonomy of Web Search.
Introduction to Information Retrieval
by Christopher D. Manning, Prabhakar Raghavan and Hinrich SchÃ¼tze
A thoughtful look at how search works from a computer science perspective. Highly recommended for those who like to delve into the science behind search.
Search User Interfaces
by Marti Hearst
A very readable and very informative book that approaches how search engines work not from the algorithms behind the scenes, but rather the interfaces that you see when you search. If you want to learn a lot about how search engines work quickly, this is a great place to start.
The Anatomy of a Large-Scale Hypertextual Web Search Engine (pdf)
by Sergey Brin and Lawrence Page
One of the very first white papers that provided a glimpse into how a commercial search engine works. The search engine in question in Google, and even though this paper was written more than a decade ago and provides some great historical perspective on Google and search, there are hints in it of things to come from the search engine.
The PageRank Citation Ranking: Bringing Order to the Web
by Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd
If you’ve heard of Google, chances are you’ve also heard about PageRank, which is a method that the search engine used to rank how important pages are on the Web, and which has been combined with other ranking signals to determine the order of pages you see when you search. It’s very likely that the PageRank of 1998, as described in this paper, has evolved over the last decade, but it’s worth reading about how it was intended to work in the early days.
Shaping the Web: Why the politics of search engines matters (pdf)
by Lucas D. Introna and Helen Nissenbaum
Published in 2000, this paper looks at the potential biases in which search engines may engage, arising not so much from technical issues, but rather political ones. Why might some types of sites be excluded from search results while others might be favored? A thoughtful criticism of popularity-based search algorithms and the purchasing of prominence in search results.
Published in 1989, this paper discusses a different kind of search interface than what often gets discussed in Information Retrieval circles, where a single search is often part of multiple pages and multiple query inquiry for information. A thoughtful paper that might have you thinking about designing web pages a little differently.
In addition to SEO Resoources, it can be helpful to look at some related concepts such as usability. This set of usability guidelines from the Department of Health and Human Services are helpful, creative, and smart. If you design websites, and you haven’t seen them, you should take a look. You might get some ideas on how to make your sites more usable for visitors.
Introduction to Information Retrieval
by Christopher D. Manning, Prabhakar Raghavan and Hinrich SchÃ¼tze.
Published in 2009, this online version of the book provides a great first look at the computer science behind how search engines work.
Helpful Government Sites
SEO Resources from the Search Engines
- Google Webmaster Guidelines– Google has shared quality guidelines for being included in the Google search engine for site owners. These are worth spending time with, if you would like to have a site listed at the Mountain View Search engine, and want to avoid being penalized by the Search Engine.
- Google Webmaster Central – Google has provided a number of informational and support pages, where you can learn more about the search engine, and some of the features that it offers, and connect to some of the other features that it makes available.
- Google Webmaster Central Help Forums – Google provides this forum where webmasters can bring questions about websites that they may own, and contributors to the help forums and Google Employees will often answer questions brought to these forums. If something happens to your site that you may have a question about, this may be a place to get answers that may be helpful.
- Google Developers – These are pages from Google for Web Designers and Developers, and are often a good place to learn about features that Google offers, including information about Structured Data.
- Google Trends – Where you can learn about Searcher’s interest over time in different terms and phrases
- Google My Business – Google’s Business Directory, and where you submit your site to have it listed in Google Maps.
- Yahoo Search Help – If you have any questions about the Search Engine at Yahoo, this is a place where you might be able to find answers.
- Webmaster Central – Bing – Bing’s Answer to Googles Webmaster Tools (which become Google Search Console), and these do seem similar to the older Webmaster Tools from Google.
- Bing Places for Business – Bings version of Google My Business; a business directory
- Apple Maps Connect-Where you submit your business for it to be listed in Apple Maps
- sitemaps.org – Information about XML sitemaps from Google, Yahoo, and Microsoft, which describes a machine readable approach that can be used to help pages of sites get indexed by search engines as an alternative approach to a search engine crawler going through a site link-by-link. A whitepaper from Google, titled Sitemaps: Above and Beyond the Crawl of Duty discusses some of the benefits of using an XML sitemap, and some different strategies for setting an XML sitemap up on larger sites that use XML sitemap indexes, and is worth reading if you would like to have a good sense of how XML sitemaps can be helpful.
SEO Resources – Publications and Blogs from Google
The Keyword Google’s blog where they focus upon new features at the search engine.
The Official Google Blog – this one seems dedicated to search at Google.
Google AI Blog – This seems to have been the Google Inside Search Blog, and the Google Research Publications.
Deepmind Publications – The AI Subsidary owned by, and controlled by Google.
Google Calico Publications – Google’s Health research branch publications.
SEO Resources – Publications and Blogs from Bing
Last Updated November 30, 2018