I came across a short paper, presented at the SIGMOD/PODS’06 in Chicago in June, and some other resources on some of the major data projects happening at Google that I wanted to share. Data Management Projects at Google (pdf) covers Google’s Big Table, Google Base, and SAWZALL. It doesn’t go into a significant amount of depth regarding any of them, but it’s a nice short overview of three of the Data projects underway at Google these days.
A GoogleTech Presentation from May 31, 2006, also provides a nice look at Building Large Systems at Google:
(Link to this at Google Video: Building Large Systems at Google )
A little more on BigTable:
- Google’s paper, Bigtable: A Distributed Storage System for Structured Data , to be presented in Seattle, Washington, at the OSDI’06: Seventh Symposium on Operating System Design and Implementation
- A Presentation about Big Table by Jeff Dean at the University of Washington on Google Video, from October 18, 2005.
- Andre Whitchcock writes about that presentation on Bigtable
If you enjoy some of the Google Techtalk presentations, a more recent one (published Sept. 20th), covers a presentation at the London Test Automation Conference, and covers aspects of how Google tests their technologies.
The conference is Google’s introduction of themselves to the London and Zurich audiences and aims at providing information about some of the things they are doing in their offices in those locations.