Sanjay Ghemawat Explained
Sanjay Ghemawat |
Birth Place: | West Lafayette, Indiana, U.S. |
Workplaces: | |
Education: | |
Thesis Title: | The Modified Object Buffer: A Storage Management Technique for Object-Oriented Databases |
Thesis Url: | http://publications.csail.mit.edu/lcs/pubs/pdf/MIT-LCS-TR-666.pdf |
Thesis Year: | 1995 |
Doctoral Advisors: | |
Awards: | |
Sanjay Ghemawat (born 1966 in West Lafayette, Indiana)[1] is an Indian American[2] computer scientist and software engineer. He is currently a Senior Fellow at Google in the Systems Infrastructure Group.[3] [4] Ghemawat's work at Google, much of it in close collaboration with Jeff Dean,[5] has included big data processing model MapReduce, the Google File System, and databases Bigtable and Spanner. Wired have described him as one of the "most important software engineers of the internet age".
Ghemawat was elected as a member into the National Academy of Engineering in 2009 for contributions to the science and engineering of large-scale distributed computer systems.
Education and early career
Sanjay grew up in Kota, Rajasthan, India[6] . He studied at Cornell University and the Massachusetts Institute of Technology (MIT). He obtained a PhD from MIT in 1995, with a dissertation titled, The Modified Object Buffer: A Storage Management Technique for Object-Oriented Databases. His advisors were Barbara Liskov and Frans Kaashoek.[7]
Before joining Google, Ghemawat worked at the DEC Systems Research Center. There he began his long-time collaboration with Jeff Dean, who worked at another DEC research lab nearby. Their work at DEC included a Java compiler and a system profiling tool.
Career at Google
After DEC was acquired by Compaq, many of its researchers left the company. Dean took a position at the newly founded search-engine company Google, and was joined by Ghemawat in 1999. The two began working on Google's core infrastructure, making improvements to cope with the search engine's rapid growth in users in the early 2000s.
Ghemawat's work at Google includes:
- Original design of Protocol Buffers, an open-source data interchange format.
- MapReduce, a system for large-scale data processing applications.
- Google File System, is a proprietary distributed file system developed to provide efficient, reliable access to data using large clusters of commodity hardware.
- Spanner, a scalable, multi-version, globally distributed, and synchronously replicated database.
- Bigtable, a large-scale semi-structured storage system.
- LevelDB, an open-source on-disk key-value store.
- TensorFlow, an open-source machine-learning software library.
- Service Weaver, an open-source framework for writing distributed applications.
Awards and honors
Ghemawat was elected to the National Academy of Engineering in 2009, and to the American Academy of Arts and Sciences in 2016.[8] In 2012, he and Dean received the ACM Prize in Computing for their work on internet infrastructure, and the ACM SIGOPS Mark Weiser Award.[9]
Selected publications
- Book: Ghemawat. Sanjay. Gobioff. Howard. Leung. Shun-Tak. Proceedings of the nineteenth ACM symposium on Operating systems principles . The Google file system . 2003. SOSP '03. New York, NY, USA. ACM. 29–43. 10.1145/945445.945450. 1581137575. 221261373.
- Dean. Jeffrey. Ghemawat. Sanjay. January 2008. MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM. 51. 1. 107–113. 10.1145/1327452.1327492. 221021701. 0001-0782. free.
- Chang. Fay. Dean. Jeffrey. Ghemawat. Sanjay. Hsieh. Wilson C.. Wallach. Deborah A.. Burrows. Mike. Chandra. Tushar. Fikes. Andrew. Gruber. Robert E.. June 2008. Bigtable: A Distributed Storage System for Structured Data. ACM Trans. Comput. Syst.. 26. 2. 4:1–4:26. 10.1145/1365815.1365816. 214769387. 0734-2071.
- Dean. Jeffrey. Ghemawat. Sanjay. January 2010. MapReduce: A Flexible Data Processing Tool. Commun. ACM. 53. 1. 72–77. 10.1145/1629175.1629198. 0001-0782. free.
- Corbett. James C.. Dean. Jeffrey. Epstein. Michael. Fikes. Andrew. Frost. Christopher. Furman. J. J.. Ghemawat. Sanjay. Gubarev. Andrey. Heiser. Christopher. August 2013. Spanner: Google's Globally Distributed Database. ACM Trans. Comput. Syst.. 31. 3. 8:1–8:22. 10.1145/2518037.2491245. 0734-2071. free.
- Abadi. Martín. Agarwal. Ashish. Barham. Paul. Brevdo. Eugene. Chen. Zhifeng. Citro. Craig. Corrado. Greg S.. Davis. Andy. Dean. Jeffrey. 2016-03-14. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. 1603.04467. cs.DC.
Notes and References
- News: The Friendship That Made Google Huge. 2018-12-10. New Yorker. 2018-12-10.
- News: Google's Sanjay Ghemawat Co-Winner of Computer Award. 2013-04-09. India West. 2017-12-16. en.
- Web site: Sanjay Ghemawat – ACM Prize in Computing. Award Winners. Association for Computing Machinery. en. 2017-12-16.
- Web site: ACM And Infosys Foundation Honor Google Developers For Innovations That Transformed Internet-Scale Computing. Infosys. en-us. 2017-12-16.
- If Xerox PARC Invented the PC, Google Invented the Internet. Metz. Cade. 2012-08-08. WIRED. 2017-12-16. https://web.archive.org/web/20171207142049/https://www.wired.com/2012/08/google-as-xerox-parc/. 2017-12-07. live. en-US.
- News: Somers . James . 2018-12-03 . The Friendship That Made Google Huge . 2024-08-21 . The New Yorker . en-US . 0028-792X.
- Web site: Sanjay Ghemawat. The Mathematics Genealogy Project. Department of Mathematics, North Dakota State University. 2017-12-16.
- Web site: Membership. American Academy of Arts and Sciences. 2017-12-16.
- Web site: The Mark Weiser Award. ACM SIGOPS. 5 July 2019.