This course investigates the architecture, design, and implementation of massive-scale data systems. The course discusses foundational concepts of distributed database theory including design and architecture, security, integrity, query processing and optimization, transaction management, concurrency control, and fault tolerance. It then applies these concepts to both large-scale data warehouse and cloud computing systems. Cloud computing topics include MapReduce, massive-scale cloud databases, and cloud analytics.
605.441 Principles of Data Base Systems or equivalent. Familiarity with "big-O" concepts and notation is recommended.
Formerly 605.741 Distributed Database Systems: Cloud Computing and Data Warehouses.