This course investigates the architecture, design, and implementation of massive-scale data systems. The course discusses foundational concepts of distributed database theory including design and architecture, security, integrity, query processing and optimization, transaction management, concurrency control, and fault tolerance. It then applies these concepts to both large-scale data warehouse and cloud computing systems. Cloud computing topics include MapReduce, massive-scale cloud databases, and cloud analytics.

Course prerequisites: 

605.441 Principles of Data Base Systems or equivalent. Familiarity with "big-O" concepts and notation is recommended.

Course notes: 

Formerly 605.741 Distributed Database Systems: Cloud Computing and Data Warehouses.

Course instructor: 

View Course Homepage(s) for this course.