Course Number
605.741
Next Offered
Summer 2024
Primary Program
Location
Online
Course Format
Online

This course investigates the theory and practice of modern large-scale database systems. Large-scale approaches include distributed relational databases; data warehouses; and non-relational databases including HDFS, Hadoop, Accumulo for query and graph algorithms, and Mahout bound to Spark for machine learning algorithms. Topics discussed include data design and architecture; database security, integrity, query processing, query optimization, transaction management, concurrency control, and fault tolerance; and query formulation, graph algorithms, and machine learning algorithms on large-scale distributed data systems. At the end of the course, students will understand the principles of several common large-scale data systems including their architectures, performance, and costs. Students will also gain a sense of which approach is recommended for different requirements and circumstances.

Course Prerequisite(s)

EN.605.202 Data Structures; EN.605.641 Principles of Database Systems or equivalent. Familiarity with “big-O” concepts and notation is recommended.

Course Offerings

Waitlist Only

Large-Scale Database Systems

605.741.81
05/28/2024 - 08/22/2024
Semester
Summer 2024
Course Format
Online
Location
Online
Cost
$5,270.00
Course Materials
Open

Large-Scale Database Systems

605.741.82
05/28/2024 - 08/22/2024
Semester
Summer 2024
Course Format
Online
Location
Online
Cost
$5,270.00
Course Materials
Open

Large-Scale Database Systems

605.741.83
05/28/2024 - 08/22/2024
Semester
Summer 2024
Course Format
Online
Location
Online
Cost
$5,270.00
Course Materials