Large Language Models: Theory and Practice

Course Number

705.651

Next Offered

Fall 2025

Primary Program

Artificial Intelligence

Location

Online

Course Format

Online - Synchronous

An apparently new breed of neural network — the large language model (LLM) — figures increasingly in today’s news: ChatpGPT and Microsoft’s new chatbot-like Bing Chat interface seem to garner headlines on the daily. This course constitutes a thorough introduction to this technology, tracing the historical threads in computational linguistics and language modeling that led to it, and exploring the design patterns that underpin its application in modern AI systems. In between, students will learn about language modeling, the attention mechanism, prompt and instruction tuning, composability, quantization, low-rank adaptation, and the wealth of software and hardware optimizations that enable LLMs to be used at scale and with acceptable latencies.

Course Offerings

Open