An apparently new breed of neural network — the large language model (LLM) — figures increasingly in today’s news: ChatpGPT and Microsoft’s new chatbot-like Bing Chat interface seem to garner headlines on the daily. This course constitutes a thorough introduction to this technology, tracing the historical threads in computational linguistics and language modeling that led to it, and exploring the design patterns that underpin its application in modern AI systems. In between, students will learn about language modeling, the attention mechanism, prompt and instruction tuning, composability, quantization, low-rank adaptation, and the wealth of software and hardware optimizations that enable LLMs to be used at scale and with acceptable latencies.
Course Offerings
There are no sections currently offered, however you can view a sample syllabus from a prior section of this course.