Course Number
705.743
Next Offered
Summer 2024
Location
Online
Course Format
In-person

Large language models (LLMs) like ChatGPT have ushered in a new wave of virtual assistants, chatbots, and text generators. Many see them as a paradigm shift in how humans interact with machines. Huge development ecosystems have arisen around LLMs, often abstracting away how they work to make them accessible to more people. While the democratization of this technology is important, LLMs cannot be fully harnessed and improved without understanding their inner workings at a fine level. In this course, students will build a small version of a text generation model like GPT3 over the course of several weeks. They will learn about the details of the GPT architecture from bottom to top, how the GPT architecture came about, and how it is used today in applications like ChatGPT. Once these fundamentals are established, students will build their own research experiment on top of their home-grown language models. Completing this course will prepare students to build and modify language models for further LLM research or novel applications.

Course Offerings

New
Waitlist Only
First Time Online

ChatGPT from Scratch: Building and Training Large Language Models

705.743.8VL
05/29/2024 - 08/21/2024
Wed 4:30 p.m. - 7:35 p.m.
Notes: This course uses the Virtual Live format. All students participate online through live web-conferencing at the scheduled day and time. This is a live-online course in which students participate in live weekly lectures and discussions, and are able to interact extensively with the instructors. All classes are recorded for download and review.
Semester
Summer 2024
Course Format
Synchronous Online
Location
Online
Cost
$5,270.00
Course Materials