Foundations of Large Language Models

Harvard Extension School

CSCI E-222

Section 1

CRN 27040

View Course Details

Students are introduced to foundational concepts and advanced techniques in large language models (LLMs) to understand the pivotal role of LLMs in natural language processing (NLP). The course covers a range of topics, including transformer architecture, and examines key models such as generative pre-trained transformer (GPT), bidirectional encoder representations from transformers (BERT), and text-to-text transfer transformer (T5). Practical skills are developed in employing these models for tasks like text generation, language translation, sentiment analysis, and building chatbots and conversational agents. Methods for working with LLMs include tokenization, dependency parsing, embedding generation, and fine-tuning of pre-trained models for specialized applications, as well as prompt engineering, data augmentation, and model evaluation to enhance performance. The course curriculum incorporates machine learning and deep learning techniques relevant to LLMs, including attention mechanisms and neural network optimizations. Students gain hands-on experience conducting experiments on platforms like Google Colab and implementing projects utilizing libraries such as Hugging Face transformers in Python.

Registration Closes: January 22, 2026
Credits: 4

Term

Spring Term 2026

Part of Term

Full Term

Format

Flexible Attendance Web Conference

Credit Status

Graduate, Noncredit, Undergraduate

Section Status

Open

Instructor Info

Dmitry V. Kurochkin, PhD

Senior Research Analyst, Faculty of Arts and Sciences Office for Faculty Affairs, Harvard University

Meeting Info

M 8:10pm - 10:10pm (1/26 - 5/16)

Participation Option: Online Asynchronous or Online Synchronous

Deadlines

Last day to register: January 22, 2026

Additional Time Commitments

Optional sections Fridays, time to be arranged.

Prerequisites

Knowledge of Python programming equivalent to CSCI E-7. Understanding matrix vector operations and notation is helpful but not required.

Notes

This course meets via web conference. Students may attend at the scheduled meeting time or watch recorded sessions asynchronously. Recorded sessions are typically available within a few hours of the end of class and no later than the following business day. See minimum technology requirements.

Tuition

All Sections of this Course

CRN	Section #	Participation Option(s)	Instructor	Section Status	Meets	Term Dates
27040	1	Online Asynchronous, Online Synchronous	Dmitry Kurochkin	Open	M 8:10pm - 10:10pm	Jan 26 to May 16

Foundations of Large Language Models

Registration Closes: January 22, 2026
Credits: 4

Term

Part of Term

Format

Credit Status

Section Status

Instructor Info

Meeting Info

Deadlines

Additional Time Commitments

Prerequisites

Notes

Tuition

All Sections of this Course

This Course May Contribute to:

Harvard Extension School

Harvard Summer School

Harvard Professional & Executive Development

Harvard Institute for Learning in Retirement

Foundations of Large Language Models

Registration Closes: January 22, 2026 Credits: 4

Term

Part of Term

Format

Credit Status

Section Status

Instructor Info

Meeting Info

Deadlines

Additional Time Commitments

Prerequisites

Notes

Tuition

All Sections of this Course

This Course May Contribute to:

Registration Closes: January 22, 2026
Credits: 4