Advanced Deep Learning

Harvard Extension School

CSCI E-104

Section 1

CRN 26435

View Course Details
Deep learning artificial intelligence (AI) models and applications have proliferated and are profoundly affecting almost every aspect of economic, social, and scientific activity. This course equips students with skills needed to engage in advanced research and development in AI and deep learning. We cover details of several classes of transformers, which are the basis of large language models (LLMs). We study deep probabilistic models as the foundation of generative techniques (stable diffusion, text-to-speech, and flow models). We study Bayesian models and apply them to the optimization of neural networks and problems with small datasets. Students learn how to utilize the overlap between dynamical systems, ordinary/partial/stochastic differential equations, and physics-based neural networks. For important classes of neural networks, we explore the fundamental mechanisms behind their operations and provide practical illustrations of their uses. For example, we review the structure of transformer-based pre-trained LLMs, the principles of attention, and their use in applications such as ChatGPT, with a focus on understanding prompt programming and structure of agentic applications. For generative networks, we examine the generation of realistic representations of people, speech, paintings, and music. For graph neural networks (GNNs), we dive into the analysis of chemical molecules, proteins, and drugs and quantitative structure property relationship in physical systems. We learn how to impose constraints that are reflections of physical or geometric laws governing physical systems. Concepts introduced in every lecture are illustrated by practical examples. Code samples in lectures and homework assignments are written in PyTorch and occasionally in Keras 3. Students learn how to scale training of deep learning models to clusters of two or many graphics processing units (GPUs).

Instructor Info

Zoran B. Djordjević, PhD

Senior Enterprise Architect, Nishava, Inc.


Blagoje Djordjević, PhD

Staff Scientist, Lawrence Livermore National Laboratory


Meeting Info

F 6:30pm - 8:30pm (1/25 - 5/15)

Participation Option: Online Asynchronous or Online Synchronous

In online asynchronous courses, you are not required to attend class at a particular time. Instead you can complete the course work on your own schedule each week.

Deadlines

Last day to register:

Additional Time Commitments

Optional sections Saturdays, 1-2 pm.

Prerequisites

CSCI E-89 or an equivalent introductory deep learning course. Proficiency with Python.

Notes

This course meets via web conference. Students may attend at the scheduled meeting time or watch recorded sessions asynchronously. Recorded sessions are typically available within a few hours of the end of class and no later than the following business day. See minimum technology requirements.

All Sections of this Course

CRN Section # Participation Option(s) Instructor Section Status Meets Term Dates
26435 1 Online Asynchronous, Online Synchronous Team Taught Open F 6:30pm - 8:30pm
Jan 25 to May 15