Fundamentals of Data Science

Harvard Extension School

CSCI E-83

Section 1

CRN 16768

View Course Details
This course builds on CSCI E-101, giving students a solid foundation for advanced data modeling, machine learning, and artificial intelligence (AI). The course focuses on the modern computational statistical methods underpinning advanced data science. In the twenty-first century, these powerful, computationally intensive models are both practical and widely used. Such models enable us to explore and model the complex datasets commonly encountered in the real world. The course employs a combination of theory and hands-on experience using Python programming tools. The focus is on foundational computational statistical algorithms, statistical inference methods, and effective visualization methods. The hands-on component of the course uses the Python packages, NumPy, Pandas, Seaborn, Statsmodels, and PyMC3, along with selected other open-source packages. The focus of this course is on methods to address the exploration, inference, and modeling changes arising from the analysis of increasingly complex datasets. Three approaches to large scale computational statistical inference are addressed: maximum likelihood, modern resampling methods, and Bayesian models. The properties and behavior of the rich family of linear models and Bayesian models, foundational to many statistical, machine learning and AI algorithms are surveyed. Additionally, time series models are explored.

Instructor Info

Stephen Elston, PhD

Principal Consultant, Quantia Analytics LLC


Meeting Info

W 6:00pm - 8:00pm (9/3 - 12/21)

Participation Option: Online Asynchronous or Online Synchronous

In online asynchronous courses, you are not required to attend class at a particular time. Instead you can complete the course work on your own schedule each week.

Deadlines

Last day to register: August 29, 2024

Additional Time Commitments

Optional sections Thursdays, 6-9 pm.

Prerequisites

Some exposure to basic machine learning and data science methods, equivalent to CSCI E-101. Experience programming using the Python language, equivalent to CSCI E-7 or CSCI E-29. For people with limited Python programming experience, some experience programming, in any language, such as R, Matlab, or C++, is essential. Knowledge of linear algebra, including eigenvalue-eigenvector decomposition and a bit of differential and integral calculus is essential.

Notes

This course meets via web conference. Students may attend at the scheduled meeting time or watch recorded sessions asynchronously. Recorded sessions are typically available within a few hours of the end of class and no later than the following business day.

All Sections of this Course

CRN Section # Participation Option(s) Instructor Section Status Meets Term Dates
16768 1 Online Asynchronous, Online Synchronous Stephen Elston Open W 6:00pm - 8:00pm
Sep 3 to Dec 21