Foundations of Data Science and Engineering

Harvard Summer School

CSCI S-101

Section 1

CRN 35160

Begin Registration
Most data scientists spend 20 percent of their time building data models and analyzing model results. What do they do with the remaining 80 percent of their time? The answer is data engineering. Data engineering is a subdiscipline of software engineering that focuses on the transportation, transformation, and management of data. This course takes a comprehensive approach to explore data science, which includes data engineering concepts and techniques. Key topics include data management and transformation, exploratory data analysis and visualization, statistical thinking and machine learning, natural language processing, and storytelling with data, emphasizing the integration of Python, MySQL, Tableau, development, and big data analytics platforms. Students cannot earn Harvard Extension School degree credit for CSCI S-101 if it is taken after CSCI E-29.

Instructor Info

Bruce Huang, EdD

Director of Master's Degree Program in Information Technology, Harvard Extension School


Meeting Info

6/22 to 8/7

Participation Option: Online Asynchronous

In online asynchronous courses, you are not required to attend class at a particular time. Instead you can complete the course work on your own schedule each week.

Deadlines

Last day to register: June 16, 2025

Prerequisites

CSCI S-7, CSCI S-50, or the equivalent.

Notes

Not open to Secondary School Program students.

All Sections of this Course

CRN Section # Participation Option(s) Instructor Section Status Meets Term Dates
35160 1 Online Asynchronous Bruce Huang Open Jun 22 to Aug 7
26190 1 Online Asynchronous Bruce Huang Open Jan 26 to May 16