
Introduction to Data Science II
Course Code
DATA 11900 20
Cross Listed Course Code(s)
STAT 11900
Course Description
This course is the second of a two-quarter systematic introduction to the foundations of data science, as well as to practical considerations in data analysis.
A broad background on probability and statistical methodology will be provided. More advanced topics on data privacy and ethics, reproducibility in science, data encryption, and basic machine learning will be introduced. We will explore these concepts with real-world problems from different domains.
Course Criteria
Prerequisites: DATA 11800 or consent of instructor
Instructor(s)
Amy Nussbaum
Other Courses to Consider
These courses might also be of interest.
Introduction to Data Science IData science provides tools for gaining insight into specific problems using data, through computation, statistics and visualization. This course introduces students to all aspects of a data analysis process: from posing questions, designing data collection strategies, management+storing and processing of data, exploratory tools to visualization, statistical inference, prediction, interpretation and communication of results. Simple techniques for data analysis are used to illustrate both effective and fallacious uses of data science tools. Although this course is designed to be at the level of mathematical sciences courses in the Core, with little background required, you will develop computational skills that will allow you to analyze data. Computation will be done using Python and Jupyter Notebook.
Remote