College of Chemistry Course Guide

STAT 140 - Probability for Data Science (4 Units)

(Taken from the UC Berkeley Course Guide)

Course Overview

Summary

An introduction to probability, emphasizing the combined use of mathematics and programming to solve problems. Random variables, discrete and continuous families of distributions. Bounds and approximations. Dependence, conditioning, Bayes methods. Convergence, Markov chains. Least squares prediction. Random permutations, symmetry, order statistics. Use of numerical computation, graphics, simulation, and computer algebra.

Prerequisites

STAT/CS/Information C8, or STAT/CS C100, or both STAT 20 and CS 61A; one year of calculs at the level of MATH 1A-MATH 1B or higher. Corequisite: MATH 54, EE 16A, STAT 89A, MATH 110 or equivalent linear algebra.

Students who have earned credit for STAT 134 will not receive credit for STAT 140.

Topics Covered

The emphasis on simulation and the bootstrap in Data 8 gives students a concrete sense of randomness and sampling variability. Stat 140 will capitalize on this, abstraction and computation complementing each other throughout.

The syllabus has been designed to maintain a mathematical level at least equal to that in Stat 134. So Stat 140 will start faster than Stat 134 (due to the Data 8 prerequisite), avoid approximations that are unnecessary when SciPy is at hand, and replace some of the routine calculus by symbolic math done in SymPy. This will create time for a unit on the convergence and reversibility of Markov Chains as well as added focus on conditioning and Bayes methods.

With about a thousand students a year taking Foundations of Data Science (Stat/CS/Info C8, a.k.a. Data 8), there is considerable demand for follow-on courses that build on the skills acquired in that class. Stat 140 is a probability course for Data 8 graduates who have also had a year of calculus and wish to go deeper into data science.

Understand the difference between math and simulation, and appreciate the power of both; Use a variety of approaches to problem solving; Work with probability concepts algebraically, numerically, and graphically

Workload

Time Commitment

3 hours of lecture, 2 hours of discussion, and 1 hour of supplement per week.




UC Berkeley Course Guide