Prob 140 (formally Statistics 140 or STAT 140) is a probability course for Data 8 graduates who have a math background and wish to go deeper into the theory of data science.
The emphasis on simulation and the bootstrap in Data 8 gives students a concrete sense of randomness and sampling variability. Prob 140 will capitalize on this. Because of the students’ backgrounds, Prob 140 will move swiftly over basics, avoid approximations that are unnecessary when SciPy is at hand, and replace some of the routine calculus by symbolic math done in SymPy. This will create time to focus on the more demanding concepts that are part of the theoretical foundations of data science.
Prob 140 contents have been selected based on consultation with faculty who regularly teach Stat and CS courses in advanced statistical topics including machine learning. Specifically, Prob 140 is intended to provide good preparation for CS 189 and Stat 154. The class requires more mathematics than Stat 134, the Statistics department’s standard course in classical probability theory; in addition to calculus, Prob 140 will require linear algebra (see details below) starting in Spring 2018. Computational power in Prob 140 allows students to solve problems that are intractable by other methods. Students also explore the standard mathematical theory graphically and by simulation, and thus develop a more firm grasp of the concepts than they might by using math alone.
Prob 140 satisfies the probability requirement for the proposed new Data Science major. The FAQ page of the Data Science Education Program website has this to say:
Should I be interested in STAT 140?
For the Statistics major and minor, and for Statistics courses numbered 135 and above, Prob 140 satisfies the same requirements as Stat 134 does. If a Statistics course currently requires Stat 134, then Prob 140 will fulfill that requirement too. A letter grade of B- or better in Prob 140 will satisfy the corresponding Stat 134 grade requirement for entry into the Statistics major.
Some students have successfully petitioned to have Prob 140 satisfy requirements in other departments. I expect that over the next year, Prob 140 will appear formally on lists of courses that satisfy upper division requirements in fields other than Statistics and Data Science.
Prob 140 can be taken by undergraduates who:
These alternatives are consistent with the Data 8 “grandfathering” alternatives proposed for the new Data Science major during the upcoming transition period. Students who satisfy the Data 8 requirement in these ways will be responsible for studying the necessary Data 8 material on their own; it will not be retaught in Prob 140.
All of these requirements will be enforced. I realize that many students have taken more advanced computing or prob/stat courses than required, but not the ones required. However, they are not the intended audience of Prob 140.
Prob 140 is an upper division course that is primarily mathematical. Students are expected to have a corresponding degree of independence, discipline, and mathematical maturity.
In every class I teach, I aim to make learning as efficient and enjoyable as possible. In particular, I try to make the lectures a great way to learn the material, so show up (there will be no videos) and ask lots of questions. A study guide is provided each week; it lists the reading and practice that students should do before attempting the week’s assignments. These lists aren’t just decoration. They are designed to be helpful. Homework will go faster if you’ve done the reading and some practice first.
But rather than reading more advice from me, listen to your fellow students instead. Here are quotes from student evaluations of the pilot course. They are from answers to the question, “What advice would you give to another student who is considering taking this course?”
I hope so.