Time: Tuesdays, 09:00 AM - 11:50 AM
Location: 46 Grad Sch of Lib & Info Science
[Canvas] [Syllabus]
Main Quad, the image on the right is generated by GPT5.
A dramatic increase in computing power has enabled new areas of data science to develop in statistical modeling and analysis. These areas cover predictive and descriptive learning and bridge between ideas and theory in statistics, computer science, and artificial intelligence. We will cover methods including predictive learning: estimating models from data to predict future outcomes. Regression topics include linear regression with recent advances using large numbers of variables, smoothing techniques, additive models, and local regression. Classification topics include linear regression, regularization, logistics regression, discriminant analysis, splines, support vector machines, generalized additive models, naive Bayes, mixture models and nearest neighbor methods as time permits. We situate the course components in the “data science lifecycle” as part of the larger set of practices in the discovery and communication of scientific findings.
This course will move rapidly. The course will include computer exercises using Python and other relevant computing languages.
This is a group project. Ideally, every group should have two students. If this is not possible, please discuss with the instructor.
The project proposal will describe the proposed dataset(s), the original research question(s), and the proposed method of solution. This will likely be the novel application of a regression or classification technique from class. It is at most one page in length.
The final project will carry out the research in the proposal. There will be a project proposal presentation describing the research question(s), dataset(s), method, and possibly the expected results. There will be a final project presentation describing the research question(s), dataset(s), method, and comparing the expected and actual results.
Yaoyao Liu
Office location: Room 5125, 614 East Daniel St
Email: lyy+IS517@g.illinois.edu
Office Hours: By appointment
Week | Date | Topic | Readings | Materials |
---|---|---|---|---|
Week 1 | Aug 26 | Syllabus + Data Science Intro | [Slides] [Code] [Assignment] | |
Week 2 | Sep 2 | Classification | ISL Ch. 4, PML Ch. 3 | [Slides] [Code] [Assignment] |
Week 3 | Sep 9 | Linear Regression | ISL Ch. 3, PML Ch. 10 | [Slides] [Code] [Assignment] |
Week 4 | Sep 16 | Resampling | ISL Ch. 5, PML Chs. 4 and 6 | |
Week 5 | Sep 23 | Linear Model Selection & Regularization | ISL Ch. 6, PML Chs. 4 and 5 | |
Week 6 | Sep 30 | Splines / Generalized Additive Models | ISL Ch. 7, PML Ch. 3 | |
Week 7 | Oct 7 | Tree-Based Methods | ISL Ch. 8, PML Ch. 3 | |
Week 8 | Oct 14 | Project Proposal + Slides Due / Project Proposal Presentations | ||
Week 10 | Oct 28 | Support Vector Machines | ISL Ch. 9, PML Ch. 3 | |
Week 11 | Nov 4 | Unsupervised Learning | ISL Ch. 10, PML Ch. 5 | |
Week 12 | Nov 11 | Deep Learning | PML Chs. 12 and 13 | |
Week 13 | Nov 18 | Final Presentation Slides Due / Final Project Presentations | ||
Week 14 | Dec 2 | Final Project Presentations | ||
Week 15 | Dec 9 | Final Project Report Due |
614 E. Daniel St. MC-314
Champaign, IL 61820-7999
Phone: (217) 300-0910
Email: vision@ischool.illinois.edu