School of Information Sciences Computer Vision and Machine Learning Group

IS517: Methods of Data Science

Fall 2025

Time: Tuesdays, 09:00 AM - 11:50 AM
Location: 46 Grad Sch of Lib & Info Science

[Canvas] [Syllabus]

Main Quad Main Quad, the image on the right is generated by GPT5.

Course Description

A dramatic increase in computing power has enabled new areas of data science to develop in statistical modeling and analysis. These areas cover predictive and descriptive learning and bridge between ideas and theory in statistics, computer science, and artificial intelligence. We will cover methods including predictive learning: estimating models from data to predict future outcomes. Regression topics include linear regression with recent advances using large numbers of variables, smoothing techniques, additive models, and local regression. Classification topics include linear regression, regularization, logistics regression, discriminant analysis, splines, support vector machines, generalized additive models, naive Bayes, mixture models and nearest neighbor methods as time permits. We situate the course components in the “data science lifecycle” as part of the larger set of practices in the discovery and communication of scientific findings.

This course will move rapidly. The course will include computer exercises using Python and other relevant computing languages.

Pre- and Co-requisites

Course Materials

Assignments and Methods of Assessment

Class Project

This is a group project. Ideally, every group should have two students. If this is not possible, please discuss with the instructor.

The project proposal will describe the proposed dataset(s), the original research question(s), and the proposed method of solution. This will likely be the novel application of a regression or classification technique from class. It is at most one page in length.

The final project will carry out the research in the proposal. There will be a project proposal presentation describing the research question(s), dataset(s), method, and possibly the expected results. There will be a final project presentation describing the research question(s), dataset(s), method, and comparing the expected and actual results.

Instructor

Yaoyao Liu
Office location: Room 5125, 614 East Daniel St
Email: lyy+IS517@g.illinois.edu
Office Hours: By appointment

Schedule (subject to revision)

Week Date Topic Readings Materials
Week 1 Aug 26 Syllabus + Data Science Intro [Slides] [Code] [Assignment]
Week 2 Sep 2 Classification ISL Ch. 4, PML Ch. 3 [Slides] [Code] [Assignment]
Week 3 Sep 9 Linear Regression ISL Ch. 3, PML Ch. 10 [Slides] [Code] [Assignment]
Week 4 Sep 16 Resampling ISL Ch. 5, PML Chs. 4 and 6
Week 5 Sep 23 Linear Model Selection & Regularization ISL Ch. 6, PML Chs. 4 and 5
Week 6 Sep 30 Splines / Generalized Additive Models ISL Ch. 7, PML Ch. 3
Week 7 Oct 7 Tree-Based Methods ISL Ch. 8, PML Ch. 3
Week 8 Oct 14 Project Proposal + Slides Due / Project Proposal Presentations
Week 10 Oct 28 Support Vector Machines ISL Ch. 9, PML Ch. 3
Week 11 Nov 4 Unsupervised Learning ISL Ch. 10, PML Ch. 5
Week 12 Nov 11 Deep Learning PML Chs. 12 and 13
Week 13 Nov 18 Final Presentation Slides Due / Final Project Presentations
Week 14 Dec 2 Final Project Presentations
Week 15 Dec 9 Final Project Report Due


School of Information Sciences Computer Vision and Machine Learning Group

614 E. Daniel St. MC-314
Champaign, IL 61820-7999

Phone: (217) 300-0910

Email: vision@ischool.illinois.edu