This course covers the fundamentals of statistical exploration and visualization of data. It is a second course in statistics and will demand greater depth of understanding than 95-796 or 90-711. In this course, we will fit models and produce specialized graphs to explore data in a detailed and statistics-oriented manner. This course also serves as an introduction to R, an open source programming language for statistics and graphics that is widely used within corporations and academia.
“Visualization is critical to data analysis. It provides a front line of attack, revealing intricate structure in data that cannot be absorbed in any other way. We discover unimagined effects, and we challenge imagined ones.” – William Cleveland
Either 90-711 or 95-796 must be taken as a prerequisite; otherwise, consent of the instructor is required.
- Learn and use R to perform basic data manipulation such as filtering, aggregating, and organizing data sets
- Produce graphs in R
- Understand how transformations, model fits, and residuals can be used to explore and check statistical assumptions about data