KOUT 2095 - Data Science with R

This workshop is about accessing, manipulating and visualizing data with R and RStudio. You will also learn about a larger, looser set of skills and tools to work with data in general. You will be using your computer a lot throughout the course, in order to learn the R programming language and its RStudio interface. We will review some essential statistical concepts along the way, building up from exploratory data analysis to statistical modeling

Learning Outcomes
1. Proficiency in exploratory data analysis
2. Knowledge of statistical inference and modeling
3. Knowledge of the R programming language
4. Knowledge of the RStudio software
5. Exposition to current data science trends

Professional Skills
Quantitative methods, R and RStudio software, familiarity with data science.
François BRIATTE
English
- Attendance: 2 hours a week / 24 hours a semester
- Online learning activities: 12 hours a week / 24 hours a semester
- Reading and Preparation for Class: 1 hour a week / 12 hours a semester
- Research and Preparation for Group Work: 2 hours a week / 24 hours a semester
- Research and Writing for Individual Assessments: 1 hour a week / 12 hours a semester
(1) A laptop running a recent version of Windows, MacOS or Linux, with full admin privileges, in order to support the installation of the latest versions of R (r-project.org) and RStudio Desktop (posit.co/products/open-source/rstudio). (2) Minimal computing skills, e.g. unzipping files, using keyboard shortcuts, connecting to the Internet. (3) Some prior exposure to introductory statistics, e.g. descriptive statistics and association tests.
Autumn and Spring 2024-2025
Individual and group exercises, to be completed in between workshop sessions.
Feedback will be provided in class as answers to the exercises.
Irizarry, R. 2023. Introduction to Data Science. https://rafalab.dfci.harvard.edu/dsbook-part-1/
Healy, K. 2019. Data Visualization. A Practical Introduction. https://socviz.co/
Rodrigues, B. 2022. Modern R with the tidyverse. https://modern-rstats.eu/
Wickham, H. and Grolemund, G. 2022. R for Data Science. 2nd ed. https://r4ds.hadley.nz/
Imai, K. 2018. Quantitative Social Science. An Introduction. https://press.princeton.edu/books/hardcover/9780691167039/quantitative-social-science
Li, Q. 2021. Using R for Data Analysis in Social Sciences. A Research Project-oriented Approach. https://academic.oup.com/book/27134
Llaudet, E. and Imai, K. 2022. Data Analysis for Social Science: A Friendly and Practical Introduction. https://press.princeton.edu/books/paperback/9780691199436/data-analysis-for-social-science
Kennedy, P.D. and Waggoner, R. 2021. Introduction to R for Social Scientists. A Tidy Programming Approach. https://i2rss.weebly.com