SoCal Data Science Bootcamp

Summer 2023 Schedule

Before the Bootcamp

  • Sign up for GitHub Sign up for a free GitHub account if you don’t already have one. Make sure to pick a username that you feel comfortable sharing on your CV. You will most likely use your GitHub username for many years in your data science career. Fill out this survey to let us know what your username is. Many of you have already done this.

  • Install R Yes, you do need to download and install R even if you have downloaded before. There is a newer version.

  • Install RStudio Yes, you do need to download and install RStudio even if you have downloaded before. There is a newer version. Download the free Desktop version.

  • Install and setup git

  • Test your installations

Day 1 - Introduction to the Toolkit

Topic Materials
Meet & greet
Introduction to Toolkit
Exploratory Data Analysis (EDA) Activity day01-eda-activity-username repo on GitHub

Day 2 - Linear Regression and Model Evaluation

Topic Materials
Simple Linear Regression
Regression Activity day02-regression-activity-username repo on GitHub

Day 3 - Categorical Data Analysis

Topic Materials
Categorical Data Analysis
Categorical Data Analysis Activity day03-cda-activity-username repo on GitHub

Day 4 - Generalized Linear Models

Topic Material
Logistics Regression
Logistics Regression Activity day04-logistic-activity-username repo on GitHub
Good Workflow Practices
Learn to Collaborate on GitHub Activity day04-collaborate-teamnumber repo on GitHub

Day 5 - Hierarchical Linear Models

Topic Material
Hierarchical Linear Models without Predictors
Hierarchical Linear Models Notes day05-hlm-username repo on GitHub
Hierarchical Linear Models with Predictors