Data Analytics with Statistics
Course Overview
Welcome to our course Data Analytics with Statistics! 👋
Note
Note that this schedule will be updated as the seminar progresses.
| Nr. | Topic | Literature | Slides | Links | Code | Activity |
|---|---|---|---|---|---|---|
| 1 | Introduction | |||||
| 2 | Data driven decision making | 📑 | Lecture | |||
| 4 | Data Science Lifecycle Overview | 📑 | Lecture | |||
| 5 | Use case identification | 📑 | Self-study | |||
| 6 | Frame the Problem | 📑 | Self-study | |||
| 7 | Identify Variables | 📑 | Self-study | |||
| 8 | Define Metrics | 📑 | Self-study | |||
| 9 | Data | |||||
| 10 | First Data Analysis | 📚 | 📑 | 💻 | Lecture | |
| 11 | Data basics | 📚 | 📑 | ☑️ | Lecture | |
| 12 | How to obtain data | 📑 | Optional | |||
| 13 | Data wrangling: Pandas lab | 📚 | 💻 | Application Exercise | ||
| 14 | Data analysis: Survey lab | 💻 | Optional | |||
| 15 | Study Design | |||||
| 16 | Population and sample | 📚 | 📑 | ☑️ | Self-study | |
| 17 | Sampling methods | 📚 | 📑 | ☑️ | Self-study | |
| 18 | Experiments | 📚 | 📑 | ☑️ | Self-study | |
| 19 | Observations | 📚 | 📑 | ☑️ | Self-study | |
| 20 | EDA with categorical data | |||||
| 21 | Loans data | 📚 | 📑 | 💻 | Self-study | |
| 22 | Contingency tables | 📚 | 📑 | 💻 | Lecture | |
| 23 | Contingency tables with proportions | 📚 | 📑 | 💻 | Self-study | |
| 24 | Simple bar chart | 📚 | 📑 | 💻 | Lecture | |
| 25 | Stacked bar plot | 📚 | 📑 | 💻 | Lecture | |
| 26 | Standardized bar plot | 📚 | 📑 | 💻 | Lecture | |
| 27 | Pie chart | 📚 | 📑 | 💻 | Self-study | |
| 28 | EDA with numerical data | |||||
| 29 | Scatterplot | 📚 | 📑 | 💻 | Self-study | |
| 30 | Dot plot mean median and mode | 📚 | 📑 | 💻 | Lecture | |
| 31 | Histogram | 📚 | 📑 | 💻 | Lecture | |
| 32 | Box Plot | 📚 | 📑 | 💻 | Lecture | |
| 33 | Comparing numerical data across groups | 📚 | 📑 | 💻 | Lecture | |
| 34 | Variance and standard deviation | 📚 | 📑 | Lecture | ||
| 35 | Kernel density plot | 📚 | 💻 | Self-study | ||
| 36 | Robust statistics and transformations | 📚 | 📑 | Self-study | ||
| 37 | Models | |||||
| 38 | Statistical Learning, Machine Learning | 📑 | Lecture | |||
| 39 | Types of Models | 📑 | Lecture | |||
| 40 | Linear Regression models | |||||
| 41 | Correlation | 📚 | 📑 | ☑️ | 💻 | Lecture |
| 42 | Sales and ads | 📑 | 💻 | Lecture | ||
| 43 | Mean squared error | 📑 | Application exercise | |||
| 44 | Fitting a line and residuals | 📚 | 📑 | ☑️ | 💻 | Self-study |
| 45 | Least squares regression | 📚 | 📑 | 💻 | Self-study | |
| 46 | R squared | 📚 | 📑 | 💻 | Lecture | |
| 47 | Categorical predictors with two levels | 📚 | 📑 | 💻 | Lecture | |
| 48 | Outliers | 📚 | 📑 | Self-study | ||
| 49 | Multiple predictors regression 1 | 📚 | 📑 | ☑️ | 💻 | Lecture |
| 50 | Multiple predictors regression 2 | 💻 | Lecture | |||
| 51 | Multiple predictors regression 3 | 💻 | Lecture | |||
| 52 | Linear Regression with Data Splitting | |||||
| 53 | Regression example happier | 📑 | 💻 | Self-study | ||
| 54 | Main model challenges | 📑 | Self-study | |||
| 55 | Data splitting | 📑 | 💻 | Self-study | ||
| 56 | Sales prediction | 📑 | 💻 | Self-study | ||
| 57 | Sales prediction with data splitting | 💻 | Lecture | |||
| 58 | Advanced Linear Regression models | |||||
| 59 | Regression splines | 📚 | 💻 | Optional | ||
| 60 | Generalized additive models | 📚 | 💻 | Optional | ||
| 61 | Adjusted R squared | 📚 | 📑 | Optional | ||
| 62 | Regression diagnostics | 💻 | Optional | |||
| 63 | Model Selection Methods | |||||
| 64 | Model selection methods | 📚 | 📑 | Optional | ||
| 65 | Implicit model selection | 📚 | 💻 | Optional | ||
| 66 | Lasso regression | 📚 | 💻 | Optional | ||
| 67 | Filter model selection | 📚 | 💻 | Optional | ||
| 68 | Wrapper model selection | 📚 | 💻 | Optional | ||
| 69 | Classification models | |||||
| 70 | Classification | 📑 | Lecture | |||
| 71 | Precision recall and F1 score | 📚 | 📑 | Lecture | ||
| 72 | ROC Curve and AUC | 📚 | 📑 | Lecture | ||
| 73 | Probability of an event | 📚 | 📑 | 💻 | Self-study | |
| 74 | Logistic regression in Python | 💻 | Lecture | |||
| 75 | Statistical Inference | |||||
| 76 | introduction | 📚 | 📑 | 💻 | Optional | |
| 77 | Cases and confidence | 📚 | 📑 | Optional | ||
| 78 | Decision errors | 📚 | 📑 | Optional | ||
| 79 | Single proportion | 📚 | 📑 | Optional | ||
| 80 | Single proportion hypothesis testing | 📚 | 📑 | Optional | ||
| 81 | Two proportions | 📚 | 📑 | Optional | ||
| 82 | Two proportions hypothesis test | 📚 | 📑 | Optional | ||
| 83 | Two proportions mammogram | 📚 | 📑 | Optional | ||
| 84 | Two way tables | 📚 | 📑 | Optional | ||
| 85 | Two means | 📚 | 📑 | Optional | ||
| 86 | Inference regression | 📚 | 📑 | Optional | ||
| 87 | Probability and Bayes | |||||
| 88 | Probability and Bayes | 📺 | 📑 | Self-study | ||
| 92 | Optional content | |||||
| 93 | Data storytelling | Optional | ||||
| 94 | Generative AI | 💻 | Optional |