Applying the Mathematical MASS Model with R

This course focuses on capabilities, models, and datasets available within the MASS package in R. Along the way, you will gain knowledge of estimating survival probabilities and working with hazard rates in multi-state model.
Course info
Level
Intermediate
Updated
Mar 10, 2020
Duration
2h 20m
Table of contents
Course Overview
Performing Qualitative and Quantitative Analysis Using MASS Datasets
Performing Predictive Analytics Using MASS Models
Implementing Multi-state Models in R
Description
Course info
Level
Intermediate
Updated
Mar 10, 2020
Duration
2h 20m
Your 10-day individual free trial includes:

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.
Description

Before machine learning and Python made statistics a subject of MASS popular appeal, an entire generation of applied statisticians learned their craft from the famous textbook named “Modern Applied Statistics with S” by Venables and Ripley. The “S” referred to in the book’s title is the precursor of the R statistical software, which is so popular and effective for statistical analysis. The influence of this seminal work is so strong, that R actually contains a package named MASS, an acronym for the book’s title.

In this course, Applying the Mathematical MASS Model with R, you will gain the ability to use the datasets, predictive models, and specialized functions available in the MASS package in R. 

First, you will learn how the classic t-test can be used in a variety of common scenarios around estimating means and also learn about using ANOVA, a powerful statistical technique used to measure statistical properties across different categories of data. This exploration will involve variants of the t-test such as one-sample and two-sample t-tests, as well as one-way ANOVA, which is used to compare means of a target variable across different groups, based on the value of a single categorical variable.

Next, you will discover about three powerful techniques in data analysis, namely linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and robust regression. LDA and QDA are classification techniques that both seek to re-orient the original data using new, optimized axes such that points belonging to different classes lie as far apart as possible. QDA is preferable to LDA when the x-variables that correspond to different y-variable values have differing covariances. MASS includes support for three powerful robust regression techniques, Huber, Bisquare, and Hampel; each of these is a useful way to fit a regression model even when data is heavily contaminated by outliers.

Finally, you will explore how to model complex systems using multi-state models, which represent the result of a stochastic process as a succession of states. You will understand the differences - and similarities - between transition probabilities and transition intensities, and then apply all of that knowledge to a special class of multi-state models: survival models. Such models find wide applications in medical domains such as modeling outcomes of different treatment regimens, and you will learn how to do so, and also how to model hazard rates and survival probabilities. When you’re finished with this course, you will have the skills and knowledge of several specialized statistical techniques that are featured in the MASS library in R.

About the author
About the author

A problem solver at heart, Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework.

More from the author
Summarizing Data and Deducing Probabilities
Intermediate
2h 50m
Jul 8, 2021
More courses by Janani Ravi
Section Introduction Transcripts
Section Introduction Transcripts

Course Overview
Hi. My name is Janani Ravi, and welcome to this course on applying the mathematical MASS model with R. A little about myself. I have a masters in electrical engineering from Stanford and have worked at companies such as Microsoft, Google, and Flipkart. At Google, I was one of the first engineers working on real‑time collaborative editing in Google Docs, and I hold four patents for its underlying technologies. I currently work on my own startup, Loonycorn, a studio for high‑quality video content. In this course, you will gain the ability to use the datasets, predictive models, and specialized functions available in the MASS package in R. First, you will learn how the classic t‑test can be used in a variety of common scenarios around estimating means and also learn about using ANOVA, a powerful statistical technique used to measure statistical properties across different categories of data. Next, you will discover three powerful techniques in data analysis, namely linear discriminate analysis, LDA, quadratic discriminate analysis, QDA, and robust regression. LDA and QDA are classification techniques that both seek to reorient the original data using new optimized axes such that points belonging to different classes lie as far apart as possible. MASS includes support for robust regression using three different techniques, Huber, Bisquare, and Hampel. Each of these is a useful way to fit a regression model even when data is heavily contaminated by outliers. Finally, you will explore how to model complex systems using multistate models. You'll understand the differences and similarities between transition probabilities and transition intensities and then apply all of that knowledge to a special class of multistate models, survival models. When you're finished with this course, you will have the skills and knowledge of several specialists statistical techniques that are featured in the MASS library in R.