What you'll learn

  • Organizing high throughput data

  • Multiple comparison problem

  • Family Wide Error Rates

  • False Discovery Rate

  • Error Rate Control procedures

  • Bonferroni Correction

Course description

In this course, you’ll learn various statistics topics including multiple testing problems, error rates, error rate controlling procedures, false discovery rates, q-values, and exploratory data analysis. We then introduce statistical modeling and how it is applied to high-throughput data. In particular, we will discuss parametric distributions, including binomial, exponential, and gamma, and describe maximum likelihood estimation. We provide several examples of how these concepts are applied in next-generation sequencing and microarray data. Finally, we will discuss hierarchical models and empirical Bayes along with some examples of how these are used in practice. We provide R programming examples in a way that will help make the connection between concepts and implementation.

This class was supported in part by NIH grant R25GM114818.

 

Instructors

Assistant Professor, Departments of Biostatistics and Genetics, UNC Gillings School of Global Public Health

You may also like

Online

A focus on several techniques that are widely used in the analysis of high-dimensional data.

Price
Free*
Duration
4 weeks long
Registration Deadline
Available now
Online

The structure, annotation, normalization, and interpretation of genome scale assays.

Price
Free*
Duration
4 weeks long
Registration Deadline
Available now
Online

Learn probability theory — essential for a data scientist — using a case study on the financial crisis of 2007–2008.

Price
Free*
Duration
8 weeks long
Registration Deadline
Available now