4. Populations and Samples

In this session we will begin thinking about statistical inference. Loosely, this describes the process of using a sample of data to make statements about a wider population. There is more than one framework for statistical inference. The traditional and most widely used approach is termed the “classical” or frequentist, and this is the one pursued in this and the next few sessions. An important alternative, the Bayesian approach, is increasingly influential. You will meet Bayesian inference later in this module.

Intended learning outcomes

By the end of this session you will be able to:

  • describe the process of frequentist statistical inference

  • define the terms population, sample, estimator and estimate

  • explain what a sampling distribution is and how it relates to the idea of repeated sampling

  • understand that sample estimates will vary as defined by the standard error

  • appreciate that the uncertainty in estimates can be described using central limit theorem and resampling (bootstrapping)

The five sub-sections in this session explore the concept of sampling from a population and define parameters and estimators. They formally define the concept of a statistical model. Sampling distributions are described, and the standard error defined, along with key ways of obtaining the approximate sampling distribution.