Introduction to Statistics

Level: Foundation

This hands-on introduction to statistics for data science gives you the tools required to make sense of data and draw *valid* conclusions. The focus of this course is on statistical thinking. Concepts will be introduced intuitively before being expanded formally. You will learn how to think in terms of distributions---not single point estimates. Statistical tools will be introduced in the context of how to use them to gain insight and solve problems. You will also learn how to use the powerful, industry-standard R environment to do the number-crunching in this statistics for data science course.

Key Features of Introduction to Statistics:

  • Learning Tree end-of-course exam included
  • After-course computing sandbox included
  • After-course instructor coaching included

You Will Learn How To:

  • Visualise data
  • Draw conclusions about the features and quality of data sets
  • Summarise your data
  • Determine correlation
  • Think of numbers as distributions
  • Understand sampling and it's importance in statistic inference
  • Use the power of computers to generate distributions for any problem
  • Calculate confidence intervals and p-values
  • Make valid statistic inferences using a range of hypothesis tests
  • Critique statistical analyses
  • Design and execute your own statistical projects

Choose the Training Solution That Best Fits Your Individual Needs or Organisational Goals


In Class & Live, Online Training

  • 2-day instructor led training course
  • After-course computing sandbox included
  • After-course instructor coaching included
  • Tuition fee fee can be paid later by invoice -OR- at the time of checkout by credit card
View Course Details & Schedule

Standard £995




Team Training

  • Bring this or any training to your organisation
  • Full - scale program development
  • Delivered when, where, and how you want it
  • Blended learning models
  • Tailored content
  • Expert team coaching

Customize Your Team Training Experience


Save More on Training with Learning Tree Training Vouchers!

Our flexible, easy-to-redeem training vouchers are available to any employee within your organisation. For details, please call 0800 282 353 or chat live.

In Class & Live, Online Training

Note: This course runs for 2 Days *

*Events with the Partial Day Event clock icon run longer than normal but provide the convenience of half-day sessions.

  • 24 - 25 Nov 9:00 AM - 4:30 PM GMT Online (AnyWare) Online (AnyWare) Reserve Your Seat

  • 10 - 11 Mar 9:00 AM - 4:30 PM GMT Online (AnyWare) Online (AnyWare) Reserve Your Seat

  • 2 - 3 Jun 9:00 AM - 4:30 PM BST Online (AnyWare) Online (AnyWare) Reserve Your Seat

  • 18 - 19 Aug 9:00 AM - 4:30 PM BST Online (AnyWare) Online (AnyWare) Reserve Your Seat

  • 14 - 15 Dec 2:00 PM - 9:30 PM GMT Online (AnyWare) Online (AnyWare) Reserve Your Seat

  • 11 - 12 Jan 2:00 PM - 9:30 PM GMT New York / Online (AnyWare) New York / Online (AnyWare) Reserve Your Seat

  • 15 - 16 Mar 1:00 PM - 8:30 PM GMT Herndon, VA / Online (AnyWare) Herndon, VA / Online (AnyWare) Reserve Your Seat

  • 12 - 13 Apr 2:00 PM - 9:30 PM BST New York / Online (AnyWare) New York / Online (AnyWare) Reserve Your Seat

  • 14 - 15 Jun 2:00 PM - 9:30 PM BST Herndon, VA / Online (AnyWare) Herndon, VA / Online (AnyWare) Reserve Your Seat

  • 12 - 13 Jul 2:00 PM - 9:30 PM BST New York / Online (AnyWare) New York / Online (AnyWare) Reserve Your Seat

  • 13 - 14 Sep 2:00 PM - 9:30 PM BST Herndon, VA / Online (AnyWare) Herndon, VA / Online (AnyWare) Reserve Your Seat

Guaranteed to Run

When you see the "Guaranteed to Run" icon next to a course event, you can rest assured that your course event — date, time — will run. Guaranteed.

Partial Day Event

Learning Tree offers a flexible schedule program. If you cannot attend full day sessions, this option consists of four-hour sessions per day instead of the full-day session.

Important Statistics Course Information

  • Requirements

    There are no formal prerequisites for attending this course.

  • Who Should Attend This Course

    Anyone who is required to draw conclusions from numbers.

    No prior knowledge of statistics or software packages is required. An inquisitive nature and an interest in using numbers to solve problems are essential.

Statistics Course Outline

  • Introduction and Overview

    • Course philosophy
    • Software
    • Contents
  • What is Statistics?

    • Definition
    • Types of statistician
    • Variability
    • Probability
    • Let the die roll!
    • Die roll outcomes
    • Why is knowledge of statistics important?
    • Descriptive vs inferential statistics
    • Inferring population parameters
    • Quantitative data
    • Qualitative data
    • R statistical software
    • RStudio
    • Interactive exercise manual demo
  • Exploratory Data Analysis

    • What is exploratory data analysis (EDA)
    • Histograms and bar charts
    • Bar chart vs histogram
    • Central tendency and spread
    • Bin width is crucial
    • Right-skewed data
    • Outliers
    • Left-skewed data
    • Bimodal data
    • Separate subpopulations for analysis
    • Individual value plot
    • Subpopulation individual value plots
    • Benefits of boxplots
    • Boxplot
    • Boxplot vs histogram
    • Left-skewed boxplot
    • Compare subpopulations using boxplots
    • Swedish salaries by level of education
    • Measures of central tendency
    • Mean vs median
    • Mean vs median for skewed data
    • Mode
    • Measures of spread
    • Range and IQR
    • Standard deviation
    • Six figure summary
    • Central tendency and spread equations
    • Quantiles
    • Benefits of scatterplots
    • Scatterplot
    • Highlighting subgroups on scatterplot
    • What is correlation?
    • Correlation examples
    • Random data correlation
    • Literacy rate correlation
    • # children per woman correlation
    • Interpreting correlation coefficients
    • Correlation doesn’t imply causation
    • Causation doesn’t imply (linear) correlation
  •  Probability Distributions

    • Numbers are mostly reckless estimates
    • Random variables
    • Male life expectancy in UK distribution
    • What’s the probability that a US man is 6’ or more?
    • What is a probability distribution?
    • Populations vs samples
    • Sampling the heights of 10 random American men
    • Sampling the heights of 100,000 random American men
    • Discrete probability distributions
    • Roll two dice and histogram the results
    • Poisson distribution
    • Binary probability distributions
    • Probability distribution for cars/household in the UK
    • Binomial distribution
    • Geometric distribution
    • Negative Binomial distribution
    • Continuous probability distributions
    • Uniform distribution
    • Triangular distribution
    • Normal distribution
    • Properties of the normal distribution
    • Distribution of IQ scores
    • Different means (same standard deviation)
    • Different standard deviations (same mean)
    • z-distribution
    • 68–95–99.7 (empirical) rule
    • Quantile-Quantile (Q-Q) plot
    • Q-Q plot of non-normal data
    • Common probability distributions “family tree”
  • Sampling

    • Samples are proxies for the population of interest
    • Unfortunately, samples vary
    • Larger samples exhibition less variation
    • Statistics vs parameters
    • Distributions involved in statistical inference
    • Sampling distribution of mean IQ
    • Collecting more IQ samples
    • Sampling distribution of mean die roll
    • Sampling distribution of mean project duration
    • Create a sampling distribution
    • Central limit theorem
    • Implications of the central limit theorem
    • Standard error of the mean (SEM)
    • Impact of sample size on SEM
    • What is a confidence interval?
    • 95% confidence interval
    • Bigger samples give greater precision
    • Smaller confidence levels result in tighter intervals
    • How should we interpret the confidence interval?
    • Random sampling
    • Simple random sampling
    • Stratified sampling
    • Cluster sampling
    • What is bootstrapping?
    • Estimating median life expectancy
  • Statistical Inference

    • What is statistical inference?
    • Why must we use samples?
    • Why do we need to conduct hypothesis tests?
    • What is hypothesis testing?
    • Null hypothesis
    • Alternative hypothesis
    • Rejecting the null hypothesis
    • One- vs Two-tailed hypothesis tests
    • Choosing between one- and two-tailed tests
    • What are p-values?
    • Significance level (?)
    • Types of errors
    • Confidence levels vs significance levels
    • Performing hypothesis tests
    • p-value controversy
    • When to use a t-test
    • t-value
    • t-distribution
    • t-distributions
    • Slot machine observed ”Return to Player”
    • Are slot machine payouts within tolerance?
    • Preform a t-test on RTP data using R
    • Two-sample t-test
    • When to use a z-test
    • Conducting hypothesis tests using z-scores
    • When to use a 2 test
    • Education and Brexit vote
    • Brexit vote breakdown
    • 2 value
    • 2 distributions
    • Are education and Brexit vote related?
    • When to use a F-test
    • Conducting hypothesis tests using F-values
    • F-distributions
    • Height distribution by sex
    • Does height variation differ by sex?
    • When to use analysis of variance (ANOVA)
    • Determining the F-value
    • Are all diets the same?
    • All diets are apparently not the same
    • Normality hypothesis tests
    • Statistically significant treatments?
    • What is statistical power?
    • Calculating statistical power
    • Statistical power curve
    • Improving statistical power of hypothesis tests

Team Training

Statistics FAQs

  • What is statistics?

    Statistics is the science of analysing data, particularly in large quantities, and using it to draw more general conclusions.

  • Why is a knowledge of statistics valuable in business?

    Without statistics, conclusions drawn from data can be fatally flawed. As data becomes ubiquitous, the ability to analyse it responsibly is essential.

  • What background do I need for this Statistics for Data Science Course?

    Participants be able to understand basic mathematical concepts (high-school level) and have a level of comfort with using computer software, such as Excel.

  • Does this include any practical, hands-on learning?

    Yes. There are various opportunities to conduct analyses throughout the period of the training.

  • Can I take this training course online?

    Yes! We know your busy work schedule may prevent you from getting to one of our classrooms which is why we offer convenient online training to meet your needs wherever you want. This course is available online, in person, or as Private Team Training.

Online (AnyWare)
Online (AnyWare)
Online (AnyWare)
Online (AnyWare)
Online (AnyWare)
New York / Online (AnyWare)
Herndon, VA / Online (AnyWare)
New York / Online (AnyWare)
Herndon, VA / Online (AnyWare)
New York / Online (AnyWare)
Herndon, VA / Online (AnyWare)
Preferred method of contact:

Please Choose a Language

Canada - English

Canada - Français