AI Basics with AK

Season 03 - Introduction to Statistics

Arun Koundinya Parasa

Episode 04

Data Visualization for Statistics — Histograms, Box Plots and Scatter Plots

Agenda

  • Why Visualize Data?
  • Histograms to Understand Distribution
  • Box Plots for Spread & Outliers
  • Scatter Plots to Explore Relationships

In last class we learned about mean, median, mode and etc;

Why Visualize Data?

Visualizations are the windows into data — they help us:

  • See patterns, trends, and outliers.
  • Understand distributions beyond numbers.
  • Communicate insights clearly.
  • Build intuition for statistical concepts.

    A picture is worth a thousand data points.

Understanding Histogram to Learn Distributions

Histograms group data into bins to show frequency distribution.

  • Helps identify shape: symmetric, skewed, uniform, bimodal.
  • Reveals where data clusters or thins out.

Box Plots: Visualizing Spread and Outliers

Box plots summarize data using:

  • Median (center line)
  • Interquartile Range (box)
  • Whiskers (range without outliers)
  • Outliers (dots outside whiskers)

    Great for comparing groups!

Box Plot Example

Scatter Plot Example

Visualizing Skewness and Outliers

Summary

  • Visualizations reveal shape, spread, and relationships in data.
  • Histograms and box plots help us understand distribution and outliers.
  • Scatter plots show how variables relate.
  • Box Plot compares the two categories.

Thank You

Thank you for joining this episode on Data Visualization.

Next up:

  • Episode 05 - Probability Fundamentals
  • Episode 06: Introduction to Probability Distributions