Statistics for Data Science : P-value in Hypothesis Testing

3 min readOct 24, 2021

I have been looking for a simple intuitive explanation for p-value for a long time and found one after listening to a Super Data Science podcast by Kirill Eremenko : https://www.superdatascience.com/podcast.

The ideas here are a reproduction of what I learnt by listening to the above podcast , and I hope that I am able to make it as clear and simple as the podcast itself.

p-value is a term in statistics to denote the measure of probability of an event happening by random chance , given our null hypothesis is true .

This can be best illustrated from an example as below :

Example : We have coin and we want to find out if it is an unbiased or a biased coin. If it is unbiased then P(H) = P(T) = 1/2, where P(H) and P(T) denote the probability of getting a head or tail on a random toss.

We formulate the experiment by proposing our null hypothesis

H0 : the coin is unbiased ie P(H) = P(T)

and alternative hypothesis

H1 : the coin is biased ie P(H) != P(T) .

We start by assuming that our null hypothesis is true and then carry out the below experiment to check whether there is any evidence to reject our null hypotheses :

Suppose we toss the coin 5 times in succession and here is what we observe.

I Toss : Heads.

This is perfectly normal as probability of obtaining heads or tails in 1 coin toss is 1/2 or 50%.

II Toss : Heads .

Probability of 2 heads occurring in 2 consecutive tosses is 1/4 or 25% as HH is one event out 4 different possibilities(HH,TT,HT,TH).

III Toss : Heads .

Probability of 3 heads occurring in 3 consecutive tosses is 1/8 or 12.5% (the no of possible outcomes here is 2³). Now we might start having a slight doubt as to whether our coin is biased towards heads or not, but still 12.5% is a significant enough probability .

IV Toss : Heads .

Probability of 4 heads occurring in 4 consecutive tosses is 1/2⁴ or 1/16 ie 6.25%. This is still a significant probability value , so we cannot still say that we have enough evidence to reject our null-hypotheses.

V Toss : Heads.

Probability of 5 heads occurring in 5 consecutive tosses is 1/2⁵ or 1/32 ie about 3%. This is quite low, and not significant enough , and provides evidence that our null-hypothesis can be rejected.

So our p-value is the probability of an event occurring given our null-hypothesis is true. If the p-value is below a particular significance value, (most commonly 5% ) , we can say that we have enough evidence to reject our Null Hypothesis in the favour of our Alternative Hypothesis.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Prakhar S

223 Followers

52 Following

Data Engineer, Sprighub

Responses (2)

Write a response

What are your thoughts?

Also publish to my profile

Kadamhari

Apr 25, 2022

Really good explanation, Prakhar.
Looking forward to see more from you.

Majfeiz

Jan 19, 2022

Using 5% criteria for finding if the coin is fair or not is a wrong assumption!!

More from Prakhar S

Plotting Decision Boundaries using Numpy and Matplotlib

Prakhar S

Plotting Decision Boundaries using Numpy and Matplotlib

A decision boundary is a surface that separates two or more classes into different sets, where all the points belonging to one class lie on…

Jan 11, 2022

Prakhar S

Linear Regression using Tensorflow

A neural network is normally associated with Deep Learning problems, such as Image classification or Natural Language Processing. But it…

Jan 19, 2022

Getting Matrix Dimensions Right in Neural Networks

Prakhar S

Getting Matrix Dimensions Right in Neural Networks

I have always had problems in getting the shape of the various matrices right when trying to use forward or backward propagation in Neural…

Dec 8, 2021

Statistics for Data Science: Normal Distribution, Z-score and Chebyshev’s Theorem

Prakhar S

Statistics for Data Science: Normal Distribution, Z-score and Chebyshev’s Theorem

Normal distributions, also known as Gaussian distributions, are one the most important distributions in the whole of statistics. They are…

Dec 15, 2021

See all from Prakhar S

Recommended from Medium

Data Analyst/Scientist Interview Questions—Read if you’re Scared.

Damini Vadrevu

Data Analyst/Scientist Interview Questions—Read if you’re Scared.

Only the BEST Guide to remove your Interview Anxiety

Mar 7

My Data Scientist — 2 Interview Experience at Zepto

Ajit Kumar Singh

My Data Scientist — 2 Interview Experience at Zepto

Hey everyone! 👋

Jan 31

Balian's techologies and innovation lab

🧠 Logical Thinking Challenges — Part 1

🚀 Level Up Your Developer Mindset

Mar 12

Chi Square Test — Intuition, Examples, and Step-by-Step Calculation

IntuitionMath

Aerin Kim

Chi Square Test — Intuition, Examples, and Step-by-Step Calculation

The best way to see if two variables are related.

Feb 12, 2023

Artificial Intelligence in Plain English

Dr. Walid Soula

Chi-Squared Test for Variance

Learn Chi-Squared Test for Variance, when, how and visualization using Python

Nov 1, 2024

How I Learned to Love `__init__.py`: A Simple Guide😊

Python in Plain English

Dhruv Ahuja

How I Learned to Love `init.py`: A Simple Guide😊

💡 Heads Up! Click here to unlock this article for free if you’re not a Medium member!

Feb 3

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams

Statistics for Data Science : P-value in Hypothesis Testing

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by Prakhar S

Responses (2)

More from Prakhar S

Plotting Decision Boundaries using Numpy and Matplotlib

A decision boundary is a surface that separates two or more classes into different sets, where all the points belonging to one class lie on…

Linear Regression using Tensorflow

A neural network is normally associated with Deep Learning problems, such as Image classification or Natural Language Processing. But it…

Getting Matrix Dimensions Right in Neural Networks

I have always had problems in getting the shape of the various matrices right when trying to use forward or backward propagation in Neural…

Statistics for Data Science: Normal Distribution, Z-score and Chebyshev’s Theorem

Normal distributions, also known as Gaussian distributions, are one the most important distributions in the whole of statistics. They are…

Recommended from Medium

Data Analyst/Scientist Interview Questions—Read if you’re Scared.

Only the BEST Guide to remove your Interview Anxiety

My Data Scientist — 2 Interview Experience at Zepto

Hey everyone! 👋

🧠 Logical Thinking Challenges — Part 1

🚀 Level Up Your Developer Mindset

Chi Square Test — Intuition, Examples, and Step-by-Step Calculation

The best way to see if two variables are related.

Chi-Squared Test for Variance

Learn Chi-Squared Test for Variance, when, how and visualization using Python

How I Learned to Love `__init__.py`: A Simple Guide😊

💡 Heads Up! Click here to unlock this article for free if you’re not a Medium member!

How I Learned to Love `init.py`: A Simple Guide😊