P Value is the probability for the “Null Hypothesis” to be true, where Null Hypothesis means which treats everything same or equal. The P stands for probability and Measures how likely it is that any observed difference between groups is due to chance. Being a probability, P can take any value between 0 and 1. Values close to 0 indicate that the observed difference is unlikely to be due to chance, whereas a P value close to 1 suggests no difference between the groups other than due to chance.
For example:
Let’s say We want to perform an experiment to see if a new type of weight loss drug (Drug X) causes people to lose body weight. So, we randomly sample a collection of volunteers and randomly assign them into two groups: Group A and Group B. We give Group A a placebo. In other words, this contains no active ingredients. Group A are therefore the control group. And, you give Group B the new drug (Drug X).
The participants are weighed at the start of the study and at the end of the study. This way, we can work out the body weight difference. At the end of the study you work out that Group A’s average body weight difference was 0 kg, in other words, they did not gain or lose any body weight. Group B’s weight difference was -1 kg, so on average they lost 1 kg of their body weight. So, does this mean that the drug worked?
To determine this, we first ask ourselves: What would happen in a world where the weight difference in volunteers who receive Drug X (Group B) is the same as the weight difference who received the placebo (Group A)?
This is where the null hypothesis comes in. Usually, the null hypothesis states that there are no differences between groups, for example. So, our null hypothesis is: The weight difference in those who receive Drug X, is the same as the weight difference in those who receive the placebo.
Now, we ask ourselves: If this null hypothesis were true, what is the chance (or probability) of discovering a 1 kg reduction (or more) in body weight in those treated with Drug X from our sample?
This probability, or p-value, measures the strength of evidence against the null hypothesis. We can think of this as a court trial where the defendant is innocent until proven guilty; in this case, the defendant is the null hypothesis. The smaller the p-value, the stronger the evidence against the null hypothesis.
To determine the p-value, scientists use what are known as statistical hypothesis tests. Common examples of statistical hypothesis tests include the Student T-test and a one-way ANOVA. Since this is a top line overview, We will not bombard with statistical jargon, but instead, pretend we have performed a statistical test using our data.
So, after inputting our experimental data into a statistical test, we get a p-value in return. Let’s say for our example, the p-value is 0.02. It’s worth mentioning that the p-value is a fraction, however, it may be easier to convert this to a percentage to understand the concept better. So, a value of 0.02, would be 2% ( simply multiplied the fraction by 100).
But what does this p-value result of 0.02 (or 2%) actually represent?
Essentially a p-value of 2% means that if the null hypothesis were true (that the two population means are identical), then there is a 2% chance of observing a difference as large (or larger) than what we observed in our sample. To put that into perspective, a 2% chance corresponds to 1 in every 50 experiments of this size. But, how can this be? What is accounting for this 2%? Simply, this 2% can be accounted for by random noise.
Let’s elaborate a little more on Random noise.
There are quite a few things that can impact a p-value, and some of these factors are collectively known as random noise or random chance. One type of factor that can contribute to random noise, especially in human studies, is the coincidence random sampling. For example, humans can exhibit a large amount of variation between one another due to genetic and environmental influences.
If we relate back to our example, some humans may contain an unknown gene that speeds up their metabolism and causes them to lose weight more than those without the gene. When recruiting volunteers for our experiment, we did not perform any DNA analysis before randomly assigning the volunteers to either Group A, the control group, or Group B, the Drug X group; so, there was no way in knowing who was a carrier or not.
Imagine a situation where, just by pure coincidence, more volunteers with the high metabolism gene, are placed in Group B, than Group A; so, it makes sense that this group lost more weight. So, here you can see that by a pure coincidence of random sampling, this can have a knock-on effect on the p-value.
So, to sum up, a p-value refers to a probability value. This p-value is a value between 0 and 1.This number represents the probability of obtaining the observed difference (or a larger one) in the outcome measure of the sample, given that no difference exists between treatments in the population (the null-hypothesis is true).
Random noise can affect the p-value. An example of random noise is the coincidence of random sampling.