Two Sample Proportion with SigmaXL
What is the Two Sample Proportion Test?
The two sample proportion test is a hypothesis test to compare the proportions of one certain event occurring in two populations following the binomial distribution.
[unordered_list style=”star”]
- Null Hypothesis(H0): p1 = p2
- Alternative Hypothesis(Ha): p1 ≠ p2
[/unordered_list]
Two Sample Proportion Test Assumptions
[unordered_list style=”star”]
- The sample data drawn from the populations of interest are unbiased and representative.
- There are only two possible outcomes in each trial for both populations: success/failure, yes/no, and defective/non-defective etc.
- The underlying distributions of both populations are binomial distribution.
- When np ≥ 5 and np(1 – p) ≥ 5, the binomial distribution can be approximated by the normal distribution.
[/unordered_list]
How the Two Sample Proportion Test Works
When np ≥ 5 and np(1 – p) ≥ 5, we use normal distribution to approximate the underlying binomial distributions of the populations.
Test Statistic
Where:
and where:
[unordered_list style=”star”]
- and are the observed proportions of events in the two samples
- and is the number of trials in the two samples respectively
- and is the number of events in the two samples respectively
[/unordered_list]
When |Zcalc| is smaller than Zcrit, we fail to reject the null hypothesis.
Use SigmaXL to Run a Two Sample Proportion Test
Case study: We are interested in comparing the exam pass rates of a high school in March and April using a nonparametric (i.e. distribution-free) hypothesis test: two sample proportion test.
Data File: “Two Sample Proportion” tab in “Sample Data.xlsx”
[unordered_list style=”star”]
- Null Hypothesis (H0): pMarch = pApril
- Alternative Hypothesis (Ha): pMarch ≠ pApril
[/unordered_list]
Steps to run a two sample proportion test in SigmaXL:
- Click SigmaXL -> Statistical Tools -> 2 Proportions Test & Confidence Interval
- A new tab named “2 Proportions Test and CI” appears automatically.
- Enter “89” in the yellow box of “Number of Events” in Sample #1 column
- Enter “112” in the yellow box of “Sample Size in Sample #1 column.
- Enter “102” in the yellow box of “Number of Events” in Sample #2 column.
- Enter “130” in the yellow box of “Sample Size” in Sample #2 column.
- Select “Fisher’s Exact” as the testing method.
Model summary: The p-value of the two-sample proportion test is 0.8756, greater than the alpha level (0.05), and we fail to reject the null hypothesis. We conclude that the exam pass rates of the high school in March and April are not statistically different.