Two Sample Proportion with SigmaXL

By Michael Parker | March 8, 2016 |

What is the Two Sample Proportion Test?

The two sample proportion test is a hypothesis test to compare the proportions of one certain event occurring in two populations following the binomial distribution.

[unordered_list style=”star”]

Null Hypothesis(H₀): p₁ = p₂
Alternative Hypothesis(H_a): p₁ ≠ p₂

[/unordered_list]

Two Sample Proportion Test Assumptions

[unordered_list style=”star”]

The sample data drawn from the populations of interest are unbiased and representative.
There are only two possible outcomes in each trial for both populations: success/failure, yes/no, and defective/non-defective etc.
The underlying distributions of both populations are binomial distribution.
When np ≥ 5 and np(1 – p) ≥ 5, the binomial distribution can be approximated by the normal distribution.

[/unordered_list]

How the Two Sample Proportion Test Works

When np ≥ 5 and np(1 – p) ≥ 5, we use normal distribution to approximate the underlying binomial distributions of the populations.
Test Statistic

Where:

Two Sample Proportion MTB_a

and where:

Two Sample Proportion MTB_b

[unordered_list style=”star”]

and are the observed proportions of events in the two samples
and is the number of trials in the two samples respectively
and is the number of events in the two samples respectively

[/unordered_list]

When |Z_calc| is smaller than Z_crit, we fail to reject the null hypothesis.

Use SigmaXL to Run a Two Sample Proportion Test

Case study: We are interested in comparing the exam pass rates of a high school in March and April using a nonparametric (i.e. distribution-free) hypothesis test: two sample proportion test.
Data File: “Two Sample Proportion” tab in “Sample Data.xlsx”

[unordered_list style=”star”]

Null Hypothesis (H₀): p_March = p_April
Alternative Hypothesis (H_a): p_March ≠ p_April

[/unordered_list]

Steps to run a two sample proportion test in SigmaXL:

Click SigmaXL -> Statistical Tools -> 2 Proportions Test & Confidence Interval
A new tab named “2 Proportions Test and CI” appears automatically.
Enter “89” in the yellow box of “Number of Events” in Sample #1 column
Enter “112” in the yellow box of “Sample Size in Sample #1 column.
Enter “102” in the yellow box of “Number of Events” in Sample #2 column.
Enter “130” in the yellow box of “Sample Size” in Sample #2 column.
Select “Fisher’s Exact” as the testing method.

Model summary: The p-value of the two-sample proportion test is 0.8756, greater than the alpha level (0.05), and we fail to reject the null hypothesis. We conclude that the exam pass rates of the high school in March and April are not statistically different.

Posted in Analyze Phase, How To, Six Sigma and tagged Analyze Phase, How To, hypothesis test, SigmaXl, Six Sigma