Table 3.
Test of Significance Among Flaws and Item Indices of the SpX entry Qualifying Exam (Item N=120)
Items/Questions Cross Tabulation | P-value | Interpretation |
---|---|---|
DIF categories * MCQs Testwiseness flaws | (0.569) | Insignificant |
DIF categories * MCQs Irrelevant flaws | (0.084) | Insignificant |
DIF Categories * Editing Errors | (0.653) | Insignificant |
DIF categories * All MCQS flaws | (0.047) | Significant |
DIF categories * Mean Functional Distractibility | (0.0035) | Significant |
PBS categories * MCQs Testwiseness flaws | (0.074) | Insignificant |
PBS categories * MCQs Irrelevant flaws | (0.063) | Insignificant |
PBS Categories * Editing Errors | (0.044) | Significant |
PBS categories * All MCQS flaws | (0.043) | Significant |
PBS categories * Mean Functional Distractibility | (0.418) | Insignificant |
PBS categories * Horst Index | <0.00001 | Significant |
Editing Errors * Horst Index | (0.253) | Insignificant |
Notes: Mean Functional Distractibility = Average number of functioning distractors (selected by ≥5% of examinees); Testwiseness Flaws = Structural flaws that aid guessing rather than knowledge. Irrelevant Flaws = Unnecessary difficulty not related to content; Editing Errors = Language, formatting, or structural issues; Horst Index = Item-level metric used to detect miskeyed or ambiguous questions.
Abbreviations: MCQs, Multiple Choice Questions; DIF, Difficulty Index; PBS, Point-Biserial Correlation.