Skip to main content
. 2025 Aug 6;16:1381–1397. doi: 10.2147/AMEP.S525828

Table 3.

Test of Significance Among Flaws and Item Indices of the SpX ‎entry Qualifying Exam‎ ‎‎(Item N=120)

Items/Questions Cross Tabulation P-value Interpretation
DIF categories * MCQs Testwiseness flaws (0.569) Insignificant
DIF categories * MCQs Irrelevant flaws ‎(0.084‎)‎ Insignificant
DIF Categories * Editing Errors ‎(0.653) ‎ Insignificant
DIF categories * All MCQS flaws ‎(0‎.047‎‎)‎ Significant
DIF categories * ‎ Mean Functional Distractibility (0.0035) Significant
PBS‎ categories * MCQs Testwiseness flaws ‎(0.074) ‎ Insignificant
PBS‎ categories * MCQs Irrelevant flaws ‎(0.063) ‎ Insignificant
PBS Categories ‎* Editing Errors ‎(0.044) ‎ Significant
PBS‎ categories * All MCQS flaws ‎‎(‎0.043) ‎ Significant
PBS‎ categories * Mean Functional Distractibility ‎(0.418)‎ Insignificant
PBS‎ categories * ‎Horst Index <0.00001 Significant
Editing Errors * ‎Horst Index ‎(0.253) ‎ Insignificant

Notes: Mean Functional Distractibility = Average number of functioning distractors (selected by ≥5% of examinees); Testwiseness Flaws = Structural flaws that aid guessing rather than knowledge. Irrelevant Flaws = Unnecessary difficulty not related to content; Editing Errors = Language, formatting, or structural issues; Horst Index = Item-level metric used to detect miskeyed or ambiguous questions.

Abbreviations: MCQs, Multiple Choice Questions; DIF, Difficulty Index; PBS, Point-Biserial Correlation.