Tuesday, January 31, 2006

 

Online PDF Paper Robot

arXiv.org Search

Sunday, January 29, 2006

 

Mean-square Significance and Sample Size, P Halkitis

Mean-square Significance and Sample Size, P Halkitis: "Mean-square Significance and Sample Size, P Halkitis

An investigation of item fit is central to the selection of items. Item misfit to the Rasch model can be summarized in mean-square statistics. Associated with each mean-square is a significance level based on a test of the hypothesis: 'Responses to this item fit the model perfectly.' All significance tests, however, are sensitive to sample size. While power in significance testing can be worthwhile, 'too much power' due to exceedingly large samples may lead to faulty conclusions about item fits.

Experimental versions of an anatomy examination were administered to 4998 individuals. Due to the resulting power of the hypothesis test, 29 of 47 items showed significant statistical misfit (p<.05), creating the impression that many items were unusable. Though each item will ultimately be examined for other aspects of quality, such as discrimination and guessing, it is useful to have a more realistic indication of fit and misfit than the conventional significance levels.

A first step is to examine how the mean-square values distribute. This distribution provides insight into whether or not the items are functioning in a similar way. In a histogram of the mean-square statistics we expect a pattern in which many statistics are clustered around a central location (near 1.0) while others are 'outliers'. Items in the cluster are exhibiting similar fit. Outliers are divergent. When the mean-square fit statistics for the anatomy items are plotted, the expected cluster emerges near 1. This cluster includes all the 'good' fit items, based on their statistical significance, and half of the 'bad' items. Since the mean-squares are ratio scale statistics, the 'cluster distribution' drawn into the Figure was obtained by fitting a smooth symmetric curve to the logarithms of the mean-squares, and exponentiating back to the ratio scaling.

This result does not mean that all items in the cluster should be immediately accepted into the final version of the examination. Nor does it mean that those outside the cluster be rejected. Rather this information provides an orderly basis for choosing final item sets."

This page is powered by Blogger. Isn't yours?