Efficiency of multiple-choice tests as a function of spread of item difficultiesCronbach, Lee J.;Warrington, Willard G.
doi: 10.1007/bf02288778pmid: N/A
Abstract The validity of a univocal multiple-choice test is determined for varying distributions of item difficulty and varying degrees of item precision. Validity is a function ofσ 2d +σ 2v , whereσ d measures item unreliability andσ v measures the spread of item difficulties. When this variance is very small, validity is high for one optimum cutting score, but the test gives relatively little valid information for other cutting scores. As this variance increases, eta increases up to a certain point, and then begins to decrease. Screening validity at the optimum cutting score declines as this variance increases, but the test becomes much more flexible, maintaining the same validity for a wide range of cutting scores. For items of the type ordinarily used in psychological tests, the test with uniform item difficulty gives greater over-all validity, and superior validity for most cutting scores, compared to a test with a range of item difficulties. When a multiple-choice test is intended to reject the poorestF per cent of the men tested, items should on the average be located at or above the threshold for men whose true ability is at theFth percentile.
Finite markov processes in psychologyMiller, George A.
doi: 10.1007/bf02288779pmid: N/A
Abstract Finite Markov processes are reviewed and considered for their usefulness in the description of behavioral data. The various alternative responses in an experimental situation define a vector space, and changes in the probabilities of these alternatives are represented by movements in this space. Methods of fitting the theory to experimental data are considered. The simplest process, with a constant matrix of transitional probabilities that is applied repeatedly to represent the effect of successive trials, seems inadequate for most learning data. A matrix function that may be useful for learning theory is presented.
An internal consistency check for scale values determined by the method of successive intervalsEdwards, Allen L.;Thurstone, L. L.
doi: 10.1007/bf02288780pmid: N/A
Abstract The method of successive intervals is a psychological scaling procedure in which stimuli are classified into successive intervals according to the degree of some defined attribute which they are judged to possess. A psychological continuum is defined and the scale values are then taken as the medians of the distributions of judgments on the psychological continuum. It is assumed that the distributions of judgments for each stimulus are normal on the psychological continuum as defined. An internal consistency check indicates that the cumulative distributions of empirical judgments for the various stimuli can be reproduced by means of a limited number of parameters with an average error that compares favorably with that usually reported for paired comparison data. Furthermore, the scale values obtained by successive interval scaling, for the data reported, are shown to be linearly related to those obtained by the method of paired comparisons.
The relation of the reliability of multiple-choice tests to the distribution of item difficultiesLord, Frederic M.
doi: 10.1007/bf02288781pmid: N/A
Abstract Under certain assumptions an expression, in terms of item difficulties and intercorrelations, is derived for the curvilinear correlation of test score on the “ability underlying the test,” this ability being defined as the common factor of the item tetrachoric intercorrelations corrected for guessing. It is shown that this curvilinear correlation is equal to the square root of the test reliability. Numerical values for these curvilinear correlations are presented for a number of hypothetical tests, defined in terms of their item parameters. These numerical results indicate that the reliability and the curvilinear correlation will be maximized by (1) minimizing the variability of item difficulty and (2) making the level of item difficulty somewhat easier than the halfway point between a chance percentage of correct answers and 100 per cent correct answers.
On the determination of redundancies in sociometric chainsRoss, Ian C.;Harary, Frank
doi: 10.1007/bf02288782pmid: N/A
Abstract The use of a matrix to represent a relationship between the members of a group is well known in sociometry. If this matrix is raised to a certain power, the elements appearing give the total number of connecting paths between each pair of members. In general, some of these paths will be redundant. Methods of finding the number of such redundant paths have been developed for three- and four-step chains by Luce and Perry (3) and Katz (2), respectively. We have derived formulas for the number of redundant paths of five and six steps; and in addition, an algorithm for determining the number of redundant paths of any given length.
Multiple group methods for common-factor analysis: Their basis, computation, and interpretationGuttman, Louis
doi: 10.1007/bf02288783pmid: N/A
Abstract In a previous paper (1) were developed three basic theorems which were shown to provide numerical routines, as well as algebraic proof, for existing common-factor methods. New “multiple” routines were also indicated. The first theorem showed how to extract as many common factors as one wished from the correlation matrix in one operation. The second theorem showed how to do the same from the score matrix. The third proved that factoring the correlation matrix was equivalent to factoring the score matrix. A particular application of these theorems is the multiple group factoring method, which the writer first used in practice on some Army attitude scores during World War II. The present paper explains the basic theorems in more detail with special reference to group factoring. Computations are outlined as consisting of five simple matric operations. The meaning of commonfactor analysis is given in terms of the basic theorems, as well as the relationship to “inverted” factor theory.
A technique for facilitating the rotation of factor axes, based on an equivalence between persons and testsSandler, Joseph
doi: 10.1007/bf02288784pmid: N/A
Abstract A technique is outlined which may facilitate the rotation of factor axes to a meaningful position. It is based on certain relationships between the results of test and person factor analysis, and consists essentially of supplementing the test factor space with tests which are thetest-equivalents of persons or groups of persons. These persons may be, for instance, well-known “types” in the domain being investigated, or even “freaks.” The ways in which these persons may be selected and used to determine the final rotated position of the factor axes is discussed.