心理科学
心理科學
심이과학
Psychological Science
2014年
1期
212~216
,共null页
计算机化自适应认知诊断测验选题策略题库使用率二分法
計算機化自適應認知診斷測驗選題策略題庫使用率二分法
계산궤화자괄응인지진단측험선제책략제고사용솔이분법
CD -CAT, item selection methods, item bank usage, halving algorithm
CD—CAT中已有选题策略较注重测验效率,而对题库使用率不够重视。针对此问题,基于DINA模型,引入两种新的选题策略KLED和RHA,同时对HA进行模拟研究。结果显示:PWKL与KLED只在测验效率上具有优势;KLED若按属性向量分层,题库使用率有所提高,KLED比ED更容易推广到其他有显式表达的诊断模型场合;HA、RHA和RP—PWKL可较好兼顾测验效度和题库使用率,但RP—PWKL需设置项目的最大曝光率阈值。两种新选题方法在定长和变长CD—CAT都具有一定的应用价值。
CD—CAT中已有選題策略較註重測驗效率,而對題庫使用率不夠重視。針對此問題,基于DINA模型,引入兩種新的選題策略KLED和RHA,同時對HA進行模擬研究。結果顯示:PWKL與KLED隻在測驗效率上具有優勢;KLED若按屬性嚮量分層,題庫使用率有所提高,KLED比ED更容易推廣到其他有顯式錶達的診斷模型場閤;HA、RHA和RP—PWKL可較好兼顧測驗效度和題庫使用率,但RP—PWKL需設置項目的最大曝光率閾值。兩種新選題方法在定長和變長CD—CAT都具有一定的應用價值。
CD—CAT중이유선제책략교주중측험효솔,이대제고사용솔불구중시。침대차문제,기우DINA모형,인입량충신적선제책략KLED화RHA,동시대HA진행모의연구。결과현시:PWKL여KLED지재측험효솔상구유우세;KLED약안속성향량분층,제고사용솔유소제고,KLED비ED경용역추엄도기타유현식표체적진단모형장합;HA、RHA화RP—PWKL가교호겸고측험효도화제고사용솔,단RP—PWKL수설치항목적최대폭광솔역치。량충신선제방법재정장화변장CD—CAT도구유일정적응용개치。
Cognitive diagnostic computerized adaptive testing ( CD - CAT) is a popular mode of online testing of cognitive diagnostic assessment (CDA). The key to a CD - CAT program is the item selection methods. Three of the most popular methods are developed based on Kullback -Leibler information (KL), Shannon entropy (SHE) and the expected discrimination method (ED) to select items in CD -CAT. These methods can achieve a much better test efficiency. However, they often lead to unbalanced item usage within a pool. Diagnostic test would not be a high - stake test, so the item overexposure problem may not be a major concern. However, the item underexposure problem leads to the waste of time and money invested in developing each item on it, and the high test overlap rate prob- lem leads to the effects of intense exercise. Although the restrictive progressive method ( RP - PWKL) and the restrictive threshold method ( RT - PWKL) are proposed to balance item exposure control with measurement accuracy, RP - PWKL and RT - PWKL sup press overexposure and thus add a restriction so that the maximum exposure rate will be kept under a predetermined value. The rationale for the maximum exposure rate deserves further consideration. For the above consideration, the article proposes two item selection methods for CD - CAT based on the "Deterministic Input, Noisy And Gate" (DINA) model. First, using KL information as a discrimination function of ED, KLED is proposed to handle other cognitive diagnostic models, besides the DINA model. Second, according to the idea of randomization strategies, in which the selection of the item is always made at random among the most informative items, randomization halving algorithm (RHA) is proposed. For RHA, all items within the specified range are available for selection rather than an arbitrary or only one number. Moreover, we show the connection between KLED based on KL, HA, and RHA; KLED can be regarded as a weighted HA method, weighted by the corre sponding item parameters; HA can be regarded as RHA without adding a random component between different item attribute vectors in the Q matrix of the item pool. Then, two simulation studies are carried out, one using a simulated item bank, and the other based on items calibrated from real data. Eight item selection strategies are taken into consideration in these studies, including random, posterior -weighted KL (PWKL), RP- PWKL, RT- PWKL, ED, halving algorithm (HA), KLED and RHA. In addition, VRP- PWKL and VRT- PWKL are pro- posed for variable - length CD - CAT as an extended version of RP - PWKL and RT - PWKL. Simulation studies for fixed or variable - length CD - CAT are conducted based on the eight methods, and the results are compared in terms of the pattern or attribute correctclas- sification rate, error classification rate, item exposure rate,and test overlap rate. The simulation results show that : RHA, HA, RP - PWKL, VRP - PWKL and VRT - PWKL have more balanced usage of the item bank and slight decrease in correct classification rate of knowledge state ; RHA, HA, ~RP - PWKL and VRT - PWKL can be used for variable - length CD - CAT. Though the results from the simulation study are encouraging, further studies of CD - CAT are proposed for the future investigations such as different coznitive diaznostic models.