心理学报
心理學報
심이학보
Acta Psychologica Sinica
2013年
6期
694~703
,共null页
认知诊断计算机化自适应测验 选题方法 测量精度 项目曝光率
認知診斷計算機化自適應測驗 選題方法 測量精度 項目曝光率
인지진단계산궤화자괄응측험 선제방법 측량정도 항목폭광솔
cognitive diagnostic computerized adaptive testing; measurement accuracy; item exposure control;item selection method
项目曝光率关系到题库建设和测验安全,是计算机化自适应测验(ComputerizedAdaptiveTesting.CAT)需要考虑的重要问题。在认知诊断CAT情形下,首先基于传统CAT中α-分层方法的思想提出按项目信息量对题库分层的分层多阶段(StratifiedMultistage,SM)选题方法;然后将SM方法与项目合格(lternEligibility,IE)方法相结合得到SMIE方法。在此基础上,开展模拟研究比较SM、IE、SMIE、最大修正优先指标(MaximumModifiedPrioritylndex,MMPI)方法、限制阈值(RestrictiveThreshold,RT)方法和限制进度(RestrictiveProgressive,RPG)方法的选题表现。总体上,它们的测量精度从高到低依次为IE、SM、SMIE、RT、RPG和MMPI方法;项目曝光分布均匀性的优劣次序为MMPI、RPG、SMIE、RT、SM和IE方法;SMIE和RT方法能较好地平衡测量精度和项目曝光均匀性要求。
項目曝光率關繫到題庫建設和測驗安全,是計算機化自適應測驗(ComputerizedAdaptiveTesting.CAT)需要攷慮的重要問題。在認知診斷CAT情形下,首先基于傳統CAT中α-分層方法的思想提齣按項目信息量對題庫分層的分層多階段(StratifiedMultistage,SM)選題方法;然後將SM方法與項目閤格(lternEligibility,IE)方法相結閤得到SMIE方法。在此基礎上,開展模擬研究比較SM、IE、SMIE、最大脩正優先指標(MaximumModifiedPrioritylndex,MMPI)方法、限製閾值(RestrictiveThreshold,RT)方法和限製進度(RestrictiveProgressive,RPG)方法的選題錶現。總體上,它們的測量精度從高到低依次為IE、SM、SMIE、RT、RPG和MMPI方法;項目曝光分佈均勻性的優劣次序為MMPI、RPG、SMIE、RT、SM和IE方法;SMIE和RT方法能較好地平衡測量精度和項目曝光均勻性要求。
항목폭광솔관계도제고건설화측험안전,시계산궤화자괄응측험(ComputerizedAdaptiveTesting.CAT)수요고필적중요문제。재인지진단CAT정형하,수선기우전통CAT중α-분층방법적사상제출안항목신식량대제고분층적분층다계단(StratifiedMultistage,SM)선제방법;연후장SM방법여항목합격(lternEligibility,IE)방법상결합득도SMIE방법。재차기출상,개전모의연구비교SM、IE、SMIE、최대수정우선지표(MaximumModifiedPrioritylndex,MMPI)방법、한제역치(RestrictiveThreshold,RT)방법화한제진도(RestrictiveProgressive,RPG)방법적선제표현。총체상,타문적측량정도종고도저의차위IE、SM、SMIE、RT、RPG화MMPI방법;항목폭광분포균균성적우렬차서위MMPI、RPG、SMIE、RT、SM화IE방법;SMIE화RT방법능교호지평형측량정도화항목폭광균균성요구。
Item exposure rate is the utilization frequency of an item. When the exposure rate is high, examinees will likely share item content. If there are too many over-exposed items, test security and hence the validity of the assessment will certainly be compromised. Furthermore, with a lot of under-exposed items having low or zero item-exposure rates, the manpower and financial resources spent on item construction will be wasted and the item pool construction will become more challenging. Item exposure control is, therefore, an important issue in computerized adaptive testing (CAT). Cognitive diagnostic CAT (CD-CAT) combines and makes use of the strengths of cognitive diagnosis theory and CAT. The system will be able to provide information on the knowledge competence of the examinees by administering fewer items than traditional assessment. Based on the a-stratified method and the item eligibility method in regular CAT, the present study proposed and compared the performance of six techniques, namely, (a) the item eligibility (IE) method, (b) the stratified multistage (SM) approach, (c) the stratified multistage-item eligibility (SMIE) method, (d) the restrictive threshold (RT) method, (e) the maximum modified priority index (MMPI) method, and (f) the restrictive progress (RPG) method. With noting it that the SM approach is similar to the a-stratified method in item selection steps. The SM approach, however, different with the a-stratified method firstly in that it stratifies the remaining item pool based on the values of item information at the estimated attributed mastery pattern while the a-stratified method is based on the values of item discrimination parameter a. Secondly, in the SM method, the remainder item bank are stratified into a number of levels before the selection of each item, whereas in the a-stratified method, the item pool is stratified only once before the test and all the examinees have the same item strata. The SMIE method combines the SM and the IE method. MATLAB (R2010a) was used in the simulation experiments to write the CD-CAT code and the deterministic inputs, noisy "and" gate (DINA) model was applied in this study. Results showed that: (a) the SM method used in CD-CAT produced widely distributed item exposure by increasing the exposure rates of most items and fully utilizing the item pool but without greatly diminishing the maximum exposure rate and measurement accuracy; (b) other than a few items, the exposure rates of the IE method were lower than the setting maximum exposure rate, but most items still had extremely low exposure rates and hence resulting in a narrow distribution of item exposure and the highest measurement precision; (c) SMIE and RT methods behaved similarly in that not only could they increase the utilization frequency of the under-exposed items but they could also decrease the maximum exposure rate to a certain extent; (d) the MMPI and the RPG methods performed similarly with almost evenly distributed item exposure but at the great sacrifice of the measurement precision. As a whole, the performances of different methods in the order of their measurement accuracy are IE, SM, SMIE, RT, RPG and MMPI. The order in terms of their performances in exposure control is: MMPI, RPG, SMIE, RT, SM and IE. All in all, the SMIE and RT methods are able to balance measurement accuracy and item exposure well.