光谱学与光谱分析
光譜學與光譜分析
광보학여광보분석
SPECTROSCOPY AND SPECTRAL ANALYSIS
2014年
8期
2094-2097
,共4页
刘国海%夏荣盛%江辉%梅从立%黄永红
劉國海%夏榮盛%江輝%梅從立%黃永紅
류국해%하영성%강휘%매종립%황영홍
光谱分析%近红外%波长优选%SCARS
光譜分析%近紅外%波長優選%SCARS
광보분석%근홍외%파장우선%SCARS
Spectral analysis%Near infrared spectroscopy%Wavelength selection%SCARS
针对近红外光谱数据的内在特点,提出了一种基于稳定性竞争自适应重加权采样(stability compet-itive adaptive reweighted sampling,SCARS)策略的近红外特征波长优选方法。该方法以PLS模型回归系数的稳定性作为变量选择的依据,其过程包含多次循环迭代,每次循环均首先计算相应变量的稳定性,而后通过强制变量筛选以及自适应重加权采样技术(ARS)进行变量筛选;最后对每次循环后所得变量子集建立PLS模型并计算交互验证均方根误差(RMSECV),将RMSECV值最小的集合作为最优变量子集。利用饲料蛋白固态发酵过程近红外光谱数据集对所提方法进行了验证,并与基于PLS的蒙特卡罗无信息变量消除法(MC-UVE)和竞争自适应重加权采样(CARS)方法所得结果进行了比较。试验结果显示:建立在SCARS方法优选的21个特征波长变量基础上的 PLS 模型预测效果更好,其预测均方根误差(RMSEP)和相关系数(Rp )分别为0.0543和0.9908;该优选策略能有效地增强固态发酵光谱数据特征波长变量选择的准确性和稳定性,提高了模型的预测精度,具有一定的应用价值。
針對近紅外光譜數據的內在特點,提齣瞭一種基于穩定性競爭自適應重加權採樣(stability compet-itive adaptive reweighted sampling,SCARS)策略的近紅外特徵波長優選方法。該方法以PLS模型迴歸繫數的穩定性作為變量選擇的依據,其過程包含多次循環迭代,每次循環均首先計算相應變量的穩定性,而後通過彊製變量篩選以及自適應重加權採樣技術(ARS)進行變量篩選;最後對每次循環後所得變量子集建立PLS模型併計算交互驗證均方根誤差(RMSECV),將RMSECV值最小的集閤作為最優變量子集。利用飼料蛋白固態髮酵過程近紅外光譜數據集對所提方法進行瞭驗證,併與基于PLS的矇特卡囉無信息變量消除法(MC-UVE)和競爭自適應重加權採樣(CARS)方法所得結果進行瞭比較。試驗結果顯示:建立在SCARS方法優選的21箇特徵波長變量基礎上的 PLS 模型預測效果更好,其預測均方根誤差(RMSEP)和相關繫數(Rp )分彆為0.0543和0.9908;該優選策略能有效地增彊固態髮酵光譜數據特徵波長變量選擇的準確性和穩定性,提高瞭模型的預測精度,具有一定的應用價值。
침대근홍외광보수거적내재특점,제출료일충기우은정성경쟁자괄응중가권채양(stability compet-itive adaptive reweighted sampling,SCARS)책략적근홍외특정파장우선방법。해방법이PLS모형회귀계수적은정성작위변량선택적의거,기과정포함다차순배질대,매차순배균수선계산상응변량적은정성,이후통과강제변량사선이급자괄응중가권채양기술(ARS)진행변량사선;최후대매차순배후소득변양자집건립PLS모형병계산교호험증균방근오차(RMSECV),장RMSECV치최소적집합작위최우변양자집。이용사료단백고태발효과정근홍외광보수거집대소제방법진행료험증,병여기우PLS적몽특잡라무신식변량소제법(MC-UVE)화경쟁자괄응중가권채양(CARS)방법소득결과진행료비교。시험결과현시:건립재SCARS방법우선적21개특정파장변량기출상적 PLS 모형예측효과경호,기예측균방근오차(RMSEP)화상관계수(Rp )분별위0.0543화0.9908;해우선책략능유효지증강고태발효광보수거특정파장변량선택적준학성화은정성,제고료모형적예측정도,구유일정적응용개치。
According to the characteristics of near infrared spectral(NIR)data,a new tactic called stability competitive adaptive reweighted sampling (SCARS)is employed to select characteristic wavelength variables of NIR data to build PLS model.This method is based on the stability of variables in PLS model.SCARS algorithm consists of a number of loops.In each loop,the stability of each corresponding variable is computed at first.Then enforced wavelength selection and adaptive reweighted sam-pling (ARS)is used to select important variables according to the stability of variables.The selected variables are kept as a vari-able subset and further used in the next loop.After the running of all loops,a number of subsets of variables are obtained and root mean squared error of cross validation (RMSECV)of PLS models is computed.The subset of variables with the lowest RMSECV is considered as the optimal variable subset.Validated by NIR data set of protein fodder solid-state fermentation process,the SCARS-PLS prediction model is better than PLS models based on wavelengths selected by competitive adaptive re-weighted sampling (CARS)and Monte Carlo uninformative variable elimination (MC-UVE)methods.As a result,twenty one wavelength variables are selected by SCARS method to build the PLS prediction model with the predicted root mean square error (RMSEP)valued at 0. 054 3 and correlation coefficient (Rp )0. 990 8.The results show that SCARS tactic can efficiently im-prove the accuracy and stability of NIR wavelength variables selection and optimize the precision of prediction model in solid-state fermentation process.The SCARS method has a certain application value.