中华流行病学杂志
中華流行病學雜誌
중화류행병학잡지
Chinese Journal of Epidemiology
2015年
10期
1047-1052
,共6页
朱猛%程阳%戴俊程%谢兰%靳光付%马红霞%胡志斌%师咏勇%林东昕
硃猛%程暘%戴俊程%謝蘭%靳光付%馬紅霞%鬍誌斌%師詠勇%林東昕
주맹%정양%대준정%사란%근광부%마홍하%호지빈%사영용%림동흔
肺癌%全基因组关联研究%风险预测模型
肺癌%全基因組關聯研究%風險預測模型
폐암%전기인조관련연구%풍험예측모형
Lung cancer%Genome-wide association study%Risk prediction model
目的 联合使用遗传因素和吸烟信息构建中国汉族人群的肺癌风险预测模型.方法 基于中国汉族人群全基因组关联研究(GWAS)数据,根据样本地区来源将样本分为训练集(南京与上海:1 473名病例vs.1 962名对照)和测试集(北京与武汉:858名病例vs.1 115名对照).系统整理已报道肺癌易感位点,在训练集中用逐步后退法筛选具有独立效应的位点,并通过加权法估算个体遗传得分用于建模.在训练集中分别构建基于吸烟信息、遗传得分和联合使用吸烟与遗传信息的3种风险预测模型(吸烟模型、遗传效应模型和联合模型),并根据受试者工作特征(ROC)曲线、曲线下面积(AUC)、净分类指数(NRI)和整体鉴别指数(IDI)评价模型对肺癌风险预测的效能.对于构建的模型,进一步在测试集中进行验证.结果 在训练集中,联合模型、吸烟模型和遗传效应模型AUC分别为0.69(0.67 ~ 0.71)、0.65(0.63 ~ 0.66)和0.60(0.59 ~ 0.62).在训练集和测试集中联合模型的风险预测效能高于吸烟模型或遗传模型,差异有统计学意义(P<0.001).重分类结果显示,联合模型与吸烟模型相比,在训练集中NRI增加4.57% (2.23% ~6.91%),IDI增加3.11%(2.52% ~ 3.69%).在测试集中,NRI和IDI分别增加2.77%和3.16%.结论 遗传得分可以显著提高肺癌传统风险模型的预测效能.联合使用遗传因素和吸烟信息构建的中国汉族人群肺癌风险预测模型可用于筛选中国汉族人群中肺癌发病的高危人群.
目的 聯閤使用遺傳因素和吸煙信息構建中國漢族人群的肺癌風險預測模型.方法 基于中國漢族人群全基因組關聯研究(GWAS)數據,根據樣本地區來源將樣本分為訓練集(南京與上海:1 473名病例vs.1 962名對照)和測試集(北京與武漢:858名病例vs.1 115名對照).繫統整理已報道肺癌易感位點,在訓練集中用逐步後退法篩選具有獨立效應的位點,併通過加權法估算箇體遺傳得分用于建模.在訓練集中分彆構建基于吸煙信息、遺傳得分和聯閤使用吸煙與遺傳信息的3種風險預測模型(吸煙模型、遺傳效應模型和聯閤模型),併根據受試者工作特徵(ROC)麯線、麯線下麵積(AUC)、淨分類指數(NRI)和整體鑒彆指數(IDI)評價模型對肺癌風險預測的效能.對于構建的模型,進一步在測試集中進行驗證.結果 在訓練集中,聯閤模型、吸煙模型和遺傳效應模型AUC分彆為0.69(0.67 ~ 0.71)、0.65(0.63 ~ 0.66)和0.60(0.59 ~ 0.62).在訓練集和測試集中聯閤模型的風險預測效能高于吸煙模型或遺傳模型,差異有統計學意義(P<0.001).重分類結果顯示,聯閤模型與吸煙模型相比,在訓練集中NRI增加4.57% (2.23% ~6.91%),IDI增加3.11%(2.52% ~ 3.69%).在測試集中,NRI和IDI分彆增加2.77%和3.16%.結論 遺傳得分可以顯著提高肺癌傳統風險模型的預測效能.聯閤使用遺傳因素和吸煙信息構建的中國漢族人群肺癌風險預測模型可用于篩選中國漢族人群中肺癌髮病的高危人群.
목적 연합사용유전인소화흡연신식구건중국한족인군적폐암풍험예측모형.방법 기우중국한족인군전기인조관련연구(GWAS)수거,근거양본지구래원장양본분위훈련집(남경여상해:1 473명병례vs.1 962명대조)화측시집(북경여무한:858명병례vs.1 115명대조).계통정리이보도폐암역감위점,재훈련집중용축보후퇴법사선구유독립효응적위점,병통과가권법고산개체유전득분용우건모.재훈련집중분별구건기우흡연신식、유전득분화연합사용흡연여유전신식적3충풍험예측모형(흡연모형、유전효응모형화연합모형),병근거수시자공작특정(ROC)곡선、곡선하면적(AUC)、정분류지수(NRI)화정체감별지수(IDI)평개모형대폐암풍험예측적효능.대우구건적모형,진일보재측시집중진행험증.결과 재훈련집중,연합모형、흡연모형화유전효응모형AUC분별위0.69(0.67 ~ 0.71)、0.65(0.63 ~ 0.66)화0.60(0.59 ~ 0.62).재훈련집화측시집중연합모형적풍험예측효능고우흡연모형혹유전모형,차이유통계학의의(P<0.001).중분류결과현시,연합모형여흡연모형상비,재훈련집중NRI증가4.57% (2.23% ~6.91%),IDI증가3.11%(2.52% ~ 3.69%).재측시집중,NRI화IDI분별증가2.77%화3.16%.결론 유전득분가이현저제고폐암전통풍험모형적예측효능.연합사용유전인소화흡연신식구건적중국한족인군폐암풍험예측모형가용우사선중국한족인군중폐암발병적고위인군.
Objective To evaluate the predictive power of risk model by combining traditional epidemiological factors and genetic factors.Methods Our previous GWAS data of lung cancer in Chinese were used in training set (Nanjing and Shanghai:1 473 cases vs.1 962 control) and testing set (Beijing and Wuhan:858 cases vs.1 115 control).All the single nucleotide polymorphisms (SNPs) associated with lung cancer risk were systematically selected and stepwise logistic regression analysis was used to select independent factors in the training set.The wGRS (weighted genetic score) was further used to calculate genetic risk score.To evaluate the contribution of the genetic factors,3 risk models were established by using the training set,i.e.smoking model (based on smoking status),genetic risk model (based on genetic risk score) and combined model (based on smoke and genetic risk score).The predictability of the models were evaluated by the areas under the receiver operating characteristic (ROC) curves,area under curve (AUC),net reclassification improvement (NRI) and integrated discrimination index (IDI).Besides,the results were further verified in the testing set.Results In the training set,it was found that the AUC of the smoking,genetic risk and combined models were 0.65 (0.63-0.66),0.60 (0.59-0.62) and 0.69 (0.67-0.71),respectively.Compared with combined model,the predictive power of other two models significantly declined,the difference was statistically significant (P<0.001).Furthermore,compared with the smoking model,the NRI of the combined model increased by 4.57% (2.23%-6.91%) and IDI increased by 3.11% (2.52%-3.69%) in the training set,the difference was statistically significant (P< 0.001).Similarly,in the testing set NRI increased by 2.77%,the difference was not statistically significant (P=0.069),and IDI increased by 3.16%,the difference was statistically significant (P<0.001).Conclusion This study showed that combining 14 genetic variants with traditional epidemiological factors could improve the predictive power of risk model for lung cancer.The model could be used in the screening of high-risk population of lung cancer in Chinese and provide evidence for the early diagnosis and treatment of lung cancer.