心理科学进展
心理科學進展
심이과학진전
Advances In Psychological Science
2013年
10期
1874~1882
,共null页
黄杨岳 孔祥祯 甄宗雷 刘嘉
黃楊嶽 孔祥禎 甄宗雷 劉嘉
황양악 공상정 견종뢰 류가
全基因组关联研究 多重校正 FWER FDR 仿真
全基因組關聯研究 多重校正 FWER FDR 倣真
전기인조관련연구 다중교정 FWER FDR 방진
Genome-Wide Association Studies; multiple testing corrections; FWER; FDR; simulation
全基因组关联研究(Genome.WideAssociation Studies,GWAS)可以直接研究人类行为能力和基因型间的关联,为心理学研究者从全基因组层次探索人类行为能力的遗传基础提供了新的手段。GWAS中涉及大量住点和行为的关联检验,所以必须采用多重校正来控制整体虚报。尽管存在多种校正方法可供选择,但GWAS研究中不同校正方法的适用性,目前尚缺少系统研究,使得GWAS中多重校正方法的选择缺少理论和经验依据。GWAS中常用的校正方法有基于族错误率(Family—Wise Error Rate,FWER)标准的Bonferroni校正法,Holm递减调整法,排列检验法和基于错误发现率(False Discovery Rate,FDR)标准的BH法。对这4种多重校正方法的原理和流程进行了详细阐述;提出了一种GWAS数据仿真方法,并基于仿真数据对不同多重校正方法进行了定量比较。结果显示,前3种基于FWER的方法差别很小,它们对虚报的控制最为严格,但是检测出的真实关联的位点数却显著低于基于FDR的BH法。独立数据上,BH法所报告的SNPs对行为具有最高的解释率,即相对于其它方法,BH方法更好的平衡了虚报和击中。未来研究中可以考虑用BH法来对结果进行校正。
全基因組關聯研究(Genome.WideAssociation Studies,GWAS)可以直接研究人類行為能力和基因型間的關聯,為心理學研究者從全基因組層次探索人類行為能力的遺傳基礎提供瞭新的手段。GWAS中涉及大量住點和行為的關聯檢驗,所以必鬚採用多重校正來控製整體虛報。儘管存在多種校正方法可供選擇,但GWAS研究中不同校正方法的適用性,目前尚缺少繫統研究,使得GWAS中多重校正方法的選擇缺少理論和經驗依據。GWAS中常用的校正方法有基于族錯誤率(Family—Wise Error Rate,FWER)標準的Bonferroni校正法,Holm遞減調整法,排列檢驗法和基于錯誤髮現率(False Discovery Rate,FDR)標準的BH法。對這4種多重校正方法的原理和流程進行瞭詳細闡述;提齣瞭一種GWAS數據倣真方法,併基于倣真數據對不同多重校正方法進行瞭定量比較。結果顯示,前3種基于FWER的方法差彆很小,它們對虛報的控製最為嚴格,但是檢測齣的真實關聯的位點數卻顯著低于基于FDR的BH法。獨立數據上,BH法所報告的SNPs對行為具有最高的解釋率,即相對于其它方法,BH方法更好的平衡瞭虛報和擊中。未來研究中可以攷慮用BH法來對結果進行校正。
전기인조관련연구(Genome.WideAssociation Studies,GWAS)가이직접연구인류행위능력화기인형간적관련,위심이학연구자종전기인조층차탐색인류행위능력적유전기출제공료신적수단。GWAS중섭급대량주점화행위적관련검험,소이필수채용다중교정래공제정체허보。진관존재다충교정방법가공선택,단GWAS연구중불동교정방법적괄용성,목전상결소계통연구,사득GWAS중다중교정방법적선택결소이론화경험의거。GWAS중상용적교정방법유기우족착오솔(Family—Wise Error Rate,FWER)표준적Bonferroni교정법,Holm체감조정법,배렬검험법화기우착오발현솔(False Discovery Rate,FDR)표준적BH법。대저4충다중교정방법적원리화류정진행료상세천술;제출료일충GWAS수거방진방법,병기우방진수거대불동다중교정방법진행료정량비교。결과현시,전3충기우FWER적방법차별흔소,타문대허보적공제최위엄격,단시검측출적진실관련적위점수각현저저우기우FDR적BH법。독립수거상,BH법소보고적SNPs대행위구유최고적해석솔,즉상대우기타방법,BH방법경호적평형료허보화격중。미래연구중가이고필용BH법래대결과진행교정。
Genome-Wide Association Studies (GWAS) can reveal the genetic basis of the behavior. However, the association analysis embodies a massive multiple testing problem, where millions of SNPs (Single Nucleotide Polymorphisms) are tested. It is vital to reduce the risk of false positive in multiple testing with an appropriate corrections method. Firstly, Family-Wise Error Rate (FWER) and False Discovery Rate (FDR), the two standard measures of Type I errors in multiple testing were introduced. Secondly, three FWER (i.e., Bonferroni, Holm Step-Down and Permutation) and one FDR (i.e., BH) multiple testing corrections method were discussed from the concept to implementation. Finally, a method to simulate GWAS data was proposed, and the four multiple testing corrections methods were evaluated on the simulated GWAS data. Results showed that SNPs reported without multiple testing corrections had both the highest average hit and the average false alarm. FWER methods reported fewer false alarms, but their average hits were also fewer than that from uncorrected or BH method. In contrast, BH method did well in balance between the false alarm and hit. Furthermore, a comprehensive index, called explained rate, was introduced to evaluate the different methods quantitatively. Results showed BH method had the highest explained rate. In the future GWAS study, researchers would better do multiple testing corrections with BH method.