军事医学
軍事醫學
군사의학
BULLETIN OF THE ACADEMY OF MILITARY MEDICAL SCIENCES
2014年
5期
377-380
,共4页
下一代测序%高通量测序%数据处理%质量控制
下一代測序%高通量測序%數據處理%質量控製
하일대측서%고통량측서%수거처리%질량공제
next generation sequencing%high-throughput sequencing%data process%quality control
目的:对下一代测序数据质量控制的几个主要问题进行分析,设计数据清理和质量控制软件,为下游的数据分析提供保障。方法基于Bioconducter 软件包,开发了一个数据清理软件( Fastq_clean )。结果该软件在一组已发表的菊花转录组数据集上测试,得到了清理后的测序数据,同时输出了质量控制信息。结论该软件在充分利用Illumina序列特征的基础上,可以非常精确地去除低质量残基、接头、rRNA和病毒等污染,最大限度保留了高质量数据。
目的:對下一代測序數據質量控製的幾箇主要問題進行分析,設計數據清理和質量控製軟件,為下遊的數據分析提供保障。方法基于Bioconducter 軟件包,開髮瞭一箇數據清理軟件( Fastq_clean )。結果該軟件在一組已髮錶的菊花轉錄組數據集上測試,得到瞭清理後的測序數據,同時輸齣瞭質量控製信息。結論該軟件在充分利用Illumina序列特徵的基礎上,可以非常精確地去除低質量殘基、接頭、rRNA和病毒等汙染,最大限度保留瞭高質量數據。
목적:대하일대측서수거질량공제적궤개주요문제진행분석,설계수거청리화질량공제연건,위하유적수거분석제공보장。방법기우Bioconducter 연건포,개발료일개수거청리연건( Fastq_clean )。결과해연건재일조이발표적국화전록조수거집상측시,득도료청리후적측서수거,동시수출료질량공제신식。결론해연건재충분이용Illumina서렬특정적기출상,가이비상정학지거제저질량잔기、접두、rRNA화병독등오염,최대한도보류료고질량수거。
Objective To investigate the quality control of the next-generation sequencing data by analyzing some im-portant problems with quality control in order to provide high-quality data for the downstream data analysis .Method The software Fastq_clean was developed to clean the data and to statistically analyze the quality of the data.Result Using a Chrysanthemum transcriptome dataset , the function of the Fastq_clean software was illustrated .Conclusion Based on the Bioconducter package , the Fastq_clean software can accurately remove low quality nucleotides and Ns , adapter contamina-tion, possible rRNA and virus contamination .Moreover, it can keep the largest possible amount of high-quality data reads.