分子植物育种
分子植物育種
분자식물육충
MOLECULAR PLANT BREEDING
2013年
3期
385-392
,共8页
王晓锋%何卫龙%蔡卫佳%阮倩倩%潘婷%季孔庶
王曉鋒%何衛龍%蔡衛佳%阮倩倩%潘婷%季孔庶
왕효봉%하위룡%채위가%원천천%반정%계공서
马尾松%转录组测序%Illumina高通量测序技术%SSR
馬尾鬆%轉錄組測序%Illumina高通量測序技術%SSR
마미송%전록조측서%Illumina고통량측서기술%SSR
Pinus massoniana%Transcriptome sequence%Illumina%SSR
本研究首次构建了马尾松均一化cDNA文库,采用Illumina高通量测序技术对转录组进行了测序,利用生物信息学方法开展基因表达谱的研究、功能基因的预测。 EST序列拼接获得83680个contig,其中33772个contig被注释为相应的331669对生物学功能,10647个contig被注释具有酶功能。根据KEGG pathway数据库,对马尾松转录组的contig进行Pathway生物学通路的注释和预测,共识别出10647个contig具有对应的1029种酶功能,并关联到135条生物学通路。 SSR查找发现,从83680个contig中找到889个SSR位点,占contig总数的比例为1.06%。其中,三核苷酸重复所占比例最高,达到48.37%,其次是六核苷酸重复,为19.12%,比例最低的是四核苷酸重复,仅为4.72%,二核苷酸重复和五核苷酸重复基本相同,分别为14.62%和13.16%。 SSR不同重复基元类型中,出现频率最高的为AT/AT,其次是AGC/CTG和AAG/CTT。
本研究首次構建瞭馬尾鬆均一化cDNA文庫,採用Illumina高通量測序技術對轉錄組進行瞭測序,利用生物信息學方法開展基因錶達譜的研究、功能基因的預測。 EST序列拼接穫得83680箇contig,其中33772箇contig被註釋為相應的331669對生物學功能,10647箇contig被註釋具有酶功能。根據KEGG pathway數據庫,對馬尾鬆轉錄組的contig進行Pathway生物學通路的註釋和預測,共識彆齣10647箇contig具有對應的1029種酶功能,併關聯到135條生物學通路。 SSR查找髮現,從83680箇contig中找到889箇SSR位點,佔contig總數的比例為1.06%。其中,三覈苷痠重複所佔比例最高,達到48.37%,其次是六覈苷痠重複,為19.12%,比例最低的是四覈苷痠重複,僅為4.72%,二覈苷痠重複和五覈苷痠重複基本相同,分彆為14.62%和13.16%。 SSR不同重複基元類型中,齣現頻率最高的為AT/AT,其次是AGC/CTG和AAG/CTT。
본연구수차구건료마미송균일화cDNA문고,채용Illumina고통량측서기술대전록조진행료측서,이용생물신식학방법개전기인표체보적연구、공능기인적예측。 EST서렬병접획득83680개contig,기중33772개contig피주석위상응적331669대생물학공능,10647개contig피주석구유매공능。근거KEGG pathway수거고,대마미송전록조적contig진행Pathway생물학통로적주석화예측,공식별출10647개contig구유대응적1029충매공능,병관련도135조생물학통로。 SSR사조발현,종83680개contig중조도889개SSR위점,점contig총수적비례위1.06%。기중,삼핵감산중복소점비례최고,체도48.37%,기차시륙핵감산중복,위19.12%,비례최저적시사핵감산중복,부위4.72%,이핵감산중복화오핵감산중복기본상동,분별위14.62%화13.16%。 SSR불동중복기원류형중,출현빈솔최고적위AT/AT,기차시AGC/CTG화AAG/CTT。
The transcriptome of the shoots of a seven-year-old Pinus massoniana was sequenced by Illumina that is a new generation of high-throughput sequencing technology to study the expression profiling and predict the functional gene. 83 680 contigs were obtained through sequence assembly, for which 33 772 contigs were annotated for 331 669 pairs in biological functions, 10 647 contigs were annotated for enzyme function. A total of 10 647 contigs were identified to correspond with 1 029 enzyme functions and associated with 135 biological pathways by annotating and forecasting the biological pathways for the transcriptome of P. massoniana. There were 889 SSR in 83 680 contigs were found, which accounting for 1.06% proportion of the total number of contigs. The characteristic of EST-SSR distribution showed that tri-nucleotide repeat was the highest reaching 48.37%, following by hexa-nucleotide repeat, which was 19.12%, and the least was tetra-nucleotide repeat which was only 4.72%, the proportion of dinucleotide repeat was as the same as penta-nucleotide repeat, they were 14.62% and 13.16%, respectively. The types of EST-SSR were analyzed that AT/AT was the highest repeat, following by AGC/CTG and AAG/CTT.