基因组学与应用生物学
基因組學與應用生物學
기인조학여응용생물학
Genomics and Applied Biology
2011年
1期
103-109
,共7页
阎毛毛%戴晓港%李淑娴%尹佟明
閻毛毛%戴曉港%李淑嫻%尹佟明
염모모%대효항%리숙한%윤동명
微卫星%表达序列标签%重复单元%基因组进化
微衛星%錶達序列標籤%重複單元%基因組進化
미위성%표체서렬표첨%중복단원%기인조진화
Microsatellites%Express sequence tag (EST)%Repeat motif%Genome evolution
微卫星是生物基因组中变异频率最快的序列,结构基因中微卫星重复数的变化会引起基因的框移突变,导致基因表达完全不同或截短的蛋白。因此在进化过程中,基因区微卫星会受到强烈选择的影响。为研究基因区微卫星在不同树种中的变化情况,在本研究中,利用SPUTNIK程序分析了NCBI数据库中松树(Pinus spp.)、杨树(Populus spp.)及桉树(Eucalyptus spp.)的表达序列标签(express sequence tag,EST)序列各3万条。结果显示,桉树和杨树EST序列含有微卫星的比例比较接近,分别为18.7%和15.3%,而在松树中则发生了较大分化,只有8.2%。研究发现,三碱基重复单元是这3个树种编码序列中微卫星的主要重复类型。除三碱基重复微卫星外,桉树和杨树EST序列中其它类型微卫星的丰度随着重复单元长度的增加而减少,而在松树中则呈相反现象。同时值得注意的是松树EST序列中变异频率快的微卫星(〉20bp)数量明显比桉树及杨树少。研究还发现,3个树种中微卫星获得或丢失重复单元的速率都随着重复单元的增加而降低。本研究首次报道了不同树种基因区微卫星比较研究,发现了一些松树与杨树、桉树相比较EST序列中所含微卫星在丰度及变异频率方面存在的异同。基因所含微卫星序列对基因的功能有重要影响,本研究的结果将为了解不同树种中基因区微卫星的特征提供重要参数,同时也将为利用所研究树种的EST序列开发多态性高的微卫星标记提供有益的生物信息学参考。
微衛星是生物基因組中變異頻率最快的序列,結構基因中微衛星重複數的變化會引起基因的框移突變,導緻基因錶達完全不同或截短的蛋白。因此在進化過程中,基因區微衛星會受到彊烈選擇的影響。為研究基因區微衛星在不同樹種中的變化情況,在本研究中,利用SPUTNIK程序分析瞭NCBI數據庫中鬆樹(Pinus spp.)、楊樹(Populus spp.)及桉樹(Eucalyptus spp.)的錶達序列標籤(express sequence tag,EST)序列各3萬條。結果顯示,桉樹和楊樹EST序列含有微衛星的比例比較接近,分彆為18.7%和15.3%,而在鬆樹中則髮生瞭較大分化,隻有8.2%。研究髮現,三堿基重複單元是這3箇樹種編碼序列中微衛星的主要重複類型。除三堿基重複微衛星外,桉樹和楊樹EST序列中其它類型微衛星的豐度隨著重複單元長度的增加而減少,而在鬆樹中則呈相反現象。同時值得註意的是鬆樹EST序列中變異頻率快的微衛星(〉20bp)數量明顯比桉樹及楊樹少。研究還髮現,3箇樹種中微衛星穫得或丟失重複單元的速率都隨著重複單元的增加而降低。本研究首次報道瞭不同樹種基因區微衛星比較研究,髮現瞭一些鬆樹與楊樹、桉樹相比較EST序列中所含微衛星在豐度及變異頻率方麵存在的異同。基因所含微衛星序列對基因的功能有重要影響,本研究的結果將為瞭解不同樹種中基因區微衛星的特徵提供重要參數,同時也將為利用所研究樹種的EST序列開髮多態性高的微衛星標記提供有益的生物信息學參攷。
미위성시생물기인조중변이빈솔최쾌적서렬,결구기인중미위성중복수적변화회인기기인적광이돌변,도치기인표체완전불동혹절단적단백。인차재진화과정중,기인구미위성회수도강렬선택적영향。위연구기인구미위성재불동수충중적변화정황,재본연구중,이용SPUTNIK정서분석료NCBI수거고중송수(Pinus spp.)、양수(Populus spp.)급안수(Eucalyptus spp.)적표체서렬표첨(express sequence tag,EST)서렬각3만조。결과현시,안수화양수EST서렬함유미위성적비례비교접근,분별위18.7%화15.3%,이재송수중칙발생료교대분화,지유8.2%。연구발현,삼감기중복단원시저3개수충편마서렬중미위성적주요중복류형。제삼감기중복미위성외,안수화양수EST서렬중기타류형미위성적봉도수착중복단원장도적증가이감소,이재송수중칙정상반현상。동시치득주의적시송수EST서렬중변이빈솔쾌적미위성(〉20bp)수량명현비안수급양수소。연구환발현,3개수충중미위성획득혹주실중복단원적속솔도수착중복단원적증가이강저。본연구수차보도료불동수충기인구미위성비교연구,발현료일사송수여양수、안수상비교EST서렬중소함미위성재봉도급변이빈솔방면존재적이동。기인소함미위성서렬대기인적공능유중요영향,본연구적결과장위료해불동수충중기인구미위성적특정제공중요삼수,동시야장위이용소연구수충적EST서렬개발다태성고적미위성표기제공유익적생물신식학삼고。
Microsatellites are the most variable sequences in the genome of different organisms. Changes in repeat motif numbers will cause frameshift mutation of the corresponding genes, and lead to the expression of completely different or shortened proteins. During the evolutionary time, microsatellites in transcribed sequences have undergone strong selection. In order to explore the variation trends of genic SSRs in different tree species, thirty thousand ESTs were analyzed for Pinus spp. Populus spp. and Eucalyptus spp. respectively in this study. The results showed that the percentage of ESTs containing SSRs was similar in eucalyptus and poplars, accounting for 18.71% and 15.33% respectively. By contrast, this ratio was significantly lower in pine, only accounting for 8.22%. A common phenomenon observed in the three tree species was that the triplet repeats were the dominant microsatellites in the investigated EST sequences. Except for the triplet SSRs, richness of different type SSRs decreased with an increase in repeat motif length both in eucalyptus and poplars, while an opposite variation trend was observed in pine. It was noteworthy that content of highly polymorphic microsatellites (20 bp) was higher in ESTs of eucalyptus and poplars than that of pine. The results also showed that, in the investigated tree species, the frequency of microsatellite gaining or losing repeat unit/units decreased with increment in the repeat motif lengths of different types of microsatellites. We first report the comparison of genic SSRs in different tree species, and find some interesting variation trends in comparison pine with poplar and eucalyptus. Since genic SSRs significantly affect the gene function, the results provide some important parameters to learn the characteristics of genic SSRs in different organisms. Meanwhile, our results also supply useful bioinformatics guidance for developing high variable EST-SSRs in the investigated tree species.