国际医学寄生虫病杂志
國際醫學寄生蟲病雜誌
국제의학기생충병잡지
INTERNATIONAL JOURNAL OF MEDICAL PARASITIC DISEASES
2014年
6期
328-333
,共6页
张颋%陈英%贾利芳%沈海默%胡薇%刘杰%奥·乌力吉
張颋%陳英%賈利芳%瀋海默%鬍薇%劉傑%奧·烏力吉
장정%진영%가리방%침해묵%호미%류걸%오·오력길
多房棘球绦虫%分泌蛋白组%信号肽%生物信息学
多房棘毬縚蟲%分泌蛋白組%信號肽%生物信息學
다방극구조충%분비단백조%신호태%생물신식학
Echinococcus multilocularis%Secretome%Signal peptide%Bioinformatics
目的 预测多房棘球绦虫的分泌蛋白组,分析分泌蛋白及其信号肽特征,为发展多房棘球蚴病的诊断和治疗方法奠定基础. 方法 采用SignalP4.1对多房棘球绦虫全基因组蛋白序列信号肽进行预测,并依次采用生物信息学软件TMHMM v2.0、Phobius、Big-PIpredictor及TargetP1.1筛选去除假阳性的预测蛋白.随后采用Excel统计信号肽和分泌蛋白的氨基酸序列特征,SPSS19.0对分泌蛋白及非分泌蛋白的氨基酸含量进行科尔莫哥罗夫-斯米尔诺夫检验(kolmogorov-smirnov test,K-S test)和t检验.最后利用京都基因和基因组百科全书自动注释工具(KEGG Automatic Annotation Sever,KAAS)进行分泌蛋白的注释和分析. 结果 在多房棘球绦虫10~780条全基因组序列所编码的蛋白中,共发现875条含有信号肽的蛋白序列,其中307条属于膜结合蛋白;38条含糖基磷脂酰肌醇(glycosylphosphatidylinositol,GPI)锚定位点;12条定位于线粒体.最终获得了含518条序列的多房棘球绦虫分泌蛋白组.SPSS统计分泌蛋白的信号肽长度集中于11~53个氨基酸,其中疏水性氨基酸含量占61%;且分泌蛋白的氨基酸含量(38~7 809个)显著小于(t=0.203,P<0.01)非分泌蛋白氨基酸的含量(11~11 194个);KAAS分析显示获得注释的分泌蛋白主要出现在人类疾病、新陈代谢过程、环境信息处理、有机系统、细胞过程和遗传信息处理通路上,其中有6条序列已知和寄生虫感染直接相关. 结论 本研究获得了518条蛋白序列多房棘球绦虫分泌蛋白组,为后续的诊断、疫苗和药物靶点等相关研究提供了生物信息学数据.
目的 預測多房棘毬縚蟲的分泌蛋白組,分析分泌蛋白及其信號肽特徵,為髮展多房棘毬蚴病的診斷和治療方法奠定基礎. 方法 採用SignalP4.1對多房棘毬縚蟲全基因組蛋白序列信號肽進行預測,併依次採用生物信息學軟件TMHMM v2.0、Phobius、Big-PIpredictor及TargetP1.1篩選去除假暘性的預測蛋白.隨後採用Excel統計信號肽和分泌蛋白的氨基痠序列特徵,SPSS19.0對分泌蛋白及非分泌蛋白的氨基痠含量進行科爾莫哥囉伕-斯米爾諾伕檢驗(kolmogorov-smirnov test,K-S test)和t檢驗.最後利用京都基因和基因組百科全書自動註釋工具(KEGG Automatic Annotation Sever,KAAS)進行分泌蛋白的註釋和分析. 結果 在多房棘毬縚蟲10~780條全基因組序列所編碼的蛋白中,共髮現875條含有信號肽的蛋白序列,其中307條屬于膜結閤蛋白;38條含糖基燐脂酰肌醇(glycosylphosphatidylinositol,GPI)錨定位點;12條定位于線粒體.最終穫得瞭含518條序列的多房棘毬縚蟲分泌蛋白組.SPSS統計分泌蛋白的信號肽長度集中于11~53箇氨基痠,其中疏水性氨基痠含量佔61%;且分泌蛋白的氨基痠含量(38~7 809箇)顯著小于(t=0.203,P<0.01)非分泌蛋白氨基痠的含量(11~11 194箇);KAAS分析顯示穫得註釋的分泌蛋白主要齣現在人類疾病、新陳代謝過程、環境信息處理、有機繫統、細胞過程和遺傳信息處理通路上,其中有6條序列已知和寄生蟲感染直接相關. 結論 本研究穫得瞭518條蛋白序列多房棘毬縚蟲分泌蛋白組,為後續的診斷、疫苗和藥物靶點等相關研究提供瞭生物信息學數據.
목적 예측다방극구조충적분비단백조,분석분비단백급기신호태특정,위발전다방극구유병적진단화치료방법전정기출. 방법 채용SignalP4.1대다방극구조충전기인조단백서렬신호태진행예측,병의차채용생물신식학연건TMHMM v2.0、Phobius、Big-PIpredictor급TargetP1.1사선거제가양성적예측단백.수후채용Excel통계신호태화분비단백적안기산서렬특정,SPSS19.0대분비단백급비분비단백적안기산함량진행과이막가라부-사미이낙부검험(kolmogorov-smirnov test,K-S test)화t검험.최후이용경도기인화기인조백과전서자동주석공구(KEGG Automatic Annotation Sever,KAAS)진행분비단백적주석화분석. 결과 재다방극구조충10~780조전기인조서렬소편마적단백중,공발현875조함유신호태적단백서렬,기중307조속우막결합단백;38조함당기린지선기순(glycosylphosphatidylinositol,GPI)묘정위점;12조정위우선립체.최종획득료함518조서렬적다방극구조충분비단백조.SPSS통계분비단백적신호태장도집중우11~53개안기산,기중소수성안기산함량점61%;차분비단백적안기산함량(38~7 809개)현저소우(t=0.203,P<0.01)비분비단백안기산적함량(11~11 194개);KAAS분석현시획득주석적분비단백주요출현재인류질병、신진대사과정、배경신식처리、유궤계통、세포과정화유전신식처리통로상,기중유6조서렬이지화기생충감염직접상관. 결론 본연구획득료518조단백서렬다방극구조충분비단백조,위후속적진단、역묘화약물파점등상관연구제공료생물신식학수거.
Objective To predict the secretome and analyze the secreted proteins and signal peptides of Echinococcus multilocularis using genome-wide bioinformatics approaches thus to provide a platform for finding biomarkers for development of diagnosis and drugs.Methods Signal peptides of E.multilocularis were identified from the whole genome sequence using SignalP4.1 program,and the proteins containing signal sequences were analyzed with TMHMM v2.0,Phobius,Big-PI predictor and TargetPl.1 in a stepwise way to minimize the false-positive prediction.Subsequently,the sequence features of both the signal peptides and the secreted proteins were statistically analyzed by SPSS19.0 and Excel.Differences in the numbers of amino acid of each secreted and the non-secreted sequences were determined through kolmogorov-smirnov test (K-S test)and t test.Finally,the KAAS (KEGG Automatic Annotation Sever) functional annotation and clustering were performed for the secretome sequences.Results A total of 875 proteins encoding sequences containing signal peptides was found in 10 780 E.multilocularis genome sequences.Among them,307 sequences are membrane-binding proteins,38 proteins contain GPI anchor site and 12 proteins are located in mitochondria.Finally,a total of 518 proteins was recognized as secreted proteins.The signal sequences contain mostly 11-53amino acids,among them 61% are hydrophobic.The secreted proteins identified possess 38-7 809 amino acids,which is significantly less than the amount possessed by non-secreted proteins(11-11 194 amino acids) (t=0.203,P<0.01).KAAS analysis showed that these secreted proteins are mainly involved in human disease,metabolism,environmental information processing,organismal systems,cellular process and genetic information processing.Among them,6 sequences are related to parasitic infection.Conclusion The secretome of E.multilocularis containing 518 secreted protein sequences was predicted and analyzed thus to provide a database for further identification of diagnostics,vaccine and drug targets.