计算机应用研究
計算機應用研究
계산궤응용연구
APPLICATION RESEARCH OF COMPUTERS
2015年
9期
2634-2638
,共5页
H1N1 病毒%HA蛋白序列%模糊邻近关系%结构聚类%进化树
H1N1 病毒%HA蛋白序列%模糊鄰近關繫%結構聚類%進化樹
H1N1 병독%HA단백서렬%모호린근관계%결구취류%진화수
H1N1 virus%HA protein sequences%fuzzy proximity relations%structural clustering%evolutionary tree
为了从本质上揭示H1N1病毒分子的变异、流感流行等关系,提出一种构建H1N1病毒进化树新方法。在1902—2013年全球22455条H1N1型禽流感病毒HA蛋白质序列数据的基础上,利用其特征向量构建基于内积的HA蛋白质序列相似度。采用基于相似度的完全聚类图的方法进行数据系统粗粒化的相似信息提取。最后,利用基于模糊邻近关系的结构聚类方法进行H1N1型禽流感病毒HA蛋白质序列的进化树研究,将病毒分为33大类。进一步分析表明,H1N1病毒的变异不仅与爆发时间密切相关,还与所分布地域及地域间的距离有很大关系,即分布地域间的距离越近,爆发的病毒进化的相似程度越高。对大量的病毒进行进化树分析,从宏观角度体现了各类病毒之间的进化关系。
為瞭從本質上揭示H1N1病毒分子的變異、流感流行等關繫,提齣一種構建H1N1病毒進化樹新方法。在1902—2013年全毬22455條H1N1型禽流感病毒HA蛋白質序列數據的基礎上,利用其特徵嚮量構建基于內積的HA蛋白質序列相似度。採用基于相似度的完全聚類圖的方法進行數據繫統粗粒化的相似信息提取。最後,利用基于模糊鄰近關繫的結構聚類方法進行H1N1型禽流感病毒HA蛋白質序列的進化樹研究,將病毒分為33大類。進一步分析錶明,H1N1病毒的變異不僅與爆髮時間密切相關,還與所分佈地域及地域間的距離有很大關繫,即分佈地域間的距離越近,爆髮的病毒進化的相似程度越高。對大量的病毒進行進化樹分析,從宏觀角度體現瞭各類病毒之間的進化關繫。
위료종본질상게시H1N1병독분자적변이、류감류행등관계,제출일충구건H1N1병독진화수신방법。재1902—2013년전구22455조H1N1형금류감병독HA단백질서렬수거적기출상,이용기특정향량구건기우내적적HA단백질서렬상사도。채용기우상사도적완전취류도적방법진행수거계통조립화적상사신식제취。최후,이용기우모호린근관계적결구취류방법진행H1N1형금류감병독HA단백질서렬적진화수연구,장병독분위33대류。진일보분석표명,H1N1병독적변이불부여폭발시간밀절상관,환여소분포지역급지역간적거리유흔대관계,즉분포지역간적거리월근,폭발적병독진화적상사정도월고。대대량적병독진행진화수분석,종굉관각도체현료각류병독지간적진화관계。
This paper proposed a new method for constructing evolutionary tree of H1N1 flu virus in order to reveal the rela-tionship between the molecular variation of H1N1 and epidemics.First,according to the 22455 HA protein sequence data of H1N1 flu virus in 1902—2013,it constructed the similarity of HA protein sequences by using the eigenvectors based on inner product.Then,it extracted coarse grained similar information of data systematically by introducing complete graph clustering based on similarity.Finally,it studied the evolutionary tree for HA protein sequences of H1N1 flu virus by using the structure clustering method based on fuzzy proximity relations,and gained 33 categories of the virus.Further analysis shows that the mu-tation of the H1N1 virus not only is closely to outbreak time,but also relates to region and the distance between the distribution regionother.That is the closer the distance between the distribution region,the outbreak of the virus in the evolution have higher similarity degree.It provides a new method to analyze the evolutionary tree of large amounts of virus,and to show the evolution-ary relationships between all kinds of virus from the macro perspective.