生物信息学
生物信息學
생물신식학
BIOINFORMATICS
2010年
1期
23-29
,共7页
吴飞珍%马文丽%王旺迪%陈启龙%郑文岭
吳飛珍%馬文麗%王旺迪%陳啟龍%鄭文嶺
오비진%마문려%왕왕적%진계룡%정문령
基因本体%基因注释%语义相似度
基因本體%基因註釋%語義相似度
기인본체%기인주석%어의상사도
gene ontology%gene annotation%semantic similarity
基因本体(GO)数据库为基因提供了统一的注释,有效地解决了不同数据库描述相同基因的不一致问题.但是,根据基因注释如何比较基因的功能相似性,这个问题仍然没有得到有效解决.本文提出一种新的基因注释语义相似度计算方法,这种方法在本质上是基于基因的生物学特性,其特点在于结点的语义相似度与结点所在集合无关,只与结点在GO图的位置有关,语义相似度可被重复利用.它既考虑了基因所映射的GO结点深度,又考虑了两GO结点之间所有路径对结点语义相似度的影响.文中以酵母菌的异亮氨酸降解代谢通路和谷氨酸合成代谢通路为实验,实验结果表明这种算法能准确地计算基因注释语义相似度.
基因本體(GO)數據庫為基因提供瞭統一的註釋,有效地解決瞭不同數據庫描述相同基因的不一緻問題.但是,根據基因註釋如何比較基因的功能相似性,這箇問題仍然沒有得到有效解決.本文提齣一種新的基因註釋語義相似度計算方法,這種方法在本質上是基于基因的生物學特性,其特點在于結點的語義相似度與結點所在集閤無關,隻與結點在GO圖的位置有關,語義相似度可被重複利用.它既攷慮瞭基因所映射的GO結點深度,又攷慮瞭兩GO結點之間所有路徑對結點語義相似度的影響.文中以酵母菌的異亮氨痠降解代謝通路和穀氨痠閤成代謝通路為實驗,實驗結果錶明這種算法能準確地計算基因註釋語義相似度.
기인본체(GO)수거고위기인제공료통일적주석,유효지해결료불동수거고묘술상동기인적불일치문제.단시,근거기인주석여하비교기인적공능상사성,저개문제잉연몰유득도유효해결.본문제출일충신적기인주석어의상사도계산방법,저충방법재본질상시기우기인적생물학특성,기특점재우결점적어의상사도여결점소재집합무관,지여결점재GO도적위치유관,어의상사도가피중복이용.타기고필료기인소영사적GO결점심도,우고필료량GO결점지간소유로경대결점어의상사도적영향.문중이효모균적이량안산강해대사통로화곡안산합성대사통로위실험,실험결과표명저충산법능준학지계산기인주석어의상사도.
Although the gene ontology (GO) provides a consistent gene annotation,which effectively tackles the problem that the same genes are described as different biological vocabularies in heterogeneous data sources. However,there is still no effective method to determine the functional similarities of genes based on gene annotation information. The paper proposed a new method to measure the semantic similarities of gene annotations. The approach,in essence,is based on biological property of genes. The semantic similarities of GO terms measured by the method are only relative to the characteristics of these GO terms,but not to the set that these GO terms belong to. The method considers not only the depth of GO terms,but also the contribution of every path distance between two GO terms. The approach has been applied to analysis of isoleucine degradation pathway and super-pathway of glutamate biosynthesis in Saccharomyces cerevisiae. The results show the method may exactly measure the semantic similarities of gene annotations.