商丘师范学院学报
商丘師範學院學報
상구사범학원학보
JOURNAL OF SHANGQIU TEACHERS COLLEGE
2015年
6期
63-68
,共6页
葛芳%郭有强%王磊%马程
葛芳%郭有彊%王磊%馬程
갈방%곽유강%왕뢰%마정
半监督学习%概率转移矩阵%标记传播%基因表达谱数据
半鑑督學習%概率轉移矩陣%標記傳播%基因錶達譜數據
반감독학습%개솔전이구진%표기전파%기인표체보수거
semi -supervised learning%probability transition matrix%label propagation%gene expression profile data
提出一种改进的标签传播算法,并将其应用于基因表达谱数据分析中。首先使用概率矩阵表示基因表达数据,将少量样本标记为已知,同时定义一个标记序列表示样本的类别属性;然后通过迭代公式更新标记序列,得到标记序列的收敛解,并证明了该收敛解的唯一性;最后采用正负标记的方式,根据标记序列各分量的符号差异实现数据类别的划分。经过癌症数据集实验的验证,证明了提出的方法可以快速有效地实现基因表达数据的聚类。
提齣一種改進的標籤傳播算法,併將其應用于基因錶達譜數據分析中。首先使用概率矩陣錶示基因錶達數據,將少量樣本標記為已知,同時定義一箇標記序列錶示樣本的類彆屬性;然後通過迭代公式更新標記序列,得到標記序列的收斂解,併證明瞭該收斂解的唯一性;最後採用正負標記的方式,根據標記序列各分量的符號差異實現數據類彆的劃分。經過癌癥數據集實驗的驗證,證明瞭提齣的方法可以快速有效地實現基因錶達數據的聚類。
제출일충개진적표첨전파산법,병장기응용우기인표체보수거분석중。수선사용개솔구진표시기인표체수거,장소량양본표기위이지,동시정의일개표기서렬표시양본적유별속성;연후통과질대공식경신표기서렬,득도표기서렬적수렴해,병증명료해수렴해적유일성;최후채용정부표기적방식,근거표기서렬각분량적부호차이실현수거유별적화분。경과암증수거집실험적험증,증명료제출적방법가이쾌속유효지실현기인표체수거적취류。
In this paper , an improved label propagation algorithm was proposed and introduced into the analysis of gene expression profiles .First, the probability transition matrix was constructed with gene expression profiles . Meanwhile , the label sequence which indicates the class information was defined and several samples were marked as labeled data .Then, the label sequence was updated by an iterative formula and the convergence solution of the label sequence was obtained , which was proved to be the unique solution .Finally , the clustering problem was solved by using plus -minus label which was on the basis of the signs of the label sequence .Experiments on the cancer data demonstrate our method is feasible and effective .