计算机科学与探索
計算機科學與探索
계산궤과학여탐색
JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY
2015年
9期
1056-1065
,共10页
周作建%林文敏%王斌斌%潘金贵
週作建%林文敏%王斌斌%潘金貴
주작건%림문민%왕빈빈%반금귀
云框架%症状自查服务%海量医疗数据%Hadoop%Lucene
雲框架%癥狀自查服務%海量醫療數據%Hadoop%Lucene
운광가%증상자사복무%해량의료수거%Hadoop%Lucene
cloud framework%symptom self-inspection services%massive medical data%Hadoop%Lucene
随着当前社会“亚健康”人群的增加,症状自查服务显得愈发重要。各地基于居民健康档案的区域卫生信息平台的建立,为症状自查服务实现提供了数据基础,但是人们仍面临着海量电子病历的获取、存储、搜索以及数据分析计算等诸多挑战。鉴于上述问题,提出了一种基于云框架的症状自查服务模型。首先建立了Hadoop集群,用来存储海量医疗数据以及建立索引,提高电子病历的搜索响应时间。其次设计了基于Lucene的分布式搜索节点集群,用来对海量的电子病历进行实时检索、数据分析和隐私过滤。此外,对症状自查服务的实现进行了讨论,包括搜索节点的选择、病历索引文件的建立、病历相似度的计算及排序方法。最后,通过实验证实了症状自查服务的云框架模型具有可扩展性和有效性。
隨著噹前社會“亞健康”人群的增加,癥狀自查服務顯得愈髮重要。各地基于居民健康檔案的區域衛生信息平檯的建立,為癥狀自查服務實現提供瞭數據基礎,但是人們仍麵臨著海量電子病歷的穫取、存儲、搜索以及數據分析計算等諸多挑戰。鑒于上述問題,提齣瞭一種基于雲框架的癥狀自查服務模型。首先建立瞭Hadoop集群,用來存儲海量醫療數據以及建立索引,提高電子病歷的搜索響應時間。其次設計瞭基于Lucene的分佈式搜索節點集群,用來對海量的電子病歷進行實時檢索、數據分析和隱私過濾。此外,對癥狀自查服務的實現進行瞭討論,包括搜索節點的選擇、病歷索引文件的建立、病歷相似度的計算及排序方法。最後,通過實驗證實瞭癥狀自查服務的雲框架模型具有可擴展性和有效性。
수착당전사회“아건강”인군적증가,증상자사복무현득유발중요。각지기우거민건강당안적구역위생신식평태적건립,위증상자사복무실현제공료수거기출,단시인문잉면림착해량전자병력적획취、존저、수색이급수거분석계산등제다도전。감우상술문제,제출료일충기우운광가적증상자사복무모형。수선건립료Hadoop집군,용래존저해량의료수거이급건립색인,제고전자병력적수색향응시간。기차설계료기우Lucene적분포식수색절점집군,용래대해량적전자병력진행실시검색、수거분석화은사과려。차외,대증상자사복무적실현진행료토론,포괄수색절점적선택、병력색인문건적건립、병력상사도적계산급배서방법。최후,통과실험증실료증상자사복무적운광가모형구유가확전성화유효성。
With the increase of the sub-health people in current society, symptom self-inspection services have become more and more important. However, the establishment of the regional health cloud platform based on the EHR (elec-tronic health record) provides the data support of the symptom self-inspection services. For example, people can find similar EHRs through this platform. As proposing the symptom self-inspection services based on the cloud frame-work, people face the challenge that a large amount of the EHRs should be acquired, stored, searched and analyzed. To solving those problems, this paper proposes the symptom self-inspection services based on the cloud framework. Firstly, this paper builds a Hadoop cluster to store and retrieve the massive medical data so that the response time of searching the EHRs can be improved. Secondly, this paper designs the distributed search node cluster based on the Lucene project to retrieve, analyze and filter the massive EHRs in real time. Moreover, this paper discusses the implementation of the symptom self-inspection services which includes the selection of searching nodes, the indexing of EHRs, the method of calculating the similarity of EHRs and the sorting algorithm. In the end, this paper conducts an experiment which proves the scalability and effectiveness of the proposed cloud framework.