科研信息化技术与应用
科研信息化技術與應用
과연신식화기술여응용
E-science Technology & Application
2013年
1期
67-73
,共7页
地理编码%地址匹配%地址规范化%地名相似度算法%空间场景相似性
地理編碼%地阯匹配%地阯規範化%地名相似度算法%空間場景相似性
지리편마%지지필배%지지규범화%지명상사도산법%공간장경상사성
Geocoding%Address matching%Address standardization%Address similarity algorithm%Spatial scene similarity assessment
地理编码技术通过将位置信息的文字表述转化为经纬度坐标为地理位置信息相关的科研提供数据支持。在地理编码过程中,地址描述性文字可能存在大量内容不正确、不准确、错别字、同音字等问题,从而导致无法进行准确地址匹配。该文针对以上问题提出了一种地址规范化的方法,通过地名相似度算法将待规范地名与标准地名库记录进行匹配,并结合空间场景相似性对结果进行评价,从而提高地址匹配的准确度。最后通过相关公共卫生数据验证了方法的可行性与准确性。
地理編碼技術通過將位置信息的文字錶述轉化為經緯度坐標為地理位置信息相關的科研提供數據支持。在地理編碼過程中,地阯描述性文字可能存在大量內容不正確、不準確、錯彆字、同音字等問題,從而導緻無法進行準確地阯匹配。該文針對以上問題提齣瞭一種地阯規範化的方法,通過地名相似度算法將待規範地名與標準地名庫記錄進行匹配,併結閤空間場景相似性對結果進行評價,從而提高地阯匹配的準確度。最後通過相關公共衛生數據驗證瞭方法的可行性與準確性。
지리편마기술통과장위치신식적문자표술전화위경위도좌표위지리위치신식상관적과연제공수거지지。재지리편마과정중,지지묘술성문자가능존재대량내용불정학、불준학、착별자、동음자등문제,종이도치무법진행준학지지필배。해문침대이상문제제출료일충지지규범화적방법,통과지명상사도산법장대규범지명여표준지명고기록진행필배,병결합공간장경상사성대결과진행평개,종이제고지지필배적준학도。최후통과상관공공위생수거험증료방법적가행성여준학성。
Geocoding serves as a tool to convert text-based descriptions of addresses into data of the format of longitude and latitude which can be used in computer environment, thus it provides data support for e-Science relevant to geographic information. In the process of Geocoding, it may be encountered that the text-based descriptions of addresses contain problems such as incompleteness, inaccuracy,misspelling and phonetic errors, which can always lead to mistakes in the results of Geocoding. To reduce the negative effects of these problems and ensure better accuracy, a framework of address standardization is introduced in this paper. It is also proposed that a address similarity algorithm to be used in the matching process of addresses and a evaluation based on geographic scene to be used in the selection part thereafter. Finally, records relevant to public health are experimented with those methods. The result shows good feasibility and accuracy.