地球信息科学学报
地毬信息科學學報
지구신식과학학보
GEO-INFORMATION SCIENCE
2015年
2期
127-134
,共8页
网络文本%地理信息%自然语言处理%信息抽取%地理定位
網絡文本%地理信息%自然語言處理%信息抽取%地理定位
망락문본%지리신식%자연어언처리%신식추취%지리정위
web text%geographic information%natural language processing%infromation extraction%geographi-cal location
互联网的普及产生了大量蕴含着丰富地理语义的文本,为地理信息的深度挖掘和知识发现带来了巨大机遇。同时,蕴含地理语义文本的异构性和动态性,使得地理实体的属性数量和种类激增、地理语义关系复杂,对地理信息检索、空间分析和推理、智能化位置服务等提出了严峻的挑战。本文阐述了网络文本蕴含地理信息抽取的技术流程,从地理实体识别、地理实体定位、地理实体属性抽取、地理实体关系构建、地理事件抽取5个方面总结了网络文本蕴含地理信息抽取的进展和关键技术瓶颈,分析了可用于网络文本蕴含地理信息抽取的开放资源,并展望了未来的发展方向。
互聯網的普及產生瞭大量蘊含著豐富地理語義的文本,為地理信息的深度挖掘和知識髮現帶來瞭巨大機遇。同時,蘊含地理語義文本的異構性和動態性,使得地理實體的屬性數量和種類激增、地理語義關繫複雜,對地理信息檢索、空間分析和推理、智能化位置服務等提齣瞭嚴峻的挑戰。本文闡述瞭網絡文本蘊含地理信息抽取的技術流程,從地理實體識彆、地理實體定位、地理實體屬性抽取、地理實體關繫構建、地理事件抽取5箇方麵總結瞭網絡文本蘊含地理信息抽取的進展和關鍵技術瓶頸,分析瞭可用于網絡文本蘊含地理信息抽取的開放資源,併展望瞭未來的髮展方嚮。
호련망적보급산생료대량온함착봉부지리어의적문본,위지리신식적심도알굴화지식발현대래료거대궤우。동시,온함지리어의문본적이구성화동태성,사득지리실체적속성수량화충류격증、지리어의관계복잡,대지리신식검색、공간분석화추리、지능화위치복무등제출료엄준적도전。본문천술료망락문본온함지리신식추취적기술류정,종지리실체식별、지리실체정위、지리실체속성추취、지리실체관계구건、지리사건추취5개방면총결료망락문본온함지리신식추취적진전화관건기술병경,분석료가용우망락문본온함지리신식추취적개방자원,병전망료미래적발전방향。
Internet generates a plenty of texts which contain abundant geographic semantic information, and bring massive opportunities for deep mining and knowledge discovery. Meanwhile, heterogeneous and dynamic web texts make a surge in the number and type of geographic entity's attributes and the complexity of geographic semantic relations, which present a unprecedented challenge to geographic information retrieval, spatial analysis and reasoning, and intelligent location based services. Firstly, we describe the process of extracting geopgraphic informantion from web texts, summarize the research status and major issues which include geographic entity recognition, locating, attribute extraction, relation construction and event extraction. Secondly, we introduce some popular open sources used for geographic information extraction. Lastly, we discuss and look ahead to the development trends of this domain in future.