地球信息科学学报
地毬信息科學學報
지구신식과학학보
GEO-INFORMATION SCIENCE
2014年
3期
341-348
,共8页
桑鹏%唐新明%艾波%王华斌
桑鵬%唐新明%艾波%王華斌
상붕%당신명%애파%왕화빈
新闻事件%RSS%多维描述%时空可视化
新聞事件%RSS%多維描述%時空可視化
신문사건%RSS%다유묘술%시공가시화
news event%RSS%multi-dimensional description%spatio-temporal visualization
百度等按照时间或焦点的传统新闻检索方式,缺少对新闻事件在时间维度和空间维度及时空发展规律上的组织和表达。鉴此,本文提出了一种在时间和空间维度对在线简易信息聚合(Really Simple Syndication,RSS)新闻进行多维描述和时空可视化的方法,帮助用户全面、直观理解焦点新闻事件的时空发展过程及趋势。该方法从新浪、百度和Google等多家网站的RSS新闻服务中抽取新闻,将新闻报道时间近似为新闻事件发生时间进行时间维度描述,动态解析并识别新闻概要中的中文地名词汇,进行地址匹配和空间定位,完成新闻事件空间维度描述。以H7N9禽流感热点新闻为例,本文通过过度颜色、统计折线图进行时间维可视化表达,以大小渐变的圆形符号进行空间维可视化表达,多维度描述并展示了H7N9禽流感新闻事件的发展过程和趋势。
百度等按照時間或焦點的傳統新聞檢索方式,缺少對新聞事件在時間維度和空間維度及時空髮展規律上的組織和錶達。鑒此,本文提齣瞭一種在時間和空間維度對在線簡易信息聚閤(Really Simple Syndication,RSS)新聞進行多維描述和時空可視化的方法,幫助用戶全麵、直觀理解焦點新聞事件的時空髮展過程及趨勢。該方法從新浪、百度和Google等多傢網站的RSS新聞服務中抽取新聞,將新聞報道時間近似為新聞事件髮生時間進行時間維度描述,動態解析併識彆新聞概要中的中文地名詞彙,進行地阯匹配和空間定位,完成新聞事件空間維度描述。以H7N9禽流感熱點新聞為例,本文通過過度顏色、統計摺線圖進行時間維可視化錶達,以大小漸變的圓形符號進行空間維可視化錶達,多維度描述併展示瞭H7N9禽流感新聞事件的髮展過程和趨勢。
백도등안조시간혹초점적전통신문검색방식,결소대신문사건재시간유도화공간유도급시공발전규률상적조직화표체。감차,본문제출료일충재시간화공간유도대재선간역신식취합(Really Simple Syndication,RSS)신문진행다유묘술화시공가시화적방법,방조용호전면、직관리해초점신문사건적시공발전과정급추세。해방법종신랑、백도화Google등다가망참적RSS신문복무중추취신문,장신문보도시간근사위신문사건발생시간진행시간유도묘술,동태해석병식별신문개요중적중문지명사회,진행지지필배화공간정위,완성신문사건공간유도묘술。이H7N9금류감열점신문위례,본문통과과도안색、통계절선도진행시간유가시화표체,이대소점변적원형부호진행공간유가시화표체,다유도묘술병전시료H7N9금류감신문사건적발전과정화추세。
Traditional methods of news retrieval which return a series of related news-list that sorted by time or events such as Baidu, are lack of intuitive description in both temporal and spatial dimensions, as well as spa-tio-temporal development that related to news events. This paper presented a method of multi-dimensional de-scription and spatio-temporal visualization of online RSS news events, which helps readers understand the spa-tio-temporal development of the whole news event. Firstly, this method pulled news from several well-known websites such as Baidu, Sina and Google News based on RSS (Really Simple Syndication) service, and then used a multi-dimensional description method to mark the spatial and temporal dimensions of RSS news. The method of temporal dimensional description defines news publishing time as news’occurrence time, while the method of spatial dimensional description dynamically parses and identifies Chinese geographical name from news description, and then matches them with their geographical coordinates. Spatial dimensional description method is the primary content of this article. This approach has been separated into four stages to accomplish the analyzing process: (i) XSL Transformation, which uses XSL(eXtensible Stylesheet Language) to transform a news RSS document into a HTML(Hypertext Markup Language) document; (ii) Description Extraction, which uses the regular expression to extract the news description from news HTML document; (iii) Chinese place Name Extraction, which uses ICTCLAS to extract geographic name from description; And (iv) Geocoding, which uses Google Geocoder API to get the geographical coordinates of the place name. At last, this paper dem-onstrated the spatio-temporal visualization of news events and made a brief analysis by setting H7N9 hot news as an example. In the analysis, temporal visualization used transition color to show the changes between two time nodes according to the amount of news, and then used line chart to show the variation tendency of the total amount of news. Furthermore, spatial visualization clustered news by province and used different-sized plots to indicate the diffidence of news amounts between two provinces.