中国中医药信息杂志
中國中醫藥信息雜誌
중국중의약신식잡지
CHINESE JOURNAL OF INFORMATION ON TRADITIONAL CHINESE MEDICINE
2013年
10期
16-18
,共3页
温先荣%张晶%刘静%雷蕾%杨策%李海燕
溫先榮%張晶%劉靜%雷蕾%楊策%李海燕
온선영%장정%류정%뢰뢰%양책%리해연
中医药学表%修订%文献标引%词频统计
中醫藥學錶%脩訂%文獻標引%詞頻統計
중의약학표%수정%문헌표인%사빈통계
Traditional Chinese Medicine Subject Headings Thesaurus%revision%literature indexing%terms frequency statistics
目的通过对文献标引词频进行统计与分析,为中医药主题词表修订的选词提供依据。方法以《中国中医药期刊文献数据库》近5年的文献标引词为数据来源,利用MS Access对主题词、关键词进行词频统计,再对结果进行分类与分析。结果245680篇文献涉及主题词18796个,其中中医主题词6940个,标引使用的中医主题词占2007年版《中国中医药学主题词表》中主题词的83.47%;15个类目主题词利用率最低的是药用动植物类(69.97%),其次是自然科学类(71.01%)和中医精神疾病和心理学类(82.81%)。245680篇文献涉及关键词136832个,其中词频≥10次的关键词3485个,经分析剔除无意义词576个,初步推荐预选新主题词或入口词368个,其余2541个供词表修订时根据实际需要进行选择。结论词频统计结果与分析为新版词表修订选词提供了依据。
目的通過對文獻標引詞頻進行統計與分析,為中醫藥主題詞錶脩訂的選詞提供依據。方法以《中國中醫藥期刊文獻數據庫》近5年的文獻標引詞為數據來源,利用MS Access對主題詞、關鍵詞進行詞頻統計,再對結果進行分類與分析。結果245680篇文獻涉及主題詞18796箇,其中中醫主題詞6940箇,標引使用的中醫主題詞佔2007年版《中國中醫藥學主題詞錶》中主題詞的83.47%;15箇類目主題詞利用率最低的是藥用動植物類(69.97%),其次是自然科學類(71.01%)和中醫精神疾病和心理學類(82.81%)。245680篇文獻涉及關鍵詞136832箇,其中詞頻≥10次的關鍵詞3485箇,經分析剔除無意義詞576箇,初步推薦預選新主題詞或入口詞368箇,其餘2541箇供詞錶脩訂時根據實際需要進行選擇。結論詞頻統計結果與分析為新版詞錶脩訂選詞提供瞭依據。
목적통과대문헌표인사빈진행통계여분석,위중의약주제사표수정적선사제공의거。방법이《중국중의약기간문헌수거고》근5년적문헌표인사위수거래원,이용MS Access대주제사、관건사진행사빈통계,재대결과진행분류여분석。결과245680편문헌섭급주제사18796개,기중중의주제사6940개,표인사용적중의주제사점2007년판《중국중의약학주제사표》중주제사적83.47%;15개류목주제사이용솔최저적시약용동식물류(69.97%),기차시자연과학류(71.01%)화중의정신질병화심이학류(82.81%)。245680편문헌섭급관건사136832개,기중사빈≥10차적관건사3485개,경분석척제무의의사576개,초보추천예선신주제사혹입구사368개,기여2541개공사표수정시근거실제수요진행선택。결론사빈통계결과여분석위신판사표수정선사제공료의거。
Objective To provide basis for revision of Traditional Chinese Medical Subject Headings (TCMeSH) Thesaurus by word selecting study using statistics of terms frequency. Methods The subject headings indexes and keywords were selected from Traditional Chinese Medical Literature Analysis and Retrieval System in recent five years. MS Access was used to analyze subject headings and keywords. Results In 245 680 articles, 18 796 subject headings were used and 6940 TCMeSH were found, which were about 83.47%of subject headings in TCMeSH Thesaurus (2007 edition). In 15 subject headings categories, utilization frequency of medicinal plants category was 69.97%that was the lowest, followed by the natural science category (71.01%) and mental disease of traditional Chinese medicine and psychology category (82.81%). At the same time, 136 832 keywords were included in 245 680 articles, in which there were 3485 words with frequency higher than 10. After deleting 576 invaluable words, 368 keywords were recommended to subject heading or lead-in words and 2541 keywords would be used in revising TCMeSH Thesaurus in the future. Conclusion The basis for the scientificity and practicability of the revision of TCMeSH Thesaurus was demonstrated by statistical analysis of terms frequency.