系统工程与电子技术(英文版)
繫統工程與電子技術(英文版)
계통공정여전자기술(영문판)
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS
2002年
1期
87-92
,共6页
Text data mining%Natural language processing%Keyword clustering
With the development of large-scale text processing, the dimension of text feature space has become larger and larger, which has added a lot of difficulties to natural language processing. How to reduce the dimension has become a practical problem in the field. Here we present two clustering methods, i.e. concept association and concept abstract, to achieve the goal. The first refers to the keyword clustering based on the co-occurrence of keywords in the same text, and the second refers to that in the same category. Then we compare the difference between them. Our experiment results show that they are efficient to reduce the dimension of text feature space.