系统工程理论与实践
繫統工程理論與實踐
계통공정이론여실천
Systems Engineering—Theory & Practice
2011年
12期
2367~2372
,共null页
符号数据分析 区间数据 描述统计 一般分布
符號數據分析 區間數據 描述統計 一般分佈
부호수거분석 구간수거 묘술통계 일반분포
symbolic data analysis; interval valued data; descriptive statistics; general distribution
以对大规模个体数据通过打包形成的区间型符号数据为研究对象,针对个体在区间内往往不服从均匀分布的实际情况,研究一般分布的区间型符号数据的描述统计和分析方法.对符号数据分析进行了概述,并定义了一般分布的区间变量.研究了一般分布的区间变量的经验分布函数和经验联合分布函数.在此基础上,讨论了一般分布区间变量的描述统计量的求解.最后给出了算例,运用一般分布区间型符号数据的因子分析方法,以中国股市为背景进行了应用研究.结论表明:以往研究基于均匀分布假设所给出的描述统计量的计算,可看作文中所给求解公式的特例.另外,研究方法基于经验分布理论,无需知道个体在区间内服从分布函数的具体表达式,且在计算过程中充分利用了区间内的个体信息.
以對大規模箇體數據通過打包形成的區間型符號數據為研究對象,針對箇體在區間內往往不服從均勻分佈的實際情況,研究一般分佈的區間型符號數據的描述統計和分析方法.對符號數據分析進行瞭概述,併定義瞭一般分佈的區間變量.研究瞭一般分佈的區間變量的經驗分佈函數和經驗聯閤分佈函數.在此基礎上,討論瞭一般分佈區間變量的描述統計量的求解.最後給齣瞭算例,運用一般分佈區間型符號數據的因子分析方法,以中國股市為揹景進行瞭應用研究.結論錶明:以往研究基于均勻分佈假設所給齣的描述統計量的計算,可看作文中所給求解公式的特例.另外,研究方法基于經驗分佈理論,無需知道箇體在區間內服從分佈函數的具體錶達式,且在計算過程中充分利用瞭區間內的箇體信息.
이대대규모개체수거통과타포형성적구간형부호수거위연구대상,침대개체재구간내왕왕불복종균균분포적실제정황,연구일반분포적구간형부호수거적묘술통계화분석방법.대부호수거분석진행료개술,병정의료일반분포적구간변량.연구료일반분포적구간변량적경험분포함수화경험연합분포함수.재차기출상,토론료일반분포구간변량적묘술통계량적구해.최후급출료산례,운용일반분포구간형부호수거적인자분석방법,이중국고시위배경진행료응용연구.결론표명:이왕연구기우균균분포가설소급출적묘술통계량적계산,가간작문중소급구해공식적특례.령외,연구방법기우경험분포이론,무수지도개체재구간내복종분포함수적구체표체식,차재계산과정중충분이용료구간내적개체신식.
Interval symbolic data gained by data packaging on the original individuals of a sample are subjects of this paper. The individuals are always non-uniformly distributed within the intervals. Regarding this situation, this paper concentrates on descriptive statistics and analysis of generally distributed interval data, within which each individual is arbitrarily distributed. The basic theory of symbolic data analysis was first introduced. Then the definition of generally distributed interval was proposed. In the following, the study on empirical distribution function and empirical joint distribution function for generally distributed interval symbolic data were put forward. Based on this, the descriptive statistics of generally distributed interval variables were obtained. Finally a numerical example was given. And an application study in Chinese stock market was carried through using factor analysis of generally distributed interval symbolic data. Research shows that the previous works supposing uniform distribution are especial case of this work. Besides this, the method presented in this paper does not need the exact form of distribution function, since it is obtained upon theory of empirical distribution. Furthermore, it makes the best of the individuals sample information of the intervals.