语言科学
語言科學
어언과학
Linguistic Sciences
2013年
2期
178~192
,共null页
树库 谓词-论元结构 补足语 标句词 词性标注
樹庫 謂詞-論元結構 補足語 標句詞 詞性標註
수고 위사-론원결구 보족어 표구사 사성표주
Treebank predicate argument structures complement complementizer POS tagging
树库是一种记录每个句子句法分析结果的标注语料库。文章介绍的是美国宾州大学构建的中文树库(CTB)。描写句子的谓词一沦元结构是CTB标注的一个重要目标。因此,它在句法标注中刻意强调的是以下三个抽象的语法关系:中心语补足语关系、中心语一附加语关系和并列关系。在CTB中每个短语节点所支配的括号对或子树只表示上述的一种语法关系。此外,CTB在语法体系上也有很多特点,文章仅选取补足语、汉语的标句词“(DEC)”以及遵循语杠理论的词性标注准则等三个汉语语法问题来进行讨论。如果我们同意句子的谓词~论元结构描写是树库建设的一个重要目标,那么上述三个问题不仅同这个目标紧密关联,而且将影响到基于树库的自动词性标注和句法分析系统的性能及其后续应用的结果。
樹庫是一種記錄每箇句子句法分析結果的標註語料庫。文章介紹的是美國賓州大學構建的中文樹庫(CTB)。描寫句子的謂詞一淪元結構是CTB標註的一箇重要目標。因此,它在句法標註中刻意彊調的是以下三箇抽象的語法關繫:中心語補足語關繫、中心語一附加語關繫和併列關繫。在CTB中每箇短語節點所支配的括號對或子樹隻錶示上述的一種語法關繫。此外,CTB在語法體繫上也有很多特點,文章僅選取補足語、漢語的標句詞“(DEC)”以及遵循語槓理論的詞性標註準則等三箇漢語語法問題來進行討論。如果我們同意句子的謂詞~論元結構描寫是樹庫建設的一箇重要目標,那麽上述三箇問題不僅同這箇目標緊密關聯,而且將影響到基于樹庫的自動詞性標註和句法分析繫統的性能及其後續應用的結果。
수고시일충기록매개구자구법분석결과적표주어료고。문장개소적시미국빈주대학구건적중문수고(CTB)。묘사구자적위사일륜원결구시CTB표주적일개중요목표。인차,타재구법표주중각의강조적시이하삼개추상적어법관계:중심어보족어관계、중심어일부가어관계화병렬관계。재CTB중매개단어절점소지배적괄호대혹자수지표시상술적일충어법관계。차외,CTB재어법체계상야유흔다특점,문장부선취보족어、한어적표구사“(DEC)”이급준순어강이론적사성표주준칙등삼개한어어법문제래진행토론。여과아문동의구자적위사~론원결구묘사시수고건설적일개중요목표,나요상술삼개문제불부동저개목표긴밀관련,이차장영향도기우수고적자동사성표주화구법분석계통적성능급기후속응용적결과。
sentence In It. Treebank is a kind of bracketed corpus which records the synta We will introduce the Penn Chinese Treebank (CTB) built by ctic parsing tree of each the University of Penn sylvania, USA. One of the important goals of the CTB annotation is to describe the predicate argu ment structures in sentences. During syntactic annotation, CTB intently focuses on the following three abstract grammatical relations: complementation, adjunction and coordination. Each of the a- bove grammatical relations is assigned a unique hierarchical structure. Although there are a number of characteristics in the grammar formalism of CTB, we only discuss the following three issues in this article: 1) Complement(补足语), 2) The Chinese complementizer "的 (DEC)", and 3) The criteria of part-of-speech (POS) tagging based on X-Bar Theory. If we also agree to the fact that the predicate argument description is one of the important goals of treebank construction, then the three issues above are closely related to the goal because they affect not only the performance of automatic POS tagging and par- sing systems trained on the treebank but also the results of those subsequent applications.