抄録
In this paper, we discuss how to annotate coreference and predicate-argument relations in Japanese written text. There have been research activities for building Japanese text corpora annotated with coreference and predicate-argument relations as are done in the Kyoto Text Corpus version 4.0 (Kawahara et al., 2002) and the GDATagged Corpus (Hasida, 2005). However, there is still much room for refining their specifications. For this reason, we discuss issues in annotating these two types of relations, and propose a new specification for each. In accordance with the specification, we built a large-scaled annotated corpus, and examined its reliability. As a result of our current work, we have released an annotated corpus named the NAIST Text Corpus1, which is used as the evaluation data set in the coreference and zero-anaphora resolution tasks in Iida et al. (2005) and Iida et al. (2006).
本文言語 | English |
---|---|
ページ | 132-139 |
ページ数 | 8 |
DOI | |
出版ステータス | Published - 2007 |
外部発表 | はい |
イベント | Linguistic Annotation Workshop, LAW 2007 - Prague, Czech Republic 継続期間: 2007 6月 28 → 2007 6月 29 |
Other
Other | Linguistic Annotation Workshop, LAW 2007 |
---|---|
国/地域 | Czech Republic |
City | Prague |
Period | 07/6/28 → 07/6/29 |
ASJC Scopus subject areas
- 言語および言語学
- 言語学および言語