Annotating a Japanese text corpus with predicate-argument and coreference relations

Ryu Iida, Mamoru Komachi, Kentaro Inui, Yuji Matsumoto

Research output: Contribution to conferencePaperpeer-review

56 Citations (Scopus)

Abstract

In this paper, we discuss how to annotate coreference and predicate-argument relations in Japanese written text. There have been research activities for building Japanese text corpora annotated with coreference and predicate-argument relations as are done in the Kyoto Text Corpus version 4.0 (Kawahara et al., 2002) and the GDATagged Corpus (Hasida, 2005). However, there is still much room for refining their specifications. For this reason, we discuss issues in annotating these two types of relations, and propose a new specification for each. In accordance with the specification, we built a large-scaled annotated corpus, and examined its reliability. As a result of our current work, we have released an annotated corpus named the NAIST Text Corpus1, which is used as the evaluation data set in the coreference and zero-anaphora resolution tasks in Iida et al. (2005) and Iida et al. (2006).

Original languageEnglish
Pages132-139
Number of pages8
DOIs
Publication statusPublished - 2007
Externally publishedYes
EventLinguistic Annotation Workshop, LAW 2007 - Prague, Czech Republic
Duration: 2007 Jun 282007 Jun 29

Other

OtherLinguistic Annotation Workshop, LAW 2007
CountryCzech Republic
CityPrague
Period07/6/2807/6/29

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Annotating a Japanese text corpus with predicate-argument and coreference relations'. Together they form a unique fingerprint.

Cite this