Annotating geographical entities on microblog text

Koji Matsuda, Akira Sasaki, Naoaki Okazaki, Kentaro Inui

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a discussion of the problems surrounding the task of annotating geographical entities on microblogs and reports the preliminary results of our efforts to annotate Japanese microblog texts. Unlike prior work, we not only annotate geographical location entities but also facility entities, such as stations, restaurants, shopping stores, hospitals and schools. We discuss ways in which to build a gazetteer, the types of ambiguities that need to be considered, reasons why the annotator tends to disagree, and the problems that need to be solved to automate the task of annotating the geographical entities. All the annotation data and the annotation guidelines are publicly available for research purposes from our web site.

Original languageEnglish
Title of host publicationLAW 2015 - 9th Linguistic Annotation Workshop, held in conjuncion with NAACL 2015 - Proceedings of the Workshop
EditorsAdam Meyers, Ines Rehbein, Heike Zinsmeister
PublisherAssociation for Computational Linguistics (ACL)
Pages85-94
Number of pages10
ISBN (Electronic)9781941643471
Publication statusPublished - 2020 Jan 1
Event9th Linguistic Annotation Workshop, LAW 2015, held in conjuncion with NAACL 2015 - Denver, United States
Duration: 2015 Jun 5 → …

Publication series

NameLAW 2015 - 9th Linguistic Annotation Workshop, held in conjuncion with NAACL 2015 - Proceedings of the Workshop

Conference

Conference9th Linguistic Annotation Workshop, LAW 2015, held in conjuncion with NAACL 2015
CountryUnited States
CityDenver
Period15/6/5 → …

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Annotating geographical entities on microblog text'. Together they form a unique fingerprint.

Cite this