Developing a corpus of indirect speech act schemas

2020

Conference: Proceedings of The 12th Language Resources and Evaluation Conference

Antonio Roque and Alexander Tsuetaki and Vasanth Sarathy and Matthias Scheutz

Resolving Indirect Speech Acts (ISAs), in which the intended meaning of an utterance is not identical to its literal meaning, is essential to enabling the participation of intelligent systems in peoples’ everyday lives. Especially challenging are those cases in which the interpretation of such ISAs depends on context. To test a system’s ability to perform ISA resolution we need a corpus, but developing such a corpus is difficult, especialy given the contex-dependent requirement. This paper addresses the difficult problems of constructing a corpus of ISAs, taking inspiration from relevant work in using corpora for reasoning tasks. We present a formal representation of ISA Schemas required for such testing, including a measure of the difficulty of a particular schema. We develop an approach to authoring these schemas using corpus analysis and crowdsourcing, to maximize realism and minimize the amount of expert authoring needed. Finally, we describe several characteristics of collected data, and potential future work.

@inproceedings{roque2020developing,
  title={Developing a corpus of indirect speech act schemas},
  author={Antonio Roque and Alexander Tsuetaki and Vasanth Sarathy and Matthias Scheutz},
  year={2020},
  booktitle={Proceedings of The 12th Language Resources and Evaluation Conference},
  url={https://hrilab.tufts.edu/publications/roque2020developing.pdf}
}