Cards corpus

Version: 2.0

Release date: 2012-03-03

Maintainer: Christopher Potts

Download

CardsCorpus-v02.zip

Search and visualize

This search function allows you to extract transcripts via keywords and study them using rich HTML/Javascript visualizations.

About

The Cards Corpus of collaborative task-oriented dialogues was collected at various time in 2008, 2010, 2011, and 2012 as part of the SUBTLE project and the Pragmatic Enrichment and Contextual Inference project.

Version 2.0 consists of 1266 transcripts, in CSV format (transcripts/).

Also included are Python classes for working with the data (cards.py), some illustrative examples of how to use the classes (examples.py), and some basic code for reading the corpus in to R as a single data frame and working with it (cards.R). Running: python examples.py will print out a list of corpus statistics.

Release notes

Version 2

Version 2 contains all of the transcripts from version 1, in the transcripts/01/ directory. The version 2 release contains a few changes to the meta-data for purposes of compatibility with the newer transcripts: MAX_CARDS, MAX_TURNS, and MAX_LINEOFSIGHT are now split into player-specific versions (e.g., P1_MAX_CARDS, P2_MAX_CARDS). These values are always the same for the two players in the 01/ directory but can differ in the 02/ directory. In addition, all transcripts have a new pair of action-types, PLAYER_1_TASK_ID and PLAYER_2_TASK_ID, which are always UNKNOWN in 01/ but which have values in 02/. cards.py has been updated to handle these changes.

Citation

Version 2

Potts, Christopher. 2012. Goal-driven answers in the Cards dialogue corpus. In Nathan Arnett and Ryan Bennett, eds., Proceedings of the 30th West Coast Conference on Formal Linguistics. Somerville, MA: Cascadilla Press.

@inproceedings{Potts:2012,
  Address = {Somerville, MA},
  Author = {Potts, Christopher},
  Booktitle = { Proceedings of the 30th West Coast Conference on Formal Linguistics},
  Editor = {Arnett, Nathan  and  Bennett, Ryan},
  Publisher = {Cascadilla Press},
  Title = {Goal-Driven Answers in the {C}ards Dialogue Corpus},
  Year = {2012}}

Version 1

Djalali, Alex; Sven Lauer; and Christopher Potts. 2012. Corpus evidence for preference-driven interpretation. In Maria Aloni, Vadim Kimmelman, Floris Roelofsen, Galit Weidman Sassoon, Katrin Schulz, and Matthijs Westera, eds., Proceedings of the 18th Amsterdam Colloquium: Revised Selected Papers. Berlin: Springer.

@inproceedings{Djalali:Lauer:Potts:2012,
  Address = {Amsterdam},
  Author = {Djalali, Alex  and  Lauer, Sven  and  Potts, Christopher},
  Booktitle = {Proceedings of the 18th Amsterdam Colloquium: Revised Selected Papers},
  Editor = {Aloni, Maria  and   Kimmelman, Vadim  and  Roelofsen, Floris  and  Sassoon, Galit Weidman  and  Schulz, Katrin  and Westera, Matthijs},
  Publisher = {Springer},
  Title = {Corpus Evidence for Preference-Driven Interpretation},
  Year = {2012}}

Djalali, Alex; David Clausen; Sven Lauer; Karl Schultz; and Christopher Potts. 2011. Modeling expert effects and common ground using Questions Under Discussion. Proceedings of the AAAI Workshop on Building Representations of Common Ground with Intelligent Agents. Washington, DC: Association for the Advancement of Artificial Intelligence.

@inproceedings{Djalali-etal:2011,
  Address = {Washington, DC},
  Author = {Djalali, Alex  and  Clausen, David  and  Lauer, Sven  and  Schultz, Karl  and  Potts, Christopher},
  Booktitle = {Proceedings of the {AAAI} Workshop on Building Representations of Common Ground with Intelligent Agents},
  Month = {November},
  Publisher = {Association for the Advancement of Artificial Intelligence},
  Title = {Modeling Expert Effects and Common Ground Using {Q}uestions {U}nder {D}iscussion},
  Year = {2011}}

Sponsorship

This research was supported in part by ONR grant No. N00014-10-1-0109 and ARO grant No. W911NF-07-1-0216. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of ONR or ARO.

Previous versions

  1. Version 1

License

This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/ or send a letter to Creative Commons, 444 Castro Street, Suite 900, Mountain View, California, 94041, USA.