Introducing CICERO, a new dataset for dialogue reasoning with contextualized commonsense inference. It contains 53K inferences for five commonsense dimensions – cause, subsequent event, prerequisite, motivation, and emotional reaction collected from 5.6K dialogues. To show the usefulness of CICERO for dialogue reasoning, we design several challenging generative and multichoice answer selection tasks for state-of-the-art NLP models to solve.
The CICERO dataset can be found in the data directory. Each line of the files is a json object indicating a single instance. The json objects have the following key-value pairs:
Key | Value |
---|---|
ID | Dialogue ID with dataset indicator. |
Dialogue | Utterances of the dialogue in a list. |
Target | Target utterance. |
Question | One of the five questions (inference types). |
Choices | Five possible answer choices in a list. One of the answers is human written. The other four answers are machine generated and selected through the Adversarial Filtering (AF) algorithm. |
Human Written Answer | Index of the human written answer in a single element list. Index starts from 0. |
Correct Answers | List of all correct answers indicated as plausible or speculatively correct by the human annotators. Includes the index of the human written answer. |
An example of the data is shown below.
{
"ID": "daily-dialogue-1291",
"Dialogue": [
"A: Hello , is there anything I can do for you ?",
"B: Yes . I would like to check in .",
"A: Have you made a reservation ?",
"B: Yes . I am Belen .",
"A: So your room number is 201 . Are you a member of our hotel ?",
"B: No , what's the difference ?",
"A: Well , we offer a 10 % charge for our members ."
],
"Target": "Well , we offer a 10 % charge for our members .",
"Question": "What subsequent event happens or could happen following the target?",
"Choices": [
"For future discounts at the hotel, the listener takes a credit card at the hotel.",
"The listener is not enrolled in a hotel membership.",
"For future discounts at the airport, the listener takes a membership at the airport.",
"For future discounts at the hotel, the listener takes a membership at the hotel.",
"The listener doesn't have a membership to the hotel."
],
"Human Written Answer": [
3
],
"Correct Answers": [
3
]
}
The details of the answer selection (MCQ) experiments can be found here. The details of the answer generation (NLG) experiments can be found here.
CICERO: A Dataset for Contextualized Commonsense Inference in Dialogues. Deepanway Ghosal and Siqi Shen and Navonil Majumder and Rada Mihalcea and Soujanya Poria. ACL 2022.