Giter Site home page Giter Site logo

reflect's Introduction

Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality

Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)

[Project Website] (https://inklab.usc.edu/Reflect/)

[Paper] (https://arxiv.org/abs/2211.09267)

Reflect is a dataset that annotates dialogues with explicit CG (materialized as inferences approximating shared knowledge and beliefs) and solicits diverse human-generated responses each following one common ground.

Reflect contains 9000 diverse responses from 600 dialogue contexts, based on 5 inference dimensions for CG. We collect three responses for each inference dimension, so there are 15 diverse responses for each dialogue context.

Content

  • data contains our main dataset (data/organized_Reflect_9k_responses.json) in json file. Each dictionary in the file contains the following keywards:

    • dialogue: the dialogue history where each utterance is separated by <br>;
    • speaker: the speaker name (note that our collected responses and reactions are from the perspective of Friend);
    • reaction_1: the inference answer we collect in stage 1 following the questions "How would you describe Speaker?"
    • reaction_2: the inference answer we collect in stage 1 following the questions "What might have happened before?"
    • reaction_1: the inference answer we collect in stage 1 following the questions "What might happen after?"
    • reaction_1: the inference answer we collect in stage 1 following the questions "What is Speaker feeling now?"
    • reaction_1: the inference answer we collect in stage 1 following the questions "What are you feeling now?"
    • responses_1 to responses_5: responses (3 for each inference dimension) we collect in stage 2 following each of the corresponding inference answer/reaction.
    • utterances: the dialogue history as a list of utterances (the first speaker is always the person in speaker );
  • exps contains code we used to fine-tune BlenderBot on Reflect and GPT-3 scripts.

Contact

Feel free to directly email peiz[at]usc[dot]edu if you have any feedback.

Citation

Please cite our EMNLP 2022 paper if you find this data helpful.

@inproceedings{zhou2022reflect,
		title={Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality},
		author={Zhou, Pei and Cho, Hyundong J. and Jandaghi, Pegah and Lee, Dong-Ho and Lin, Bill Yuchen and Pujara, Jay and Ren, Xiang},
		booktitle={Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing},
		year={2022}
	  }

reflect's People

Contributors

shaoxia57 avatar

Stargazers

Renat Zayashnikov avatar Hyungjoo Chae avatar Jin Yao avatar Dong-Ho Lee avatar (Bill) Yuchen Lin avatar XU, Yan (Yana) avatar  avatar Takashi Kodama avatar  avatar Bruno Henrique avatar  avatar

Watchers

Xiang Ren avatar (Bill) Yuchen Lin avatar  avatar

reflect's Issues

Training/Testing datafiles for finetuning

Finetuning blender expects all_train_responses.json and all_test_responses.json but the only data that is provided is organized_Reflect_9k_responses.json. Based on the agents.py code, the format of organized_Reflect_9k_responses.json is different than that of the expected files for finetuning. Will training/testing data files themselves be released, or at least the preprocessing code to convert organized_Reflect_9k_responses.json?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.