using incomplete_utterance_rewriting in English dataset is very weak

📫 Paper Code Collection (MSRA DKI Group)

This repo hosts multiple open-source codes of the Microsoft Research Asia DKI Group. You could find the corresponding code as below:

News

December, 2022: our paper MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing was accepted by AAAI 2023.
October, 2022: our paper Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge was accepted by EMNLP 2022.
October, 2022: our paper Reasoning Like Program Executors was accepted by EMNLP 2022.
October, 2022: our paper LEMON: Language-Based Environment Manipulation via Execution-guided Pre-training was accepted by EMNLP 2022 Findings.
October, 2022: our paper AdapterShare: Task Correlation Modeling with Adapter Differentiation was accepted by EMNLP 2022.
Septempter, 2022: our paper LogiGAN: Learning Logical Reasoning via Adversarial Pre-training was accepted by NeurIPS 2022.
March, 2022: Our paper Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation was accepted by ACL 2022.
August, 2021: Our paper Awakening Latent Grounding from Pretrained Language Models for Semantic Parsing was accepted by ACL 2021 Findings.
August, 2021: Our paper Learning Algebraic Recombination for Compositional Generalization was accepted by ACL 2021 Findings.
September, 2020: Our paper "What Do You Mean by That?" A Parser-Independent Interactive Approach for Enhancing Text-to-SQL was accepted by EMNLP 2020.
September, 2020: Our paper Incomplete Utterance Rewriting as Semantic Segmentation was accepted by EMNLP 2020.
September, 2020: Our paper Hierarchical Poset Decoding for Compositional Generalization in Language was accepted by NeurIPS 2020.
September, 2020: Our paper Compositional Generalization by Learning Analytical Expressions was accepted by NeurIPS 2020 as Spotlight.
April, 2020: Our paper How Far are We from Effective Context Modeling ? An Exploratory Study on Semantic Parsing in Context was accepted by IJCAI 2020.

Code Release (Click Title to Locate the Code)

Reasoning

Reasoning Like Program Executors Xinyu Pi*, Qian Liu*, Bei Chen, Morteza Ziyadi, Zeqi Lin, Qiang Fu, Yan Gao, Jian-Guang Lou, Weizhu Chen, EMNLP 2022.

LEMON: Language-Based Environment Manipulation via Execution-guided Pre-training Qi Shi, Qian Liu, Bei Chen, Yu Zhang, Ting Liu, Jian-Guang Lou, EMNLP 2022 Findings.

LogiGAN: Learning Logical Reasoning via Adversarial Pre-training Xinyu Pi*, Wanjun Zhong*, Yan Gao, Nan Duan, Jian-Guang Lou, NeurIPS 2022.

Text-to-SQL

MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing Longxu Dou, Yan Gao, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Jian-Guang Lou, AAAI 2023.

Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge Longxu Dou, Yan Gao, Xuqi Liu, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Min-Yen Kan, Dechen Zhan, Jian-Guang Lou, EMNLP 2022.

UniSAr: A Unified Structure-Aware Autoregressive Language Model for Text-to-SQL Longxu Dou, Yan Gao, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Jian-Guang Lou, arxiv 2022.

Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation Xinyu Pi*, Bing Wang*, Yan Gao, Jiaqi Guo, Zhoujun Li, Jian-Guang Lou, ACL 2022.

Awakening Latent Grounding from Pretrained Language Models for Semantic Parsing Qian Liu*, Dejian Yang*, Jiahui Zhang*, Jiaqi Guo, Bin Zhou, Jian-Guang Lou, ACL 2021 Findings.

Compositional Generalization

Learning Algebraic Recombination for Compositional Generalization Chenyao Liu*, Shengnan An*, Zeqi Lin, Qian Liu, Bei Chen, Jian-Guang Lou, Lijie Wen, Nanning Zheng, Dongmei Zhang, ACL 2021 Findings.

Hierarchical Poset Decoding for Compositional Generalization in Language Yinuo Guo, Zeqi Lin, Jian-Guang Lou, Dongmei Zhang, NeurIPS 2020.

Compositional Generalization by Learning Analytical Expressions Qian Liu*, Shengnan An*, Jian-Guang Lou, Bei Chen, Zeqi Lin, Yan Gao, Bin Zhou, Nanning Zheng, Dongmei Zhang, NeurIPS 2020.

Conversation

"What Do You Mean by That?" A Parser-Independent Interactive Approach for Enhancing Text-to-SQL Yuntao Li, Bei Chen, Qian Liu, Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang, EMNLP 2020

Incomplete Utterance Rewriting as Semantic Segmentation Qian Liu, Bei Chen, Jian-Guang Lou, Bin Zhou, Dongmei Zhang, EMNLP 2020

How Far are We from Effective Context Modeling ? An Exploratory Study on Semantic Parsing in Context Qian Liu, Bei Chen, Jiaqi Guo, Jian-Guang Lou, Bin Zhou, Dongmei Zhang, IJCAI 2020

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Question

If you have any question or find any bug, please go ahead and open an issue. Issues are an acceptable discussion forum as well.

If you want to concat the author, please email: qian DOT liu AT buaa.edu.cn.

	if query.startswith("Did M") or query.startswith("Was M") or query.startswith("Were M") or query.startswith("Was a"):
	if type in ['mcd2', 'mcd3']:
	nl_pattern = query.split()[0] +" " + query.split()[1]
	terms.append((nl_pattern, [f'?x0#is#{query.split()[1]}'], (0, 1)))
	else:
	nl_pattern = query.split()[0] +" M"
	terms.append((nl_pattern, ['?x0#is#M'], (0, 1)))

	if candidate_term.count("M") == 1:
	if candidate_term.startswith("?x0 is M") and split in ['mcd2', 'mcd3']:
	candidate_triplets[candidate_skeleton] += [candidate_term]
	else:
	candidate_triplets[candidate_skeleton] += [''.join(candidate_term.replace("M", entity[0][0])) for entity in entities]

microsoft / contextualsp Goto Github PK

contextualsp's Introduction

📫 Paper Code Collection (MSRA DKI Group)

News

Code Release (Click Title to Locate the Code)

Reasoning

Text-to-SQL

Compositional Generalization

Conversation

Contributing

Question

contextualsp's People

Contributors

Stargazers

Watchers

Forkers

contextualsp's Issues

Recommend Projects

Recommend Topics

Recommend Org