Giter Site home page Giter Site logo

ukairia777 / inters Goto Github PK

View Code? Open in Web Editor NEW

This project forked from daod/inters

0.0 0.0 0.0 411 KB

This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"

License: MIT License

inters's Introduction

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

license

๐Ÿ“ƒ ArXiv Paper โ€ข ๐Ÿค— HuggingFace Model โ€ข ๐Ÿ“š Dataset

Authors: Yutao Zhu, Peitian Zhang, Chenghao Zhang, Yifei Chen, Binyu Xie, Zhicheng Dou, Zheng Liu, and Ji-Rong Wen

โญ We will release the datasets, models, templates, and codes within a month (before Feb. 15th). Thanks for your attention!

Introduction

Large language models (LLMs) have demonstrated impressive capabilities in various natural language processing tasks. Despite this, their application to information retrieval (IR) tasks is still challenging due to the infrequent occurrence of many IR-specific concepts in natural language. While prompt-based methods can provide task descriptions to LLMs, they often fall short in facilitating comprehensive understanding and execution of IR tasks, thereby limiting LLMs' applicability. To address this gap, in this work, we explore the potential of instruction tuning to enhance LLMs' proficiency in IR tasks. We introduce a novel instruction tuning dataset, \ourdata{}, encompassing 21 tasks across three fundamental IR categories: query understanding, document understanding, and query-document relationship understanding. The data are derived from 43 distinct datasets with manually written templates. Our empirical results reveal that \ourdata{} significantly boosts the performance of various publicly available LLMs, such as LLaMA, Mistral, and Phi, in search-related tasks. Furthermore, we conduct a comprehensive analysis to ascertain the effects of base model selection, instruction design, volume of instructions, and task variety on performance.

Our dataset and the models fine-tuned on it will be released soon!

Citation

Please kindly cite our paper if it helps your research:

@article{Inters,
    author={Yutao Zhu and
            Peitian Zhang and
            Chenghao Zhang and
            Yifei Chen and
            Binyu Xie and
            Zhicheng Dou and
            Zheng Liu and
            Ji-Rong Wen},
    title={INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning},
    journal={CoRR},
    volume={abs/2401.06532},
    year={2024},
    url={https://arxiv.org/abs/2401.06532},
    eprinttype={arXiv},
    eprint={}
}

inters's People

Contributors

daod avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.