Lifan Yuan's Projects
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"
Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations".
An Open-Source Package for Textual Adversarial Attack.
Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"