System Info transformers ve

LLama-3 8B - can't match MMLU performance about transformers HOT 2 OPEN

gioaca00 commented on June 15, 2024

LLama-3 8B - can't match MMLU performance

from transformers.

Comments (2)

clefourrier commented on June 15, 2024 2

Hi! Evaluation is notoriously fickle and prompt sensitive.
To reproduce Meta's number, you would need to run the exact same setup (same batch size, same generation temperature, same few-shot samples in the exact same order, etc). If you want to try to reproduce results we get on the Open LLM Leaderboard, you can follow the steps in the About page, reproducibility section.

from transformers.

ArthurZucker commented on June 15, 2024

Hey! I don't think it is a difference in implementation as the model has been QUITE tested in the past years 😓
What is happening might be that the generation_config does not include the same parameters? Or something with the prompt. We ran the evaluations on the openLLM leaderboard using transformers as a backend and made sure that we were able to reproduce the results.

cc @clefourrier the lead of LLM Leaderboards!

from transformers.

LLama-3 8B - can't match MMLU performance about transformers HOT 2 OPEN

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent