While generating any text with a specified value of max_length, the generated text kee

Output with higher max_length is repetition of base text about gemma_pytorch HOT 6 OPEN

google commented on August 9, 2024

Output with higher max_length is repetition of base text

from gemma_pytorch.

Comments (6)

pengchongjin commented on August 9, 2024 1

Could you please try the instruction-tuned model instead? It should give you better results.

from gemma_pytorch.

AbhishekJ24 commented on August 9, 2024 1

I am just happy to be a part of this chat

from gemma_pytorch.

azrael05 commented on August 9, 2024

Could you please try the instruction-tuned model instead? It should give you better results.

Thanks, With the instruct tuned model the output is perfect.

Btw is there any reason why the gemma_2b_en model produced repetitive output instead ks stopping ?.

from gemma_pytorch.

pengchongjin commented on August 9, 2024

It's kind of expected that the pre-trained models only try to complete text. Maybe one way you could try is to tune the sampling parameters to see if you can get a bit diversity in the output.

from gemma_pytorch.

azrael05 commented on August 9, 2024

It's kind of expected that the pre-trained models only try to complete text. Maybe one way you could try is to tune the sampling parameters to see if you can get a bit diversity in the output.

Yeah, Its expected of it to complete the text but still shouldn't repeat its text right?
Example the other text generation models might produce half ending sentence outputs depending on the max_length size but they don't producr repeating ouputs.

from gemma_pytorch.

Ittiz commented on August 9, 2024

I've noticed the 2b model repeating itself as well. Although, I found it does it when the context of my prompt would be hard even for a human to figure out.

from gemma_pytorch.

Recommend Projects

Output with higher max_length is repetition of base text about gemma_pytorch HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent