Comments (2)
运行代码:
/home/gfr/jxd/Huatuo-Llama-Med-Chinese/finetune.py --base_model ./model/llama-7b-hf --data_path ./data/llama_data.json --output_dir ./lora-llama-l1 --prompt_template_name med_template --micro_batch_size 128 --batch_size 128 --wandb_run_name l1
我觉得可能是这个原因:
Hey! The main issue is that they did not update the tokenizer files at "decapoda-research/llama-7b-hf" but they are using the latest version of transformers. The tokenizer was fixed see huggingface/transformers#22402 and corrected. Nothing we can do on our end...
我尝试把基模型中的tokenizer_config.json文件修改为:
{ "add_prefix_space": false, "bos_token": "<s>", "eos_token": "</s>", "model_max_length": 1000000000000000019884624838656, "pad_token": "<pad>", "padding_side": "right", "special_tokens_map_file": null, "tokenizer_class": "LlamaTokenizer", "unk_token": "<unk>" }
后可以运行,具体原因不是很清楚。
from huatuo-llama-med-chinese.
您好,请您完整描述一下您所运行的代码,并尽量提供报错的完整内容,谢谢
from huatuo-llama-med-chinese.
Related Issues (20)
- 答非所问 HOT 3
- 如何使用微调后的模型 HOT 1
- 报错BrokenPipeError: [Errno 32] Broken pipe,完整报错如下,请问这是哪里的问题 HOT 1
- ValueError: We need an `offload_dir` to dispatch this model according to this `device_map`...
- 请问有打算开发在线体验版吗? HOT 1
- 如何多卡部署 HOT 1
- 希望取得联系 HOT 1
- 您好,您这个项目如何运行起来? HOT 3
- 指令微调的训练集的数据分布 HOT 1
- 使用huozi模型时出现错误 HOT 4
- 请问数据集什么时候完整公布 HOT 1
- RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select) HOT 2
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xaf in position 93: illegal multibyte sequence HOT 2
- How to finetune on huozi? HOT 2
- 生成答案重复 HOT 2
- 但节点多GPU训练 HOT 1
- 为何使用Bloom model测试结果有问题? HOT 1
- 使用cpu进行推理,accelerate报KeyError的问题,key的值随机
- 请问什么时候公开测试集 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from huatuo-llama-med-chinese.