Comments (1)
看似是转换出来模型的词表对不上。需要注意的是,将.pth
模型转换为FP16-ggml时,tokenizer.model要用我们模型压缩包中提供的。另外,alpaca/llama的词表不能混用。麻烦再仔细对照流程检查一下。
也推荐你看一下在线的notebook,对照查看哪一步有出入:https://github.com/ymcui/Chinese-LLaMA-Alpaca/blob/main/notebooks/convert_and_quantize_chinese_llama.ipynb
from chinese-llama-alpaca.
Related Issues (20)
- 合并 lora 模型与原版 LLaMA 模型,合并文件夹里缺少部分文件,如config文件等 HOT 3
- 关于使用SentencePiece进行词表合并的问题 HOT 2
- Qwen72(int4)版本的 执行server为什么会这么慢呀? HOT 1
- TypeError: __init__() got an unexpected keyword argument 'merge_weights' HOT 2
- 训练保存checkpoint之后如何完成推理、评估 HOT 2
- 我的回答简直就是乱码一堆 HOT 2
- 报错 HOT 2
- 合并词表的时候,为何不重新统计spm分数? HOT 2
- 当运行时出现了tensor 'token_embd.weight' has wrong shape; expected 4096, 32001, got 4096, 32000, 1, 1 HOT 3
- About vocabulary extension HOT 2
- 中文token扩张的model是和llama绑定还是通用型model,如果我想新增token进行调优,请问有操作建议吗? HOT 1
- 运行预训练,一直卡顿 HOT 2
- 指令微调发生错误 HOT 2
- 扩充词表后只修改embedding_size,没有修改lm_head的维度 HOT 2
- lora 模型合并的几个问题 HOT 2
- ollama怎么集成 HOT 2
- 结合官方给定的Lora模型,推理不
- 结合官方给定的Lora,推理不准确 HOT 2
- Using Chinese-LLaMA-Alpaca For Low Resource Language such as Pashto HOT 2
- 在运行scripts/inference/inference_hf.py时,在seq_len> self.max_seq_len_cached部分,会出现RuntimeError: Boolean value of Tensor with more than one value is ambiguous HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chinese-llama-alpaca.