Comments (5)
Thank you for your suggestion. We will definitely consider adding this feature in future updates. In the meantime, you can refer to this link to finetune
from qwen-vl.
are u able to finetune? if yes, how? any tutorial or documentation? @monksgoyal @ShuaiBai623
link mentioned is not working.
from qwen-vl.
@ShuaiBai623 it is not working.
from qwen-vl.
error :
RuntimeError: Input type (torch.cuda.ByteTensor) and weight type (torch.cuda.HalfTensor) should be the same
is coming with bnb quantize.
from qwen-vl.
Now, fine-tuning the LORA code for both distribution and single GPU is ready, please refer to the Finetuning
from qwen-vl.
Related Issues (20)
- [Question] Does the model support Document analysis?
- [BUG] <title>本地下载了模型,也检查了模型文件完整性,但是导入的时候还是会从网上下载 HOT 1
- [BUG] <title><.cache/huggingface/modules/transformers_modules/Qwen-VL-Chat/modeling_qwen.py,每次运行会被刷新,请问怎么不刷新呢? HOT 1
- safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooSmall
- 💡 [REQUEST] - <title>
- [BUG] <title>Using RTX 4000 series doesn't support faster communication broadband via P2P or IB. HOT 1
- SFT or Instruction Tuning
- [BUG] merge lora checkpoint;合并lora权重之后报错:'QWenTokenizer' object has no attribute 'IMAGE_ST'
- [BUG] <title>IsADirectoryError: [Errno 21] Is a directory: '/data/data/sxj/qwenvlchat_model'
- [BUG] <title> 在微调过程中,是否可以在value中规定json?
- 基于下游视觉任务微调,是否可以在value固定json输出,这样我就可以去获取指定的信息。 HOT 3
- 为了方便多模态技术交流,建了多模态技术交流群,感兴趣可以加入 HOT 1
- [BUG] <Failed to Finetune for multi GPUs/多卡微调一直失败>
- [BUG] <Qwen-VL-Chat多卡微调所需的内存多大>
- t4卡的lora finetune HOT 1
- [BUG] <使用Gradio界面询问框选的时候无法返回图片>
- [BUG] <title>进行Lora微调时,训练数据集的每轮对话的图片输入是否有数量限制?
- [BUG] <title>基于CoCo2017数据集进行qlora的finetune后,示例图片中的目标都无法定位到
- [BUG] <title>QwenVL2 阿里云百炼平台没法设置temperature和sample_num HOT 1
- 💡 [REQUEST] - <title> 有人知道怎么用VLLM对Qwen2-VL进行推理加速嘛 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from qwen-vl.