The chatlaw's discuss from pku-yuangroup

Where i can find config.json

this say JessyTsu1/ChatLaw-13B does not appear to have a file named config.json.

关于训练数据demo中meta_instruction的问题

感谢你们贡献如此优秀的开源项目。
在训练数据demo中，meta_instruction中的指令并不通顺（你一个名叫）
"meta_instruction": "你一个名叫ChatLAW
请问这是疏忽还是有意为之，这会影响模型的效果吗？

text2vec数据集构建

看huggingface的例子，例子， text2vec的原始数据看起来像QA问答的数据，请问您是怎么把问答数据构建成为文本embeddings训练的数据的呢，CoSENT的训练是不是需要（sentence1,sentenc2,sentenc3）三个样本呢？

Keyword LLM 是如何訓練的?

您好, 我的理解是要 fine-tune Keyowrd LLM 是需要先有對應輸入文章的關鍵詞輸出才能訓練。想請問是否有建構關鍵詞資料集?

请问一下 Law Text2Vec 这个模型的语料会开源吗？

为什么Huggingface仓库没有config.json文件呢，模型合并了也无法运行啊

如题。
另外：把合并好的Ziya带进去直接就能跑起，就是缺少config文件啊，为什么hf仓库没有上传呢

合并以后跑出来的是unk

import re
import torch
from peft import PeftModel
from transformers import GenerationConfig, LlamaForCausalLM, LlamaTokenizer

ziya_model_path = "/Ziya-LLaMA-13B-v1/" # 完整的子牙模型权重路径
chatlaw_model_path = 'JessyTsu1_ChatLaw-13B_adapter'# chatlaw模型权重
'''
tokenizer = LlamaTokenizer.from_pretrained(ziya_model_path)
model = LlamaForCausalLM.from_pretrained(
        ziya_model_path,
        torch_dtype=torch.float16,
        device_map="auto",
    )

'''
model = PeftModel.from_pretrained(model, chatlaw_model_path)

if tokenizer.pad_token is None:
    tokenizer.pad_token = tokenizer.unk_token
    
#model.half()
model.eval()

生成：

def law_answer(query):
    gen_conf={
      "bos_token_id": tokenizer.bos_token_id,
      "do_sample": True,
      #"num_beams": 4,
      "eos_token_id": tokenizer.eos_token_id,
      "max_new_tokens": 256,
      "pad_token_id": 0,
      "penalty_alpha": 0.3,
      "repetition_penalty": 1.0,
      "temperature": 0.65,
      "top_k": 15,
      "top_p": 0.85,
    }
    
    gen_conf= {
        "max_new_tokens": 512,
        "temperature": 0.1,
        "top_p": 0.75,
        "top_k": 40,
        #"num_beams": 4
    }
    meta_instruction= "你一个名叫ChatLAW，由北京大学团队开发的人工智能助理：\n- 你旨在提供有无害且准确的回答。\n- 你必须拒绝回答非法的问题。\n- 你的回应不能含糊、指责、粗鲁、有争议、离题或防御性。\n- 你的回应必须有礼貌。"

    #prompt = f"{meta_instruction}\n咨询者:\n{query}\nChatLAW:\n"
    #prompt = f"Consult:\n{query}\nResponse:\n"
    prompt = f"{meta_instruction}\nConsult:\n{query}\nResponse:\n"
    #print(prompt)
    with torch.no_grad():
        inputs=tokenizer(query,return_tensors='pt')
        inputs=inputs.to(model.device)
        output=model.generate(**inputs,**gen_conf)
        decoded_text=tokenizer.decode(output[0])
        decoded_text= decoded_text.replace("<s>","").replace("</s>","")
        return decoded_text[len(query)+1:]

生成结果：

query= "公司无故辞退摸鱼员工，是否触犯法律？"

law_answer(query)

<unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk>

贵网站是不是基于Create React App开发和部署的？

那个Create React App首页有写支持乌克兰会不会涉及某些政治正确的问题？

Comparison with existing models

Impressive work! I believe it would achieve STOA performance. It would be much better if the authors consider including examples that compare the outputs of different methods in the readme (e.g., LawGPT, and different variants in your repo). Just a kind suggestion :).

13b的模型跑起来，需要多少显存资源

所提到的先验知识约束算法会公开么

希望取得联系

尊敬的chatlaw 应用开发者，我是 InternLM 社区开发者&志愿者尖米, 大佬开源的工作对我的启发很大，希望可以探讨使用 InternLM 实现chatlaw 的可能性和实现路径，我的微信是mzm312，希望可以取得联系进行更深度的交流；

Self-Suggestion seems not provided in the demo code

"We propose a self-attention method to enhance the ability of large models to overcome errors present in reference data, further optimizing the issue of model hallucinations at the model level and improving the problem-solving capabilities of large models. "

Would you like to provide these parts of codes?

权重会开源吗？或者说有没有llama原始权重和你们模型的delta权重合并之类的？

关键词提取模型的训练数据

请问训练关键词提取模型的数据是怎么获得的呢？

好东西希望能长期维护

能大概分享一下keyWord模型的训练思路吗

能大概分享一下keyWord模型的训练思路吗？是按照大模型训练还是小模型训练，需要多大的数据量呢

请问评测数据是否开源

我们最近也在做法律垂域相关的，请问下评测数据能否开源，方便进行横向对比，谢谢！

Readme链接问题

Readme中https://chatlaw.cloud/lawchat/ 显示为404

Is retrieval used for the other models as well?

as title

LLaMa2 权重发布了，有没有相应的合并？

或者你们模型的同步升级。

keyword llm 和 self-suggestion具体的代码会开源吗？

怎么保留基座模型原有的问答能力

请问在用指令数据微调的时候，有没有再加上通用数据？
我在尝试微调的时候：
1、指令+垂直领域问题/不加指令+通用问题
==》没法区分用户的问题是否要加上指令，加上指令会回答正确，不加回答不上来。
2、垂直领域问题
==》只能回答垂直领域问题，通用能力完全丧失。

谢谢！

交流群4满了，进不去了，还有新群吗

模型不运作

合同法是不是在2021年1月1日后失效了，作者有没有注意更新这一点

2021年1月1日民法典生效后，《中华人民共和国民法通则》、《中华人民共和国担保法》、《中华人民共和国合同法》、《中华人民共和国物权法》、《中华人民共和国民法总则》等法律文件已经失效

关于ChatLaw的训练的咨询

在论文的3.1部分，作者说“Additionally, we introduced the self-suggestion role to further alleviate model hallucination issues”这句话不太理解。请问，这个self-suggestion role是如何在训练中体现出来的？是通过设计特殊的loss function吗？还是说在训练样本中通过prompt的方式，实现模型的self-suggestion？如果是后者能否举个例子说明。

训练语录数据

训练语录数据会开源吗？

ImportError: cannot import name 'LlamaForCausalLM' from 'transformers'

当我运行demo/web.py程序是，得到如下错误提示：
ImportError: cannot import name 'LlamaForCausalLM' from 'transformers’

“we fine-tuned an LLM to extract the keywords from user queries.”

是怎么从input中提取keywords的（甚至出现input中不存在的单词）？

fine-tune LLM后将input输入给LLM后得到的输出就是keywords吗?还是又对输出进行了处理得到的keywords？
@JessyTsu1

模型合并好了，跑示例代码报错了

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)a'