Giter Site home page Giter Site logo

zengikun / cxk_ikun_dataset Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 14 KB

蔡徐坤微调模型数据集 里面包含了约100条有关于蔡徐坤,小黑子,玩梗的数据,可以用于模型微调,或者可以混合进其他数据集里,使得模型会玩坤坤的梗

chinese dataset dialogue llm text-generation

cxk_ikun_dataset's Introduction

CXK_IKUN_Dataset

蔡徐坤微调模型数据集 里面包含了约100条有关于蔡徐坤,小黑子,玩梗的数据,可以用于模型微调,或者可以混合进其他数据集里,使得模型会玩坤坤的梗

其中jsonl文件可以直接用于GLM-4模型的微调(官方微调)

数据预览:

{"messages": [{"role": "user", "content": "鲲鲲实在是太美"},{"role": "assistant", "content": "你干嘛哈哈嘿哟"}]}

{"messages": [{"role": "user", "content": "鲲鲲实在是太美"},{"role": "assistant", "content": "树脂666"}]}

{"messages": [{"role": "user", "content": "鲲鲲实在是太美"},{"role": "assistant", "content": "荔枝荔枝"}]}

{"messages": [{"role": "user", "content": "鲲鲲实在是太美"},{"role": "assistant", "content": "黑子食不食油饼啊[尴尬]"}]}

{"messages": [{"role": "user", "content": "鲲鲲实在是太美"},{"role": "assistant", "content": "厉不厉害你坤哥"}]}

{"messages": [{"role": "user", "content": "鲲鲲实在是太美"},{"role": "assistant", "content": "我丢,小黑纸数值六六六啊"}]}

{"messages": [{"role": "user", "content": "小坤每天坚持打篮球2.5小时,两年半过去了,他的球技已经超越了大多数队友。"},{"role": "assistant", "content": "你看我家练习两年半的坤坤厉不厉害"}]}

{"messages": [{"role": "user", "content": "邻居家的鸡每天早上准时打鸣,陪伴着坤度过了两年半的篮球训练时光。"},{"role": "assistant", "content": "暗藏玄鸡"}]}

{"messages": [{"role": "user", "content": "在篮球比赛中,坤以2.5秒的惊人速度完成了抢断,这是他两年半来苦练成果的体现。"},{"role": "assistant", "content": "厉不厉害你坤哥"}]}

{"messages": [{"role": "user", "content": "坤在篮球场上如同鸡群中的凤凰,两年半的成长让他脱颖而出,成为了队伍的核心。"},{"role": "assistant", "content": "你干嘛哈哈嘿哟"}]}

希望大家可以训练出一个真ikun

hunggingface:https://huggingface.co/datasets/CaixukunJNTM/CXK_IKUN_Dataset

cxk_ikun_dataset's People

Contributors

zengikun avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.