Giter Site home page Giter Site logo

zero_nlp's Introduction

zero to nlp

特点

  1. 注重中文的nlp建设
  2. 基于transformers库
  3. 每一个项目,都具有数据、模型训练、部署等模块
  4. 提供公开的数据
  5. 开箱即用,保证代码都是可跑通,替换数据即可训练出自己的模型
  6. transformers库的源码级介绍
  7. 基于pytorch,不用tensorflow
  8. 模型方面,只介绍大模型,不再介绍传统文本模型
  9. 每一个项目,都会给出相关的坑和注意事项

目录

模型训练

中文名称 文件夹名称 数据 数据清洗 大模型 模型部署 图解
中文文本分类 chinese_classifier ☑️
中文gpt2 chinese_gpt2 ☑️
中文clip chinese_clip ☑️
图像生成中文文本 VisionEncoderDecoderModel ☑️
vit核心源码介绍 vit model ☑️ ☑️ ☑️ ☑️
训练Thu-ChatGlm-6b simple_thu_chatglm6b ☑️

数据流程图解

我一直觉得,数据流程通过图解的形式表达出来,其实是最清楚的,因此我都会尽可能的把每一个任务的都图解出来。

文本分类数据图解

中文gpt2

中文clip

model

图像生成中文文本

model

vit 源码

zero_nlp's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.