Giter Site home page Giter Site logo

bilivideos's Introduction

Hi I'm Xinyu 👋

  • 🔭 I’m currently working at Bytedance.

  • 🌱 I’m currently learning Graph-Embedding/NLP and more.

  • ✨ Recently I'm interested in LLM, and here is some Modest Understandings on LLM

  • 💬 Ask me about

    • 🐍 Python
    • 🚀 CUDA/C++
    • 🐛 Write bugs
  • 📫 How to reach me: [email protected]

  • 📱 Tel: 18338224727

  • 📺 Bilibili: 小杨不努力0v0

  • ⚡ Fun fact: My cat's name is Aphelios, which happens to be the name of my favorite hero in League of Legends.

Anurag's github stats

bilivideos's People

Contributors

cauyxy avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

bilivideos's Issues

FastTokenizer下的乱码问题

首先感谢作者提供的可视化工具

使用FastTokenizer时(其实也是我猜的),比如bloom直接AutoTokenizer.from_pretrained()加载进来后,在这一行:

raw_str = tokenizer.convert_ids_to_tokens([tid])[0]
如果使用convert_ids_to_tokens的话会解出一大堆乱码。改用tokenizer.decode能解决这个问题

LLAMA 中文乱码

这个有办法解决吗?+1
例如:LLAMA

 我 <0xE5> <0x88> <0x9A> <0xE5> <0x88> <0x9A> 

希望可以做个 多个 token 映射 汉字 时候,可以加权求和

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.