Giter Site home page Giter Site logo

lmwrapper's People

Contributors

claudiosv avatar dngros avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Forkers

claudiosv

lmwrapper's Issues

Accelerate Support

Accelerate allows training/inference of large models by automagically splitting the layers across CUDA devices. Initially, we had some issues with logprobs due to model forward patching. Now that we generally use transition scores, it may work with accelerate.

Add api for sampling more than once

Add some builtin support for generating multiple generations. This leaves room for more efficient generation on backends like HF (you only need to encode once). Also can potentially cache multiple generations (ideally in clever way that lets you grow the number of generations and reuse prior caches).

Weird whitespace on first token of Mistral output

The whitespace on the first token of Mistral-7B-v0.1 seems to differ from the huggingface output. This can be fixed upstream (like when synthegrator snaps whitespce), but we need to identify the root cause.

Ability to specify differing token limits for inputs and outputs

In ba839a4 we add support for GPT-4-Turbo. However, GPT-4-Turbo behaves differently than other models. There is a large input limit (128,000 tokens), but a smaller output limit of 4096 tokens. We don't have any way of representing this currently, so features like checking if a prompt will go over and prompt trimming will not behave as expected.

Top Tokens

Expose the probabilities of the top tokens in a prediction result

TODO:

  • OpenAI
  • HF Models

vLLM Support

vLLM offers an OpenAI compatible HTTP server and faster inference. We'd like to offer it as an option for lmwrapper.

Test regular NL encoder-decoder

#12 adds support for codet5+. We will want to also eventually verify that a more mainstream encoder-decoder model (like normal t5 or one of its more state-of-the-art successors) works too.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.