Giter Site home page Giter Site logo

Comments (13)

maksymdolgikh avatar maksymdolgikh commented on July 17, 2024 1

I found a workaround, at least for T4. Set the model to half-precision to avoid this error:

model = MambaLMHeadModel.from_pretrained('state-spaces/mamba2-130m')
model = model.half()

Let me know if you understand why this works :)

from mamba.

tridao avatar tridao commented on July 17, 2024

Looks like a Triton error, which GPU do you use?

from mamba.

Prophet-Kathleen avatar Prophet-Kathleen commented on July 17, 2024

rtx2080ti with driver version 535.171.04 right now, i am trying to use different version of python or newest trition 2.3.1 or pytorch
to fix this problem

from mamba.

tridao avatar tridao commented on July 17, 2024

I'm not sure triton supports GPUs before Ampere (e.g. 2080) very well

from mamba.

Prophet-Kathleen avatar Prophet-Kathleen commented on July 17, 2024

I just borrowed a rtx3060 (driver version 535.171.04) to test the code and it works. and triton version is 2.3.1. thanks

from mamba.

ghaddarAbs avatar ghaddarAbs commented on July 17, 2024

same issue here but using V100 :(

from mamba.

jsie7 avatar jsie7 commented on July 17, 2024

With newest triton version (2.3.1), this seems mainly related to the used GPU. I also ran into this error on an RTX 2080 Ti, thus I tried to reproduce the error on different GPUs I have available.

Setup: Nvidia driver 535.161.07, Cuda 11.8, Triton 2.3.1, mamba-ssm v2.0.3
Working GPUs: V100, RTX 3090, RTX 4090, A100 (40GB & 80GB)
Index Error (map::at): RTX 2080 Ti, Titan RTX, Quadro RTX 6000
--> it seems the error only occurs for the Turing microarchitecture

@ghaddarAbs V100 works for me; maybe update triton to 2.3.1?

from mamba.

ghaddarAbs avatar ghaddarAbs commented on July 17, 2024

@jsie7 thanks for suggesting I will try it out ... which torch version you used ?

from mamba.

jsie7 avatar jsie7 commented on July 17, 2024

@jsie7 thanks for suggesting I will try it out ... which torch version you used ?

I'm using v2.0.1

from mamba.

SolomidHero avatar SolomidHero commented on July 17, 2024

I experienced same problem :(

Driver Version: 550.54.14      
CUDA Version: 12.4 # by nvcc -V
Tesla T4  
triton==2.3.1

I only installed torch with cuda12.1 support.

from mamba.

jsie7 avatar jsie7 commented on July 17, 2024

I experienced same problem :(

Driver Version: 550.54.14      
CUDA Version: 12.4 # by nvcc -V
Tesla T4  
triton==2.3.1

I only installed torch with cuda12.1 support.

The Tesla T4 is also based on the Turing microarchitecture. This just further confirms that it's an issue with that architecture.

from mamba.

JulienSiems avatar JulienSiems commented on July 17, 2024

Thanks @maksymdolgikh this worked for me, also no idea why this works though.

from mamba.

samjoc avatar samjoc commented on July 17, 2024

I found a workaround, at least for T4. Set the model to half-precision to avoid this error:

model = MambaLMHeadModel.from_pretrained('state-spaces/mamba2-130m')
model = model.half()

Let me know if you understand why this works :)

worked on 2080Ti for me too.

from mamba.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.