Hi, here is the code: <div class="snippet-clipboard-content notranslate position-r

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

I experienced same problem :( <div class="snippet-clipboard-content notranslate po

Mamba-2: IndexError: map::at about mamba HOT 13 OPEN

Prophet-Kathleen commented on July 17, 2024

Mamba-2: IndexError: map::at

from mamba.

Comments (13)

maksymdolgikh commented on July 17, 2024 1

I found a workaround, at least for T4. Set the model to half-precision to avoid this error:

model = MambaLMHeadModel.from_pretrained('state-spaces/mamba2-130m')
model = model.half()

Let me know if you understand why this works :)

from mamba.

tridao commented on July 17, 2024

Looks like a Triton error, which GPU do you use?

from mamba.

Prophet-Kathleen commented on July 17, 2024

rtx2080ti with driver version 535.171.04 right now, i am trying to use different version of python or newest trition 2.3.1 or pytorch
to fix this problem

from mamba.

tridao commented on July 17, 2024

I'm not sure triton supports GPUs before Ampere (e.g. 2080) very well

from mamba.

Prophet-Kathleen commented on July 17, 2024

I just borrowed a rtx3060 (driver version 535.171.04) to test the code and it works. and triton version is 2.3.1. thanks

from mamba.

ghaddarAbs commented on July 17, 2024

same issue here but using V100 :(

from mamba.

jsie7 commented on July 17, 2024

With newest triton version (2.3.1), this seems mainly related to the used GPU. I also ran into this error on an RTX 2080 Ti, thus I tried to reproduce the error on different GPUs I have available.

Setup: Nvidia driver 535.161.07, Cuda 11.8, Triton 2.3.1, mamba-ssm v2.0.3
Working GPUs: V100, RTX 3090, RTX 4090, A100 (40GB & 80GB)
Index Error (map::at): RTX 2080 Ti, Titan RTX, Quadro RTX 6000
--> it seems the error only occurs for the Turing microarchitecture

@ghaddarAbs V100 works for me; maybe update triton to 2.3.1?

from mamba.

ghaddarAbs commented on July 17, 2024

@jsie7 thanks for suggesting I will try it out ... which torch version you used ?

from mamba.

jsie7 commented on July 17, 2024

@jsie7 thanks for suggesting I will try it out ... which torch version you used ?

I'm using v2.0.1

from mamba.

SolomidHero commented on July 17, 2024

I experienced same problem :(

Driver Version: 550.54.14      
CUDA Version: 12.4 # by nvcc -V
Tesla T4  
triton==2.3.1

I only installed torch with cuda12.1 support.

from mamba.

jsie7 commented on July 17, 2024

I experienced same problem :(
Driver Version: 550.54.14      
CUDA Version: 12.4 # by nvcc -V
Tesla T4  
triton==2.3.1
I only installed torch with cuda12.1 support.

The Tesla T4 is also based on the Turing microarchitecture. This just further confirms that it's an issue with that architecture.

from mamba.

JulienSiems commented on July 17, 2024

Thanks @maksymdolgikh this worked for me, also no idea why this works though.

from mamba.

samjoc commented on July 17, 2024

I found a workaround, at least for T4. Set the model to half-precision to avoid this error:
model = MambaLMHeadModel.from_pretrained('state-spaces/mamba2-130m')
model = model.half()
Let me know if you understand why this works :)

worked on 2080Ti for me too.

from mamba.

Recommend Projects

Mamba-2: IndexError: map::at about mamba HOT 13 OPEN

Comments (13)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent