Hi, I notice that right now the dimension of the head is fixed as 32 because of the co

Question about the dimension of head about neighborhood-attention-transformer HOT 7 CLOSED

shi-labs commented on June 27, 2024

Question about the dimension of head

from neighborhood-attention-transformer.

Comments (7)

alihassanijr commented on June 27, 2024

Hello,
Thank tou for your interest.

We fixed the dimension for our purposes to get a minor speed improvement. You can modify the line #define DIM 32 and change it to 64 and recompile if you're interested in doing that.
We plan to release either a separate version of the kernel with dynamic dims, or merge that into the kernel directly.

from neighborhood-attention-transformer.

XiaoyuShi97 commented on June 27, 2024

Hi, I want to double check that no matter the value of dim and num_heads, the dim of head is always 32?

from neighborhood-attention-transformer.

alihassanijr commented on June 27, 2024

Hi,
No, we specifically kept the per-head dim at 32 for our 4 variants, and extended heads for larger variants. That's why we kept it fixed in the kernel.

from neighborhood-attention-transformer.

alihassanijr commented on June 27, 2024

Just an update,
You can now use arbitrary dims per head with v0.11 (PR #23 )

@PeiqinZhuang If that resolves your question, feel free to close the issue.

from neighborhood-attention-transformer.

PeiqinZhuang commented on June 27, 2024

Just an update, You can now use arbitrary dims per head with v0.11 (PR #23 )

@PeiqinZhuang If that resolves your question, feel free to close the issue.

Hi, I have one question. Should I change the block size from 32 to 64, if I change the default dimension from 32 to 64.

from neighborhood-attention-transformer.

alihassanijr commented on June 27, 2024

Sorry, to what exactly are you referring by block size?

from neighborhood-attention-transformer.

alihassanijr commented on June 27, 2024

Closing this due to inactivity. If you still have questions feel free to open it back up.

from neighborhood-attention-transformer.

Recommend Projects

Question about the dimension of head about neighborhood-attention-transformer HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent