Giter Site home page Giter Site logo

Comments (20)

liberty-hit avatar liberty-hit commented on May 25, 2024

By the way, there is a typo in your table, the supervision for works including yours should be I (Image) not S (Saliency) I think

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

There are a lot of things to prepare recently and I hope to release the code in the coming months.

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

By the way, there is a typo in your table, the supervision for works including yours should be I (Image) not S (Saliency) I think

Table 2?

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

By the way, there is a typo in your table, the supervision for works including yours should be I (Image) not S (Saliency) I think

Table 2?

yes

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

I'm trying to reproduce your result, but the training loss doesn't decrease at all...
In the Sec 3.3, you said the loss is log(s), in which s is a cosine similarity, so $s \in [-1, 1]$, and log(s) could be nan, did you really mean this, or you use other format losses

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

When I trying to reproduce, I use logit_scale to scale cosine similarity to logits \in [-100/7, 100/7] and then use logsigmoid to compute the loss, I wonder whether I use the correct format.

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

When I trying to reproduce, I use logit_scale to scale cosine similarity to logits \in [-100/7, 100/7] and then use logsigmoid to compute the loss, I wonder whether I use the correct format.

I just use the cosine similarity between two vectors, norm(v1) * norm(v2).T

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

When I trying to reproduce, I use logit_scale to scale cosine similarity to logits \in [-100/7, 100/7] and then use logsigmoid to compute the loss, I wonder whether I use the correct format.

In my experiments, the loss slightly decreased but the activation map gradually approached the target category.

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

When I trying to reproduce, I use logit_scale to scale cosine similarity to logits \in [-100/7, 100/7] and then use logsigmoid to compute the loss, I wonder whether I use the correct format.

I just use the cosine similarity between two vectors, norm(v1) * norm(v2).T

but how to compute log on the cosine similarity less than 0

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

clamp (0, 1)

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

clamp (0, 1)

ok, I'll try that formation, thanks for your reply

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

I hope to release the code in the coming months :-)

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

I hope to release the code in the coming months :-)

Great!, by the way, could you please provide the pre-defined cbs set

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

{'boat': ['a photo of river', 'a photo of water', 'a photo of lake', 'a photo of sea', 'a photo of building'],
'train': ['a photo of railroad', 'a photo of railway', 'a photo of branches', 'a photo of tree'], You can try this one.

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

{'boat': ['a photo of river', 'a photo of water', 'a photo of lake', 'a photo of sea', 'a photo of building'], 'train': ['a photo of railroad', 'a photo of railway', 'a photo of branches', 'a photo of tree'], You can try this one.

Nice

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

@Sierkinhane Sry to bother you again, could you please provide the finetuned CLIP model, I've finetuned the CLIP model using RN50 as backbone and got ~96% accuracy on VOC12 train_aug, but still failed to train CLIMS even only using loss_otm, the miou of CAMs on train set is only 19%

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

I've found the code to finetune CLIP (openai/CLIP#83), I'll try it

from clims.

liberty-hit avatar liberty-hit commented on May 25, 2024

But I wonder whether I use F.multilabel_soft_margin_loss and treat finetuning CLIP as a multi-label classification task is correct

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

Details were provided in email. : )

from clims.

Sierkinhane avatar Sierkinhane commented on May 25, 2024

@liberty-hit hello, the source code is released now.

from clims.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.