Here we are going to add the <a href="https://github.com/MilaNLProc/contextualized-top

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Adding a neural topic model baseline about lady HOT 11 CLOSED

fani-lab commented on September 7, 2024

Adding a neural topic model baseline

from lady.

Comments (11)

hosseinfani commented on September 7, 2024 2

@farinamhz
We'll talk tomorrow.

nonetheless, it's time to switch your experiments to computecanada then. we have a doc in General > Files > Library > Compute Canada guide that helps you.

@smh997 did you convert that doc into https://github.com/fani-lab/Library/blob/main/ComputeCanada.md?

from lady.

smh997 commented on September 7, 2024 2

@hosseinfani It is still in progress and still needs to be finalized. I am adding the GPU part. I expect to finish it by tomorrow (at least the first version as a draft). However, I can share my experience with @farinamhz before I update the repo.

from lady.

farinamhz commented on September 7, 2024 1

Hi @hosseinfani,
I added the CTM baseline and added the percentages of the hide function for the evaluation section.
However, there is a problem with this new model that its evaluation takes too much time. In their paper, they said much lesser time for each epoch. But we have ~16 minutes for training.
At the end of the day, we can handle the training, but the evaluation is taking unusual time.
For example, we are going to evaluate 15% of 350 reviews that each of them has avg ~3 documents or sentences, and inference for each of these reviews takes almost 2 minutes.
It means that if we have 5 folds and 11 different evaluations for 0, 10, 20,...,100 percent of hide the aspect, in total, it takes almost 4 days to evaluate just the results before back-translation!
I am running on GPU, and for sure, if it takes this amount of time, we would not have time to test different values for each param!
Finally, I think that there is a problem somewhere that is taking too much time, even when I have done it from their document.
This was the whole problem, and I would appreciate it if you had time for a meeting to talk about this.

from lady.