Comments (11)
@farinamhz
We'll talk tomorrow.
nonetheless, it's time to switch your experiments to computecanada then. we have a doc in General > Files > Library > Compute Canada guide that helps you.
@smh997 did you convert that doc into https://github.com/fani-lab/Library/blob/main/ComputeCanada.md?
from lady.
@hosseinfani It is still in progress and still needs to be finalized. I am adding the GPU part. I expect to finish it by tomorrow (at least the first version as a draft). However, I can share my experience with @farinamhz before I update the repo.
from lady.
Hi @hosseinfani,
I added the CTM baseline and added the percentages of the hide function for the evaluation section.
However, there is a problem with this new model that its evaluation takes too much time. In their paper, they said much lesser time for each epoch. But we have ~16 minutes for training.
At the end of the day, we can handle the training, but the evaluation is taking unusual time.
For example, we are going to evaluate 15% of 350 reviews that each of them has avg ~3 documents or sentences, and inference for each of these reviews takes almost 2 minutes.
It means that if we have 5 folds and 11 different evaluations for 0, 10, 20,...,100 percent of hide the aspect, in total, it takes almost 4 days to evaluate just the results before back-translation!
I am running on GPU, and for sure, if it takes this amount of time, we would not have time to test different values for each param!
Finally, I think that there is a problem somewhere that is taking too much time, even when I have done it from their document.
This was the whole problem, and I would appreciate it if you had time for a meeting to talk about this.
from lady.
Hi @hosseinfani,
Results and code for the CTM model and changes in the evaluation have been added.
Also, all the results with their aggregation have been added, and you can see it in ../tree/main/output/English/Semeval-2016/25
#31
from lady.
@hosseinfani
Result for CTM:
from lady.
Fortunately, we have reasonable results like other baselines for the first 5 selections, which is good news!
from lady.
But in comparison with other models, the results of CTM are less than other baselines.
from lady.
Hi @hosseinfani,
These are the results for epoch = 10 and epoch = 100.
Unfortunately, with increasing epoch to 100, although we have an improvement in success values, the results after back-translation decrease!
(10 means epoch=10 and 100 means epoch=100)
from lady.
@farinamhz
Interesting! Can you do [10, 100, 200, 300, 400, 500, 1000] epochs and draw the same diagram?
from lady.
We found https://github.com/MIND-Lab/OCTIS that includes neural and non-neural topic modeling.
There is an issue installing scikit-learn == 0.24.2 when installing on python 3.10. I reduced the python to 3.7 and it's been installed with no issue.
b.py
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building wheel for scikit-learn
Failed to build scikit-learn
ERROR: Could not build wheels for scikit-learn, which is required to install pyproject.toml-based projects
from lady.
@farinamhz
we can close this issue. let me know otherwise.
from lady.
Related Issues (20)
- Adding HAST as a supervised baseline HOT 10
- Classification baseline for aspect term extraction HOT 18
- pipeline progress flow
- Check the existing readme and codeline HOT 2
- Gif image/video for illustrating the pipeline HOT 2
- Dockerize and fix installation on linux HOT 13
- a server for the web app
- Setup and Quickstart HOT 5
- Aspect based sentiment analysis + Running Bert and Cat library
- Adding Twitter Reviews Dataset HOT 1
- Needing for update in OCTIS library HOT 2
- Aspect Sentiment Triplet Extraction Baseline HOT 20
- New baseline for Aspect-Based Sentiment Analysis HOT 2
- Literature Review on Aspect and Sentiment Extraction HOT 1
- Adding a new tanslation model to the pipeline HOT 1
- OCTIS.CTM throws a value error during the training phase HOT 3
- Updating stats on quality of translation HOT 4
- Incorporating Underrepresented Languages: A Focus on Low-Resource Languages HOT 1
- Dataset for LADy (Story Board) HOT 22
- LADy Progress Documentation HOT 16
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lady.