Comments (8)
In my case step 3 was unintentional, after labelling some images I had to split them into two projects, so I moved all images into two folders and renamed them, I also splitted the json array and renamed the task.data, unfortunately I missed a lower case letter. After importing the labels, all seemed fine until I found out that there were duplicated tasks and I was labelling images twice. In my user experience it would have helped me save some time if the front end didn't display the images having a 'wrong' case filepath.
Thank you, I didn't know about this feature to modify fields, I managed to fix filepaths in the json file directly and after reimporting it it worked fine.
As a newbie I spent some time to figure it out so I thought that if this happens to someone else this issue might be helpful.
from label-studio.
oh nice, I can filter by 'Storage filename', that is helpful too!
from label-studio.
Why do you need this step?
- rename local storage folder containing referenced data with different case (e.g. .\Images-> .\images)
What paths do you have in your original json file? Images or images?
Remove duplicates will work only if your task.data will be absolutely the same with other tasks. It sounds like you need to fix paths in your database, you can try running label studio this way: EXPERIMENTAL_FEATURES=1 label-studio
and try to use Add or Modify Field
in the data manager actions. There you can type something like replace("image", "Image")
and it should rename your paths. Then you will be able to run remove duplicates.
from label-studio.
Also you could filter your tasks by path using Data manager filters and remove all images that you don't need.
from label-studio.
Is this issue solved now?
from label-studio.
I think it's still an issue, images should be displayed case-sensitively, it shouldn't happen to have duplicated tasks and 'remove duplicated tasks' does not remove them. Do you agree?
Would it be helpful a pull request to display images case-sensitively?
from label-studio.
in my opinion 'duplicated task' means that it points to the same data source, I think this is the case.
from label-studio.
unfortunately I don't have enough time to contribute to this project, I will close this issue. If anyone else finds useful to fix it feel free to reopen. Thanks @makseq!
from label-studio.
Related Issues (20)
- Region hiding freezes screen when mouse is moving over the region HOT 5
- Auto-Annotation Not Appearing in Label Studio After Configuring Tesseract as ML Backend
- Smart Polygon and Brush don't exist despite mention in docs HOT 2
- strokeWidth=1 on masks? HOT 4
- offline cannot start label-studio HOT 1
- How can a second annotator annotate using the queue? HOT 1
- Adding Elestio as deployment option
- Tasks no longer showing up in label-studio projects HOT 14
- `Filters` --> `Pin to sidebar` not persisting when switching between projects. HOT 1
- Empty project UI on Google Chrome HOT 27
- Custom region label
- UI Project List Bug HOT 3
- NoneType Object error during drafting annotations After Upgrade HOT 2
- Can not reset password
- Filter is unstable
- Group for removed labels
- How to label linestrip?
- Brush-type predictions won't display by the defined attribute after upgrade LS
- How to have predefined value for datetime tag HOT 1
- Cant Able to Add Annotations HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from label-studio.