Comments (2)
Yes. Cleanlab does this automatically. If your classifier is appropriate for the data (reasonably high cross validation accuracy), it will include any image that is not labeled correctly in the error set, regardless of the underlying true class.
This is easy to test. Add an image of the letter A into the MNIST digit dataset and see if it identified the letter as an error.
If your dataset contains lots of birds or if your model isn't very good (like if you use a simple naive Bayes on image data), then your model won't be able to train well enough on cats and dogs to have good predicted probabilities, which will affect cleanlab's ability to find the errors. But for the most part, if you use a reasonable model that has high accuracy (> 90 percent) on the data when it's clean and the data has no more than 40 percent error, cleanlab will find most of the birds in your cat / dog set.
from cleanlab.
Thank you. This answered my question!
from cleanlab.
Related Issues (20)
- [Docs] Need more examples on Tabular Data HOT 1
- Datalab.get_issues() for cleanvision issue types fails
- Datalab is not available due to missing dependencies. HOT 3
- Add internal API reference for datalab adapter modules in docs
- Running into bug when using knn graph in Datalab Tabular tutorial HOT 1
- Error in null: Ambiguous truth value of a Series HOT 4
- Add end-to-end tests at the end of Datalab quickstart tutorial
- get rid of warnings in the datalab quickstart tutorial
- Remove Tensorflow version constraint in developer dependencies
- add unit test with all identical dataset HOT 3
- Difference of object detection confident learning with objectlab paper HOT 1
- update coveragerc to only skip over specific experimental subfolders that currently are untested
- Null issue check throwing an error HOT 1
- lab.find_issues(features=features) outputs error for underperforming issue HOT 1
- Object detection, segmentation k-fold practical issue HOT 1
- Trying to create Datalab object with label set to a dtype of 'category' but getting 'NotImplementedError'
- test_scores_for_identical_examples unit test fails
- be able to pass in kwargs to plt.show()
- datalab issue guide should better describe the relevant cleanlab columns
- Trying to build docs with a new notebook I have created but getting `AttributeError` from the audio.ipynb tutorial HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cleanlab.