Comments (6)
The functions in utils.py should already be taking this into account.
from autowebcompat.
@marco-c Do all of them do this already? and what do you think about having the data coverage function, i think a measure of how successful we are at dataset collection could help us in the long run
from autowebcompat.
@marco-c Do all of them do this already?
Yes, the ones we care about.
and what do you think about having the data coverage function, i think a measure of how successful we are at dataset collection could help us in the long run
Do you mean the number of webcompat bugs we haven't taken screenshots for yet? Yes, that sounds useful.
from autowebcompat.
Do you mean the number of webcompat bugs we haven't taken screenshots for yet? Yes, that sounds useful.
Yeah, I meant this. Do we take any note of what bugs we've collected for yet?
But also I was pointing at the fact that, if we are collecting about 1000 set of images through our crawler and then some say 100 sets are inconsistent, we are only using 90% of the collection. I think a measure of this could help us in the long run.
@marco-c
from autowebcompat.
Yes, a script to see how many we have collected can definitely be useful.
from autowebcompat.
This can be part of the data_inconsistencies.py script.
from autowebcompat.
Related Issues (20)
- Sort labels when saving them
- test_labels should validate all labels files
- test_labels.py is not actually testing the screenshots actually exist HOT 3
- Limit size of full page screenshot HOT 4
- Script to rename images and labels according to new convention
- Implementing Object Segmentation networks for bounding box annotations HOT 1
- Throw a meaningful error in utils.read_labels when labels.csv is empty HOT 2
- Running pretrain.py gives FileNotFoundError. HOT 4
- Try training a neural network using the responses from a DOM-based tool as features
- Try using the responses from a DOM-based tool as additional features
- Create a web-based tool to show predicted differences HOT 2
- Move the labeling tool to be web-based
- Use multiple releases of each browser
- Try using mdn/browser-compat-data to automatically label screenshot pairs
- Investigate training a model to detect regressions in a browser
- When prefilling an issue on webcompat.com, prefill as much as possible
- Add possibility to navigate to websites on demand
- Use Docker to run browsers and collect screenshots
- Train a baseline classifier HOT 12
- Out of memory error while training vgg16 and vgg19 with imagenet weights on Colab
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from autowebcompat.