Comments (4)
By classes, do you mean which shape the objects are (e.g. rectangle/triangle/circle)? In that case, you simply use a longer vector for each bounding box (the class and the color are encoded as one-hot vectors in each bounding box).
Or do you mean the number of objects on each image? That is very tricky to do. Part of the reason why the examples in here are so simple is because they contain a fixed number of objects per image. There are methods to recognize a variable number of objects per image, but they are way more sophisticated. See also the last chapter in my blogpost here.
from shape-detection.
Hello? I meant the number of objects and classes.
Currently I made some example codes with keras to contain max.100 classes or objecs per a scene; their class IDs as well as ROI (x, y, w, h) information is embedded as a 1D data in 'class number, x, y, w, h' order. So 5 data x 100 objects = 500 + dummy 12 = 512. And all other blanks are filled with zeros.
But accuracy is around 67% with 32x32 size VOC data. And its accuracy is dropped after around 14,000 epochs. Maybe I have to increase size of conv2D input to 224x224 or add more conv2d layers. Currently I used only 4 of conv2d layers.
Also, I haven't used encoded one-hot vectors for classes. Maybe I would try it with one-hot vectors as well....
Thanks,
from shape-detection.
With up to 100 objects, I think it will get very hard to get good results with the methods presented here. You might want to have a closer look at the papers I mention in the blogpost linked above (and there are also some more recent approaches).
Using one-hot encoding should definitely improve the situation for the object classification though.
from shape-detection.
Oh and just to be clear: There's no possibility to have a variable number of objects per image within the framework presented here (again, see the blogpost).
from shape-detection.
Related Issues (5)
- dist wrong HOT 1
- Accuracy is much lower in my case HOT 1
- data set HOT 1
- Rotated Regtangle
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from shape-detection.