Comments (8)
Initial dim of the tensor is always reserved for the batch size. So if you want to load only one clip, you need to expand initial dim to 1. Moreover, you are concatenating the frames in the wrong dim. Your final tensor shape should be [1,3,16,h,w] such that line 'x_2d = input[:, :, -1, :, :]' in "model.py" can successfully takes the last frame of the clip.
For a successful inference you need also processing of clip (such as normalization etc) same as the test phase.
from yowo.
@jinfagang
Thanks for your interest. Our 3D-CNN model extracts spatial-temporal information from an input clip consisting of several successive frames, thus you need to concatenate them (8/16 frames) together as a clip.
from yowo.
How to specific using 2d or 3d? it seems default use them all. 8/16 means 8~16 frames?
from yowo.
@jinfagang
3D model helps to understand an action, while 2D model boosts the localization precision. Our algorithm fuses both 3D and 2D information to achieve the spatial-temporal localization task. If only a single model is employed, the result will be worse. You can find the corresponding ablation study in our paper.
We provide two options: 8 frames or 16 frames. Model with 8 frames performs a little bit worse than 16 frames yet more efficient. The experiment results are also presented in the paper.
from yowo.
thanks, I got it. That means the input video at least 16 frames for inference?
from yowo.
@jinfagang
For the model with 16 frames, yes.
from yowo.
@wei-tim can I manually edit clip size to 32?
from yowo.
@jinfagang Running into the same error as you. I read in an image frame as an np.array and made the shape of the image [3,h,w]. Then I concatenated 16 consecutive frames into an array with shape [16, 3, h, w] before converting to Tensor.
I am still missing a dimension (shape length is current 4 and not 5). Did you find a fix?
from yowo.
Related Issues (20)
- Working on dataset other than humans HOT 3
- Can you provide each type of map in ava
- About the envs setup HOT 2
- KeyError: 'exp_avg' HOT 1
- YOWO not using GPUs? HOT 1
- model is None
- how is the yolo.weights trained?
- Training YOWO on a custom dataset HOT 1
- Dropbox links for pre-trained models for J-HMDB, UCF and their annotations give 404 error HOT 1
- The [email protected] results reported in the paper are for validation or test splits of the AVA dataset?
- A stronger YOWO achieved by us. HOT 4
- not find yowo_jhmdb21_32f_best.pth
- plz,how to make and train my own dataset?
- test_video_ava.py error
- ava_detection_val_boxes_and_labels.csv is missing
- ava_classnames.json is missing
- /usr/home/sut path HOT 1
- animal action reconginition
- Training YOWO on a customized dataset HOT 2
- This code lacks any Conda environment or usage instructions.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from yowo.