Giter Site home page Giter Site logo

Comments (3)

hughperkins avatar hughperkins commented on June 12, 2024

You would need 3-d convolutions I think? Current convolutions are 2-d, over images. But I think you'd be convolving over time too, is that right?

I think this will be quite hard to shoe-horn into deepcl, which was originally intended to have a very strictly limited scope of handling Go-boards, and as such handles only square images. It could be upgraded to handle non-square videos, but would be a fair amount of work.

If I was in your position, I might plausibly look at porting across the similar layers in cuda torch into cl torch. That should be fairly straightforward to do. I can probably actually handle that if you are interested? cl torch is at https://github.com/hughperkins/clnn Let me know if this could be interesting to you.

from deepcl.

Pax1601 avatar Pax1601 commented on June 12, 2024

Indeed it requires 3-d convolutions, but as you can read in the webpage I have linked above, it also provides a Caffe model where 3 subsequent grayscale frames are merged in a single RGB frame. Do you think such model may work?

My problem is portability. The final program must run on a drone, and right now I'm not sure about the architecture of the GPU. The only information I have is that it should run OpenCL. I'd like to be more precise, but I'm asking you this question because I will require optical flow computation for a university project and I still don't exactly know what machine the code will run on.

from deepcl.

hughperkins avatar hughperkins commented on June 12, 2024

2d convolution takes a stack of 2d images, and convolves them together, using an arbitrary number of filters, to give a number of output 2d images equal to the number of filters. Each filter is 3d: taking a stack of 2d images.

It sounds like your model will have 3 incoming image planes, is that right? In which case, it's just a standard convolution. Actually, when I say '2d', each image is 2d, but the convolution filters are 3d: taking a stack of input images. Its plausible that for video, one would actually need stacks of 4d filters actually, not 3d as I implied earlier.

DeepCL has the following requirements to run:

  • an opencl 1.1 compatible gpu
  • c and c++ runtime library
  • cmake build system (this bit is not very set in stone though, if you're willing to hack together some other way of building it)

from deepcl.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.