Giter Site home page Giter Site logo

Comments (3)

daerduoCarey avatar daerduoCarey commented on August 15, 2024

Hello, thanks for interests in my code.

PowerFile layer is to read some values from the file and perform the shifts to the bottom data (thetas in this case). The reason to shift the thetas is to initialize the thetas to a location at the very beginning according to some prior knowledge. For example, we hope the crop to focus on the head of the bird while another crop focus on the tail, etc.

The objective of using LocLoss layer is totally not related with the usage of PowerFile layer. LocLoss layer, as I said in the README, is designed as a loss function to make the crop not fall too far from the central location of the image space. They are totally two unrelated layers.

Sorry that I cannot find my crop_init_1.txt, but I guess it should be something like 0.5, 0 or -0.5, 0 to make the initial crops to focus on the left part and the right part on the original image.

Ask more if you are still confusing. Hope my implementation helps you with your research projects.

from spatialtransformerlayer.

naranjuelo avatar naranjuelo commented on August 15, 2024

Ok, I see. I wanted to use the LocLoss layer so the crop does not fall too far as you said, but the problem is that in my case it focuses too much so my net is not learning anything. I understand that there's no way to limit this minimum value (minimum threshold), am I right?

Instead of that I also tried to use the STLoss layer, but I wanted to fix some theta values, for example 2 of them to avoid rotation (in the same way that it's possible for the LocLoss layer). The problem is that the STLoss layer needs 6 input parameters, so I cannot fix 2 thetas in the SpatialTransformer layer and then set the number of outputs of the layer previous to the STLoss to 4. Is there any way? Thank you!!

from spatialtransformerlayer.

daerduoCarey avatar daerduoCarey commented on August 15, 2024

First of all, sorry that I may release too many confusing layers. They are the helper layers I added when I did the experiments. But finally when I got everything correct (learning something useful), I didn't activate any of these layers. The difficulty of training the networks with STLayer is that it's hard to fine-tune for the learning rates for the layers. Notice that finally when my trainings work out, my learning rate for the small net (localization net) is very small (like 1e-5) while the rates for other layers are much bigger (like 1e-2). So, please do not rely too much on my helper layers.

Besides, you can edit STLossLayer as your need. Sorry that I didn't do the work.

from spatialtransformerlayer.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.