Giter Site home page Giter Site logo

imatge-upc / egocentric-2016-saliency Goto Github PK

View Code? Open in Web Editor NEW
6.0 6.0 3.0 2.63 MB

Research on the prediction of visual saliency in egocentric vision.

Home Page: http://imatge-upc.github.io/egocentric-2016-saliency/

Shell 3.11% Jupyter Notebook 1.61% Python 64.34% MATLAB 28.72% M 2.22%

egocentric-2016-saliency's People

Contributors

monicachs avatar xavigiro avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

egocentric-2016-saliency's Issues

State of the art on egocentric saliency prediction

You should identify and read some (3-5) scientific papers or works where similar to your research.

I think that in the world of egocentric they use a lot the term "attention" as a similar concept to saliency.

I have found some papers that I would like you to look at and write a short summary (one paragraph for each):

Yamada, Kentaro, Yusuke Sugano, Takahiro Okabe, Yoichi Sato, Akihiro Sugimoto, and Kazuo Hiraki. "Can saliency map models predict human egocentric visual attention?." In Computer Vision–ACCV 2010 Workshops, pp. 420-429. Springer Berlin Heidelberg, 2010.

Matsuo, Kenji, Kentaro Yamada, Satoshi Ueno, and Sei Naito. "An attention-based activity recognition for egocentric video." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 551-556. 2014.

Bettadapura, Vinay, Irfan Essa, and Caroline Pantofaru. "Egocentric field-of-view localization using first-person point-of-view devices." In Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on, pp. 626-633. IEEE, 2015.

Fathi, Alireza, Yin Li, and James M. Rehg. "Learning to recognize daily actions using gaze." In Computer Vision–ECCV 2012, pp. 314-327. Springer Berlin Heidelberg, 2012

In particular, I want you to answer this questions:

  • Which dataset did they use ?
  • Which metric did they use ?
  • Is there any aspect in their methodology we could adopt ?
  • Could we compare our results with them somehow ?

Answer to this issue with a paragraph every time you finish reading each paper. Make sure you answer the questions I posed.

Privacy issues for the dataset recordings

Professor O'Connor wrote:

I think that the suggested data set would be very valuable and if we succeed in creating it, it would be great to be able to release it for others to use. Given thus, we should try to keep privacy considerations in mind when creating the data set so that there are no problems if we decide to go this route e.g. we could make sure that if people appear in the video, they are members of the group here who can give formal consent, make sure there are no car license plates, etc. From our ACM Grand Challenge data sets we learnt that its much better to consider these things up front rather than try to fix them later

Extract frames from the Tobi video

Learn how to conveniently extract frames from the video sequence providede by Tobi glasses. Check Sergi Imedio's thesis as well as 'srun' command to be used in GPI servers.

Locations to record a new dataset

Places were we could record:

  • University
  • Residence
  • Supermarket
  • Gym
  • Touristic places: Phoenix park, Trinity college, Spire, Leffey, ...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.