Giter Site home page Giter Site logo

peterwangcn / egofinger.hcii.scut Goto Github PK

View Code? Open in Web Editor NEW

This project forked from hyichao/egofinger.hcii.scut

0.0 1.0 0.0 4.74 MB

This repo is about a dataset called EgoFinger, which is maintained by Yichao Huang in SCUT HCII Lab.

C++ 89.06% C 0.01% QMake 0.07% Shell 0.18% CMake 0.25% Makefile 0.66% Cuda 4.37% MATLAB 0.94% M 0.01% Python 3.01% Protocol Buffer 1.45%

egofinger.hcii.scut's Introduction

EgoFinger

This repo is about a dataset called EgoFinger, which is maintained by Yichao Huang in SCUT HCII Lab.

Description

SCUT-Ego-Finger Dataset is for research on egocentric vision hand detection and fingertip detection. It includes 93729 frames from 24 videos captured and labeled by 24 volunteers. Detail scenes include: Avenue (4058), Basketball Field (2894), Chinese Book (3001), Classroom (4868), Computer Screen (5088), East Canteen (3738), East Lake (4151), Football Field (4145), Fountain (4158), Lcy-Lab (4084), Liwu Building (5488), NO.1 North Dorm (4281), NO3. North Dorm (4679), North Canteen (4314), North Lake (3419), North Library (2495), Renwen Building (3611), Supermarket (1682), Teaching Building (4368), Tennis Field (5124), West Canteen (4185), Wyx-Lab (3222), Yifu Building 1st Floor (3806), Yifu Building 2nd Floor (2870).

The dataset is collected for solving problem of ego-vision fingertip detection and hand detection, which may conduct challenges such as background complexity, Illumination change, hand shape and hand color diversity, motion blur, so on and forth. Therefore the dataset deliberately covers different type of frames. It is feasible to applied Deep Learning on the dataset. For further evaluation of the dataset, please visit use a tool in this repo named Performer, or contact [email protected]

In the repo there are three folders. The one named Collector is the tool for data labeling, executed on Windows in convienience, which is written by Qt framework with OpenCV. The one named Performer is for data evaluation, simply like spacial distribution of hand, color distribution and so on. The Application folder contains personal used demo codes (FYI, the demo video is uploaded to Youtube, please click the link or search Egocentric Air Writing in Youtube), and due to some stupid policy I have to keep the models myself and therefore the Application codes cannot run directly until caffemodels are filled.

Install

In this repo we build the UI mainly by Qt environment and OpenCV library. So,

  1. Intall OpenCV The easiest way might be
sudo apt-get install libopencv-dev python-opencv
  1. Install Qt Go to Qt download page and download the .run file the follow the instruction

  2. Clone the repo Using git the clone the repo

Then you may do whatever you want...except runing them :( Again, sorry for not providing caffemodel... However, hope this will help for somebody and I will be glad.

Publication

It't not a revolutional work, though, we are proud that we do something on this topic and push the area further a little bit.

Y. Huang, et al. "DeepFinger: A Cascade Convolutional Neuron Network Approach to Finger Key Point Detection in Egocentric Vision with Mobile Camera." Systems, Man, and Cybernetics (SMC), 2015 IEEE International Conference on. IEEE, 2015.


Y. Huang. et al. "A Pointing Gesture Based Egocentric Interaction System: Dataset, Approach and Application." Computer Vision and Pattern Recognition (CVPR) Workshop, 2016 IEEE Internaional Conference on. IEEE, 2015.


Link

The dataset is availble in the following link (which is the data page of our lab)

http://www.hcii-lab.net/data/SCUTEgoFinger/index.htm

By using the dataset, we establish a demo application named Ego-Air-Writing, and the video has been uploaded to Youtube

https://www.youtube.com/embed/PeSrRxKghv8

If you are one of the researchers interested in hand or egocentric vision, please read our paper, which will provide a straight-forward illustration of dataset, approach, and application. Or, if you need further information, please feel free to contact

Yichao Huang: [email protected]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.