Giter Site home page Giter Site logo

cdac's Introduction

A Cross-Dimension Annotations Method for 3D Structural Facial Landmark Extraction

Background

This repository is for A Cross-Dimension Annotations Method for 3D Structural Facial Landmark Extraction (COMPUTER GRAPHICS forum),

by Xun Gong , Ping Chen, Zhemin Zhang, Ke Chen, Yue Xiang and Xin Li from Southwest Jiaotong University.

citation

If you find our work useful in your research, please consider to cite:

@article{doi:10.1111/cgf.13895,
author = {Gong, Xun and Chen, Ping and Zhang, Zhemin and Chen, Ke and Xiang, Yue and Li, Xin},
title = {A Cross-Dimension Annotations Method for 3D Structural Facial Landmark Extraction},
journal = {Computer Graphics Forum},
volume = {39},
number = {1},
pages = {623-636},
doi = {10.1111/cgf.13895},
url = {https://onlinelibrary.wiley.com/doi/abs/10.1111/cgf.13895},
year = {2020}
}

Contents

Introduction

Recent methods for 2D facial landmark localization perform well on close-to-frontal faces, but 2D landmarks are insufficient to represent 3D structure of a facial shape. For applications that require better accuracy, such as facial motion capture and 3D shape recovery, 3DA-2D (2D Projections of 3D Facial Annotations) is preferred. Inferring the 3D structure from a single image is an ill-posed problem whose accuracy and robustness are not always guaranteed. This paper aims to solve accurate 2D facial landmark localization and the transformation between 2D and 3DA-2D landmarks. One way to increase the accuracy is to input more precisely annotated facial images. The traditional cascaded regressions cannot effectively handle large or noisy training data sets. In this paper, we propose a Mini-Batch Cascaded Regressions (MBCR) method that can iteratively train a robust model from a large data set. Benefiting from the incremental learning strategy and a small learning rate, MBCR is robust to noise in training data. We also propose a new Cross-Dimension Annotations Conversion (CDAC) method to map facial landmarks from 2D to 3DA-2D coordinates and vice versa. The experimental results showed that CDAC combined with MBCR outperforms the-state-of-the-art methods in 3DA-2D facial landmark localization. Moreover, CDAC can run efficiently at up to 110 fps on a 3.4 GHz-CPU workstation. Thus, CDAC provides a solution to transform existing 2D alignment methods into 3DA-2D ones without slowing down the speed.

Requirements

1.opencv2.4.13

Usage

Organize your training and test sets as follows, and enter the paths of the training and test sets.
H:\68gedian\afw
H:\68gedian\helen\trainset
H:\68gedian\ibug
H:\68gedian\lfpw\trainset
end
H:\68gedian\lfpw\testset
end

demo

Baidu Cloud:https://pan.baidu.com/s/1VSCAuMHHopy6Szf6nxHe3w

cdac's People

Contributors

swjtu-3dvision avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.