baochaozhu Goto Github PK
Name: baochao.zhu
Type: User
Company: HeFei University of Technology
Location: HeFei Anhui
Name: baochao.zhu
Type: User
Company: HeFei University of Technology
Location: HeFei Anhui
Codes and data for the TIP 2023 paper: Towards 3D Face Reconstruction in Perspective Projection: Estimating 6DoF Face Pose from Monocular Image
A self-supervised learning framework for audio-visual speech
A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.
🔊 Text-Prompted Generative Audio Model
Bark Voice Cloning and Voice Cloning for Chinese Speech
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Dense Prediction Transformers
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
[CVPR19] FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation from a Single Image
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
High-Resolution Image Synthesis with Latent Diffusion Models
LAVIS - A One-stop Library for Language-Vision Intelligence
贵校课程资料民间整理
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Modern Computer Vision with PyTorch, published by Packt
nanobind: tiny and efficient C++/Python bindings
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion
The Introduction of the OLKAVS Dataset
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
An open source implementation of CLIP.
Train your custom LLMs like Llama, baichuan-7b, GPT
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.