yumianhuli1 Goto Github PK
Name: yumianhuli
Type: User
Company: China
Bio: I am a graduate student from Tongji University in Shanghai, China
Location: shanghai
Name: yumianhuli
Type: User
Company: China
Bio: I am a graduate student from Tongji University in Shanghai, China
Location: shanghai
Auto detecting, masking and inpainting with detection model.
AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
3D animation editor (with ai mocap, mixamorig)
33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU
Official implementation of AnimateDiff.
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
A selection of state-of-the-art research materials on trajectory prediction
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, Deeplabv3+, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet, DFANet)
A curated list of deep learning resources for video-text retrieval.
addon for blender to import mocap data from tools like easymocap, frankmocap and Vibe
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
ChatDev IDE is an tools for building your ai agent, Whether it's NPCs in games or powerful agent tools, you can design what you want for this platform.
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
My undergraduate thesis is Chinese named entity recognition based on bi-RNN(LSTM)+CRF, this paper won the excellent thesis (top 1%) at July,2017.
中文CLIP预训练模型
基于ClipCap的看图说话Image Caption模型
Convert Github Copilot to ChatGPT, free to use the GPT-4 model
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Draw a mockup and generate html for it
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Make human motion capture easier.
EasyMocap中文文档
微软文字转语音工具,edge-tts UI版本,增加了停顿功能
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Real-Time Face Recognition use SCRFD, ArcFace, ByteTrack, Similarity Measure
Next generation face swapper and enhancer
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.