natyren Goto Github PK
Name: George
Type: User
Bio: Interested in machine learning (especially in representation learning), AGI, math and vector databases.
Name: George
Type: User
Bio: Interested in machine learning (especially in representation learning), AGI, math and vector databases.
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
An open source business card designer and sharing platform
Cassava Leaf 2020 competition repository
ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.
Easily compute clip embeddings and build a clip retrieval system with them
a state-of-the-art-level open visual language model | 多模态预训练模型
TorchCFM: a Conditional Flow Matching library
A mini-library for training consistency models.
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
Official repo for the paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
This is open-source implementation of DoRA
Open-Source implementation of FlexPredict paper (https://arxiv.org/pdf/2308.00566.pdf)
Open-source attempt to implement tiny vision-language model which works well with text-rich images
GPT based autonomous agent that does online comprehensive research on any given topic
Convert a HTML string to LATEX using Python
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Implementation of Action Matching
Script to easy (from the bbox inference and deployment) of kosmos-2.5
🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
Official PyTorch implementation of the paper: Flow Matching in Latent Space
A python library for self-supervised learning on images.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.