zmykevin Goto Github PK
Name: Mingyang Zhou
Type: User
Bio: Post-Doc Researcher at Columbia University. My research interest lies on Multimodality Learning with vision and language.
Location: New York
Name: Mingyang Zhou
Type: User
Bio: Post-Doc Researcher at Columbia University. My research interest lies on Multimodality Learning with vision and language.
Location: New York
The code repository for our multimodal machine translation project: A Visual Attention Grounding Neural Netowork
The official code implementation of the ACL 2023 Finding paper: Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs
A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)
For the code from ChartLlama-code
Wechat robot based on ChatGPT, which using OpenAI api and itchat library. 使用ChatGPT搭建微信聊天机器人,基于GPT3.5 API和itchat实现
Contrastive Language-Image Pretraining
Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", published at ICCV'23.
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
This repo stores my work for project 4 for Udacity Deep Learning Nanodegree, which is a deep_neural_network that can translate English into French.
The face generation project using GAN for Udacity deep learning foundation nanodegree
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
The Repository for the baselines for GanDraw Dataset
Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction"
Scripts to generate the CoDraw and i-CLEVR datasets used for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction"
Find "People Also Ask" questions
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
An open source implementation of CLIP.
Open Source Neural Machine Translation in PyTorch
Code that'll help you kickstart a personal website that showcases your work as a software developer.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
基于Pytorch的中文聊天机器人 集成BeamSearch算法
Social_Chat_Bot
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.