zmykevin Goto Github PK

followers: 36.0 following: 10.0 repos: 42.0 gists: 0.0

Name: Mingyang Zhou

Type: User

Bio: Post-Doc Researcher at Columbia University. My research interest lies on Multimodality Learning with vision and language.

Location: New York

Mingyang Zhou's Projects

a-visual-attention-grounding-neural-model

The code repository for our multimodal machine translation project: A Visual Attention Grounding Neural Netowork

acl2023_chartt5

The official code implementation of the ACL 2023 Finding paper: Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs

bert_nli

A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)

chatgpt-on-wechat

Wechat robot based on ChatGPT, which using OpenAI api and itchat library. 使用ChatGPT搭建微信聊天机器人，基于GPT3.5 API和itchat实现

cliptrans

Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", published at ICCV'23.

detectron

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

detectron2

Detectron2 is FAIR's next-generation platform for object detection and segmentation.

dlnd_project_4

This repo stores my work for project 4 for Udacity Deep Learning Nanodegree, which is a deep_neural_network that can translate English into French.

dlnd_project_5

The face generation project using GAN for Udacity deep learning foundation nanodegree

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

gandraw

The Repository for the baselines for GanDraw Dataset

geneva

Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction"

Scripts to generate the CoDraw and i-CLEVR datasets used for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction"

gquestions

Find "People Also Ask" questions

llava

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

open_clip

An open source implementation of CLIP.

opennmt-py

Open Source Neural Machine Translation in PyTorch

personal-website

Code that'll help you kickstart a personal website that showcases your work as a software developer.

zmykevin Goto Github PK

Mingyang Zhou's Projects

Recommend Projects

Recommend Topics

Recommend Org