Giter Site home page Giter Site logo

Maxy's Projects

3am icon 3am

Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"

bottom-up-attention icon bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

camp_iccv19 icon camp_iccv19

CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval

cider icon cider

python codes for CIDEr - Consensus-based Image Caption Evaluation

decaf icon decaf

The new Decaf compiler, rewritten in "modern" Java

df-gan icon df-gan

Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis

fairseq_mmt icon fairseq_mmt

This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and scripts for the proposed probing tasks. We hope the code could help those who want to research on the multimodal machine translation task.

genforce icon genforce

GenForce: an efficient PyTorch library for deep generative modeling (StyleGANv1v2, PGGAN, etc)

glm icon glm

GLM (General Language Model)

glove icon glove

GloVe model for distributed word representation

imagecaptioning.pytorch icon imagecaptioning.pytorch

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

kotnews icon kotnews

A naive Android App for learning Kotlin and Android jetpack

llava icon llava

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

mattnet icon mattnet

MAttNet: Modular Attention Network for Referring Expression Comprehension

mm-cot icon mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.