chwlsunny,github

multi-source-sound-localization

This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.

multirelational-poincare

Multi-relational Poincaré Graph Embeddings

nvim-config

My custom Neovim configuration with full battery for Python, Markdown, LaTeX and more...

openvqa

A lightweight, scalable, and general framework for visual question answering (VQA) research

pointrcnn

The PyTorch Implementation of PointRCNN for 3D Object Detection from Raw Point Cloud, CVPR 2019.

pythia

A modular framework for Visual Question Answering research by the FAIR A-STAR team

pytorch-cnn-visualizations

Pytorch implementation of convolutional neural network visualization techniques

pytorch-faster-rcnn

pytorch1.0 updated. Support cpu test and demo.

pytorch-gan

PyTorch implementations of Generative Adversarial Networks.

pytorch-grad-cam

PyTorch implementation of Grad-CAM

pytorchtricks

Some tricks of pytorch... :star:

qix

Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang

recipe_semantic_flickraudio

Semantic speech retrieval with a visually grounded model of untranscribed speech.

referringrelationships

resdavenet-vq

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

ros_exploring

《ROS机器人开发实践》源码

rubi.bootstrap.pytorch

RUBi : Reducing Unimodal Biases for Visual Question Answering

scan

PyTorch source code for "Stacked Cross Attention for Image-Text Matching"

semantics-assistedvideocaptioning

Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy

show-control-and-tell

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

speaksee

PyTorch library for Visual-Semantic tasks

speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

speech2image

Neural network implementation of a speech to image system. Networks are trained to embed images and corresponding captions to the same vector space.

summary

summaries of all the papers I read

theano-rnn

Demonstration of recurrent neural network implemented with Theano

up-down-captioner

Automatic image captioning model based on Caffe, using features from bottom-up attention.

vase

visual-semantic-embedding

Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models"

visual-semantic-embedding-1

Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models"

visual7w-qa-models

Visual7W visual question answering models

chwlsunny Goto Github PK

chwlsunny's Projects

Recommend Projects

Recommend Topics

Recommend Org