acodec Goto Github PK
Name: AcodeC
Type: User
Company: Zhejiang University
Location: HangZhou
Name: AcodeC
Type: User
Company: Zhejiang University
Location: HangZhou
3D ResNets for Action Recognition
3D ResNets for Action Recognition (CVPR 2018)
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
:hammer:AI 方向好用的科研工具
算法模板 From https://www.acwing.com
A PyTorch implementation of the Transformer model in "Attention is All You Need".
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
This repository contains all the papers accepted in top conference of computer vision, with convenience to search related papers.
A curated list of image captioning and related area resources. :-)
Reading list for research topics in multimodal machine learning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.
The official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Efficient Video Representations by Temporal Aggregation Modules"
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
Bottom-up features extractor implemented in PyTorch.
Caffe: a fast open framework for deep learning.
包含Caffe-SSD-Mobilenet(DepthwiseConvolution) 和 Caffe-SSD 和 Classification
Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)
Adds SPICE metric to coco-caption evaluation server codes.[THIS iS NON-OFFICIAL Python 3.x support]
《剑指Offer》第二版源代码
2019-2020 International Conferences in Artificial Intelligence, Machine Learning, Computer Vision, Data Mining, Natural Language Processing and Robotics
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计、Java、Python、C++
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.