tongbaochen,github

2d-tan

AAAI‘20 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language

adpn-mm

Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding"

adventures-in-ml-code

This repository holds all the code for the site http://www.adventuresinmachinelearning.com

adversarialnetspapers

The classical papers and codes about generative adversarial nets

android-training-course-in-chinese

Android官方培训课程中文版

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

audio-visual-event-localization-in-unconstrained-videos

audio visual event localization

avmr

ACM MULTIMEDIA CONFERENCE 2020

avset

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

awesome-cross-modal-video-moment-retrieval

前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。

awesome-github

A curated list of awesome GitHub guides, articles, sites, tools, projects and resources. 收集这个列表，只是为了更好地使用亲爱的GitHub,欢迎提交pr和issue。

awesome-llms-for-video-understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

awesome-temporal-sentence-grounding-in-videos

A curated list of grounding natural language in video and related area. :-)

big_data_hw1

基于Hadoop_hbase的Distinct实现

bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

cg-blog

:octocat: Welcome to my blog, watch, star and fork. In my blog, you can know the latest techniques and anecdotes.

cmbs

audio-visual event localization

cmfnet

Compound Multi-branch Feature Fusion for Real Image Restoration

cmin_moment_retrieval

Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos

cnn-text-classification-tf

Convolutional Neural Network for Text Classification in Tensorflow

coco-caption

collection-of-visual-storytelling-storynlp

This repository aims to collect the articles and codes for the Visual Storytelling (VIST) task. VIST is a vision-and-language task. It aims to summarize the idea of a photo stream and tells a story about it (in natural language). Be careful about its difference from the "storytelling with data", which is more related to data visualization.

tongbaochen Goto Github PK

tongbaochen's Projects

Recommend Projects

Recommend Topics

Recommend Org