tongbaochen Goto Github PK
Type: User
Type: User
AAAI‘20 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
Acm Cheat Sheet
Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding"
This repository holds all the code for the site http://www.adventuresinmachinelearning.com
The classical papers and codes about generative adversarial nets
Android官方培训课程中文版
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
audio visual event localization
ACM MULTIMEDIA CONFERENCE 2020
A curated list of different papers and datasets in various areas of audio-visual processing
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
A curated list of awesome GitHub guides, articles, sites, tools, projects and resources. 收集这个列表,只是为了更好地使用亲爱的GitHub,欢迎提交pr和issue。
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
A curated list of grounding natural language in video and related area. :-)
基于Hadoop_hbase的Distinct实现
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
:octocat: Welcome to my blog, watch, star and fork. In my blog, you can know the latest techniques and anecdotes.
audio-visual event localization
Compound Multi-branch Feature Fusion for Real Image Restoration
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
Convolutional Neural Network for Text Classification in Tensorflow
This repository aims to collect the articles and codes for the Visual Storytelling (VIST) task. VIST is a vision-and-language task. It aims to summarize the idea of a photo stream and tells a story about it (in natural language). Be careful about its difference from the "storytelling with data", which is more related to data visualization.
compositional-temporal-ground
[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Exploring and Forecasting Country Indicators using Big Data Approach
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.