This is the official project website of our paper: Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model
Peng Wu, Jing Liu Senior Member, IEEE, Xiangteng He, Yuxin Peng Senior Member, IEEE, Peng Wang, and Yanning Zhang Senior Member, IEEE
UCFCrime-AR Captions are released to facilitate future research.
I3D&VGGish features of XDViolence-AR
- We introduce a new task named video anomaly retrieval to bridge the gap between the literature and real-world applications in terms of video anomaly analysis. To our knowledge, this is the first work moves towards VAR from VAD;
- We present two large-scale benchmarks, i.e., UCFCrime- AR and XDViolence-AR, based on public VAD datasets. The former is applied to video-text VAR, the latter is to video-audio VAR;
- We propose a model called ALAN, aiming at challenges in VAR, where anomaly-led sampling, video prompt based masked phrase modeling, and cross-modal align- ment are introduced for the attention of anomalous segments, enhancement of fine-grained associations, and multi-perspective match, respectively.
If you find this repo useful for your research, please consider citing our paper:
@article{wu2023towards,
title={Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model},
author={Wu, Peng and Liu, Jing and He, Xiangteng and Peng, Yuxin and Wang, Peng and Zhang, Yanning},
journal={arXiv preprint arXiv:2307.12545},
year={2023}
}