Giter Site home page Giter Site logo

mac's Introduction

MAC

By Runzhou Ge, Jiyang Gao, Kan Chen, Ram Nevatia.

University of Southern California (USC).

Introduction

This repository contains the code for the WACV 2019 paper, MAC: Mining Activity Concepts for Language-based Temporal Localization. arXiv

Requirements

  • Python 2.7
  • Tensorflow 1.0 or higher
  • others

Download

The code is for Charades-STA dataset.

After cloning this repo, please donwload:

ref_info contains Charades-STA annotations, semantic activity concepts, checkpoints and others. After downloading ref_info.tar, untar it and move the folder to the root/ directory of this repo.

Please also change the visual feature and visual activity concepts directories in the main.py.

Training

For the paper results on Charades-STA dataset, run

python main.py --is_only_test True \
--checkpoint_path ./ref_info/charades_sta_wacv_2019_paper_ACL_k_results/trained_model.ckpt-10000 \
--test_name paper_results

You will get similar results listed in the row "ACL-K" of the following table.

Model R@1,IoU=0.7 R@1,IoU=0.5 R@5,IoU=0.7 R@5,IoU=0.5
CTRL 7.15 21.42 26.91 59.11
ACL-K 12.20 30.48 35.13 64.84

To train the model from scratch, run

python main.py

The results and checkpoints will appear in root/results_history/ and root/trained_save/, respectively.

Results Visualization

Citation

If you find this work is helpful, please cite:

@InProceedings{Ge_2019_WACV,
  author = {Ge, Runzhou and Gao, Jiyang and Chen, Kan and Nevatia, Ram},
  title = {MAC: Mining Activity Concepts for Language-based Temporal Localization},
  booktitle = {The IEEE Winter Conference on Applications of Computer Vision (WACV)},
  month = {January},
  year = {2019}
}

License

MIT License

Acknowledgements

This research was supported, in part, by the Office of Naval Research under grant N00014-18-1-2050 and by an Amazon Research Award.

mac's People

Contributors

runzhouge avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

mac's Issues

about the dataset

Hello,

I wanna to know the fps when you process the datasets, namely how many frames one second?

Looking forward to your reply.

training problem

hello!Why did my loss is quite low when I start training,and in the end the loss didn't change too much?
like:
Step 0: loss = 0.46 (1.248 sec)
Step 5: loss = 0.47 (0.273 sec)
Step 10: loss = 0.48 (0.405 sec)
Step 15: loss = 0.48 (0.325 sec)
Step 20: loss = 0.50 (0.372 sec)
Step 25: loss = 0.50 (0.297 sec)
Step 30: loss = 0.54 (0.295 sec)
Step 35: loss = 0.53 (0.320 sec)
Step 40: loss = 0.55 (0.313 sec)
Step 45: loss = 0.52 (0.289 sec)
Step 50: loss = 0.49 (0.303 sec)
...but my test results are similar to those in the paper......

Confirm Python 3 support

Python 2.7 is just a year away from being permanently retired. It is surprising that it was used. Please ensure the code works with Python 3, and confirm the same.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.