Giter Site home page Giter Site logo

caption-eval's Introduction

Caption Evaluator

Sentence/Caption evaluation using automated metrics.

This code is released as supplementary material with S2VT[1].

This code can be used to

  1. evaluate sentences/captions for any dataset,
  2. it provides BLEU, METEOR, ROUGE-L and CIDEr scores.

This uses the MSCOCO caption evaluation code [2].

Getting started

  1. Get this code. git clone https://github.com/vsubhashini/caption-eval.git
  2. Get the coco evaluation scripts. ./get_coco_scripts.sh

To ensure you have all the dependencies for the evaluation scripts, please refer to the COCO Caption Evaluation page.

Evaluating predicted sentences against groundtruth references

Make sure you have the coco scripts

    ./get_coco_scripts.sh

Create your groundtruth references in the desired format

Here's a sample file with several reference sentences: data/references.txt

    python create_json_references.py -i data/references.txt -o data/references.json

Evaluate the model predictions against the references

Sample file with predictions from a model is in data/predicted_sentences.txt

    python run_evaluations.py -i data/predicted_sentences.txt -r data/references.json

References

[1] Sequence to Sequence - Video to Text

Sequence to Sequence - Video to Text
S. Venugopalan, M. Rohrbach, J. Donahue, T. Darrell, R. Mooney, K. Saenko
The IEEE International Conference on Computer Vision (ICCV) 2015

[2] Microsoft COCO Captions: Data Collection and Evaluation Server

Microsoft COCO Captions: Data Collection and Evaluation Server
X. Chen, H. Fang, T.Y. Lin, R. Vedantam, S. Gupta, P. Dollar, C.L. Zitnick
arXiv preprint arXiv:1504.00325

caption-eval's People

Contributors

vsubhashini avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

caption-eval's Issues

beam_size_1

What beam_size_1 means and why it should be there in the .txt file?

python s2vt_captioner.py -m s2vt_vgg_rgb error

When I run the s2vt_captioner.py I get the following error:

mkldnn_concat_layer.cpp:139] Check failed: dim_src == bottom[i]->shape().size() (4 vs. 3)

I am running on intel caffe version 1.1.0 in cpu only mode.

How can I resolve the same?

Unable to compile s2vt

I am using cuda-8 and tried cudnn 5.1 and 6 but was unable to compile
https://github.com/vsubhashini/caffe/tree/recurrent/examples/s2vt.
(Cannot create an issue there so raising it here)

Also, can you elaborate on the caffe model shared on your page is trained on all only MVSD dataset?.

Lastly, is there any specific steps to follow to test on new videos other than extracting VGG features.

Thanks for your work.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.