Giter Site home page Giter Site logo

meeteval's People

Contributors

boeddeker avatar hbredin avatar jkienegger avatar s0h3yl avatar thequilo avatar vieting avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

meeteval's Issues

Computing time-constrained WER

I am thinking of a metric for long-form ASR and segmentation. Consider the following scenario:

  • The input is a long recording (either single speaker or multi speaker).
  • References may be with word-level timestamps (CTM file) or segment-level (STM).
  • Hypothesis may be word-level or segment-level (CTM or STM).

If reference is STM and hypothesis is CTM, this may correspond to computing the asclite aWER metric, but we also want to support (i) other kinds of systems that may not provide word-level timestamps, and (ii) tighter penalty on segmentation by providing reference CTM.

Additionally, we also want to be able to include multiple possible references (e.g., references may be orthographic or normalized in some way), although I understand that this may be beyond the scope of this toolkit.

I am looking for suggestions about what would be a good metric (if one exists) for this scenario.

(cc @MartinKocour since we were having related discussions.)

Missing `LICENSE` file

Hi folks,

Thanks for your work here - much appreciated!

Would you please consider adding a LICENSE text file to the repository?

Thanks in advance!

Does mdeval computes DER taking overlapped speech into account?

Thank you very much for sharing this repository. It is very useful to have a single repo with many audio metrics :)

Tools like pyannote allows us to choose the collar and if we want to compute DER on overlapped speech regions or not.

With mdeval, we can specify the collar but it seems like there is no option for including overlapped speech in the metric or not.
Does that mean that by default it computes over overlapped regions? Or are they excluded for the calculations?

Thank you for your answer !

circular import error

Hi,

Thank you for open sourcing these multispeaker ASR metrics :).

Just reporting some issue I encountered:

Installing as follows works fine

pip install cython
git clone [email protected]:fgnt/meeteval.git
pip install -e ./meeteval[cli]

However installing like:

pip install cython
pip install https://github.com/fgnt/meeteval/archive/refs/heads/main.zip

Causes circular import issue:

import meeteval

Traceback (most recent call last):
File "", line 1, in
File "/opt/conda/lib/python3.7/site-packages/meeteval/init.py", line 1, in
from . import io
ImportError: cannot import name 'io' from 'meeteval' (/opt/conda/lib/python3.7/site-packages/meeteval/init.py)

Very strange, wondering if you have any ideas for workarounds for this? The build software we are using doesn't support installation via the first method.

ToDo: fix mimo_matching_v4 for python 3.9

Python 3.9.17 (main, Jul 28 2023, 05:54:52) 
Type 'copyright', 'credits' or 'license' for more information
IPython 8.14.0 -- An enhanced Interactive Python. Type '?' for help.

In [1]: from meeteval.wer.matching import orc_matching, mimo_matching
   ...: ref=[['a']]
   ...: hyps=['aaabb']
   ...: mimo_matching.mimo_matching_v4(ref, hyps)
   ...: 
Out[1]: (2, [(0, 0)])
Python 3.11.4 (main, Jul  5 2023, 13:45:01) [GCC 11.2.0]
Type 'copyright', 'credits' or 'license' for more information
IPython 8.14.0 -- An enhanced Interactive Python. Type '?' for help.

In [1]: from meeteval.wer.matching import orc_mat
ching, mimo_matching
   ...: ref=[['a']]
   ...: hyps=['aaabb']
   ...: mimo_matching.mimo_matching_v4(ref, hyps)
   ...: 
Out[1]: (4, [(0, 0)])

See GitHub Action log: logs_144.zip

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.