Light

codeneuro / spikefinder-python Goto Github PK

View Code? Open in Web Editor NEW

1.0 4.0 3.0 17 KB

python module for testing spike detection algorithms

License: MIT License

Python 100.00%

spikefinder-python's Introduction

spikefinder-python

python module for testing spike detection algorithms

This repository contains a module and a CLI for working with spike finding algorithm results. Spike finding is defined as identifying the timing of discrete spiking events from continuous-valued calcium flouresence data. This module is used by the spikefinder benchmarking challenge to compare ground truth results to results from submitted algorithms.

The included functions compute evaluation statistics. You can use it to compare ground truth against algorithm results and load result files in standard formats, either as a command line tool, or as a module inside a python project (supports Python 2.7 and 3.4).

If you have any questions about these metrics or want to suggest others, please open an issue or submit a pull request!

use as a command line tool

To evaluate a pair of results, just pass two CSV files as arguments

spikefinder evaluate a.csv b.csv

use as a module

You can also use this module inside a Python project, for example

from spikefinder import load, score

a = load('a.csv')
b = load('b.csv')
score(a, b)

methods

`spikefinder.load(file)`

Load results from a CSV file.

`spikefinder.score(a, b)`

Estimate similarity scores between two results.

format

Input data should be formatted as comma-separated values in a table, where each row corresponds to a time point and each column corresponds to a neuron, and the header row are integer labels, e.g.

0,1,2,3,4,5,6,7,8,9,10
0,0,0,0.0,0,0,0,0,0,0,0
0,0,0,0.0,0,1,0,0,0,0,0
0,0,0,0.0,0,0,0,0,0,0,0
0,0,0,0.0,0,1,0,0,0,0,0
0,0,0,0.0,0,0,0,0,0,0,0
...
0,0,,0,0,0,0,0,0,,0
0,0,,,0,0,0,,0,,0

It is fine if different columns have different lengths, just leave these entries empty as in the last two rows of the above example. If you have your time point x neuron table of results as an array in Python named mat, with nans for any missing entries, you can write it to a correctly formatted output file using

from pandas import DataFrame

df = DataFrame(mat)
df.to_csv('filename.csv', index=False)

spikefinder-python's People

Stargazers

Watchers

Forkers

philippberens thomasdeneux afcarl

spikefinder-python's Issues

Length checking

We should make sure all the columns are the correct length before they are scored.

evaluation metrics

Let's discuss evaluation metrics for spikefinder!

The data for doing evaluation will be, for each neuron, a fluorescence trace and an estimated spike rate. Presumably the rate will be either binary (a spike), integer (a count), or continuous valued (a probability).

Options include information gain, area under an ROC curve, and correlation. For anyone with a lot of experience here, it'd be great to add user experience with these statistics (cc @philippberens).

Correlation might be simplest because it's easy to compute and still gives reasonable results. But there's also no reason not to compute multiple metrics, as we currently do for neurofinder.

NaN handling

We should decide how to handle NaNs in submission (or how many to allow) and implement it

Sampling rate

For evaluation, we probably don't want to evaluate at the full 100 Hz that the signals are provided in. For the paper we used 25 Hz. Also, we want to be able to vary the resolution for making plots.

We have to find a way to pass as argument.

Multiple submissions

How do we prevent people from overfitting on the test data by performing many submissions?

Maybe the leaderboard should initially only show the performance on the training data and only after submission is close show the performance on the real data?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.