vocalpy / vak Goto Github PK

View Code? Open in Web Editor NEW

75.0 4.0 16.0 201.05 MB

A neural network framework for researchers studying acoustic communication

Home Page: https://vak.readthedocs.io

License: BSD 3-Clause "New" or "Revised" License

Python 100.00%

python python3 vocalizations speech-processing birdsong torch torchvision pytorch bioacoustics bioacoustic-analysis

vak's People

Contributors

Stargazers

Watchers

Forkers

bollwyvl yardencsgithub luke-poeppel tubbz-alt kaiyaprovost dadotouterelle jaeronga eltociear khoality-dev trellixvulnteam nickledave neuralsyntaxlab ja-sonyun nosrednab marisbasha zhileiz1992

vak's Issues

change 'train_data_dict' and 'train_data_path' etc to just 'train_data'? in config / code

Less verbose, no confusion from slightly different names

apply `os.expanduser` to all paths in config

change config to attrs, write docstring for sub-config classes, use those in docs

add 'copy_training_data' argument for learncurve

add 'save_data' parameter to vak.cli.summary; add option to config

cleanly separate general `dataset.split` from more specific `train_test_dur_split`

see comments in #57

config parsing code throws bug when 'train_data_dict' not set even when running 'prep'

as discovered by @yardencsGitHub

learncurve: grab all replicates and durations and only then run the training

to avoid crash halfway through training, as per @yardencsGitHub

add 'spectrogram_files' option to [DATA]

add contrib info + covenant to docs

adding detail here from @Luke-Poeppel
https://github.com/NickleDave/vak/issues/365#issue-906829778

It would be great to have a CONTRIBUTING.md file in the repo––or notes in the README––for new contributors to the package. This could contain info on:

Desired commit message formatting

Style guide

Testing

See here for a nice example.

Fix "experiments.md" in README.md

add ability for paths in VocalizationDataset to be relative

so that a vds generated in one place doesn't crash when used in another place.

Maybe add root or location attribute and have all others paths be relative to that?

Improve train/test(/val) split functions?

Making this into a general issue / discussion of how to improve these functions.
Will collect thoughts + ideas from other issues here (to close those and consolidate).

add separate config section for learncurve

add 'save_transformed_data' argument in docs

learncurve, train should use size of X_train, not what's in config

add DeepChirp models as built-ins

https://github.com/kylerbrown/deepchirp

use tf.data, get rid of 'reshape_data_for_batching'

change 'n_syllables' to 'n_classes'

make Librosa a dependency, deprecate current spectrogram functions

remove `input_vec_size` arg from vak.utils.data.reshape_data_for_batching

and just use "height" of data

fix default None for [DATA] output_dir that gives confusing error

fix repr for VocalizationDataset

currently dumps all attributes for every Vocalization in voc_list

Quick fix would be to override __repr__
Might be worth figuring out why attrs doesn't come up with a good __repr__ though

add 'annotation_format' and 'audio_format' options to [DATA]

change absolute imports to relative where possible

automate updating vak-test-data repo on osf with datalad-osf

possibly with
https://github.com/cognoma/figshare
or
https://github.com/rmcgibbo/figshare

make it so invalid sections/options in config.ini raise human-interpretable errors

i.e. validate those sections and options

remove 'log_device_placement=True' from learncurve and summary

Not helping anymore + the major reason CI logs are too verbose

rename `get_inds_for_dur` to `subset_inds` and then write wrapper `subset`

that accepts X_train (stack of spects) and returns X_train_subset, using subset_inds

make 'make_spects_from_dir' func to wrap repeated logic from make_data/train/predict

one arg should be "purpose" or something like that with valid values {'learncurve','train','predict'} that then makes logic w/in function more explicit
i.e. if purpose=='predict' better than
Boolean flag that makes you invert logic mentally if is_not_predict==True