Giter Site home page Giter Site logo

neurodata / connectocross Goto Github PK

View Code? Open in Web Editor NEW
3.0 4.0 3.0 137.2 MB

Connectocross: statistical characterizations and comparisons of nanoscale connectomes across taxa (A paper in progress)

Home Page: https://docs.neurodata.io/connectocross/

License: MIT License

Python 21.64% Jupyter Notebook 78.36%
connectome

connectocross's Issues

(decide if we need a) lightweight graph + metadata object

i often find something like this helpful.

Just a light graph object that stores adjacency matrix + pandas metadata on the nodes dataframe

havent done anything smart for multigraph or edges with features

likely a better way to implement the above

unsure if it is even necessary or networkx is enough. but end up using adjacency matrix representation so much that it was convenient

define a file spec for attributed graphs

  • should be well suited to the data we care about here as a first use case
  • in the past we have discussed a csv edgelist for the graph itself with separate json(s) for metadata

fit a priori SBMs to the connectomes

Spencer:

  • seems like something that is easy to do and we could do for all of them
  • could just use the metadata, find some features that we care about.

Ben:

  • maybe we fit using various node metadata columns and just report likelihood, number of parameters, bic or something like that
  • maybe also just plot them and show that we can fit these models

graph matching stuff

we could run graph matching on a lot of these connectomes
maybe even match some of them to other connectomes

write data pulling functions for the connectomes of interest

for the connectomes listed in the readme (roughly, exact ones may change)

  • want a simple, clear script to pull necessary graph + metadata from wherever it's hosted online
  • saves that data to the format specified in #1

Note: obviously a bit downstream of #1, but some work on pulling the data could probably start concurrently. configuring how to save to whatever format we pick is likely not the bottleneck for this issue.

write data pulling functions for the connectomes of interest - xlsx

for the connectomes listed in the readme (roughly, exact ones may change)

  • want a simple, clear script to pull necessary graph + metadata from wherever it's hosted online
  • saves that data to the format specified in #1

Note: Sub-issue of #2

Specifically, focus on datasets stored as .xlsx files.

Consistent styling

Palettes

As a group, please decide on a consistent color palette for species/dataset:
https://seaborn.pydata.org/tutorial/color_palettes.html
https://matplotlib.org/stable/tutorials/colors/colormaps.html

It may make sense to do something like have two different shades of the same color for multiple related datasets (e.g. two C. elegans could be two shades of blue or something) as long as this is less distinct than the species annotation.

I'd like this palette to just be saved somewhere as a json that any script can just import

Style

As a group, please decide on a consistent style for matplotlib. This is also something that can be saved and import easily. One example is here which you are welcome to use or modify.

E.g. each notebook you each separately make can then just call set_theme() and everyone's plots will look the same

simple statistics

  • number of nodes
  • number of edges
  • max degree
  • graph density

chart of the above for each connectome we have

L/R homotypic affinity?

to what extent are edges between L/R pairs more probable than any L/R connection?

(which datasets have L/R pairs?)

(If they don't have pairs, can we use graph matching to predict?)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.