Giter Site home page Giter Site logo

wt-biodiversity-summer-2018's Introduction

WholeTale Summer Internship 2018

Taxonomy alignment as a key to enhance reproducibility in biodiversity research: a case study of Magnolia

Author: Yi-Yun Cheng (Jessica), University of Illinois at Urbana-Champaign

Mentors: Dr. Bertram Ludaescher, UIUC ; Dr. Nico Franz, ASU

Goal of this project

Oftentimes in biodiversity research, we expect the scientific names of species to be unique identifiers, but actually they may not be. Why is that?

(1) The scientific names can vary over time

(2) The names stay the same, but the semantics of the names change

Other complicated issues:

(1) Different people may have different perceptions to the taxonomy of a same topic

(2) Species distribution datasets oftentimes only include information on a species ‘name’ without crediting the authorship of that taxonomy

This is why we are in a pressing need to align diffferent taxonomies that is addressing the same topic, not to only make the names more interoperable, but also to make way for further datasets usage.

Overview of the tasks for this project

Step 1: Decide which species (or genus) to examine

Step 2: Domain experts provide a mapping table for the taxonomies used over time for that particular species

Step 3: Researcher transpose domain experts’ table into Euler/X or LeanEuler input file

Step 4: Gather species distribution dataset from biodiverisity portals

Step 5: Concept mapping of the taxonomies and create new datasets based on different taxonomies

Step 6: Data cleaning - geocode missing lat-long information

Step 7: Visualizing species co-occurrence distribution & synthesized taxonomy alignment distribution

Step 8: Niche modeling and further analyisis

Refer to the following for details

  1. Step 5: Concept mapping process. Refer to this notebook.

  2. Step 6: Filling in missing geo-location information. Clone this repository and run the geocode.py along with testgeocode.py.

  3. Step 7: Species co-occurrence distribution visualization. Refer to this notebook.

wt-biodiversity-summer-2018's People

Contributors

yiyunyc2 avatar snunezcr avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.