Giter Site home page Giter Site logo

dmh's Introduction

Data Management Handbook

This handbook describes general data management practices of the S-ENDA partners (available here):

  • Norwegian Meteorological Institute (Meteorologisk institutt - MET)
  • Norwegian Institute for Air Research (Norsk institutt for luftforskning - NILU)
  • Norwegian Institute for Nature Research (Norsk institutt for naturforskning - NINA)
  • Norwegian Institute for Water Research (Norsk institutt for vannforskning - NIVA)

It focuses on the management and sharing of dynamical geodata in line with the FAIR (Findable, Accessible, Interoperable, and Reusable) guiding principles for scientific data management.

Any suggestions for improvement are welcome by either creating an issue or pull request.

dmh's People

Contributors

elinahkk avatar ivanah8 avatar knl88 avatar lisemurberg avatar mdsnor avatar vangnina avatar vivigirardin avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Forkers

mdsnor

dmh's Issues

Information from MET

  • Update Morten about our work
  • Investigate if there is any material in MET’s DMH that must be kept in S-ENDA’s handbook. If so, take examples from MET’s official DMH.
  • Add some MET-specific information on the Practical Guides

Access to data (without downloading it)

GDAL supports accessing to data remotely without having to download it in advance.

Should we mention it and provide some code snippets on how to do it in Python/R?

Here is an example of using GDAL virtual file system to access a remote NetCDF file without downloading it:

url='NETCDF:/vsicurl/https://thredds.met.no/thredds/fileServer/senorge/seNorge_2018/Archive/seNorge2018_2006.nc'
data2006 = stars::read_stars(url, sub='rr', proxy=TRUE)
[...]
st_crs(data2006)
reg1_st <- reg1 %>% st_as_sf(.,
                             coords = c("longitude", "latitude"), 
                             crs=4326)

reg1_st <- st_transform(reg1_st, crs=st_crs(data2006))
reg1_st$rr <- st_extract(data2006, reg1_st, time_column="date")

This snippet uses R stars library.

Such a URL works with gdalinfo from command line and on a wide variety of software using GDAL.

Maybe Benjamin Cretois can help with that as well?

Find data using Geonorge CSW

MET DMH explains how to use MET's CSW endpoint for finding data. Should we have a common section for using GeoNorge CSW endpoint to find S-ENDA data?

It looks that QGIS MetaSearch often comes with GeoNorge CSW endpoint already configured.

Data governance - Introduction

Should be moved to the top of the DMH, and be quite general as we think this is quite introductory.

The sections from MET are more or less ok, but needs to be rewritten a bit.

Its important to explain both what data governance is and why its important.

Example from NILU, The ACTRIS Data Management Plan: https://github.com/actris/data-management-plan

NIVA Tasks

  • Create issues for actions in the agenda on GitHub
    Correct formatting in FAIR.xlsx (sheets Glossary and metadata)
  • Update README.md

Add chapter 7 from MET's DMH

Add relevant links from chapter 7 - How do find data

  • Important to add context to links. Ex: Introduction to OpenSearch.
    The template hasn't been updated, need output from @mortenwh

DMH content

Look into the content of MET’s DMH, see what can be recycled

NINA's tasks

  • Practical guides example
  • Add information about vocabulary

Data services

Data Services

3.1. Purpose
3.2. Implementation

User Portals and Documentation .

4.1. Purpose
4.2. Implementation …

User portal should be inside data services.

Issue assignent!

Could you please assign yourself or someone you believe is the right person for the task? I have assigned myself for #63, #69, and #70, and left a suggestion on #59 and #67. Please mark the box(es) with the issues you have assigned.

Content from MET’s DMH that should be maintained in S-ENDA’s handbook. To be re-written to fit and structured accordingly

Move this information to our hanbook so we can start working on changes:
Content from MET’s DMH that should be maintained in S-ENDA’s handbook. To be re-written to fit and structured accordingly

  • 1.1. The principles of data management for dynamic geodata.
  • Structuring and Documenting Data for Efficient Discovery and Use
    • 2.1. Purpose.
    • 2.2. Implementation
    • 2.2.1. Current practice
  • Data Services
    • 3.1. Purpose
    • 3.2. Implementation
  • User Portals and Documentation .
    • 4.1. Purpose
    • 4.2. Implementation …
  • Data Governance
    • 5.1. Purpose
    • 5.2. Data life cycle management
    • 5.2.1. Data Management Plan.
    • 5.3. Implementation …
  • Practical Guides

Originally posted by @elinahkk in #2 (comment)

Practical guides

  • Add some MET-specific information on the Practical Guides (from issue #2 )
  • NINA come with examples on practical guides (from issue #5 )

Practical guides - what do we need?

It would be good to have a guide on how we find S-ENDA's datasets in Geonorge. We need suggestions from the other groups, specially Dataset-group

Clean up old files

There are some old files (ex. DMH.Rmd, FAIR.xlsx) in the main branch. This should be cleaned up/removed.

Update general data management text

Update general data management text in chapter 4. to work for whole chapter, not just information about data management roles.

Should cover:
4.1 Institute specific data management
4.2 S-ENDA Architecture
4.3 Contact and roles

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.