Giter Site home page Giter Site logo

3dheatmapdatapreprosses's Introduction

3D Heat Map Data Preprocess

This project is about retrieving mineral data from OpenMindat API and cleansing the data into our designed data structure for the 3D heat map to analyze co-relationships between minerals and elements.

For users interested in using Mindat API, please refer to https://github.com/ChuBL/How-to-Use-Mindat-API.

Get Started

Prerequisite

You will need an api_key.txt file in the root path of your cloned repository to run the codes. The API key will not be included in this repository. Please reach out to Mindat administrators for help.

Run the code

The whole data stream is wrapped in mindat_data_processor.py. You can walk through all the steps from data retrieving to data export by running this single .py file.

File structures

Data Description

The retrieved data are saved in ./mindat_data/raw_data, in the naming format of mindat_items_IMA_00000000000000.json.

The exported CSV files are saved in ./mindat_data/csv/ Under this directory we provided 8 generated datasets derived from OpenMindat IMA-approved mineral species.

Subset 30 Common Elements Datasets

  • 30_elements.csv Elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of 30 top frequency elements in Mindat attribute elements.

  • 30_sigelements.csv Elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of 30 top frequency elements in Mindat attribute sigelements.

  • normalized_30_elements.csv Normalized elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of 30 top frequency elements in Mindat attribute elements.

  • normalized_30_sigelements.csv Normalized elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of 30 top frequency elements in Mindat attribute sigelements.

All Elements Datasets

  • 73_elements.csv Elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of all elements in Mindat attribute elements.

  • 73_sigelements.csv Elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of all elements in Mindat attribute sigelements.

  • normalized_73_elements.csv Normalized elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of all elements in Mindat attribute elements.

  • normalized_73_sigelements.csv Normalized elements cooccurrence 3D matrix, comprises of as a concatenated 2D matrices of all elements in Mindat attribute sigelements.

Auxiliary code

Data retrieving

mindat_api.py for retrieving data from Mindat api.

Csv Normalization

csv_normalizer.py for generating normalized version of the cleaned CSV file.

3dheatmapdatapreprosses's People

Contributors

chubl avatar xgmachina avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

richardscottoz

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.