Giter Site home page Giter Site logo

getmetamapper / metamapper Goto Github PK

View Code? Open in Web Editor NEW
76.0 6.0 5.0 28.75 MB

Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.

Home Page: https://www.metamapper.io

License: BSD 2-Clause "Simplified" License

Dockerfile 0.13% Makefile 0.11% Python 60.23% Shell 0.35% HTML 0.23% JavaScript 34.96% CSS 0.05% TSQL 1.96% SCSS 1.99%
metamapper data-catalog data-discovery schema-inspection data-warehouse python django data-curation metadata

metamapper's Introduction

Metamapper

CircleCI Vulnerabilities latest version python discord

Metamapper is an open-source metadata management platform that aims to make it easier to share data and its context across your organization. It's a self-updating data catalog complete with full-text search, an integrated commenting system, and much more.

What we're trying to accomplish

Growing organizations rely on data and analytics to drive decisions. With the emergence of tools like Airflow and companies like Segment and Fivetran, it's never been easier to get data into your warehouse.

But with all of this data comes a lot of noise. It can become difficult to keep track of things like business purpose and/or timeliness of your data, amongst other things. Plus, writing and maintaing that sort of documentation is just plain boring.

Metamapper aims to automate those boring documentation tasks and reduce the time that data engineers spend answering redundant questions. Just connect your data warehouse and Metamapper will periodically scan the datastore and maintain a commentable data catalog that can be viewed by your team via the UI.

Think of it as Google for your data warehouse โ€“ perform a search and it'll find the data that best fits your needs.

Here are a few features of Metamapper:

  • Browser-based: Everything in your browser, with a shareable URL you can give to your team.
  • Schema inspection: Metamapper crawls your database schema(s) every few hours and maintains a comprehensive data catalog.
  • Change detection: Detects when data definitions change between schema inspection runs. Useful for alerting uncommunicated changes.
  • Annotations: Supports comments on almost every object so your team can crowdsource knowledge about data assets.
  • Custom Properties: Easily attach custom metadata to databases and tables, such as data steward or ETL process references.
  • Search: Everything is indexed and searchable. Self-service data analytics, here we come!

Quickstart

You can try out a default version of Metamapper with sample data using Docker and Docker-Compose.

Clone the repository:

git clone [email protected]:getmetamapper/metamapper.git

From the repository root:

docker-compose -f docker-quickstart.yml up

Head to http://localhost:5555 to view the Metamapper UI. Try searching for "clickstream events" and see what happens!

Installation

Use our pre-baked Docker images. Detailed setup instructions can be found here: https://github.com/getmetamapper/metamapper-setup

Documentation

Supported datastores

Metamapper currently supports automatic crawling and indexing of these database management systems with plans to add more in the near future.

  • Amazon Redshift
  • AWS Athena
  • AWS Glue
  • Azure SQL Database
  • Azure Synapse (formerly Azure DW)
  • Google BigQuery
  • Hive Metastore
  • Microsoft SQL Server
  • MySQL
  • Oracle
  • PostgreSQL
  • Snowflake

Community / Get Involved

Other Resources

metamapper's People

Contributors

cn86 avatar scruwys avatar snyk-bot avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

metamapper's Issues

Exact search feature

Feature

Return exact matches when the search term is enclosed in double-quotes.

Motivation

This feature would enable the user to narrow down the search results.

Search results do not acknowledge workspace roles

Bug Report

Search results currently do not honor role-based permissions, such as the Owner role.

Have you read the Contributing Guidelines on issues?

Yes

Actual Behavior

If John Doe is an owner of a workspace, he would have access to the Postgres datastore. If object-level permissions are enabled and he is not explicitly granted the view_datastore permission, the Postgres datastore will not appear in the search results.

This should be a pretty quick fix in the elastic_backend.py if we just allow all datastores within a workspace if the requesting user is an owner.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.