Giter Site home page Giter Site logo

data2health / data-harmonization Goto Github PK

View Code? Open in Web Editor NEW
16.0 11.0 3.0 3.93 MB

Supporting multi-center research requires combining data created in different data models; this community coordination project aims to provide an data model adaptor for CTSA hubs.

data2health cd2hpm

data-harmonization's Introduction

Github.io

data-harmonization

Clinical data in CTSA hubs are not readily queryable in a federated fashion. Many efforts exist to address this, including TriNetX, ACT, PCORNet, and OHDSI among others. Unifying these with an HL7 FHIR framework is an aspiration.

Enabling the CTSA to function as a federated network of clinical data, supporting multicenter research is among the core goals of the program. This project advances that agenda through common data model harmonization.

Problem statement

Data respositories across CTSA hubs need to have semantic and syntactic alignment to support federated query. This must impose a minimal maintenence burden on CTSA hub sites. Leveraging the native FHIR APIs, no proposed as required for US EHRs by CMS, would mitigate ETL costs and maintenence issues.

Project description

Harmonize the data ecosystem. An improved data ecosystem will enhance and extend existing work being performed on the NCATS Data Translator system, which integrates clinical and translational data at scale for mechanistic discovery, as well as other emergent systems such as the NIH Commons. We will apply our strengths and existing activities to make data FAIR-TLC: Findable, Accessible, Interoperable, and Reusable, as well as Traceable, Licensable, and Connected. We will assist contributors and users to develop and apply data standards, Common Data Elements (CDEs), and other commonly utilized data models such as FHIR and OHDSI. We will extend and supplement infrastructure, training, and collaborative environments to enable data to be shared openly, so that groups can collaborate on its harmonization based on specific needs or standards. The data ecosystem will provision CTSA-wide quality assurance reports and data quality assessment, as well as gold-standard datasets and synthetic clinical data sets. Fundamentally, we aim to develop an open-science ethos and unite CTSA community data sharing with broader global efforts.

Alignment to program objectives

TODO see here

Sub-projects

Related Projects

Contact person

Point person (github handle) Site Program Director
Tricia Francis (@tricfran) JHU Chris Chute (@cgchute)

Leads

Project scientific leadership:

Lead(s) (github handle) Site
Chris Chute (@cgchute) JHU

Team members

Team members are listed here.

Repositories

Many repositories could be listed here, including FHIR sites and CDM data models. However, for parsimony, we presently list the main FHIR project and the NCATS supported clinicalprofiles.org.

Deliverables

Key long-term deliverables

  • A coherent common data model across CTSA hubs, arising naturally from their EHR sources.
  • Shared terminology services across the CTSA community

Milestones

Milestones are listed, though at present are quite general.

Evaluation

Evaluation of data harmonization will ultimately rest with its impact on our community. The goal is to enable federated query and inferencing at scale across the CTSA community. There are likley to be many lesser advantages and consequences. Several evaluation issues are in place, though we expect they will evolve with time.

Education

We anticipate substantial need to educated the CTSA community about elements of the well-known FHIR specification relevent to translational research. In particular, the notion of managing FHIR as a canonical model, with migration paths to traditional common data models (e.g. OMOP/OHDSI, PCORNet, ACT, etc.)

Explore Our Work

Get involved

We encourage the community to get involved.

We are looking for community participation in the following areas:

If you are interested in participating, please onboard here or contact Tricia Francis at [email protected] with any questions.

Working documents

Documentation for the various data harmonization task teams can be found at this Google drive folder and project specific work may be in this GitHub using the wiki or .md files.

data-harmonization's People

Contributors

cgchute avatar cgcook avatar eichmann avatar hsolbrig avatar jmcmurry avatar mellybelly avatar richard1933 avatar tricfran avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data-harmonization's Issues

5.2.19 HL7 and FHIR Coordination with Translational Research

Hopkins hosted Grahame Grieve, the original developer and present leader of the HL7 FHIR initiative, Wayne Kubick, the CTO at HL7, and Chuck Jaffe, the CEO at HL7, in Baltimore, Thursday, May 2nd. HL7 has prioritized translational research, from precision medicine to clinical trials, as a priority area for engagement and expansion. In particular, they seek to work with CTSAs and we are taking this opportunity to learn more about translational medicine, the CTSA, PMAP, and our work on a new FHIR resource for the NCATS Translator program. NCATS representatives also attended: Ken Gersing, Xinzhi Zhang, Qian Zhu and Elaine Collier.

CAMP-FHIR access, CDM to FHIR

CAMP FHIR is a lightweight application intended to transform clinical data stored in a relational database (RDBMS) to HL7's Fast Healthcare Interoperability Resources. Once your clinical data are in FHIR, they can be used for a variety of downstream applications.

image

Project spearheaded at UNC.
CAMP-FHIR github repo received by CDM-FHIR Task Team co-lead Emily Pfaff.

HL7 and FHIR Coordination with Translational Research Summit

Hopkins hosted Grahame Grieve, the original developer and present leader of the HL7 FHIR initiative, Wayne Kubick, the CTO at HL7, and Chuck Jaffe, the CEO at HL7 in Baltimore on Thursday, May 2nd. HL7 has prioritized translational research, from precision medicine to clinical trials, as a priority area for engagement and expansion. NCATS representatives attended - including Ken Gersing, Xinzhi Zhang, Qian Zhu and Elaine Collier.

7.17.19 CDM-FHIR Gap Analysis Task Team Meeting outcomes

7.17.19 CDM-FHIR Gap Analysis Task Team met.

Attendees: Emily Pfaff (lead), Guoqian Jiang, Brian Furner, Thomas Oberst, Andrew Williams, Chris Chute, Svetlana Rojevsky, Umit Topaloglu, Jack Chang, Catherine Craven, Les Lenert, Tony Solomonides, Adam Lee, Aaron Zhang, Tricia Francis (3 other members had conflicts).

Determined to collected all known mappings already done by next meeting.
Looking at option to have a working meeting around AMIA in November.

SMILE CDR demonstrated

Demonstrated by Leslie Lenert at South Carolina (MUSC) at the Federated Data Query Workshop in May 20 & 21, 2019

5.20&21.19 Federated Data Query Workshop

Workshop included determination of five sub-groups and drafting of a manuscript.
HL7 Engagement
CDM-FHIR Gap Analysis
FHIR Education
FHIR server options
Sustainability and Change Management

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.