Giter Site home page Giter Site logo

lasigebiotm / biont Goto Github PK

View Code? Open in Web Editor NEW
26.0 7.0 6.0 16.39 MB

BiOnt: Deep Learning using Multiple Biomedical Ontologies for Relation Extraction

License: Apache License 2.0

Dockerfile 2.98% Shell 3.64% Python 93.38%
biomedical-text-mining deep-learning ontologies relation-extraction text-mining

biont's Introduction

BiOnt: Deep Learning using Multiple Biomedical Ontologies for Relation Extraction

To perform relation extraction, our deep learning system, BiOnt, employs four types of biomedical ontologies, namely, the Gene Ontology, the Human Phenotype Ontology, the Human Disease Ontology, and the Chemical Entities of Biological Interest, regarding gene-products, phenotypes, diseases, and chemical compounds, respectively.

Our academic paper which describes BiOnt in detail can be found here.

Downloading Pre-Trained Weights

Available versions of pre-trained weights are as follows:

The training details are described in our academic paper.

Getting Started

 cd bin/
 git clone [email protected]:lasigeBioTM/DiShIn.git

Use the Dockerfile to set up the rest of the experimental environment or the BiOnt Image available at Docker Hub.

Preparing Data

  • $2: type_of_action
  • $3: pair_type
  • $4: preprocess_what
  • $5: input_path

Example:

 python3 src/ontologies_embeddings.py preprocess DRUG-DRUG train corpora/drug_drug/train
 python3 src/ontologies_embeddings.py preprocess DRUG-DRUG test corpora/drug_drug/test

For more options check model.sh.

Train Model

  • $2: type_of_action
  • $3: pair_type
  • $4: model_name
  • $6:: channels

Example:

 python3 src/ontologies_embeddings.py train DRUG-DRUG model_name words wordnet concatenation_ancestors common_ancestors

For more options check model.sh.

Predict New Data

  • $2: type_of_action
  • $3: pair_type
  • $4: model_name
  • $5: gold_standard OR data_to_test
  • $6:: channels

Example:

 python3 src/ontologies_embeddings.py test DRUG-DRUG model_name corpora/drug_drug/test words wordnet concatenation_ancestors common_ancestors

For more options check model.sh.

Reference

  • Diana Sousa and Francisco M. Couto. 2020. BiOnt: Deep Learning using Multiple Biomedical Ontologies for Relation Extraction. In Jose J. et al. (eds) Advances in Information Retrieval. ECIR 2020. Lecture Notes in Computer Science, Volume 12036, pages 367-374. Springer, Cham.

biont's People

Contributors

dpavot avatar sirconceicao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

biont's Issues

bin/sst-light-0.4/sst

image
hello,I want to know the location of these files above picture, the directory of "bin" is nothing,can you tell me where them is?

Dataset

Hello,

Thank you for your work. Could you please share the datasets? There are no ontologies in your data as the folders are empty and just have .gitkeep. Your ontologies are OWL/RDF-based?

Assumed bug in preprocess_ids

In ontologies_embeddings.py in the method preprocess_ids, I assume that it's intended to collect the list of all ontology parents/ancestors. But wat it does instead is to take only the last ancestor. I guess correct would be something like:

 idxs.append(id_to_index[d.replace('_', ':')])

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.