Giter Site home page Giter Site logo

2021-npbert-antimalaria's Introduction

Predicting Antimalarial Activity in Natural Products using Pre-trainded BERT

T-H Nguyen-Vo, Q. Trinh, L. Nguyen, T. T. T. Do, M. C. H. Chua*, B. P. Nguyen*

alt text

Motivation

Malaria is one of the most dangerous diseases leading to thousands of deaths and millions of infected cases annually. For years, many studies have been conducted to discover potent antimalarial compounds to treat this disease. Along with chemically synthesized compounds, natural products are also demonstrated to have strong antimalarial activities. To investigate antimalarial activity in natural products, besides experimental approaches, computational methods have been developed with satisfactory outcomes obtained. In our study, we construct various prediction models to identify antimalarial natural products using pre-trained Bidirectional Encoder Representations from Transformers (so-called NPBERT) incorporated with four machine learning algorithms, including k-Nearest Neighbours (k-NN), Support Vector Machines (SVM), eXtreme Gradient Boosting (XGB), and Random Forest (RF).

Results

The results show that SVM models are the best-performed classifiers, followed by the XGB, k-NN, and RF models. Additionally, comparative analysis between our proposed molecular encoding schemes and existing state-of-the-arts indicates that NPBERT work more effectively compared to the others. Moreover, the employment of Transformers in constructing molecular encoders is not limited to this study but can be expanded to address numerous biochemical issues.

Availability and Implementation

Source code and data are available on GitHub

Citation

Nguyen-Vo, T. H., Trinh, Q. H., Nguyen, L., Do, T. T., Chua, M. C. H., & Nguyen, B. P. (2021). Predicting Antimalarial Activity in Natural Products Using Pretrained Bidirectional Encoder Representations from Transformers. Journal of Chemical Information and Modeling. DOI: 10.1021/acs.jcim.1c00584

Contact

Go to contact information

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.