Giter Site home page Giter Site logo

manas02 / fbdd Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 1.17 MB

FragmentBERT: Masked Language Models are Fragment Based Drug Designers

License: Apache License 2.0

Python 2.13% Shell 0.20% Jupyter Notebook 97.67%
bert cheminformatics drug-discovery

fbdd's Introduction

Masked Language Models are Fragment Based Drug Designers

Download, Process and Train FragmentBERT

bash run.sh

Steps to reproduce :

  • fragment_tokenizer/tokenizer.py Defines Fragment Augmentation, Encoder and Decoder.
  • moses_train_test_split.py Takes in moses_dataset.csv and produces train, test moses data.
  • encode_moses.py Produces failed and augmented encoded moses smiles files.
  • model/tokenizer.py Creates FragmentBERT Tokenizer.
  • model.py Train the model and log to WandB.

Conda

conda env create -f environment.yml
conda activate fbdd

fbdd's People

Contributors

manas02 avatar gitbutler-client avatar

Watchers

 avatar

fbdd's Issues

Help with WandB error

Running src/model/model.py...
wandb: ERROR api_key not configured (no-tty). call wandb.login(key=[your_api_key])
Traceback (most recent call last):
File "/home/martis-e/work/code-tuts/fbdd/fbdd/src/model/model.py", line 16, in
wandb.init(project="FragmentBERT")
File "/home/martis-e/miniconda3/lib/python3.11/site-packages/wandb/sdk/wandb_init.py", line 1195, in init
raise e
File "/home/martis-e/miniconda3/lib/python3.11/site-packages/wandb/sdk/wandb_init.py", line 1172, in init
wi.setup(kwargs)
File "/home/martis-e/miniconda3/lib/python3.11/site-packages/wandb/sdk/wandb_init.py", line 306, in setup
wandb_login._login(
File "/home/martis-e/miniconda3/lib/python3.11/site-packages/wandb/sdk/wandb_login.py", line 317, in _login
wlogin.prompt_api_key()
File "/home/martis-e/miniconda3/lib/python3.11/site-packages/wandb/sdk/wandb_login.py", line 247, in prompt_api_key
raise UsageError("api_key not configured (no-tty). call " + directive)
wandb.errors.UsageError: api_key not configured (no-tty). call wandb.login(key=[your_api_key])
/home/martis-e/work/code-tuts/fbdd/fbdd
Finished running src/model/model.py.

Implement Molecule and MolOps

Implement Molecule & Fragment Ops

  • Create Molecule class that can perform all the basic tasks related to conversion between rdkit.Mol object to SMILES, SMARTS, BRICS, etc.
  • Add type Molecules = list[Molecule].
  • Create Fragment class to hold Fragment data with connectivity information.
  • Add type Fragments = list[Fragment].
  • Add error handling for both classes.
  • Implement __hash__ & __eq__ for both classes.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.