Skip to content
/ MoIma Public

Molecule Imagination: A framework aims to accelare machine learning in Chemistry

License

Notifications You must be signed in to change notification settings

fate1997/MoIma

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

${\color{#FF9EAA}\rm{Mo}\color{black}\rm{lecule} \space \color{#FF9EAA}\rm{Ima}\color{black}\rm{gination}}$

This repository contains a framework for easily building, training, and evaluating machine learning models for molecular property prediction and molecule generation. The overview of the framework is shown below. alt text

The main idea is to use Simple Factory Pattern to easily create featurizer, dataset, splitter, model, optimizer, scheduler, and loss function. Then the Pipe class will take care of the training, validation, and testing process. Coupled with Pipe, there is a Config class. All the configurations will be controled via a Config class, which could parse inputs from files, command line, or python arguments. Currently, the framework supports the following pipes:

The usage of the frameworks is simple, for example, to train a VAE model, you can simply run the following command:

from moima.pipeline.vae_pipe import VAEPipe, VAEPipeConfig

# Specify the parameters for the VAEPipe
config = VAEPipeConfig(
    raw_path='./example/database/zinc1k.csv', 
    desc='pytest_run', 
    seq_len=120,
    save_interval=50000,
    log_interval=5,
    vocab_size=100,
    num_epochs=50
)


pipe = VAEPipe(config)
pipe.train()

You can also easily evaluate the model or generate molecules using the following commands:

pipe.eval()
sampled_smiles = pipe.sample(10)

# Save/Load the Pipe
pipe.save(name='test')
pipe.from_pretrained(path='test.pt')

The framework is still under development, and more features will be added in the future.

Installation

First, install the required dependencies:

# Install torch and torch_geometric
conda install -y pytorch==2.3.0 torchvision==0.18.0 torchaudio==2.3.0 \
                 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install torch_geometric
pip install torch_scatter torch_sparse -f https://data.pyg.org/whl/torch-2.3.0+cu118.html

# Install other dependencies
pip install e3nn faenet egnn_pytorch mace_torch meeko numpy pandas pytest selfies

Then, clone the repository and install the package:

git clone https://github.com/fate1997/MoIma.git
cd MoIma
pip install -e .

About

Molecule Imagination: A framework aims to accelare machine learning in Chemistry

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages