TenGAN

A PyTorch implementation of "TenGAN: Pure Transformer Encoders Make an Efficient Discrete GAN for De Novo Molecular Generation." The paper has been accepted by AISTATS 2024.

Installation

Execute the following commands:

$ conda env create -n tengan_env -f env.yml
$ source activate tengan_env

File Description

dataset: contains the training datasets. Each dataset contains only one column of SMILES strings.
- QM9.csv
- ZINC.csv
res: all generated datasets, saved models, and experimental results are saved in this folder.
- save_models: all training results, pre-trained and trained filler and discriminator models are saved in this folder.
- main.py: definite all hyper-parameters, pretraining of the generator, pretraining of the discriminator, adversarial training of the TenGAN and Ten(W)GAN.
- mol_metrics.py: definite the vocabulary, tokenization of SMILES strings, and all the objective functions of the chemical properties.
- data_iter.py: load data for the generator and discriminator.
- generator.py: definite the generator.
- discriminator.py: definite the discriminator.
- rollout.py: definite the Monte Carlo method.
- utils.py: definite the performance evaluation methods of the generated molecules, such as the validity, uniqueness, novelty, and diversity.

Available Chemical Properties at Present:

- solubility
- druglikeness
- synthesizability

Experimental Reproduction

TenGAN on the ZINC dataset with drug-likeness as the optimized property:

$ python main.py

Citation

C. Li and Y. Yamanishi (2024). TenGAN: Pure transformer encoders make an efficient discrete GAN for de novo molecular generation. AISTATS 2024.

BibTeX format:

@inproceedings{li2024tengan,
title={TenGAN: Pure Transformer Encoders Make an Efficient Discrete GAN for De Novo Molecular Generation},
author={Li, Chen and Yamanishi, Yoshihiro},
booktitle={27th International Conference on Artificial Intelligence and Statistics (AISTATS)},
volume={２３８},
year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TenGAN

Installation

File Description

Available Chemical Properties at Present:

Experimental Reproduction

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
dataset		dataset
env		env
res/save_models/ZINC/TenGAN_0.5/rollout_8/batch_64/druglikeness		res/save_models/ZINC/TenGAN_0.5/rollout_8/batch_64/druglikeness
README.md		README.md
SA_score.pkl.gz		SA_score.pkl.gz
data_iter.py		data_iter.py
discriminator.py		discriminator.py
generator.py		generator.py
main.py		main.py
mol_metrics.py		mol_metrics.py
rollout.py		rollout.py
tengan_overview.png		tengan_overview.png
utils.py		utils.py

naruto7283/TenGAN

Folders and files

Latest commit

History

Repository files navigation

TenGAN

Installation

File Description

Available Chemical Properties at Present:

Experimental Reproduction

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages