PROUD: PaRetO-gUided diffusion model for multi-objective generation

Yao, Yinghua; Pan, Yuangang; Li, Jing; Tsang, Ivor; Yao, Xin

doi:10.1007/s10994-024-06575-2

PROUD: PaRetO-gUided diffusion model for multi-objective generation

Published: 02 July 2024

Volume 113, pages 6511–6538, (2024)
Cite this article

Machine Learning Aims and scope Submit manuscript

Yinghua Yao^1,2,
Yuangang Pan^1,2,
Jing Li^1,2,
Ivor Tsang^1,2 &
…
Xin Yao^3,4

476 Accesses
1 Altmetric
Explore all metrics

Abstract

Recent advancements in the realm of deep generative models focus on generating samples that satisfy multiple desired properties. However, prevalent approaches optimize these property functions independently, thus omitting the trade-offs among them. In addition, the property optimization is often improperly integrated into the generative models, resulting in an unnecessary compromise on generation quality (i.e., the quality of generated samples). To address these issues, we formulate a constrained optimization problem. It seeks to optimize generation quality while ensuring that generated samples reside at the Pareto front of multiple property objectives. Such a formulation enables the generation of samples that cannot be further improved simultaneously on the conflicting property functions and preserves good quality of generated samples.Building upon this formulation, we introduce the ParetO-gUided Diffusion model (PROUD), wherein the gradients in the denoising process are dynamically adjusted to enhance generation quality while the generated samples adhere to Pareto optimality. Experimental evaluations on image generation and protein generation tasks demonstrate that our PROUD consistently maintains superior generation quality while approaching Pareto optimality across multiple property functions compared to various baselines

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Comprehensive Survey of Image Generation Models Based on Deep Learning

Article 20 June 2024

Score-based generative modeling for de novo protein design

Article 04 May 2023

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Availability of data and materials

All datasets used in this work are available online and clearly cited.

Code availability

The code of this work is available at https://github.com/EvaFlower/Pareto-guided-diffusion-model.

Notes

This relates to the manifold hypothesis that many real-world high-dimensional datasets lie on low-dimensional latent manifolds in the high-dimensional space (Fefferman et al., 2016).
In other words, the generated samples is as realistic as samples in the given dataset X.
We have checked all citations and DOIs and ensured that they are existent, true and duplicate-free.
As demonstrated in Sect. 3 and Fig. 3b of their study, an objective that forces the center of generated images to be a black square can be used for constrained sampling on CIFAR10. Accordingly, they obtain samples that lie on the CIFAR10 data manifold and exhibit the black square in the middle, such as “black plane” and “black dog” images which contain a black square (smaller size than the object) in the middle. This task can be considered as image outpainting (Yao et al., 2022), namely, extrapolating images based on specified color patches on CIFAR10.
RGB values [0, 255] are divided by 255.
We only sample 5, 000 protein sequence since the computation cost of SASA values is very high.
Our problem setting is slightly different as we take the distance square in order to obtain a non-linear shape of the Pareto front. We also refer reviewer to example-1 in Liu et al. (2021a) that defines a same two-objective problem but with 1-D decision variable for easy understanding.
We use $[0.5_{\Omega }, 1_{\Omega }]$ to denote image patches in normalized RGB color values between [0.5, 0.5, 0.5] (grey) and [1, 1, 1] (white).

References

Afshari, H., Hare, W., & Tesfamariam, S. (2019). Constrained multi-objective optimization algorithms: Review and comparison with application in reinforced concrete structures. Applied Soft Computing, 83, 105631. https://doi.org/10.1016/J.ASOC.2019.105631
Article Google Scholar
Andrieu, C., De Freitas, N., Doucet, A., et al. (2003). An introduction to MCMC for machine learning. Machine Learning, 50, 5–43. https://doi.org/10.1023/A:1020281327116
Article Google Scholar
Arjovsky, M., Chintala, S., & Bottou, L. (2017). Wasserstein generative adversarial networks. In International conference on machine learning (pp. 214–223). https://proceedings.mlr.press/v70/arjovsky17a.html
Borghi, G., Herty, M., & Pareschi, L. (2023). An adaptive consensus based method for multi-objective optimization with uniform pareto front approximation. Applied Mathematics and Optimization, 88(2), 58. https://doi.org/10.1007/s00245-023-10036-y
Article MathSciNet Google Scholar
Cheng, R., Li, M., Tian, Y., et al. (2017). A benchmark test suite for evolutionary many-objective optimization. Complex and Intelligent Systems, 3, 67–81. https://doi.org/10.1007/s40747-017-0039-7
Article MathSciNet Google Scholar
Chinchuluun, A., & Pardalos, P. M. (2007). A survey of recent developments in multiobjective optimization. Annals of Operations Research, 154(1), 29–50. https://doi.org/10.1007/S10479-007-0186-0
Article MathSciNet Google Scholar
Cock, P. J., Antao, T., Chang, J. T., et al. (2009). Biopython: Freely available python tools for computational molecular biology and bioinformatics. Bioinformatics, 25(11), 1422–1423. https://doi.org/10.1093/bioinformatics/btp163
Article Google Scholar
Dathathri, S., Madotto, A., & Lan, J. et al (2020). Plug and play language models: A simple approach to controlled text generation. In International conference on learning representations. https://openreview.net/forum?id=H1edEyBKDS
Deb, K. (2001). Multi-objective optimization using evolutionary algorithms (Vol. 16). Wiley.
Google Scholar
Demšar, J. (2006). Statistical comparisons of classifiers over multiple data sets. The Journal of Machine learning research, 7, 1–30.
MathSciNet Google Scholar
Deng, Y., Yang, J., Chen, D., et al (2020). Disentangled and controllable face image generation via 3D imitative-contrastive learning. In IEEE/CVF conference on computer vision and pattern recognition (pp. 5154–5163). https://doi.org/10.1109/CVPR42600.2020.00520
Désidéri, J. A. (2012). Multiple-gradient descent algorithm (MGDA) for multiobjective optimization. Comptes Rendus Mathematique, 350(5–6), 313–318. https://doi.org/10.1016/j.crma.2012.03.014
Article MathSciNet Google Scholar
Désidéri, J. A. (2018). Quasi-Riemannian multiple gradient descent algorithm for constrained multiobjective differential optimization. Ph.D. thesis, Inria Sophia-Antipolis; Project-Team Acumes. https://inria.hal.science/hal-01740075
Dhariwal, P., & Nichol, A. (2021). Diffusion models beat GANs on image synthesis. In Advances in neural information processing systems (pp. 8780–8794). https://proceedings.neurips.cc/paper_files/paper/2021/file/49ad23d1ec9fa4bd8d77d02681df5cfa-Paper.pdf
Fefferman, C., Mitter, S., & Narayanan, H. (2016). Testing the manifold hypothesis. Journal of the American Mathematical Society, 29(4), 983–1049. https://doi.org/10.1090/jams/852
Article MathSciNet Google Scholar
Ferruz, N., Schmidt, S., & Höcker, B. (2022). Protgpt2 is a deep unsupervised language model for protein design. Nature Communications, 13(1), 4348. https://doi.org/10.1038/s41467-022-32007-7
Article Google Scholar
Gong, C., Liu, X., & Liu, Q. (2021). Bi-objective trade-off with dynamic barrier gradient descent. In Advances in neural information processing systems (pp. 29630–29642). https://proceedings.neurips.cc/paper_files/paper/2021/file/f7b027d45fd7484f6d0833823b98907e-Paper.pdf
Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., et al. (2014). Generative adversarial nets. In Advances in neural information processing systems (pp. 2672–2680). https://proceedings.neurips.cc/paper_files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf
Gruver, N., Stanton, S., Frey, N. C., et al. (2023). Protein design with guided discrete diffusion. In Advances in neural information processing systems (pp. 12489–12517). https://proceedings.neurips.cc/paper_files/paper/2023/file/29591f355702c3f4436991335784b503-Paper-Conference.pdf
Guo, X., Du, Y., & Zhao, L. (2020). Property controllable variational autoencoder via invertible mutual dependence. In International conference on learning representations. https://openreview.net/forum?id=tYxG_OMs9WE
Heusel, M., Ramsauer, H., Unterthiner, T., et al. (2017). GANs trained by a two time-scale update rule converge to a local Nash equilibrium. In Advances in neural information processing systems (pp. 6629–6640). https://proceedings.neurips.cc/paper_files/paper/2017/file/8a1d694707eb0fefe65871369074926d-Paper.pdf
Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. In Advances in neural information processing systems (pp. 6840–6851). https://proceedings.neurips.cc/paper/2020/file/4c5bcfec8584af0d967f1ab10179ca4b-Paper.pdf
Ishibuchi, H., Tsukamoto, N., & Nojima, Y. (2008). Evolutionary many-objective optimization: A short review. In IEEE congress on evolutionary computation (pp. 2419–2426). https://doi.org/10.1109/CEC.2008.4631121
Ishibuchi, H., Yamane, M., Akedo, N., et al. (2013). Many-objective and many-variable test problems for visual examination of multiobjective search. In IEEE congress on evolutionary computation (pp. 1491–1498). https://doi.org/10.1109/CEC.2013.6557739
Jain, M., Raparthy, S. C., & Hernández-Garcıa, A., et al. (2023). Multi-objective gflownets. In International conference on machine learning (pp. 14631–14653). https://proceedings.mlr.press/v202/jain23a.html
Jin, W., Barzilay, R., & Jaakkola, T. (2020). Multi-objective molecule generation using interpretable substructures. In International conference on machine learning (pp. 4849–4859). http://proceedings.mlr.press/v119/jin20b.html
Kingma, D. P., & Welling, M. (2014). Auto-encoding variational Bayes. In International conference on learning representations. https://openreview.net/forum?id=33X9fd2-9FyZd
Klys, J., Snell, J., & Zemel, R. (2018). Learning latent subspaces in variational autoencoders. In Advances in neural information processing systems (pp. 6445–6455). https://proceedings.neurips.cc/paper_files/paper/2018/file/73e5080f0f3804cb9cf470a8ce895dac-Paper.pdf
Krizhevsky, A., & Hinton, G., et al. (2009). Learning multiple layers of features from tiny images. https://www.cs.utoronto.ca/~kriz/learning-features-2009-TR.pdf
Li, M., Grosan, C., Yang, S., et al. (2017). Multiline distance minimization: A visualized many-objective test problem suite. IEEE Transactions on Evolutionary Computation, 22(1), 61–78. https://doi.org/10.1109/TEVC.2017.2655451
Article Google Scholar
Li, S., Liu, M., & Walder, C. (2022). Editvae: Unsupervised parts-aware controllable 3d point cloud shape generation. In AAAI conference on artificial intelligence (pp. 1386–1394). https://doi.org/10.1609/AAAI.V36I2.20027
Liao, Y., Schwarz, K., Mescheder, L., et al. (2020). Towards unsupervised learning of generative models for 3D controllable image synthesis. In IEEE/CVF conference on computer vision and pattern recognition (pp. 5871–5880). https://doi.org/10.1109/CVPR42600.2020.00591
Liu, X., Tong, X., & Liu, Q. (2021a). Profiling pareto front with multi-objective stein variational gradient descent. In Advances in neural information processing systems (pp. 14721–14733). https://proceedings.neurips.cc/paper/2021/file/7bb16972da003e87724f048d76b7e0e1-Paper.pdf
Liu, X., Tong, X., & Liu, Q. (2021b). Sampling with trusthworthy constraints: A variational gradient framework. In Advances in neural information processing systems (pp. 23557–23568). https://papers.nips.cc/paper/2021/file/c61aed648da48aa3893fb3eaadd88a7f-Paper.pdf
McInnes, L., Healy, J., & Melville, J. (2018). Umap: Uniform manifold approximation and projection for dimension reduction. arXiv:1802.03426
Olsen, T. H., Boyles, F., & Deane, C. M. (2022). Observed antibody space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences. Protein Science, 31(1), 141–146. https://doi.org/10.1002/pro.4205
Article Google Scholar
Papamakarios, G., Nalisnick, E., Rezende, D. J., et al. (2021). Normalizing flows for probabilistic modeling and inference. Journal of Machine Learning Research, 22(57), 1–64.
MathSciNet Google Scholar
Roweis, S. T., & Saul, L. K. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500), 2323–2326. https://doi.org/10.1126/science.290.5500.2323
Article Google Scholar
Ruffolo, J. A., Chu, L. S., Mahajan, S. P., et al. (2023). Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies. Nature Communications, 14(1), 2389. https://doi.org/10.5281/zenodo.7709609
Article Google Scholar
Sanchez-Lengeling, B., & Aspuru-Guzik, A. (2018). Inverse molecular design using machine learning: Generative models for matter engineering. Science, 361(6400), 360–365. https://doi.org/10.1126/science.aat2663
Article Google Scholar
Sener, O., & Koltun, V. (2018). Multi-task learning as multi-objective optimization. In Advances in neural information processing systems (pp. 525–536). https://proceedings.neurips.cc/paper/2018/file/432aca3a1e345e339f35a30c8f65edce-Paper.pdf
Shen, M. W., Bengio, E., & Hajiramezanali, E., et al. (2023). Towards understanding and improving gflownet training. In International conference on machine learning (pp. 30956–30975). https://proceedings.mlr.press/v202/shen23a.html
Sohl-Dickstein, J., Weiss, E., Maheswaranathan, N., et al. (2015). Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning (pp. 2256–2265). http://proceedings.mlr.press/v37/sohl-dickstein15.html
Song, Y., & Ermon, S. (2019). Generative modeling by estimating gradients of the data distribution. In Advances in neural information processing systems (pp. 11918–11930). https://proceedings.neurips.cc/paper_files/paper/2019/file/3001ef257407d5a371a96dcd947c7d93-Paper.pdf
Song, Y., & Ermon, S. (2020). Improved techniques for training score-based generative models. In Advances in neural information processing systems (pp. 12438–12448). https://papers.neurips.cc/paper_files/paper/2020/file/92c3b916311a5517d9290576e3ea37ad-Paper.pdf
Song, Y., & Kingma, D. P. (2021). How to train your energy-based models. arXiv:2101.03288
Song, Y., Durkan, C., Murray, I., et al. (2021a). Maximum likelihood training of score-based diffusion models. In Advances in neural information processing systems (pp. 1415–1428). https://papers.nips.cc/paper/2021/file/0a9fdbb17feb6ccb7ec405cfb85222c4-Paper.pdf
Song, Y., Sohl-Dickstein, J., Kingma, D.P., et al (2021b). Score-based generative modeling through stochastic differential equations. In International conference on learning representations. https://openreview.net/forum?id=PxTIG12RRHS
Stanton, S., Maddox, W., Gruver, N., et al. (2022). Accelerating Bayesian optimization for biological sequence design with denoising autoencoders. In International conference on machine learning (pp. 20459–20478). https://proceedings.mlr.press/v162/stanton22a.html
Tagasovska, N., Frey, N. C., Loukas, A., et al. (2022). A pareto-optimal compositional energy-based model for sampling and optimization of protein sequences. In NeurIPS 2022 workshop AI for science: progress and promises. https://openreview.net/forum?id=U2rNXaTTXPQ
Tanabe, R., & Ishibuchi, H. (2020). An easy-to-use real-world multi-objective optimization problem suite. Applied Soft Computing, 89, 106078. https://doi.org/10.1016/J.ASOC.2020.106078
Article Google Scholar
Van Veldhuizen, D. A., Lamont, G. B., et al (1998). Evolutionary computation and convergence to a pareto front. In Late breaking papers at the genetic programming 1998 conference (pp. 221–228). https://citeseerx.ist.psu.edu/document?repid=rep1 &type=pdf &doi=f329eb18a4549daa83fae28043d19b83fe8356fa
Wang, S., Guo, X., Lin, X., et al. (2022). Multi-objective deep data generation with correlated property control. In Advances in neural information processing systems (pp. 28889–28901). https://proceedings.neurips.cc/paper_files/paper/2022/file/b9c2e8a0bbed5fcfaf62856a3a719ada-Paper-Conference.pdf
Wang, S., Du, Y., Guo, X., et al. (2024). Controllable data generation by deep learning: A review. ACM Computing Surveys. https://doi.org/10.1145/3648609
Article Google Scholar
Wang, Z., Zhao, L., & Xing, W. (2023). Stylediffusion: Controllable disentangled style transfer via diffusion models. In IEEE/CVF international conference on computer vision (pp. 7677–7689). https://doi.org/10.1109/ICCV51070.2023.00706
Watson, J. L., Juergens, D., Bennett, N. R., et al. (2023). De novo design of protein structure and function with rfdiffusion. Nature, 620(7976), 1089–1100. https://doi.org/10.1038/s41586-023-06415-8
Article Google Scholar
Welling, M., & Teh, Y. W. (2011). Bayesian learning via stochastic gradient Langevin dynamics. In International conference on machine learning (pp. 681–688). https://icml.cc/2011/papers/398_icmlpaper.pdf
Yang, L., Zhang, Z., Song, Y., et al. (2023). Diffusion models: A comprehensive survey of methods and applications. ACM Computing Surveys, 56(4), 1–39. https://doi.org/10.1145/3626235
Article Google Scholar
Yao, K., Gao, P., Yang, X., et al. (2022). Outpainting by queries. In European conference on computer vision (pp. 153–169). https://doi.org/10.1007/978-3-031-20050-2_10
Ye, M., & Liu, Q. (2022). Pareto navigation gradient descent: A first-order algorithm for optimization in pareto set. In Uncertainty in artificial intelligence (pp. 2246–2255). https://proceedings.mlr.press/v180/ye22a.html
Zhang, S., Qian, Z., Huang, K., et al. (2023). Robust generative adversarial network. Machine Learning, 112, 5135–5161. https://doi.org/10.1007/s10994-023-06367-0
Article MathSciNet Google Scholar
Zitzler, E., & Thiele, L. (1999). Multiobjective evolutionary algorithms: A comparative case study and the strength pareto approach. IEEE Transactions on Evolutionary Computation, 3(4), 257–271. https://doi.org/10.1109/4235.797969
Article Google Scholar

Download references

Funding

This work was supported by the National Research Foundation, Singapore and DSO National Laboratories under the AI Singapore Programme (AISG Award No: AISG2-GC-2023-010-T), the A*STAR GAP project (Grant No. I23D1AG079), the A*STAR Career Development Fund (Grant No. C222812019), the A*STAR Pitchfest for ECR 232D800027, the A*STAR Centre for Frontier AI Research, the Program for Guangdong Introducing Innovative and Entrepreneurial Teams (Grant No. 2017ZT07X386), NSFC (Grant No. 62250710682), and the Program for Guangdong Provincial Key Laboratory (Grant No. 2020B121201001).

Author information

Authors and Affiliations

Centre for Frontier AI Research, Agency for Science, Technology and Research (A*STAR), Singapore, 138632, Singapore
Yinghua Yao, Yuangang Pan, Jing Li & Ivor Tsang
Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), Singapore, 138632, Singapore
Yinghua Yao, Yuangang Pan, Jing Li & Ivor Tsang
School of Data Science, Lingnan University, Hong Kong, China
Xin Yao
School of Computer Science, University of Birmingham, Birmingham, UK
Xin Yao

Authors

Yinghua Yao
View author publications
You can also search for this author in PubMed Google Scholar
Yuangang Pan
View author publications
You can also search for this author in PubMed Google Scholar
Jing Li
View author publications
You can also search for this author in PubMed Google Scholar
Ivor Tsang
View author publications
You can also search for this author in PubMed Google Scholar
Xin Yao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Idea: YY; Methodology and Experiment: YY, YP; Writing—comments/edits: all.

Corresponding author

Correspondence to Yuangang Pan.

Ethics declarations

Conflict of interest

The authors have no financial or non-financial interests to disclose that are relevant to the content of this article.

Ethics approval

Not applicable.

Consent to participate.

Not applicable.

Consent to publish

Not applicable.

Additional information

Editor: Myra Spiliopoulou.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Complete sensitivity analysis for single-objective generation

We set the weight coefficient w for combining two objectives in DM+single “$w\times f_1(x)+(1-w)\times f_2(x)$” as 0 to 1 with a step 0.1. The results is shown in Fig. 7:

when $w < 0.5$, the resultant final objective is dominated by $f_2(x)$. Consequently, the leading objective is optimized to the best where all the generated samples have the smallest value for $f_2(x)$ but the largest one for $f_1(x)$.
when $w > 0.5$, the resultant final objective is dominated by $f_1(x)$. Therefore, the generated samples achieve the smallest value for the first objective but the largest one for the second objective.
when $w = 0.5 =\frac{1}{m}$, the generated samples are supposed to obtain the comprise value between $f_1(x)$ and $f_2(x)$, i.e., (0.0625, 0.0625). We notice that the generated samples cover a small range around this point. This diversity could result from the diffusion noise in diffusion models (Figs. 8, 9, 10).

Appendix B: More experimental settings and analyses

Image Generation

According to Ishibuchi et al. (2013); Li et al. (2017)^{Footnote 6}, we can obtain that: (1) the Pareto solutions of the two objective setting are the points on the line between $1_{\Omega }$ and $0.5_{\Omega }$. Namely, the Pareto solutions are $\{x|x_{\Omega }=\kappa _{\Omega }, \kappa _{\Omega } \in [0.5_{\Omega }, 1_{\Omega }]\}$.^{Footnote 7} When taking images from CIFAR10 based on the Pareto set (Fig. 12), we follow Liu et al. (2021b) to sample images in a small neighborhood around $\kappa _{\Omega }$, namely, $\Vert x_\Omega -\kappa _\Omega \Vert _2^2 \le \epsilon$, where $\epsilon =8\times 10^{-4}$. (2) The Pareto solutions of the three objective setting are the points on the convex polygonal formed by three points $a_{\Omega }, b_{\Omega }, c_{\Omega }$. For easy understanding, we assume $\Omega =3\times 1\times 1$, which is actually to constrain the middle point of CIFAR10 images to be certain colors.

We visualize the Pareto front of these two settings in Fig. 11. Specifically, for the two objective setting, the Pareto optimal points lie on the line between [1, 1, 1] and [0.5, 0.5, 0.5] (Fig. 11a), which physically denote RGB values (normalized, RGB values [0, 255] divided by 255). Then, we calculate the objectives values $[f_1(x), f_2(x)]$ for these points accordingly, shown in Fig. 11b. Figure 11c, d are plotted for the three objective setting in a similar way. According to their Pareto fronts, we select [0.25, 0.25] and [0.2, 0.1, 0.2] as reference points to calculate the hypervolume (HV) for the two objective setting and the three objective setting in Table 2, respectively.

We sample CIFAR10 image using the constraint with different patch sizes to demonstrate its effect in Fig. 13. With a smaller size of the region $\Omega$, more CIFAR10 images will meet the constraint.

Protein Sequence Generation

Our experiments in Section 5.2 adopted the same dataset and objectives as that in Section 5.2 of Gruver et al. (2023). Note that we did not include their other experiments, because the experiment in their Section 5.1 is not a generation task equipped with property optimization and the dataset for the experiment in Section 5.3 and 5.4 has not been released due to private data. We select $[1\times 10^4, 0]$ as a reference point to calculate the HV for this task.

Justification of Our Experiment Designs

Our experiment designs can appropriately justify the motivation of the MOG problem. Both CIFAR10 and protein datasets are real-world datasets whose data lie on low-dimensional manifolds in high-dimensional space (Krizhevsky and Hinton, 2009; Gruver et al., 2023), thus applicable to our MOG problem setting. Meanwhile, the objectives considered for CIFAR10 are indeed benchmark multi-objective optimization problems with clear evaluations (Ishibuchi et al., 2013); the objectives considered for the protein design task represent real-world scenarios (Gruver et al., 2023). Lastly, Fig. 2 and Table 2 demonstrate the necessity of considering generation quality, as the generation quality of all baseline methods suffers to some extent when optimizing multiple properties.

Significant Test

We apply the Friedman test under the null hypothesis positing that all methods perform similarly, alongside the Nemenyi post-hoc test for pairwise comparisons among the four methods (Demšar, 2006). The number of factors was set to four, given the failure of m-MGD to produce qualified samples, leading to its exclusion. The dataset comprised 30 instances, with each of the four methods independently evaluated five times across three datasets, employing two evaluation criteria. The Friedman test shows that $\tau _F=18.24$, greater than the critical value $F_{3,87} = 2.709$ when $\alpha = 0.05$. Therefore, the null hypothesis is rejected, which signifies a statistically significant difference among the four methods at the significance level of 0.05. Subsequent analysis via the Nemenyi post-hoc test in Fig. 14 unequivocally demonstrates that our PROUD exhibits marked superiority over the three baseline methods.

Appendix C: Discussions

The constrained MOO problem defines its decision space S on a constrained space expressed using specified linear, nonlinear, or box constraints (Afshari et al., 2019; Désidéri, 2018) in $\mathbb {R}^d$. Consequently, it is different from our MOG problems, whose manifold is delineated by a given dataset $\mathcal {X}$. Nevertheless, MOG problems could be understood as a type of constrained MOO problem in a broader context (Table 5).

Table 5 Comparison of the MOG problem with the relevant MOO problems

Full size table

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Yao, Y., Pan, Y., Li, J. et al. PROUD: PaRetO-gUided diffusion model for multi-objective generation. Mach Learn 113, 6511–6538 (2024). https://doi.org/10.1007/s10994-024-06575-2

Download citation

Received: 06 December 2023
Revised: 23 May 2024
Accepted: 30 May 2024
Published: 02 July 2024
Issue Date: September 2024
DOI: https://doi.org/10.1007/s10994-024-06575-2

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PROUD: PaRetO-gUided diffusion model for multi-objective generation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Comprehensive Survey of Image Generation Models Based on Deep Learning

Score-based generative modeling for de novo protein design

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Availability of data and materials

Code availability

Notes

References

Funding