$$\textbf{CHA}_2$$ : CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular Design

Ghaemi, Mohammad Sajjad; Hu, Hang; Hu, Anguang; Ooi, Hsu Kiang

doi:10.1007/978-3-031-42608-7_3

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14236))

Included in the following conference series:

German Conference on Artificial Intelligence (Künstliche Intelligenz)

1124 Accesses
3 Citations

Abstract

Optimizing molecular design and discovering novel chemical structures to meet specific objectives, such as quantitative estimates of the drug-likeness score (QEDs), is NP-hard due to the vast combinatorial design space of discrete molecular structures, which makes it near impossible to explore the entire search space comprehensively to exploit de novo structures with properties of interest. To address this challenge, reducing the intractable search space into a lower-dimensional latent volume helps examine molecular candidates more feasibly via inverse design. Autoencoders are suitable deep learning techniques, equipped with an encoder that reduces the discrete molecular structure into a latent space and a decoder that inverts the search space back to the molecular design. The continuous property of the latent space, which characterizes the discrete chemical structures, provides a flexible representation for inverse design to discover novel molecules. However, exploring this latent space requires particular insights to generate new structures. Therefore, we propose using a convex hull (CH) surrounding the top molecules regarding high QEDs to ensnare a tight subspace in the latent representation as an efficient way to reveal novel molecules with high QEDs. We demonstrate the effectiveness of our suggested method by using the QM9 as a training dataset along with the Self-Referencing Embedded Strings (SELFIES) representation to calibrate the autoencoder in order to carry out the inverse molecular design that leads to unfolding novel chemical structure.

This project is supported by the National Research Council Canada (NRC) and the Defence Research and Development Canada (DRDC).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

eBook: USD 12.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

') var buybox = document.querySelector("[data-id=id_"+ timestamp +"]").parentNode var buyingOptions = buybox.querySelectorAll(".buying-option") ;[].slice.call(buyingOptions).forEach(initCollapsibles) var buyboxMaxSingleColumnWidth = 480 function initCollapsibles(subscription, index) { var toggle = subscription.querySelector(".buying-option-price") subscription.classList.remove("expanded") var form = subscription.querySelector(".buying-option-form") var priceInfo = subscription.querySelector(".price-info") var buyingOption = toggle.parentElement if (toggle && form && priceInfo) { toggle.setAttribute("role", "button") toggle.setAttribute("tabindex", "0") toggle.addEventListener("click", function (event) { var expandedBuyingOptions = buybox.querySelectorAll(".buying-option.expanded") var buyboxWidth = buybox.offsetWidth ;[].slice.call(expandedBuyingOptions).forEach(function(option) { if (buyboxWidth <= buyboxMaxSingleColumnWidth && option != buyingOption) { hideBuyingOption(option) } }) var expanded = toggle.getAttribute("aria-expanded") === "true" || false toggle.setAttribute("aria-expanded", !expanded) form.hidden = expanded if (!expanded) { buyingOption.classList.add("expanded") } else { buyingOption.classList.remove("expanded") } priceInfo.hidden = expanded }, false) } } function hideBuyingOption(buyingOption) { var toggle = buyingOption.querySelector(".buying-option-price") var form = buyingOption.querySelector(".buying-option-form") var priceInfo = buyingOption.querySelector(".price-info") toggle.setAttribute("aria-expanded", false) form.hidden = true buyingOption.classList.remove("expanded") priceInfo.hidden = true } function initKeyControls() { document.addEventListener("keydown", function (event) { if (document.activeElement.classList.contains("buying-option-price") && (event.code === "Space" || event.code === "Enter")) { if (document.activeElement) { event.preventDefault() document.activeElement.click() } } }, false) } function initialStateOpen() { var buyboxWidth = buybox.offsetWidth ;[].slice.call(buybox.querySelectorAll(".buying-option")).forEach(function (option, index) { var toggle = option.querySelector(".buying-option-price") var form = option.querySelector(".buying-option-form") var priceInfo = option.querySelector(".price-info") if (buyboxWidth > buyboxMaxSingleColumnWidth) { toggle.click() } else { if (index === 0) { toggle.click() } else { toggle.setAttribute("aria-expanded", "false") form.hidden = "hidden" priceInfo.hidden = "hidden" } } }) } initialStateOpen() if (window.buyboxInitialised) return window.buyboxInitialised = true initKeyControls() })()

Institutional subscriptions

Scalable estimator of the diversity for de novo molecular generation resulting in a more robust QM dataset (OD9) and a more efficient molecular optimization

Article Open access 02 October 2021

Inverse mapping of quantum properties to structures for chemical space of small organic molecules

Article Open access 18 July 2024

Inductive transfer learning for molecular activity prediction: Next-Gen QSAR Models with MolPMoFiT

Article Open access 22 April 2020

References

Abadi, M., et al.: $\{$TensorFlow$\}$: a system for $\{$Large-Scale$\}$ machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2016), pp. 265–283 (2016)
Google Scholar
Becke, A.D.: Density-functional thermochemistry. III. The role of exact exchange. J. Chem. Phys. 98(7), 5648–5652 (1993)
Google Scholar
Blaschke, T., Olivecrona, M., Engkvist, O., Bajorath, J., Chen, H.: Application of generative autoencoder in de novo molecular design. Mol. Inf. 37(1–2), 1700123 (2018)
Article Google Scholar
Ditchfield, R., Hehre, W.J., Pople, J.A.: Self-consistent molecular-orbital methods. IX. An extended gaussian-type basis for molecular-orbital studies of organic molecules. J. Chem. Phys. 54(2), 724–728 (1971)
Google Scholar
Frisch, M.J., et al.: Gaussian 16 Revision C.01. Gaussian Inc., Wallingford (2016)
Google Scholar
Ghaemi, M.S., Grantham, K., Tamblyn, I., Li, Y., Ooi, H.K.: Generative enriched sequential learning (ESL) approach for molecular design via augmented domain knowledge. In: Proceedings of the Canadian Conference on Artificial Intelligence, 27 May 2022
Google Scholar
Grantham, K., Mukaidaisi, M., Ooi, H.K., Ghaemi, M.S., Tchagang, A., Li, Y.: Deep evolutionary learning for molecular design. IEEE Comput. Intell. Mag. 17(2), 14–28 (2022)
Article Google Scholar
Joswig, M., Kaluba, M., Ruff, L.: Geometric disentanglement by random convex polytopes. arXiv preprint arXiv:2009.13987 (2020)
Kingma, D., Welling, M.: Auto-encoding variational Bayes. In: International Conference on Learning Representations (2014)
Google Scholar
Lee, C., Yang, W., Parr, R.G.: Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 37, 785–789 (1988)
Article Google Scholar
Menon, D., Ranganathan, R.: A generative approach to materials discovery, design, and optimization. ACS Omega 7(30), 25958–25973 (2022)
Article Google Scholar
Ramakrishnan, R., Dral, P.O., Rupp, M., von Lilienfeld, O.A.: Quantum chemistry structures and properties of 134 kilo molecules. Sci. Data 1(1), 140022 (2014)
Article Google Scholar
Romez-Bombarelli, R., et al.: Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent. Sci. 4, 268–276 (2018)
Article Google Scholar
Sanchez-Lengeling, B., Aspuru-Guzik, A.: Inverse molecular design using machine learning: generative models for matter engineering. Science 361(6400), 360–365 (2018)
Article Google Scholar
Vershynin, R.: High-Dimensional Probability. University of California, Irvine (2020)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

National Research Council Canada, Toronto, ON, Canada
Mohammad Sajjad Ghaemi, Hang Hu & Hsu Kiang Ooi
Suffield Research Centre, DRDC, Alberta, Canada
Anguang Hu

Authors

Mohammad Sajjad Ghaemi
View author publications
Search author on:PubMed Google Scholar
Hang Hu
View author publications
Search author on:PubMed Google Scholar
Anguang Hu
View author publications
Search author on:PubMed Google Scholar
Hsu Kiang Ooi
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Sajjad Ghaemi .

Editor information

Editors and Affiliations

Universität Würzburg, Würzburg, Germany
Dietmar Seipel
University of Greifswald, Greifswald, Germany
Alexander Steen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ghaemi, M.S., Hu, H., Hu, A., Ooi, H.K. (2023). $\textbf{CHA}_2$: CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular Design. In: Seipel, D., Steen, A. (eds) KI 2023: Advances in Artificial Intelligence. KI 2023. Lecture Notes in Computer Science(), vol 14236. Springer, Cham. https://doi.org/10.1007/978-3-031-42608-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-031-42608-7_3
Published: 18 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-42607-0
Online ISBN: 978-3-031-42608-7
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science

Keywords

Publish with us

Policies and ethics

\(\textbf{CHA}_2\): CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular Design

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Scalable estimator of the diversity for de novo molecular generation resulting in a more robust QM dataset (OD9) and a more efficient molecular optimization

Inverse mapping of quantum properties to structures for chemical space of small organic molecules

Inductive transfer learning for molecular activity prediction: Next-Gen QSAR Models with MolPMoFiT

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Keywords

Publish with us

Subscribe and save

Buy Now

\(\textbf{CHA}_2\): CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular Design

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Scalable estimator of the diversity for de novo molecular generation resulting in a more robust QM dataset (OD9) and a more efficient molecular optimization

Inverse mapping of quantum properties to structures for chemical space of small organic molecules

Inductive transfer learning for molecular activity prediction: Next-Gen QSAR Models with MolPMoFiT

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Keywords

Publish with us