Generic Context-Aware Group Contributions

Flamm C, Hellmuth M, Merkle D, Nojgaard N, Stadler PF (2022)
IEEE/ACM Transactions on Computational Biology and Bioinformatics 19(1): 429-442.

Zeitschriftenaufsatz | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Flamm, Christoph; Hellmuth, Marc; Merkle, DanielUniBi ; Nojgaard, Nikolai; Stadler, Peter F.
Abstract / Bemerkung
Many properties of molecules vary systematically with changes in the structural formula and can thus be estimated from regression models defined on small structural building blocks, usually functional groups. Typically, such approaches are limited to a particular class of compounds and requires hand-curated lists of chemically plausible groups. This limits their use in particular in the context of generative approaches to explore large chemical spaces. Here we overcome this limitation by proposing a generic group contribution method that iteratively identifies significant regressors of increasing size. To this end, LASSO regression is used and the context-dependent contributions are “anchored” around a reference edge to reduce ambiguities and prevent overcounting due to multiple embeddings. We benchmark our approach, which is available as “Context AwaRe Group cOntribution” ( CARGO ), on artificial data, typical applications from chemical thermodynamics. As we shall see, this method yields stable results with accuracies comparable to other regression techniques. As a by-product, we obtain interpretable additive contributions for individual chemical bonds and correction terms depending on local contexts.
Erscheinungsjahr
2022
Zeitschriftentitel
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Band
19
Ausgabe
1
Seite(n)
429-442
ISSN
1545-5963
eISSN
1557-9964, 2374-0043
Page URI
https://pub.uni-bielefeld.de/record/2987396

Zitieren

Flamm C, Hellmuth M, Merkle D, Nojgaard N, Stadler PF. Generic Context-Aware Group Contributions. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2022;19(1):429-442.
Flamm, C., Hellmuth, M., Merkle, D., Nojgaard, N., & Stadler, P. F. (2022). Generic Context-Aware Group Contributions. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 19(1), 429-442. https://doi.org/10.1109/TCBB.2020.2998948
Flamm, Christoph, Hellmuth, Marc, Merkle, Daniel, Nojgaard, Nikolai, and Stadler, Peter F. 2022. “Generic Context-Aware Group Contributions”. IEEE/ACM Transactions on Computational Biology and Bioinformatics 19 (1): 429-442.
Flamm, C., Hellmuth, M., Merkle, D., Nojgaard, N., and Stadler, P. F. (2022). Generic Context-Aware Group Contributions. IEEE/ACM Transactions on Computational Biology and Bioinformatics 19, 429-442.
Flamm, C., et al., 2022. Generic Context-Aware Group Contributions. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 19(1), p 429-442.
C. Flamm, et al., “Generic Context-Aware Group Contributions”, IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 19, 2022, pp. 429-442.
Flamm, C., Hellmuth, M., Merkle, D., Nojgaard, N., Stadler, P.F.: Generic Context-Aware Group Contributions. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 19, 429-442 (2022).
Flamm, Christoph, Hellmuth, Marc, Merkle, Daniel, Nojgaard, Nikolai, and Stadler, Peter F. “Generic Context-Aware Group Contributions”. IEEE/ACM Transactions on Computational Biology and Bioinformatics 19.1 (2022): 429-442.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar