Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns
Böcker S, Letzel M, Lipták Z, Pervukhin A (2007) Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld.
Bielefeld: Technical Faculty, Bielefeld University.
Report
| Veröffentlicht | Englisch
Download
Autor*in
Böcker, Sebastian;
Letzel, MatthiasUniBi;
Lipták, ZsuzsannaUniBi;
Pervukhin, Anton
Einrichtung
Abstract / Bemerkung
Metabolites, small molecules that are intermediates and products of the metabolism, participate in almost all cellular processes such as signal transduction and stress response. There exist several thousand metabolites for every species, the overwhelming majority still being uncharacterized. Mass spectrometry has become a method of choice to analyze the metabolites of a cell. High resolution mass spectrometry allows us to determine the mass and isotopic distribution of sample molecules with outstanding accuracy. Here, we provide a method to determine the sum formula of an unidentified metabolite (or, more generally, any chemical compound) solely from its mass and isotopic pattern. This is a crucial step in the identification of an unknown metabolite, as it reduces its possible structures to a finite and, hopefully, manageable set.
In Part I, we show how to use integer decomposition techniques, introduced earlier by two of the authors, for decomposing real valued molecule masses, with large improvements over naïve methods that are currently best known for this problem. We then show how to rapidly match and rank simulated spectra against the measured spectrum. Our method is computationally efficient and can be applied to metabolites and other chemical compounds with mass up to 1000 Dalton. First results on experimental data indicate good identification rates for chemical compounds up to 700 Dalton.
In Part II, we present our method for rapid computation of isotope distributions and mean masses of isotope peaks, i.e., for simulation of isotopic spectra, improving on best-known results. Fast simulation of isotope patterns is vital due to the large search space. Above 1000 Dalton, however, the number of molecules with a certain mass increases rapidly. Since the size of the search space thus becomes prohibitive, generating all potential solutions, simulating their isotope patterns, and matching them against the input is often not feasible. Instead, we define several "additive invariants" extracted from the input and then propose to solve a "joint decomposition problem": Given a finite weighted alphabet with character masses {a_1,...,a_sigma} and a query "m", a "decomposition" of "m" is a non-negative integer vector (c_1,..., c_sigma) such that sum_i c_i a_i = m. Here, we have the problem of finding a "joint" decomposition "c" for a set of queries, where each query has to be decomposed over a different weighted alphabet. We present an efficient algorithm for producing all joint decompositions of the query vector and demonstrate its fitness on real data extracted from a metabolite database.
Stichworte
Metabolomics;
Efficient algorithms;
Mass spectrometry;
Integer decomposition;
Isotopic pattern
Erscheinungsjahr
2007
Serientitel
Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld
ISSN
0946-7831
Page URI
https://pub.uni-bielefeld.de/record/2308810
Zitieren
Böcker S, Letzel M, Lipták Z, Pervukhin A. Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns. Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld. Bielefeld: Technical Faculty, Bielefeld University; 2007.
Böcker, S., Letzel, M., Lipták, Z., & Pervukhin, A. (2007). Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns (Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld). Bielefeld: Technical Faculty, Bielefeld University.
Böcker, Sebastian, Letzel, Matthias, Lipták, Zsuzsanna, and Pervukhin, Anton. 2007. Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns. Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld. Bielefeld: Technical Faculty, Bielefeld University.
Böcker, S., Letzel, M., Lipták, Z., and Pervukhin, A. (2007). Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns. Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld, Bielefeld: Technical Faculty, Bielefeld University.
Böcker, S., et al., 2007. Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns, Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld, Bielefeld: Technical Faculty, Bielefeld University.
S. Böcker, et al., Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns, Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld, Bielefeld: Technical Faculty, Bielefeld University, 2007.
Böcker, S., Letzel, M., Lipták, Z., Pervukhin, A.: Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns. Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld. Technical Faculty, Bielefeld University, Bielefeld (2007).
Böcker, Sebastian, Letzel, Matthias, Lipták, Zsuzsanna, and Pervukhin, Anton. Identifying metabolites with integer decomposition techniques, using only their mass spectrometric isotope patterns. Bielefeld: Technical Faculty, Bielefeld University, 2007. Forschungsberichte der Technischen Fakultät, Abteilung Informationstechnik / Universität Bielefeld.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Name
Access Level
Open Access
Zuletzt Hochgeladen
2019-09-06T08:57:52Z
MD5 Prüfsumme
bcaa9d797e33f17abf2e97b79aa7c271