The E. coli Metabolome Database is a comprehensive, high-quality, freely accessible, online database of small molecule metabolites found in or produced by Escherichia coli. Escherichia coli is perhaps the best studied bacterium on earth and has served as the "model microbe" in microbiology research for more than 60 years. The ECMDB is essentially an E. coli "omics" encyclopedia containing detailed data on E. coli's genome, proteome and its metabolome. ECMDB is part of a suite of organism-specific metabolomics databases that includes DrugBank, HMDB, YMDB and SMPDB. As a metabolomics resource, the ECMDB is designed to facilitate research in the area gut/microbiome metabolomics and environmental metabolomics. The ECMDB contains two kinds of data: 1) chemical data and 2) molecular biology and/or biochemical data. The chemical data includes more than 2700 metabolite structures with detailed metabolite descriptions along with nearly 5000 NMR, GC-MS and LC-MS spectra corresponding to these metabolites. The biochemical data includes nearly 1600 protein sequences and more than 3100 biochemical reactions that are linked to these metabolite entries. Each metabolite entry in the ECMDB contains more than 80 data fields with approximately 65% of the information being devoted to chemical data and the other 35% of the information devoted to enzymatic or biochemical data. Many data fields are hyperlinked to other databases. The ECMDB also has a variety of structure and pathway viewing applets. The ECMDB database offers a number of text, sequence, spectral, chemical structure and relational query searches. These are described in more detail below.
Accessing the database
The ECMDB's content may be explored or searched using a variety of database-specific tools. The text search box Home; 2) Browse; 3) Search; 4) About; 5) Help; 6) Downloads and 7) Contact Us. The ECMDB's browser Metabolite Browse Protein Browse; 3) Reaction Browse Pathway Browse Class Browse; and 6) Concentration Browse. By selecting a specific Browse option the ECMDB's content can be displayed in a synoptic tabular format with the ECMDB identifiers, names and other data displayed in re-sortable tables. Clicking on an ECMDB MetaboCard or ProteinCard button will bring up the full data content for the corresponding metabolite Chem Query; 2) Text Query; 3) Sequence Search; 4) Data Extractor; and 4 other MS or NMR spectral search tools. Chem Query option allows users to sketch or to type a chemical compound and to search the ECMDB for metabolites similar or identical to the query compound. The Sequence Search can be used to perform BLAST sequence searches against all the protein sequences contained in ECMDB. Single and multiple sequence BLAST queries are supported through this search tool. It is also possible to perform detailed spectral searches of ECMDB's reference compound NMR and MS spectral data through the ECMDB's MS, MS/MS, GC/MS and NMR Spectra Search links. These tools are intended to support the identification and characterization of bacterial metabolites using NMR spectroscopy, GC-MS spectrometry and LC-MS spectrometry. The ECMDB also contains a large number of statistical tables, with detailed information about not only its content but also about E. coli, in general. In particular, under the "About" tab, a section called "E. coli numbers and stats" contains hundreds of interesting factoids about E. coli and E. coli physiology. Many components of the ECMDB are fully downloadable, including most of textual data, chemical structures and sequence data. These may be retrieved by clicking on the Download button, scrolling through the different files and selecting the appropriate hyperlinks.
Scope and access
All data in ECMDB is non-proprietary or is derived from a non-proprietary source. It is freely accessible and available to anyone. In addition, nearly every data item is fully traceable and explicitly referenced to the original source. ECMDB data is available through a public web interface and downloads.