Skip to content


  • Database
  • Open Access
  • Open Peer Review

Comprehensive database of Chorismate synthase enzyme from shikimate pathway in pathogenic bacteria

BMC Pharmacology and Toxicology201314:29

  • Received: 27 August 2012
  • Accepted: 24 April 2013
  • Published:
Open Peer Review reports



Infectious diseases are major public health problem. It is increasingly affecting more than 50 million people worldwide. Targeting shikimate pathway could be efficiently used for the development of broad spectrum antimicrobial compound against variety of infectious diseases. Chorismate synthase is an enzyme in shikimate pathway that catalyzes Phosphoenol pyruvate to chorismate in most of the prokaryotic bacteria. This step is crucial for its growth, since Chorismate acts as a precursor molecule for the synthesis of aromatic amino acids. Hence, we present a comprehensive database of Chorismate Synthase Database (CSDB) which is a manually curated database. It provides information on the sequence, structure and biological activity of chorismate synthase from shikimate pathway of pathogenic bacteria. Design of suitable inhibitors for this enzyme, hence could be a probable solution to destroy its proteomic machinery and thereby inhibit the bacterial growth.


The aim of this study was to characterise chorismate synthase enzyme belonging to pathogenic bacteria to analysis the functional and structural characterization of chorismate synthase is very important for both structure-based and ligand based drug design.


The broad range of data easy to use user interface makes a useful database for researchers in designing drugs.


  • Biological database
  • Shikimate pathway
  • Chorismate synthase
  • Pathogenic bacteria
  • Drug design
  • IC50


Biomolecules databases, in general contain gene function, structure and localization of cell and chromosome. This also includes clinical effects of mutations, sequence and structural properties of proteins, domains, motifs and their functional roles in a protein and pathway information [1]. Targeting the seven enzymes of shikimate pathway could be an effective target for the development of antimicrobial and herbicidal compounds as it is a crucial pathway for synthesis of aromatic amino acid in bacteria and plants but not in mammals [27]. Chorismate synthase (CS) catalyzes conversion of 5- enolpyruvylshikimate 3-phosphate (EPSP) to chorismate, is the final step of shikimate pathway [3, 8]. It is also an essential precursor for the synthesis of p-aminobenzoic acid and folate [9].

Chorismate synthase also plays a remarkable role in the biosynthesis of nucleotides. The reaction of chorismate synthase is unique in nature, involves a 1, 4 elimination of phosphate and loss of proton of the C-6 hydrogen. The formation of two out of three necessary double bonds to build an aromatic amino acid is aided by CS and activity of this enzyme requires reduced FMN molecule which is not consumed during the reaction. In the elimination reaction the most accepted mechanism suggests a direct role of reduced FMN that transfers the electron to phosphate and the substrate donates an electron for the regeneration of FMN. Furthermore, the monofunctional form of chorismate synthase is found in plants and bacteria whereas in bi-functional it occurs in fungi [3].

Characterization of this pathway in bacteria was achieved largely by studying mutants lacking the individual enzyme activities. Shikimate pathway is essential in bacteria since enzymatic mutations in this pathway completely inhibit the growth in culture unless aromatic supplements are provided [10]. Studies of Barea and Giles showed that shikimate pathway in fungi play an essential role in synthesis of aromatic amino acid [11]. The genomic studies confirmed that this pathway could be efficiently used for the development of broad spectrum antimicrobial compound against variety of infectious diseases [10]. Earlier reports have shown that inhibition of one of the enzyme of shikimate pathway could efficiently treat the opportunistic pathogens such as Pneumocystis carinii, Mycobacterium tuberculosis, Cryptosporidium parvum and Toxoplasma gondii, which may simultaneously infect AIDS and other immune compromised patients [12].

A promising drug target for bacterial pathogenic diseases could be developed by blocking any enzymes of this pathway. Designing inhibitors for this reaction would greatly facilitate researchers to block multiple pathways essential for the survival of micro organism. The Chorismate Synthase Database provides data incorporating all the parameters required for the inhibition of chorismate synthase in 42 pathogenic bacterial species which is a potential drug target for blocking the shikimate pathway. A list of 48 inhibitors reported in literature with their IC50 and Ki values are shown in Table 1.
Table 1

List of chorismate synthase inhibitors

Sl no

Pathogen name

Inhibitors name

IC50 nM

Ki nM


Escherichia coli


500 and 250000



Salmonella typhimurium

2-(3-(-(S)-5-((S)-1-Amino-3-(3-chlorophenyl)-1-oxopropan-2-ylamino)-4-(3-hydroxy-4-methyl-2-nitrobenzamido)-5-oxopentylcarbamoyl)-phenoxy)acetic Acid



2-(3-(3-((R)-3-((S)-1-Amino-3-(3-chlorophenyl)-1-oxopropan-2-ylamino)-2-(3-hydroxy-4-methyl-2-nitrobenzamido)-3-oxopropylthio) propylcarbamoyl)phenoxy)acetic Acid


2500000 ± 120000



5700000 ± 1000000



6800000 ±2000000



(2E)-6,7-dihydroxy-2-[(2-hydroxy-4-pentoxyphenyl)methylidene]-1-benzofuran-3- one



(2E)-2-[(4-hexoxy-2-hydroxyphenyl)methylidene]-6, 7-dihydroxy-1-benzofuran-3-one



(2E)-2-[(4-butoxy-2-hydroxyphenyl)methylidene]-6, 7-dihydroxy-1-benzofuran-3-one



(2E)-6,7-dihydroxy-2-[(2-hydroxy-4-propoxyphenyl)methylidene]-1-benzofuran-3- one



(2E)-6,7-dihydroxy-2-[[2-hydroxy-4-(2-methylpropoxy)phenyl]methylidene]-1- benzofuran-3-one



ethyl 4-[4-[(E)-(6, 7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]-3-hydroxyphenoxy] butanoate






(2E)-6,7-dihydroxy-2-[(2-hydroxy-3-methoxyphenyl)methylidene]-1-benzofuran-3- one



(2E)-6,7-dihydroxy-2-[[2-hydroxy-4-(4-hydroxybutoxy)phenyl]methylidene]-1- benzofuran-3-one



(2E)-6,7-dihydroxy-2-[(2-hydroxy-4-propan-2-yloxyphenyl)methylidene]-1- benzofuran-3-one



(2E)-6,7-dihydroxy-2-[(2-hydroxy-4-phenylmethoxyphenyl)methylidene]-1- benzofuran-3-one



4-[4-[(E)-(6,7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]-3-hydroxyphenoxy] butanenitrile



4-[4-[(E)-(6,7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]-3-hydroxyphenoxy] butanoic acid






(2E)-6,7-dihydroxy-2-[[2-hydroxy-4-(6-hydroxyhexoxy)phenyl]methylidene]-1- benzofuran-3-one



(2E)-6,7-dihydroxy-2-[(2-hydroxy-4-methoxyphenyl)methylidene]-1-benzofuran-3- one



2-[2-[(E)-(6,7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]phenoxy]acetic acid






(2E)-6,7-dihydroxy-2-[[2-hydroxy-4-(3-hydroxypropoxy)phenyl]methylidene]-1- benzofuran-3-one



2-[4-[(E)-(6,7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]-3-hydroxyphenoxy]acetic acid



(2E)-2-[[4-[ethyl(2-hydroxyethyl)amino]-2-methylphenyl]methylidene]-6, 7-dihydroxy-1-benzofuran-3-one



























(2E)-6,7-dihydroxy-2-[[4-(trifluoromethyl)phenyl]methylidene]-1-benzofuran-3- one






ethyl 2-[4-[(E)-(6, 7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]-3-hydroxyphenoxy] acetate



2-[(E)-(6,7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]benzoic acid









(2E)-6,7-dihydroxy-2-[[2-(trifluoromethyl)phenyl]methylidene]-1-benzofuran-3- one



4-[(E)-(6,7-dihydroxy-3-oxo-1-benzofuran-2-ylidene)methyl]benzoic acid



(2E)-2-[[4-(dimethylamino)phenyl]methylidene]-6, 7-dihydroxy-1-benzofuran-3-one













[(2S,3R,4R)-5-(7,8-dimethyl-2,4-dioxobenzo[g]pteridin-10-yl)-2,3, 4-trihydroxypentyl] dihydrogen phosphate



Construction and content

Data sources and curation

The starting point for data curation in Chorismate Synthase Database is a manual curation of all publicly available sequence, structure and functional information for pathogens from UniProtKB [13, 14]. Other database identifiers (e.g. NCBI taxonomy codes, Gene Ontology classifications, InterPro and Pfam accessions, super family, SCOP, prosite, KEGG, Pubchem Substance, etc.,) were also imported apart from the literature references, annotations of sequence and structure features. CSDB taxonomy is derived from the NCBI taxonomy database.

The data in CSDB is organized into 7 fields (Figure 1) such as protein resources, gene annotations, features, gene and nucleotide sequence, pathways, molecular target, taxonomical ID and literature references. The classification of pathogenic bacteria used in CSDB is similar to that of the already available pathogenic bacteria listed in “Classification of Pathogenic Bacteria” available at the weblink ( Links are provided to access further information on the Pathogenic Bacteria, if present in external databases like Swiss-Prot, NCBI Taxonomy Browser, EMBL-EBI, Sanger institute, chemical database, PDB and Pubmed reference etc.,
Figure 1
Figure 1

Structure of Chorismate synthase database.

An extensive literature survey was carried out using PUBMED and MEDLINE to extract information about human diseases caused by various bacterial pathogens. Critical features related to chorismate synthase for each bacterial strain such as gene sequence, gene id, protein sequence in fasta format, domain and motif information were retrieved from domain and motif databases. The structure related information were retrieved from PDB, CATH, and SCOP, kinetic data from literature, pathway information from KEGG, and its Gene Ontology information were retrieved from GO database. A database was constructed using these information by integrating them appropriately in a flat file format.

The features of this database can be categorized in to three broad areas:
  1. 1.

    Query interface: The query interface is a collection of all the pathogenic bacteria with their strain information available in literature and relates to the disease it causes to humans.

  2. 2.

    Feature enrichment: Feature enrichment category is sequence annotation from well curated databases, multiple sequence alignment in chorismate synthase of all strains and 3D structure determination using Modeller v.9.10 and its validation using GNR plot.

  3. 3.

    External references/links: This category includes pathogenic organism database, Genome databases, Database of protein-protein interactions, Systems Biology pathways, Drug bank and Structure prediction servers.

The molecular modeling in this work was performed by the MODELLER version 9.10. The MODELLER program was completely automated to calculate comparative models for a large number of protein sequences, by using many different template structures and sequence-structure alignments [1517]. Sequence-structure matches are established by aligning SALIGN [18, 19]. Sequence profile of the target sequence against each of the template sequences extracted from PDB [14] (Figure 2).
Figure 2
Figure 2

Schematic workflow for homology modeling.

Database architecture

CSDB is built on Apache HTTP Server 2.2.11 with MySQL Server 5.1.36 as the back-end and PHP 5.3.0, HTML and JavaScript, CSS as the front-end. Apache, MySQL and PHP technology were preferred as they are open-source software’s and platform independent. Besides these advantages, MySQL is the most popular open source SQL (Structured Query Language) database over the internet. MySQL (Figure 3) is a relational database management system that works much faster which also supports multi-user and multi-threading. It can work both on Windows and Linux. It comes with Triggers, Cursors and stored procedures to improve the productivity of developers.
Figure 3
Figure 3

Schematic draw showing the Interaction of web client interface.

Utility and discussion

Data access

Data stored in CSDB can be accessed in the following ways: (i) Search options in CSDB: CSDB can be queried to obtain pathogen information. In order to facilitate this, simple search options or manual browse option have been provided in the ‘Search’ section.

Select pathogenic bacteria: the user can select pathogenic bacteria to obtain related information on bacteria. (Figure 4) illustrates the result of organism-based search).
Figure 4
Figure 4

Chorismate synthase database search section. (A) Organism based selection. (B) The list of proteins found in a selected organism. (C) The list of selected protein with their major features.

External links

External database links are provided in the web portal by using hyperlinks to other useful bioinformatics resources such as genome database, protein-protein interactions databases, system biology pathways, pathogenic organism databases, microarray databases, structure prediction server and GENE CARDS.


Users can submit their suggestions/comments/queries using this feature.


A detailed description on the use of the various features incorporated in CSDB is provided in this section for the benefit of users.

Future work

The resource will be updated constantly with further enhanced features. We also intend to add some bioinformatics tools on structural and sequence analysis in future versions. We would also like to extend this database for other pathogens.


The CSDB provides manually curated information on analysis of chorismate synthase in 42 pathogenic bacterial species. This database provides information useful for designing a drug in both ligand as well as structure based methods. For structure based drug design, information on the protein’s motif and Interpro’s/PFAM domain categorization are been added and 48 inhibitors with IC50/Ki values are made available for designing inhibitors using Ligand based drug design strategies [Table 1]. In addition to this, this database also contains information about the protein’s superfamily, SCOP IDs, GO IDs, active site residues pathway information using KEGG, taxonomy, and structural models using modeler 9.10. This facilitates their usage in drug design for researchers. This database is freely available at the website

Availability and requirements

CSDB is freely available at


CSDB database contents can be downloaded easily from the ‘Download Database’ section. Users can obtain the entire collection of ID’s at SQL format with a single mouse click.



We thank SRM University, Chennai and Apsara Innovations for their constant encourage and support.

Authors’ Affiliations

Department of Bioinformatics, School of Bioengineering, Faculty of Engineering and Technology, SRM University, Kattankulathur, 603203, Tamil, Nadu, India
Sr. Application Scientist, Apsara Innovations, #218/1, 2nd floor, Kammanahalli main road, 3rd block, Kalyana nagar, Bangalore, 560 043, India


  1. Altman RB: Building successful biological databases. Brief Bioinformatics. 2004, 5 (1): 4-5. 10.1093/bib/5.1.4.View ArticlePubMedGoogle Scholar
  2. Bentley R: The shikimate pathway a metabolic tree with many branches. Crit Rev Biochem Mol Biol. 1990, 25: 307-384. 10.3109/10409239009090615.View ArticlePubMedGoogle Scholar
  3. Macheroux P, Bomemann S, Ghisla S, Roger NF, Thrneley : Studies with Flavin Analogs Provide Evidence That a Protonated Reduced FMN is the Substrate induced Transient Intermediate in the Reaction of Escherichia coli Chorismate Synthase. J Biol Chem. 1996, 271 (42): 25858-25858.Google Scholar
  4. Roberts CW, Finnerty J, Johnson JJ, Roberts F, Kyle DE, Krell T, Coggins JR, Coombs GH, Milhous WK, Tzipori S, Ferguson DJP, Chakrabarti D, McLeod R: Evidence for the shikimate pathway in apicomplexan parasites. Nature. 1998, 393: 801-805. 10.1038/31723.View ArticlePubMedGoogle Scholar
  5. Gibson F, Pittard J: Pathways of biosynthesis of aromatic amino acids and vitamins and their control in micro organisms. Bacteriol Rev. 1968, 32: 465-492.PubMedPubMed CentralGoogle Scholar
  6. Kishore GM, Shah DM: Amino acid biosynthesis inhibitors as herbicides. Annu Rev Biochem. 1988, 57: 627-663. 10.1146/ ArticlePubMedGoogle Scholar
  7. Baillie A, Orbett JR, Dowsett JR, McCloskey P: Inhibitors of shikimate dehydrogenase as potential herbicides. Pesticide Sci. 1972, 3: 113-120. 10.1002/ps.2780030202.View ArticleGoogle Scholar
  8. Payne DJ, Wallis NG, Gentry DG, Rosenberg M: The impact of genomics on novel antibacterial targets. Curr Opin Drug Discovery Dev. 2000, 3: 177-190.Google Scholar
  9. Ratledge C: Nutrition, growth and metabolism. The Biology of the Mycobacteria Volume 1. 1982, London: Academic, 185-271.Google Scholar
  10. De JJ, Voos K, Rutter BG, Schroder H, Su , Zhu Y, Barry CE: The salicylate derived mycobactinsiderophores of Mycobacterium tuberculosis are essential for growth in macrophages. Proc Natl Acad Sci USA. 2000, 97: 1252-1257. 10.1073/pnas.97.3.1252.View ArticleGoogle Scholar
  11. Barea JL, Giles NH: Purification and characterization of quinate (shikimate) dehydrogenase, an enzyme in the inducible quinic acid catabolic pathway of Neurospora crassa. Biochim Biophys Acta. 1978, 524 (1): 1-14. 10.1016/0005-2744(78)90097-9.View ArticlePubMedGoogle Scholar
  12. Roberts CW, Roberts F, Lyons RE, Kirisits MJ, Mui EJ, Finnerty J, Johnson JJ, Ferguson DJP, Coggins JR, Krell T: The shikimate pathway and its branches in apicomplexan parasites. J Infect Dis. 2002, 185: S25-S36. 10.1086/338004.View ArticlePubMedGoogle Scholar
  13. Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O’Donovan , Phan I: The SWISS-PROT protein knowledgebase and its supplement TrEMBL. Nucleic Acids Res. 2003, 31: 365-370. 10.1093/nar/gkg095.View ArticlePubMedPubMed CentralGoogle Scholar
  14. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissing H, Shindyalov IN, Bourne PE: The Protein Data bank. Nucleic Acids Res. 2000, 28: 235-242. 10.1093/nar/28.1.235.View ArticlePubMedPubMed CentralGoogle Scholar
  15. Sanchez R, Sali A: Large-scale protein structure modeling of the saccharomyces cerevisiae genome. Proc Natl Acad Sci USA. 1998, 95: 13597-13602. 10.1073/pnas.95.23.13597.View ArticlePubMedPubMed CentralGoogle Scholar
  16. Marti-Renom MA, Stuart AC, Fiser A, Sanchez R, Melo F, Sali : Comparative protein structure modeling of genes and genomes. Annu Rev Biophys Biomol Struct. 2000, 29: 291-325. 10.1146/annurev.biophys.29.1.291.View ArticlePubMedGoogle Scholar
  17. Pieper U, Eswar N, Braberg H, Madhusudhan MS, Davis FP, Stuart AC, Mirkovic N, Rossi A, Marti-Renom MA, Fiser A, Webb B, Greenblatt D, Huang CC, Ferrin TE, Sali A: ModBase a database of annotated comparative protein structure models. Nucleic Acids Res. 2002, 30: 255-259. 10.1093/nar/30.1.255.View ArticlePubMedPubMed CentralGoogle Scholar
  18. Arcuri HA, Zafalon GFD, Marucci EA, Bonalumi CE, da Silveira NJF, Machado JM, de Azevedo WF, Palma MS: SKPDB: a structural database of shikimate pathway enzymes. BMC Bioinformatics. 2010, 12:  -10.1186/1471-2105-11-12.Google Scholar
  19. Needleman S, Wunsch C: A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970, 48: 443-453. 10.1016/0022-2836(70)90057-4.View ArticlePubMedGoogle Scholar
  20. Pre-publication history

    1. The pre-publication history for this paper can be accessed here:


© Pitchandi et al.; licensee BioMed Central Ltd. 2013

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.