- Research article
- Open Open Peer Review
Differences between murine arylamine N-acetyltransferase type 1 and human arylamine N-acetyltransferase type 2 defined by substrate specificity and inhibitor binding
BMC Pharmacology and Toxicologyvolume 15, Article number: 68 (2014)
The mouse has three arylamine N-acetyltransferase genes, (MOUSE)Nat1, (MOUSE)Nat2 and (MOUSE)Nat3. These are believed to correspond to (HUMAN)NAT1, (HUMAN)NAT2 and NATP in humans. (MOUSE)Nat3 encodes an enzyme with poor activity and human NATP is a pseudogene. (MOUSE)Nat2 is orthologous to (HUMAN)NAT1 and their corresponding proteins are functionally similar, but the relationship between (MOUSE)Nat1 and (HUMAN)NAT2 is less clear-cut.
To determine whether the (MOUSE)NAT1 and (HUMAN)NAT2 enzymes are functionally equivalent, we expressed and purified (MOUSE)NAT1*1 and analysed its substrate specificity using a panel of arylamines and hydrazines. To understand how specific residues contribute to substrate selectivity, three site-directed mutants of (MOUSE)NAT2*1 were prepared: these were (MOUSE)NAT2_F125S, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L. All three exhibited diminished activity towards “(MOUSE)NAT2-specific” arylamines but were more active against hydrazines than (MOUSE)NAT1*1. The inhibitory and colorimetric properties of a selective naphthoquinone inhibitor of (HUMAN)NAT1 and (MOUSE)NAT2 were investigated.
Comparing (MOUSE)NAT1*1 with other mammalian NAT enzymes demonstrated that the substrate profiles of (MOUSE)NAT1 and (HUMAN)NAT2 are less similar than previously believed. Three key residues (F125, R127 and Y129) in (HUMAN)NAT1*4 and (MOUSE)NAT2*1 were required for enzyme inhibition and the associated colour change on naphthoquinone binding. In silico modelling of selective ligands into the appropriate NAT active sites further implicated these residues in substrate and inhibitor specificity in mouse and human NAT isoenzymes.
Three non-catalytic residues within (HUMAN)NAT1*4 (F125, R127 and Y129) contribute both to substrate recognition and inhibitor binding by participating in distinctive intermolecular interactions and maintaining the steric conformation of the catalytic pocket. These active site residues contribute to the definition of substrate and inhibitor selectivity, an understanding of which is essential for facilitating the design of second generation (HUMAN)NAT1-selective inhibitors for diagnostic, prognostic and therapeutic purposes. In particular, since the expression of (HUMAN)NAT1 is related to the development and progression of oestrogen-receptor-positive breast cancer, these structure-based tools will facilitate the ongoing design of candidate compounds for use in (HUMAN)NAT1-positive breast tumours.
Arylamine N- acetyltransferases (NATs, EC. 22.214.171.124)  are drug metabolising enzymes which catalyse the conjugation of an acetyl group from acetyl Coenzyme A (AcCoA) to arylamines, hydrazines and N- hydroxyarylamines. They participate in the detoxification and metabolic activation of xenobiotics and are found in a wide variety of prokaryotic and eukaryotic species .
The human genome contains two polymorphic NAT genes, (HUMAN)NAT1 and (HUMAN)NAT2[3, 4], which play important pharmacogenetic roles in cancer susceptibility and have the potential to contribute to personalised medicine [5, 6]. The corresponding enzymes possess distinct substrate profiles: HUMAN(NAT1) preferentially N- acetylates the arylamines 4-aminobenzoic acid (4ABA), 4-aminobenzoyl glutamate (4ABglu) and 4-aminosalicylate (4AS), whereas (HUMAN)NAT2 has higher activity towards the hydrazines isoniazid (INH) and hydralazine (HDZ) and the arylamine sulfamethazine (SMZ) [7, 8]. The crystal structures of human NATs  resemble the three-domain conformation observed in prokaryotic NATs . Like their prokaryotic counterparts, each has a catalytic triad comprising cysteine (C), histidine (H) and aspartic acid (D) at the active site; they also contain an additional loop of 17 residues which many prokaryotic enzymes lack. These observations helped to lay the foundations for studies addressing the structural determinants of their catalytic selectivity .
Human NATs are characterised by differing tissue distributions and patterns of gene expression during development [12–14]. In particular, while (HUMAN)NAT2 appears to be a well-established drug metabolising enzyme and is mainly expressed in the liver , the selective N-acetylation of the folate catabolites 4ABglu and 4ABA by (HUMAN)NAT1 suggests that this isoform may have an endogenous role in folate homeostasis [16, 17]. Furthermore, recent studies indicate that (HUMAN)NAT1 has a novel catalytic function as a folate-dependent AcCoA hydrolase .
Rodents also have multiple NAT isoforms which have been investigated as animal models for the corresponding human enzymes. Mice, for example, have three NAT genes, (MOUSE)Nat1, (MOUSE)Nat2 and (MOUSE)Nat3. All of these are polymorphic; the (MOUSE)Nat3 gene exhibits the greatest extent of polymorphism and may effectively be considered a pseudogene like (HUMAN)NAT3, although it does have enzymatic activity [20, 21].
Historically, (MOUSE)NAT2 has been considered to correspond to (HUMAN)NAT1 because their primary sequences are 82% homologous and they exhibit similarities in terms of expression profile, tissue distribution and substrate preference [8, 22–24]. Furthermore, inhibition studies have identified a (HUMAN)NAT1/(MOUSE)NAT2 selective inhibitor, naphthoquinone 1 (N-(3-(3,5-dimethylphenylamino)-1,4-dioxo-1,4-dihydronaphthalen-2-yl)benzenesulphonamide; Additional file 1: Figure S1), which undergoes a distinctive colour change from red to blue upon binding to either of these two enzymes but not to other mammalian NAT isoenzymes [24, 25]. Virtual modelling studies [25, 26] indicate that the interaction between naphthoquinone 1 and (HUMAN)NAT1 or (MOUSE)NAT2 depends on selective ionic interactions between the conjugate base of compound 1 and the guanidinium moiety of an active site residue, arginine 127 (R127). This is part of a group of active site residues which had previously been identified as being involved, together with phenylalanine 125 (F125) and tyrosine 129 (Y129), in substrate specificity [27, 28]. These residues differ between (HUMAN)NAT1*4 and (HUMAN)NAT2*4 and between (MOUSE)NAT1*1 and (MOUSE)NAT2*1, while the catalytic triad of C, H and D is identical.
The (MOUSE)NAT1 enzyme is commonly assumed to be functionally equivalent to (HUMAN)NAT2 because both can metabolise isoniazid (INH) and sulfamethazine (SMZ) [29–31]. In order to determine whether these enzymes are true functional equivalents, we used a previously reported (MOUSE)Nat1*1 clone  to express and purify the (MOUSE)NAT1*1 enzyme and compared its substrate profile with those of other rodent and human NAT enzymes using a broad panel of aromatic amines and hydrazines. In addition, we used three site-directed mutants ((MOUSE)NAT2_F125S, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L) to investigate the effects of key active site residues on the substrate specificity of (MOUSE)NAT2. In the present study we focused on residues 125 and 127; the role of the Y129 residue found in (HUMAN)NAT1 and (MOUSE)NAT2, at least with respect to inhibitor binding, has previously been investigated using (MESAU)NAT2, which has identical active site residues except for a leucine (L) at location 129 . The results of this earlier study  suggest that Y129 is functionally important, at least in inhibitor recognition, and illustrate the value of (MESAU)NAT2 as a protein model for comparative studies with (HUMAN)NAT1 and (MOUSE)NAT2.
Finally, we modelled the binding of representative arylamine substrates within the active sites of reference and mutant mammalian NATs in order to elucidate key interactions within the NAT/substrate complex. The identification of (HUMAN)NAT1 as a potential therapeutic target in cancer means that understanding the molecular details of this series of enzymes in humans and potential animal models is important for their potential exploitation in both diagnostics  and therapy .
Chemicals and reagents
All chemicals were purchased from Sigma-Aldrich unless otherwise stated. Molecular biology reagents were obtained from Promega.
Expression of pure recombinant NATs
Expression and purification of (MOUSE)NAT1*1
The open reading frame of (MOUSE)Nat1*1 was subcloned into the Nde I and Eco RI sites of pET28b(+) (Novagen) and transformed into Escherichia coli BL21(DE3)CodonPlus-RIL (Stratagene). BL21 cells carrying the expression plasmid were grown in auto-induction medium at 27°C in the presence of kanamycin (30 μg.mL−1) and chloramphenicol (34 μg.mL−1) and harvested after 24 hrs by centrifugation (5,000 g, 4°C, 20 min). The cell pellet was resuspended in lysis buffer (300 mM NaCl, 20 mM Tris–HCl (pH 8.0), 1 × EDTA-free Complete Protease Inhibitor (Roche)) and stored at −80°C. Cells were thawed, lysed by sonication and the soluble protein fraction was separated from cell debris by centrifugation (12,000 g, 4°C, 20 min). The soluble fraction was incubated with pre-equilibrated Ni-NTA resin (Qiagen) for 5 min, loaded on a chromatography column and washed with buffer solutions containing increasing imidazole concentrations at 4°C (two washes each of 0 mM, 10 mM, 20 mM, 50 mM and 100 mM imidazole). Fractions containing (MOUSE)NAT1*1 were identified by sodium dodecyl sulphate-polyacrylamide gel electrophoresis and NAT activity assays using 5-aminosalicylate (5AS) . The hexa-His tag was removed by thrombin cleavage (5 U thrombin/mg of protein, 16 h incubation, 4°C) and the cleaved protein was dialysed against 20 mM Tris–HCl (pH 8.0), 1 mM DTT, 1 mM EDTA buffer. Glycerol was added to 5%. Samples were concentrated by centrifugal ultrafiltration (Amicon), snap frozen in liquid nitrogen and stored at −80°C (1 mg.mL−1 in 20 mM Tris–HCl (pH 7.0), 5 mM DTT, 5% glycerol).
Site-directed mutagenesis, expression and purification of (MOUSE)NAT2_F125S
QuikChange II (Stratagene) was used to mutate codon 125 (TTT; F125) of (MOUSE)Nat2*1 to TCT, which encodes serine (F125S) (Additional file 2: Figure S2). The resulting mutant was expressed in Rosetta(DE3)pLysS as described previously [22, 25]. (MOUSE)NAT2_F125S was purified by Ni-NTA affinity chromatography and thrombin cleavage, as described. Fractions containing mutant (MOUSE)NAT2_F125S protein were identified by sodium dodecyl sulphate-polyacrylamide gel electrophoresis and NAT activity assays using 4ABA .
Preparation of pure recombinant mammalian NATs
Substrate and inhibitor selectivity and spectrometry of (MOUSE)NAT1*1 and three (MOUSE)NAT2 mutants
Substrate specific N-acetylation profiles were determined according to published methods with minor modifications [8, 36]. Recombinant proteins were used immediately after purification. The quantity of each protein used in assay mixes (100 μL) was: (MOUSE)NAT1*1, 0.633 μg; (MOUSE)NAT2*1, 1.4 μg; (MOUSE)NAT2_F125S, 1 μg; (MOUSE)NAT2_R127G, 0.8 μg; and (MOUSE)NAT2_R127L, 5.5 μg. The substrates used were 4ABglu, 4ABA, 4AS, 5AS, 4-chloroaniline (4CA), 4-bromoaniline (4BA), 4-iodoaniline (4IA), 4-methoxyaniline (ANS), 4-aminoveratrole (4AV), 4-hexyloxyaniline (HOA), 4-phenoxyaniline (POA), SMZ, INH and HDZ. In each case the final concentration in the reaction mix was 500 μM. All measurements were performed in triplicate and are expressed as mean ± standard deviation. For each enzyme tested, average measures of specific activity were calculated relative to the substrate exhibiting the highest specific activity (100%) for that enzyme.
In order to highlight differences in substrate preference among murine NAT proteins, the substrate profiles of (MOUSE)NAT2*1, (MOUSE)NAT1*1, (MOUSE)NAT2_F125S, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L were compared by ANOVA (confidence limit 95%) using the Shapiro-Wilk test to verify normal distribution and Cochran’s test to check the equality of variances. Statistical significance was evaluated using Student’s t-test with a Bonferroni adjustment if required.
Inhibition of NAT activity was determined as the ratio of specific activity in the presence and absence of the inhibitor naphthoquinone 1 using the preferred substrate for the enzyme under investigation. The final concentration of dimethyl sulphoxide (DMSO) in reaction mixes was 5% (v/v). IC50 values were estimated using a dose–response-based regression model in Kyplot® software. Curves were fitted by least squares analysis with confidence limits of 95%. Visible spectra (λ = 800 to 326 nm) of naphthoquinone 1 in the presence of pure recombinant NATs were recorded with a U-2001 spectrophotometer (Hitachi) using 50 μL UVettes® (Eppendorf).
Structural modelling and docking simulations
Structural models of wild type (MOUSE)NAT2*1, (MOUSE)NAT2 mutants and (MESAU)NAT2*1 were generated based on the structure of (HUMAN)NAT1*4 [PDB:2PQT]  using SwissModel software (http://swissmodel.expasy.org/) in automated mode [37–39]. Each substrate was drawn in 3D using ChemBio3D Ultra 12.0 and its ground state conformation predicted before it was docked into the catalytic pocket of the appropriate NAT structure ((HUMAN)NAT1*4: 2PQT, (HUMAN)NAT2*4: 2PFR ) or NAT model ((MOUSE)NAT2*1, (MOUSE)NAT2 mutants and (MESAU)NAT2*1). Protein and substrate structures were defined as pdbqt files and protein-structure interactions were analysed using Autodock Vina . After adding polar hydrogen atoms to the NAT target and defining the rotatable bonds of the ligand, a docking site was defined within the active site pocket and possible solutions were ranked according to affinity energy (lowest to highest; kcal.mol−1). The docking results were visualised in 3D using PyMOL .
Substrate selectivity of (MOUSE)NAT1*1 and (MOUSE)NAT2*1
We expressed (MOUSE)NAT1*1 in E. coli Rosetta (DE3)pLysS and used the resulting recombinant protein to determine the activity of (MOUSE)NAT1*1 towards a panel of chemicals commonly used as NAT substrates (Additional file 3: Table S1). The highest specific activities were with the hydrazines INH and HDZ and the arylamine SMZ (Figure 1). A parallel screening experiment with (MOUSE)NAT2*1 yielded results corresponding to those published previously . The differences between (MOUSE)NAT1*1 and (MOUSE)NAT2*1 for most substrates were statistically significant. (MOUSE)NAT1*1 did, however, have low but measurable activity towards certain arylamine substrates which are usually considered to be (MOUSE)NAT2-specific (4ABA and 4AS but not 4ABglu) and towards halogenated arylamines and alkyloxy- and aryloxy-substituted arylamines.
Site-directed mutagenesis of (MOUSE)NAT2
Three (MOUSE)NAT2 mutants ((MOUSE)NAT2_F125S, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L) were used to explore the effects of mutating a single residue within the active site on the substrate specificity of (MOUSE)NAT2. Two of these, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L, were reported previously [8, 22]. The third, (MOUSE)NAT2_F125S was generated by site-directed mutagenesis, as described above and illustrated in Additional file 2: Figure S2.
The substrate panel listed in Additional file 3: Table S1 was used to characterise the activity profiles of the three mutants in comparison with that of (MOUSE)NAT2*1. The maximum specific activity of (MOUSE)NAT2_F125S (observed with EOA) was around 7.5 times lower than the maximal value of (MOUSE)NAT2*1 (observed with 4ABA), but the substrate preferences of (MOUSE)NAT2_F125S and (MOUSE)NAT2*1 were similar. Some discrepancies were, however, observed: the main differences observed were marked reductions in the relative rate of N- acetylation of the arylamines 4ABglu and 4ABA and an augmentation of activities towards the hydrazines INH and HDZ.
Substituting R127 with glycine (R127G) or leucine (R127L) caused a dramatic loss of N- acetylation activity towards 4ABA and 4ABglu (Figure 2). Aminosalicylic acids were differentially N- acetylated by (MOUSE)NAT2*1 and its R127 mutants: (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L had significantly lower N- acetylation activity towards the (MOUSE)NAT2-specific substrate 4AS, but higher catalytic activity with 5AS compared with (MOUSE)NAT2*1. These mutations had no marked effects on the N- acetylation of halogenated arylamines and alkyloxy-substituted arylamines with small alkyl chains (ANS, 4AV), but those with extended alkyloxy or bulkier aryloxy substituents (hexyloxy in HOA and phenoxy in POA) were strongly preferred by both R127 mutants compared with (MOUSE)NAT2*1. The mutants also had much higher N- acetylation activity towards hydrazines such as HDZ.
Comparison of the substrate profiles of the two R127 mutants did not identify significant differences in their activity profiles. Overall, the best substrate for (MOUSE)NAT2_R127G was HDZ. The observed specific activity (1,385 μM.sec−1.mg−1) was 7 fold higher than the maximum specific activity of (MOUSE)NAT2*1 with 4ABA (196 μM.sec−1.mg−1), suggesting that (MOUSE)NAT2_R127G mutant had greater catalytic efficiency than (MOUSE)NAT2*1. In contrast, comparing the maximum specific activity value of (MOUSE)NAT2*1 (196 μM.sec−1.mg−1 with 4ABA) with that of (MOUSE)NAT2_R127L (82.5 μM.sec−1.mg−1 with 4AV) indicated that the maximum catalytic turnover of (MOUSE)NAT2_R127L was ~2.5 fold lower than that of (MOUSE)NAT2*1.
Comparison of substrate selectivity among mammalian NATs
The assays described here were performed under the conditions described in our previous publications [8, 22]. We were therefore able to compare previously published substrate selectivity profiles of (HUMAN)NAT1*4, (HUMAN)NAT2*4 and (MESAU)NAT2*1 with those of (MOUSE)NAT1*1, (MOUSE)NAT2*1 and the three engineered (MOUSE)NAT2 mutants reported here (Table 1). In Table 1, the NAT enzymes are arranged according to divergence of the active site sequence from that of (HUMAN)NAT1*4 (Figure 3)  and plotted against arylamine and hydrazine substrates organised according to their chemical functionalities and physicochemical properties. Overall, as the active site diverged further from that of (HUMAN)NAT1*4, substrate preference moved from arylamines with negatively charged para-substituents towards aromatic amines and hydrazines without ionised, polar or para-substituents around the aromatic core. In particular, 4ABglu, 4ABA and 4AS have an acetyl acceptor amine with pKaH <3 (Additional file 3: Table S1) and are subject to N-acetylation by (HUMAN)NAT1*4 and its homologues (MOUSE)NAT2*1 and (MESAU)NAT2*1. The other substrates have an acceptor amine with pKaH ≥3. These are better substrates for (HUMAN)NAT2*4, (MOUSE)NAT1*1 and the two R127-mutated (MOUSE)NAT2 enzymes.
In general, the results obtained corresponded with previous reports of functional similarities between (HUMAN)NAT1*4, (MOUSE)NAT2*1 and (MESAU)NAT2*1 [8, 22]; however, the substrate profile of (MOUSE)NAT1*1 did not, as expected, correspond to that of (HUMAN)NAT2*4 [29, 31]. Indeed, (MOUSE)NAT1*1 was able to N- acetylate substrates characteristic of both (HUMAN)NAT1*4 (4ABA and 4AS) and (HUMAN)NAT2*4 (SMZ, INH and HDZ).
The inhibitory potency and colorimetric properties of the (HUMAN)NAT1-selective inhibitor naphthoquinone 1 were also explored in relation to alterations at positions 125, 127 and 129 using (MOUSE)NAT2_F125S, (MOUSE)NAT1*1 and (HUMAN)NAT2*4, comparing the results with those of previous studies using (HUMAN)NAT1*4, (MESAU)NAT2*1, (MOUSE)NAT2*1, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L [24–26] (Table 2). The lowest IC50 values for naphthoquinone 1 were exhibited by (HUMAN)NAT1*4 and (MOUSE)NAT2*1 enzymes, both of which possess the triad F125, R127 and Y129. Naphthoquinone 1 was at least an order of magnitude more potent against (HUMAN)NAT1*4 and (MOUSE)NAT2*1 (IC50 values of 5.3 and 1.3 μM, respectively) than the other NATs used in this study. When pure recombinant (MOUSE)NAT2_F125S was incubated with naphthoquinone 1 under the conditions reported previously, it did not shift the λmax of naphthoquinone 1 towards longer wavelengths (585–625 nm), in contrast to previous observations with (HUMAN)NAT1*4 and (MOUSE)NAT2*1 (Additional file 4: Figure S3, Table 2).
In silico analysis of substrate selectivity in mammalian NATs
The structural features underlying substrate selectivity were investigated by in silico analysis of interactions between the NAT proteins and their arylamine substrates.
Structural models of (MOUSE)NAT2*1 and (MESAU)NAT2*1 were generated using the crystal structure of (HUMAN)NAT1*4 [PDB:2PQT]  because these three isoforms have >80% amino acid identity (Additional file 5: Table S2). Swiss-Model simulations generated models of (MOUSE)NAT2*1 and (MESAU)NAT2*1 which were of high quality according to their scoring functions and background noise, thus confirming the reliability of the template used. The mutations (MOUSE)NAT2_F125S, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L did not abolish the catalytic reactivity compared with the reference enzyme, so each individual residue modification was considered unlikely to have altered overall protein folding. The same modelling procedure was therefore used to create structural models of (MOUSE)NAT2_F125S, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L. However, attempts to model (MOUSE)NAT1*1 (64% identity; Additional file 5: Table S2) based on the structure of (HUMAN)NAT2*4 [PDB:2PFR]  did not yield reliable results.
Substrates were docked into the catalytic site of each enzyme as shown in the selectivity profile summarised in Table 1. 4ABglu was docked in the active sites of (HUMAN)NAT1*4, (MOUSE)NAT2*1, (MESAU)NAT2*1 and (MOUSE)NAT2_F125S (Figure 4). The affinity energies of all these simulations were low (−8.0/-7.0 kcal.mol−1) and probable polar and hydrophobic interactions were revealed within the enzyme-substrate complexes. For example, the amide functionality of 4ABglu could point towards the guanidinium of R127 via hydrogen bonds (<3.2 Å); the Cγ carboxylic group of glutamate could form a hydrogen bridge with the hydroxyl tail of Y129 (<3.1 Å) in (HUMAN)NAT1*4, (MOUSE)NAT2*1 and (MOUSE)NAT2_F125S; and/or the aromatic substrate core could interact via π-π stacking (<3.8 Å) with the hydrophobic plane defined by the isopropyl moiety of valine 93 (V93) and the phenyl ring of F125 in all the reference enzymes. The results obtained with 4ABA support the postulated interactions inferred from 4ABglu, suggesting a polar interaction between the carboxylate of 4ABA and the guanidinium of R127 at pH 8.0 (~3.6 Å) and hydrophobic stacking of the aromatic portion of 4ABA on the apolar flat surface defined by the side chains of V93 and F125 (~4 Å) (Additional file 6: Figure S4).
We also attempted to model the selective hydrazine substrates INH and HDZ within the active site of (HUMAN)NAT2*4 and the (MOUSE)NAT2 mutants. No proximity between the hydrazine functionality and C68 thiolate compatible with the NAT catalytic mechanism  was observed, possibly because of the smaller steric size of the substrate docked. However, when a bulkier selective arylamine substrate such as POA was docked into (HUMAN)NAT2*4 and the three (MOUSE)NAT2 mutants, the results indicated proximity between the primary amine of the substrate and the key active site residues C68 and H106 (<4 Å) (Figure 5). These simulations indicated that the 4-aminoarene of POA could be sandwiched between the benzyl ring of F217 and the apolar side chain of F93 in (HUMAN)NAT2*4 and F125 in (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L via hydrophobic interactions. In the (MOUSE)NAT2 mutants, the phenoxy group of POA was predicted to make further π-stacking interactions with the side chain of Y129 (~3.9 Å), whereas the equivalent residue in (HUMAN)NAT2*4, S129, is incapable of this interaction.
The present study has extended our understanding of (MOUSE)NAT1 by demonstrating that it is functionally different from (HUMAN)NAT2. In addition to its known activity in N-acetylating the hydrazine INH and the arylamine SMZ [29, 30], (MOUSE)NAT1*1 was shown to have significant activity towards the hydrazine HDZ and two arylamine substrates which were previously considered to be (HUMAN)NAT1 and (MOUSE)NAT2-specific (4ABA and 4AS).
The substrate-binding interactions involved in N-acetylation were previously characterised using nuclear magnetic resonance [33, 45], demonstrating that an arylamine or hydrazine could bind to a non-acetylated NAT active site. In the present study, arylamine substrates were docked within the catalytic pockets of different non-acetylated NATs in silico in order to identify possible selective interactions underlying the formation of the enzyme-substrate complex.
In (HUMAN)NAT1*4, the positively charged guanidinium (at pH 8.0) of R127 plays an essential role in recognising the negatively charged carboxylate of 4ABA, as illustrated by modelling 4ABA (Additional file 6: Figure S4) or 4AS  within the active site. The significant N-acetylation activity of (MOUSE)NAT1*1 towards 4ABA, 4AS and ANS in the present study was difficult to reconcile with the presence of an apolar G127 instead of an R127. However, it is possible that the presence of a sterically smaller residue (G) at this position in (MOUSE)NAT1*1 permits bulky or highly polar substrates to enter and allows their charged para-substituents to interact with other polar side chains in the active site (e.g. Y125 and Y129). Similarly, essential π-π stacking interactions observed between the aromatic core of 4ABA and F125, as seen in (HUMAN)NAT1*4, may be permitted by the side chain arene of Y125 within the active site of (MOUSE)NAT1*1. The lack of a structural model suitable for docking studies and the restricted commonality of substrate selectivity between (MOUSE)NAT1*1 and either of the two human NATs precluded the construction of a structural model for (MOUSE)NAT1*1, so this hypothesis could not be tested.
When 4ABglu was docked into the structure of (HUMAN)NAT1*4 and models of (MOUSE)NAT2*1 and (MESAU)NAT2*1, the essential role of R127 in forming ionic interactions with the electron-negative para-substituent of the substrate was evident; this may explain why a bulky charged arylamine such as 4ABglu can enter a small, very hydrophobic microenvironment such as the active site crevice of (HUMAN)NAT1*4 . The preference of (HUMAN)NAT1*4 and its homologues (MOUSE)NAT2*1 and (MESAU)NAT2*1 for arylamine substrates with a negatively charged para-substituent (e.g. 4ABA and 4AS) may therefore be due to the positively charged guanidinium moiety of R127.
Previous studies have shown that mutation of F125 to S125 modifies the catalytic preference of (HUMAN)NAT1*4 from the conventional probe substrate 4AS to SMZ . When we examined the effect of the equivalent mutation on (MOUSE)NAT2, no such shift in substrate preference was observed; neither did the general substrate preferences of (MOUSE)NAT2 change to resemble those of (MOUSE)NAT1*1. The higher reactivity of (MOUSE)NAT2_F125S with conformationally flexible hydrazines than with planar arylamines could be associated with increased active site space in (MOUSE)NAT2*1 after the substitution of the bulky benzyl group of F with the less hindering hydroxymethyl of S.
Site-directed mutagenesis of R127 to G127 or L127 within (MOUSE)NAT2 markedly altered the enzyme’s substrate selectivity. Overall, the non-polar side chains of G and L appeared to have similar effects in terms of modifying the substrate preferences of (MOUSE)NAT2: the metabolism of 4ABglu and 4ABA was dramatically decreased whereas N-acetylation of the hydrazines INH and HDZ, which are commonly used as probe substrates for (MOUSE)NAT1 and (HUMAN)NAT2, was augmented. R127 seemed to play a crucial role in discriminating between arylamines with highly hydrophilic substituents and hydrazines with less polar aromatic functional groups. Moreover, the (MOUSE)NAT2_R127G mutation improved the overall catalytic activity of the enzyme, possibly due to the larger size of the active site cavity generated after substitution. Similarly, when the arylamine POA was docked into the active sites of (MOUSE)NAT2_F125S, (MOUSE)NAT2_R127G and (MOUSE)NAT2_R127L, the results suggested that accommodation of this substrate via stacking interactions is facilitated by the larger cavity created by these single substitutions. When POA was docked into the active pocket of (HUMAN)NAT2*4, the results of substrate fitting were very similar to previous modelling results obtained with SMZ , consistent with the preference of (HUMAN)NAT2*4 for apolar and flexible substrates .
The individual (MOUSE)NAT2 mutants tested did not affect the specificity of (MOUSE)NAT2 for SMZ, which is a very poor substrate for this isoform. This may be related to the need for a larger active site to accommodate SMZ, which is a bulky arylamine; a single amino acid change may not be sufficient to permit access to the active site. Further 3D structural data and thermal stability studies on the mutants would help to ascertain the extent of the folding perturbations produced by each mutation. However, it is also intriguing that small hydrazines such as INH and phenylhydrazine are poor substrates for (HUMAN)NAT1 and its rodent homologues [8, 22], whereas arylamines of similar steric size such as 4ABA, 4AS and ANS are good substrates.
While the amine nitrogen and the aromatic carbons of an arylamine substrate are on the same plane, crystallographic studies of INH, HDZ and phenylhydrazine as single molecules [46–50] and protein co-crystallised ligands [51, 52], showed that the hydrazine bond could also be off the plane defined by the remaining aryl and acyl carbon atoms, thereby giving the hydrazine substrate a non-planar conformation. Our docking simulations using the crystal structure of (HUMAN)NAT1*4 and the model of (MOUSE)NAT2*1 show that the bulky side chain of R127, the benzyl ring of F125 and the 4-hydroxybenzyl of Y129 create a characteristic constrained microenvironment which appears to allow preferable accommodation of planar arylamines rather than conformationally flexible hydrazines in the active site of (HUMAN)NAT1*4. This is consistent with the volumes of the active sites in human NAT structures: the (HUMAN)NAT1*4 cavity has a volume of 162 Å3, whereas (HUMAN)NAT2*4 (with S at positions 125, 127 and 129) has a larger active pocket (257 Å3). In this context it is also noteworthy that the crystal structures of prokaryotic NATs from Mycobacterium tuberculosis, M. marinum and M. smegmatis, which preferentially N-acetylate conformationally flexible hydrazine substrates, have much larger active sites (>200 Å3) than (HUMAN)NAT1*4 [53–55].
Recent studies have suggested that F125, R127 and Y129 in the active sites of (HUMAN)NAT1*4 and (MOUSE)NAT2*1 are essential for selective recognition and binding of the potent inhibitor naphthoquinone 1[24–26]. In particular, ionic recognition between the conjugate base of naphthoquinone 1 and the guanidinium moiety of R127 is known to be essential for inhibitor binding. The results shown here confirm that these key residues are required for the interaction with naphthoquinone 1, since the absence of any of them results in a protein which is resistant to inhibition and colour shifting in response to naphthoquinone 1 (Table 2).
Overall, our results suggest that the substrate selectivity of the mammalian NATs investigated in this study is influenced by two major factors; firstly, the planarity of the acceptor arylamine and the polarity of its para substituent (which appear to affect both the type of the intermolecular interactions within the enzyme and the pKaH strength and nucleophilicity of the acceptor amine); and secondly, the size of the catalytic cavity and its overall polarity, as determined by the residues at positions 125, 127 and 129. In particular, F125 in (HUMAN)NAT1*4 and (MOUSE)NAT2*1 appears to play a key role in discriminating between planar arylamines and conformationally flexible hydrazines, while the ionised (at physiological pH) side chain of R127 perturbs the characteristic hydrophobicity of the NAT active site and, in cooperation with the 4-hydroxybenzyl side chain of Y129, contributes to the narrowness of the (HUMAN)NAT1*4 active site.
In conclusion, this evaluation of the substrate profiles of various native and engineered mammalian NATs in the context of their structures has highlighted the features which influence NAT substrate selectivity in mammals. The virtual models of mammalian NATs generated in this study, used in conjunction with X-ray structures of human NATs, constitute a rich resource for investigating the roles of particular residues within the NAT active site in relation to both NAT activity and inhibitor selectivity. Three non-catalytic residues within (HUMAN)NAT1*4 (F125, R127 and Y129) contribute both to substrate recognition and inhibitor binding by participating in distinctive intermolecular interactions and maintaining the steric conformation of the catalytic pocket. These active site residues contribute to the definition of substrate and inhibitor selectivity, an understanding of which is essential for facilitating the design of second generation (HUMAN)NAT1-selective inhibitors for diagnostic, prognostic and therapeutic purposes. In particular, since the expression of (HUMAN)NAT1 is related to the development and progression of oestrogen-receptor-positive breast cancer, these structure-based tools will facilitate the ongoing design of candidate compounds for use in (HUMAN)NAT1-positive breast tumours.
Chemical compounds studied in this article
Aminobenzoyl glutamate (PubChem CID: 5103842), 4-aminobenzoic acid (PubChem CID: 978), 4-aminosalicylate (PubChem CID: 4649), 5-aminosalicylate (PubChem CID: 4075), 4-chloroaniline (PubChem CID: 7812), 4-methoxyaniline (PubChem CID: 7732), 4-phenoxyaniline (PubChem CID: 8764), hydralazine (PubChem CID: 3637), isoniazid (PubChem CID: 3767), sulfamethazine (PubChem CID: 5327).
Acetyl coenzyme A
Arylamine N- acetyltransferase
Weber WW, Hein DW: N- acetylation pharmacogenetics. Pharmacol Rev. 1985, 37: 25-79.
Sim E, Abuhammad A, Ryan A: Arylamine N- acetyltransferases: from drug metabolism and pharmacogenetics to drug discovery. Br J Pharmacol. 2014, 171: 2705-2725. 10.1111/bph.12598.
Blum M, Grant DM, McBride W, Heim M, Meyer UA: Human arylamine N- acetyltransferase genes: isolation, chromosomal localization, and functional expression. DNA Cell Biol. 1990, 9: 193-203. 10.1089/dna.1990.9.193.
Hickman D, Risch A, Buckle V, Spurr NK, Jeremiah SJ, McCarthy A, Sim E: Chromosomal localization of human genes for arylamine N- acetyltransferase. Biochem J. 1994, 297 (Pt 3): 441-445.
Hein DW: N- acetyltransferase SNPs: emerging concepts serve as a paradigm for understanding complexities of personalized medicine. Expert Opin Drug Metab Toxicol. 2009, 5: 353-366. 10.1517/17425250902877698.
Hein DW, Doll MA, Fretland AJ, Leff MA, Webb SJ, Xiao GH, Devanaboyina US, Nangju NA, Feng Y: Molecular genetics and epidemiology of the NAT1 and NAT2 acetylation polymorphisms. Cancer Epidemiol Biomarkers Prev. 2000, 9: 29-42.
Ohsako S, Deguchi T: Cloning and expression of cDNAs for polymorphic and monomorphic arylamine N- acetyltransferases from human liver. J Biol Chem. 1990, 265: 4630-4634.
Kawamura A, Graham J, Mushtaq A, Tsiftsoglou SA, Vath GM, Hanna PE, Wagner CR, Sim E: Eukaryotic arylamine N- acetyltransferase: investigation of substrate specificity by high-throughput screening. Biochem Pharmacol. 2005, 69: 347-359. 10.1016/j.bcp.2004.09.014.
Wu H, Dombrovsky L, Tempel W, Martin F, Loppnau P, Goodfellow GH, Grant DM, Plotnikov AN: Structural basis of substrate-binding specificity of human arylamine N- acetyltransferases. J Biol Chem. 2007, 282: 30189-30197. 10.1074/jbc.M704138200.
Sim E, Lack N, Wang CJ, Long H, Westwood I, Fullam E, Kawamura A: Arylamine N- acetyltransferases: structural and functional implications of polymorphisms. Toxicology. 2008, 254: 170-183. 10.1016/j.tox.2008.08.022.
Zhou X, Ma Z, Dong D, Wu B: Arylamine N- acetyltransferases: a structural perspective. Br J Pharmacol. 2013, 169: 748-760. 10.1111/bph.12182.
Butcher NJ, Arulpragasam A, Pope C, Minchin RF: Identification of a minimal promoter sequence for the human N- acetyltransferase type I gene that binds AP-1 (activator protein 1) and YY-1 (Yin and Yang 1). Biochem J. 2003, 376: 441-448. 10.1042/BJ20030650.
Minchin RF, Hanna PE, Dupret JM, Wagner CR, Rodrigues-Lima F, Butcher NJ: Arylamine N- acetyltransferase I. Int J Biochem Cell Biol. 2007, 39: 1999-2005. 10.1016/j.biocel.2006.12.006.
Sim E, Walters K, Boukouvala S: Arylamine N- acetyltransferases: from structure to function. Drug Metab Rev. 2008, 40: 479-510. 10.1080/03602530802186603.
Boukouvala S, Fakis G: Arylamine N- acetyltransferases: what we learn from genes and genomes. Drug Metab Rev. 2005, 37: 511-564. 10.1080/03602530500251204.
Minchin RF: Acetylation of p- aminobenzoylglutamate, a folic acid catabolite, by recombinant human arylamine N- acetyltransferase and U937 cells. Biochem J. 1995, 307 (Pt 1): 1-3.
Ward A, Summers MJ, Sim E: Purification of recombinant human N- acetyltransferase type 1 (NAT1) expressed in E. coli and characterization of its potential role in folate metabolism. Biochem Pharmacol. 1995, 49: 1759-1767. 10.1016/0006-2952(95)00087-G.
Laurieri N, Dairou J, Egleton JE, Stanley LA, Russell AJ, Dupret JM, Sim E, Rodrigues-Lima F: From arylamine N- Acetyltransferase to folate-dependent acetyl CoA hydrolase: impact of folic acid on the activity of (HUMAN)NAT1 and its homologue (MOUSE)NAT2. PLoS One. 2014, 9: e96370-10.1371/journal.pone.0096370.
Sugamori KS, Brenneman D, Wong S, Gaedigk A, Yu V, Abramovici H, Rozmahel R, Grant DM: Effect of arylamine acetyltransferase Nat3 gene knockout on N- acetylation in the mouse. Drug Metab Dispos. 2007, 35: 1064-1070. 10.1124/dmd.107.015396.
Boukouvala S, Price N, Sim E: Identification and functional characterization of novel polymorphisms associated with the genes for arylamine N- acetyltransferases in mice. Pharmacogenetics. 2002, 12: 385-394.
Fretland AJ, Doll MA, Gray K, Feng Y, Hein DW: Cloning, sequencing, and recombinant expression of NAT1, NAT2, and NAT3 derived from the C3H/HeJ (rapid) and A/HeJ (slow) acetylator inbred mouse: functional characterization of the activation and deactivation of aromatic amine carcinogens. Toxicol Appl Pharmacol. 1997, 142: 360-366.
Kawamura A, Westwood I, Wakefield L, Long H, Zhang N, Walters K, Redfield C, Sim E: Mouse N- acetyltransferase type 2, the homologue of human N- acetyltransferase type 1. Biochem Pharmacol. 2008, 75: 1550-1560.
Stanley LA, Mills IG, Sim E: Localization of polymorphic N- acetyltransferase (NAT2) in tissues of inbred mice. Pharmacogenetics. 1997, 7 (2): 121-130.
Laurieri N, Crawford MH, Kawamura A, Westwood IM, Robinson J, Fletcher AM, Davies SG, Sim E, Russell AJ: Small molecule colorimetric probes for specific detection of human arylamine N- acetyltransferase 1, a potential breast cancer biomarker. J Am Chem Soc. 2010, 132: 3238-3239.
Laurieri N, Egleton JE, Varney A, Thinnes CC, Quevedo CE, Seden PT, Thompson S, Rodrigues-Lima F, Dairou J, Dupret JM, Russell AJ, Sim E: A novel color change mechanism for breast cancer biomarker detection: naphthoquinones as specific ligands of human arylamine N- acetyltransferase 1. PLoS One. 2013, 8: e70600-
Egleton JE, Thinnes CC, Seden PT, Laurieri N, Lee SP, Hadavizadeh KS, Measures AR, Jones AM, Thompson S, Varney A, Wynne GM, Ryan A, Sim E, Russell AJ: Structure–activity relationships and colorimetric properties of specific probes for the putative cancer biomarker human arylamine N- acetyltransferase 1. Bioorg Med Chem. 2014, 22: 3030-3054.
Goodfellow GH, Dupret JM, Grant DM: Identification of amino acids imparting acceptor substrate selectivity to human arylamine acetyltransferases NAT1 and NAT2. Biochem J. 2000, 348 (Pt 1): 159-166.
Dupret JM, Goodfellow GH, Janezic SA, Grant DM: Structure-function studies of human arylamine N- acetyltransferases NAT1 and NAT2. Functional analysis of recombinant NAT1/NAT2 chimeras expressed in Escherichia coli. J Biol Chem. 1994, 269: 26830-26835.
Estrada-Rodgers L, Levy GN, Weber WW: Substrate selectivity of mouse N- acetyltransferases 1, 2, and 3 expressed in COS-1 cells. Drug Metab Dispos. 1998, 26: 502-505.
Martell KJ, Levy GN, Weber WW: Cloned mouse N- acetyltransferases: enzymatic properties of expressed Nat-1 and Nat-2 gene products. Mol Pharmacol. 1992, 42: 265-272.
Sugamori KS, Wong S, Gaedigk A, Yu V, Abramovici H, Rozmahel R, Grant DM: Generation and functional characterization of arylamine N- acetyltransferase Nat1/Nat2 double-knockout mice. Mol Pharmacol. 2003, 64: 170-179.
Kelly SL, Sim E: Arylamine N- acetyltransferase in Balb/c mice: identification of a novel mouse isoenzyme by cloning and expression in vitro. Biochem J. 1994, 302 (Pt 2): 347-353.
Zhang N, Liu L, Liu F, Wagner CR, Hanna PE, Walters KJ: NMR-based model reveals the structural determinants of mammalian arylamine N- acetyltransferase substrate specificity. J Mol Biol. 2006, 363: 188-200.
Butcher NJ, Minchin RF: Arylamine N- acetyltransferase 1: a novel drug target in cancer development. Pharmacol Rev. 2012, 64: 147-165.
Andres HH, Klem AJ, Szabo SM, Weber WW: New spectrophotometric and radiochemical assays for acetyl-CoA: arylamine N- acetyltransferase applicable to a variety of arylamines. Anal Biochem. 1985, 145: 367-375.
Brooke EW, Davies SG, Mulvaney AW, Pompeo F, Sim E, Vickers RJ: An approach to identifying novel substrates of bacterial arylamine N- acetyltransferases. Bioorg Med Chem. 2003, 11: 1227-1234.
Arnold K, Bordoli L, Kopp J, Schwede T: The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics. 2006, 22: 195-201.
Kiefer F, Arnold K, Kunzli M, Bordoli L, Schwede T: The SWISS-MODEL repository and associated resources. Nucleic Acids Res. 2009, 37: D387-D392.
Peitsch MC: Protein modeling by E-Mail. Bio-Technol. 1995, 13: 658-660.
Trott O, Olson AJ: AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem. 2010, 31: 455-461.
DeLano WL: PyMOL: An open-source molecular graphics tool. DeLano Scientific, San Carlos, California, USA. 2002,http://www.ccp4.ac.uk/newsletters/newsletter40/11_pymol.pdf,
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680.
Gouet P, Courcelle E, Stuart DI, Metoz F: ESPript: analysis of multiple sequence alignments in PostScript. Bioinformatics. 1999, 15: 305-308.
Westwood IM, Kawamura A, Fullam E, Russell AJ, Davies SG, Sim E: Structure and mechanism of arylamine N- acetyltransferases. Curr Top Med Chem. 2006, 6: 1641-1654.
Delgoda R, Lian LY, Sandy J, Sim E: NMR investigation of the catalytic mechanism of arylamine N- acetyltransferase from Salmonella typhimurium. Biochim Biophys Acta. 2003, 1620: 8-14.
Abboud KA, Smith DW, Wagener KB: Structure of cyclo-1,1′,4,4′-bis(1,1,3,3-tetramethyl-1,3-disiloxanediyl)dibenzene. Acta Crystallogr C. 1993, 49: 1845-1848.
Borba A, Gomez-Zavaglia A, Fausto R: Molecular structure, infrared spectra, and photochemistry of isoniazid under cryogenic conditions. J Phys Chem A. 2009, 113: 9220-9230.
Srinivasan S, Swaminathan S: The crystal structure of phenyl hydrazine, C6H5 · NH · NH2. Zeitschrift für Kristallographie. 1968, 127: 442-449.
Okabe N, Fukuda H, Nakamura T: Structure of hydralazine hydrochloride. Acta Crystallogr C. 1993, 49: 1844-1845.
Lemmerer A: Covalent assistance to supramolecular synthesis: modifying the drug functionality of the antituberculosis API isoniazid in situ during co-crystallization with GRAS and API compounds. Cryst Eng Comm. 2012, 14: 2465-2478.
Abuhammad AM, Lowe ED, Fullam E, Noble M, Garman EF, Sim E: Probing the architecture of the Mycobacterium marinum arylamine N- acetyltransferase active site. Protein Cell. 2010, 1: 384-392.
Sandy J, Mushtaq A, Kawamura A, Sinclair J, Sim E, Noble M: The structure of arylamine N-acetyltransferase from Mycobacterium smegmatis–an enzyme which inactivates the anti-tubercular drug, isoniazid. J Mol Biol. 2002, 318: 1071-1083.
Sinclair JC, Sandy J, Delgoda R, Sim E, Noble ME: Structure of arylamine N- acetyltransferase reveals a catalytic triad. Nat Struct Biol. 2000, 7: 560-564.
Fullam E, Westwood IM, Anderton MC, Lowe ED, Sim E, Noble ME: Divergence of cofactor recognition across evolution: coenzyme A binding in a prokaryotic arylamine N- acetyltransferase. J Mol Biol. 2008, 375: 178-191.
Abuhammad A, Lowe ED, McDonough MA, Shaw Stewart PD, Kolek SA, Sim E, Garman EF: Structure of arylamine N- acetyltransferase from Mycobacterium tuberculosis determined by cross-seeding with the homologous protein from M. marinum: triumph over adversity. Acta Crystallogr D Biol Crystallogr. 2013, 69: 1433-1446.
Szakacs Z, Noszal B: Determination of dissociation constants of folic acid, methotrexate, and other photolabile pteridines by pressure-assisted capillary electrophoresis. Electrophoresis. 2006, 27: 3399-3409.
Brown HC, McDaniel DH, Haflinger P: Determination of Organic Structures by Physical Methods. 1955, New York: Academic Press
Nobilis M, Vybiralova Z, Sladkova K, Lisa M, Holcapek M, Kvetina J: High-performance liquid-chromatographic determination of 5-aminosalicylic acid and its metabolites in blood plasma. J Chromatogr A. 2006, 1119: 299-308.
Lukács M, Barcsa G, Kovács-Hadady K: The effects of pH, ionic strength and buffer concentration of mobile phase on R F of acidic compounds in ion-pair TLC. Chromatographia. 1998, 48: 511-516.
Machado JD, Gomez JF, Betancor G, Camacho M, Brioso MA, Borges R: Hydralazine reduces the quantal size of secretory events by displacement of catecholamines from adrenomedullary chromaffin secretory vesicles. Circ Res. 2002, 91: 830-836.
Becker C, Dressman JB, Amidon GL, Junginger HE, Kopp S, Midha KK, Shah VP, Stavchansky S, Barends DM: Biowaiver monographs for immediate release solid oral dosage forms: isoniazid. J Pharm Sci. 2007, 96: 522-531.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/2050-6511/15/68/prepub
Financial support for this work was provided by Servier. NL held a Cancer Research UK Cancer Medicinal Chemistry Studentship and AJR was the recipient of a Research Councils UK fellowship. The authors thank Dr. Ali Ryan and Hilary Long for technical support and helpful discussions.
The authors declare that they have no competing interests.
The authors contributed to this manuscript as follows: NL, AK, IMW and ES conceived and designed the experiments; NL, AK, IMW, AV and EM performed the experiments; NL, AK, IMW, AM, EM, AJR, LAS and ES analyzed the data; AJR and ES contributed reagents, materials and/or analysis tools; and NL, LAS and ES wrote the manuscript. All authors read and approved the final manuscript.