Extensible Systematic Force Field Study of Photoactive Protein Nitrile Hydratase

Karina KUBIAK , Bartek Dobrzelecki and Wieslaw NOWAK

Institute of Physics, N. Copernicus University, ul. Grudziadzka 5, 87-100 Torun, Poland
Contents:
  • Abstract
  • Introduction
  • Methods
  • Results and Discussion
  • Conclusions
  • References

  • Abstract

    Keywords: Nitrile Hydratase, Molecular Dynamics, ESFF, Biotechnology

    Nitrile Hydratase (Nhase, EC. 4.2.1.84) is an enzyme used in biotechnological industrial production of acrylamide. Recently its structure has been elucidated [1]. The active site containing non-heme iron is buried in the protein core just at the interface of two domains. Hydrogen bonds between Arg56 (subunit B) and Cys114 sulfenic acid (subunit A) are important to maintain the activity of the protein. In order to propose a plausible mechanism of the catalytic activity possible structural changes of the active site should be studied [2]. We have recently found that the iron may undergo "inverted doming" after the nitric oxide ligand photodissociation [3]. In this work results of the ESFF modeling of two domain Nhase are presented. In particular, an interaction of 1,4-dioxane with the entry channel is analyzed. The molecular dynamics study using the DISCOVER module of InsightII (Accelrys) may provide data for calculations of substrat/product reaction paths in this important enzyme. Preliminary results of 1ns NHase dynamics obtained using MOIL forcefield [4] are also presented.

     

    Introduction.

    Biotechnology bings a great promise for a new ways of doing chemistry. In particular industrial production of certain chemicals may be achived without charmful and expensive processes. A good example of the advantages of using microorganisms in chemistry is production of acrylamide. Japaneese companies use bacteria Rhodococcus Sp. for routine production of kilotons of this commodity. The reaction (see Fig. 1) is catalysed by the protein nitrile hydratase (NHase).

    Fig. 1

    Fig.1. Reaction catalyzed by nitrile hydratase.

    The molecular structure of NHase from R. Sp. N-771 has been recently solved [1] - Fig. 2. The crystals of the inactive form of the enzyme was obtained thanks to applying a dioxane molecule as a molecular "flap" that kept the inactivating factor nitric oxide (NO) in a close proximity of the iron (FEIII) center. The interesting observation was that this ligand is possibly stabilized by the so called "oxygen claws" [1]. The Cys112 and Cys114 residues coordinating the Fe center located on the border between A and B subunits of NHase, were found to be oxidized to sufinic and sulfenic acids (CSD112, CEA114), respectively. Studies of model systems of NHase are quite common [2,3], since the mechanism of its catalytic activity is not known.

    Fig. 2

    Fig.2 NHase structure.

    In order to get better insight into dynamical features of the protein, MD simulations are often used [4]. So far only a few theoretical studies of NHase were published. Boone et al. [5] calculated electronic properties and structure of a series of NHase models using the PUHF INDO/S method. Nowak et al. [3] optimized structures of 5- and 6-coordinated NHase active site models using the DFT method. The authors observed "inverted doming" upon NO ligand binding to the iron center. Quantum-chemical calculations did not reveal any additional stablization of the NO molecule related to the oxidized cysteine ligands. Since no data on molecular dynamics (MD) simulations of NHase are avaliable, we perfomed such study of NHase using the Extensible Systematic Force Field (ESFF) [6] and the MOIL [4] forcefields. The preliminary results are presented in this paper.

     

    Methods

    The initial structure of NHase was adopted from PDB - file 2AHJ.

    The ESFF forcefield (see for example [ 7 and references therein] )as implemented in InsightII v. 97.0 package [6] was chosen for its ease in parametization. It is a diagonal forcefield, with Morse bond potentials and based on density functional theory calculations of atomic electronegativity and hardness. The charges are determined in complex way, by minimizing the electrostatic energy under the constraints that the sum of charges is equal to the net charge of the molecule. In our model the system have a charge of -23 since no counterions were used on this stage of the study. The ESFF uses the 6-9 potential for the van der Waals interactions. In this forcefield, van der Waals parameters are affected by the charge of the atom. An important part of modeling is correctly setting up all the bond orders in the system under study.

    All crystallographic water molecules were retained in the model (>200). Hydrogen atoms were added using the "Biopolymer" module of the InsightII package. The Builder module was used to substitute Cys112 and Cys114 for CSD and CEA. One should note, that standard ESFF s4l and s3d sulfur types not necessary describe correctly the geometry of the active site.

    For molecular dynamics (MD) simulations, a small cutoff of 9.5 Ang. was used. The structure was subject to 200 steps of minimization before the 15 ps heating stage and 50 ps quillibration MD runs. A dielectric constant of 1 was used through all computations. All bond lengths were kept constant using RATTLE algorithm. The simulation timestep was 1 fs. The initial temperature was 10 K. During the dynamics simulations it was kept constant at 298 K by a velocity scaling. The sampling data were collected through 5 ps, each 0.1 ps, and analyzed using InsightII 97.0. Calculations were performed on the SGI R5000 workstation.

    The MOIL model of Nitrile hydratase was construted in the following way. The crystal structure of an inactive form of nitrile hydratase isolated from Rhodococcus sp.N-771 (2ahj.pdb, resolution 1.70A) [1] was obtained from the Protein Data Bank. All minimizations, heatings, equilibrations, dynamics runs and the analysis were performed using the package MOIL (ver. 7.1) [4] and Moil-View (ver. 9.0) [4a] software running on a Silicon Graphic Indy R5000 computer. The active site of Nhase contains four residues: Cys109, CSD112 (Cys-SO2H), Ser113 and CEA114 (Cys-SOH). The central atom, Fe(III) is coordinated by sulfur atoms of Cys109, CSD112 and CEA114, and two other coordinating atoms are amide nitrogens of Ser113 and CEA114. The sixth coordinating position is occupated by nitrogen atom of nitric oxide (NO). In order to perform molecular dynamics simmulation we prepared a new, non-standard residue called BFE (BigFe), representing the Fe-type active site of Nhase. This residue contained: Val108, Cys109, Ser110, Leu111, CSD112, Ser113, CEA114, Thr115, and the Fe(III) atom and was built by adding 68 new atom types. We described all properties of those atoms, eg. masses, charges and van der Waals parameters, and in order to construct from them the BFE residue, we have defined 71 bonds, 109 bond angles, 43 torsions and 31 improper torsions. Masses and van der Waals parameters of atoms were standardi (i.e. as used in the MOIL forcefield), charges of atoms of Val108, Ser110, Leu111 and Thr115 were standard too, but charges of atoms belonging to CSD112, Ser113 and CEA114 were the same as we the Mulliken charges caculated with the DFT method. These calculations we presented elsewere [3].

    MOIL MD protocol.

    Initially hydrogen atoms were addedto the bare pdb the Nhase molecule. Crystallographics waters were retained. A system was subject to 3000 steps of minimization by the Powell method [4]. Subsequently the heating (10-300K) step was done through 20ps and then equilibration at 300K through 30 ps. Data were collected each 100 steps through 950 ps period of simulation of the dynamics. We kept the constant temperature by the velocity scaling. Cutoff distances for electrostatic and Van der Waals interactions were of 12 and 7 Ang., respectively. To reduce a time step to 1 fs all bonds in the protein and all bonds and angles in water molecules were kept fixed by the SHAKE protocol.

     

    Results and Discussion

    The ESFF forcefield

    It is known that the RMS value is a proper tool to estimate the stability of protein structure during a simulation. Thus in the first step of analysis the dynamical behaviour of the nitrile hydratase model we checked the RMS value vs. time (Fig.3). As expeceted, in such rather short simulation a full equillibration is not achieved, but the RMS tend to be rather stable.

    Fig. 3

    Fig.3. The RMS value for CA atoms calculated during 5ps period of dynamics.

    Since the RMS is not higher than 1 Ang. we could postulate that the structure of the NHase remains stable during the dynamics in the ESFF forcefield. However the analyzed trajectory was too short to conclude that the geometry of NHase is maintained in the dynamic and/or catalytic activity. In order to descript possible changes in the protein structure and particular in the active site futher computations are required.

    Important information about catalytic mechanism of NHase could be provide by analysis of position of nitric oxide (NO) and dioxane (DIO) molecules in the active site. The substrates in catalytic reaction have similar size to the dioxane molecule. The interaction with the entry channel residues is not perhaps as strong as observed in DIO, otherwise catalytic cycle would not be possible. However, since the catalytic site is located almost on the border between alpha and betha subunits, a large conformational change - functionally important in the catalytic cycle - can not be ruled out. As we noticed, the NO molecule in the starting conformation was quite close to the iron ion, the distance between DIO molecule and FE was much higher. During the equilibration procedure (50 ps) the relative positions of NO and DIO i have switched, so NO molecule wasn't kept in the active site by the DIO molecule - see Fig.4. We were monitoring the distance between NO molecule and FE ion, we found that this value was increasing non-uniformly (data not shown) and we expect that in a longer simulation the nitric oxide will leave the active site region. Details of the NO diffusion path will be presented elsewere.

    Water is involved in postulated mechanism of the catalysis. We were looking for water molecules having locations close to the iron site. Such water might occupate the 6'th coordinate position after the dissociation of the NO molecule. Since in the radius of about 4 to 6 Ang. from the iron atom we did not find any water molecule (data not shown), perhaps DIO molecule effectively blocks an access to the active site for larger molecules. One should note that our statistics is very limited, so in longer trajectories some water presence in close proximity of Fe may occur, especially in the active forms of NHase.

    Fig. 4

    Fig.4. The environment of the DIO molecule. Distances to the nearest neighbours are also indicated.

    Environment of the DIO molecule is exhibited on Fig. 4. As mentioned earlier, the active site is located at the border of two subunits: A and B, so a position of the DIO molecule is similar. In order to evaluate possible ways to maintain the DIO molecule in the neighbourhood of iron we were looking for possibile hydrogen bonds between the DIO molecule and aminoacids placed close to the active site region. In our 5 ps snapshot of ESFF trajectory we did not notice any possibilities to create such bonds. As indicated on the Fig. 4, the closed residues to the DIO are Gln90, Trp117, Arg167 from subunit A, Met41, Val55 Arg56 from subunit B and residues belong to the active site. The shortes and the largest distances between DIO ligand and the NHraAse matrix observed in 5ps ESFF trajectory are presented in Table 1.

     

    Table 1. The distances for the nearest neighbours of the DIO molecule. Min, max and average distances between two atoms, wchich could be involved in the hydrogen bonds are listed. Distances for PDB structure are also indicated. In this case the hydrogen atoms were added using the BIOPOLYMER module of the INSIGHT package ver. 95.0 [6].

    Ligand atom - NHase

    Dist. PDB [1]

    Min. dist.

    Max. dist

    Aver.

    DIO:H21 - GlnA90:NE2

    7.68

    2.43

    5.86

    4.58

    DIO:O1 - GlnA90:HE21

    6.88

    2.32

    7.15

    5.67

    DIO:O1 - GlnA90:HE22

    6.86

    2.24

    5.75

    4.24

    DIO:O1 - GlnA90:HG1

    5.49

    5.26

    9.03

    7.08

    DIO:O1 - GlnA90:HG2

    6.48

    3.78

    8.75

    2.15

    DIO:O1' - SerA113:HG

    5.57

    3.25

    5.87

    4.20

    DIO:O1' - CEA114:HA

    5.43

    2.84

    5.15

    3.87

    DIO:O1 - CEA114:HB1

    8.25

    3.08

    6.77

    5.01

    DIO:O1 - TrpA117:HZ2

    3.66

    2.51

    5.34

    4.23

    DIO:H11 - ArgA167:NH2

    6.89

    2.89

    8.98

    6.17

    DIO:H12 - ArgA167:NH2

    7.45

    3.91

    10.09

    7.44

    DIO:O1 - ArgA167:HH22

    7.38

    3.56

    8.18

    6.26

    DIO:O1 - MetB40:HE2

    4.45

    2.62

    5.07

    4.01

    DIO:O1 - MetB40:HE3

    4.72

    2.54

    6.62

    4.69

    DIO:H1'2 - ArgB55:N

    9.47

    5.36

    7.93

    6.29

    DIO:O1 - ArgB55:HB

    5.48

    3.08

    6.35

    4.42

    DIO:O1' - ArgB56:HG2

    4.03

    3.69

    6.12

    5.03

    DIO:H1'1 - ArgB56:NE

    5.94

    3.78

    8.57

    6.34

    DIO:H12 - ArgB56:NE

    2.59

    2.60

    8.61

    6.12

    DIO:H1'2 - ArgB56:NE

    5.25

    3.14

    7.52

    4.97

    DIO:H12 - ArgB56:NH2

    6.36

    3.67

    9.16

    7.18

    DIO:O1 - ArgB56:HE

    5.96

    3.68

    7.63

    6.15

    DIO:H12 - NO:O

    3.83

    2.31

    6.15

    4.59

    To get a better insight into dynamical behaviour of the DIO molecule and to estimate possible positions for docking of this moiety we monitored distances between the DIO, FE iron and one of mentioned residues eg. ValB55. As one can see (Fig. 5) the amplitude of motions of DIO in respect to the position of FE(III) ion is about 1 Ang. (Fig. 5a and 5b) so we can say that position of DIO molecule is rather stable. As we expected, the amplitude of relative motions DIO vs. ValB55 are not so small, this is probably caused by the overlap of two kinds of motions: motions of the DIO molecule and motions of the side chain of ValB55 residue.

    Fig. 5

    Fig.5. Distances in the environment of the DIO molecule (ESFF 5 ps data). Showed distances: (a) FE - O1' DIO (b) FE - O1 DIO and (c) H1' DIO - O ValB55.

    The MOIL forcefield

    For the long 1 ns trajectory obtained with the MOIL forcefield plots of some RMS values are also presented. The RMS calculated for CA atoms (all protein) achieved the maximum value of about 6.5 Ang. (Fig. 6). Such a high value of the RMS could suggest that the structure of NHase is not well maintained in this forcefield. On the other hand long trajectory data may reveal intersiting structural transformations related to the activity of the enzyme. In order to check which regions of the NHase were undergoing the largest structural rearrangements, i we calculated the RMS for subunit A only (Fig. 6 and Fig. 7),for subunit B only (Fig. 6 and Fig. 7), for domain A of subunit B and for domain B of subunit B (Fig. 7).

    Fig. 6

    Fig.6. The RMS values for CA atoms from all protein (black line), subunit A only (red line), subunit B only (green line) and for all atoms formnig the BFE residue (blue line), 1ns trajectory, MOIL.

    Fig. 7

    Fig.7. The RMS values for CA atoms from all subunit A (black line), all subunit B (red line), domain A of subunit B (green line) and domain B of subunit B (blue line), 1ns trajectory, MOIL.

    The RMS for CA atoms from the doman B of subunit B achieved the smallest values - the max value in this case is about 4.2 Ang. Since is known the domain B of subunit B of NHase is composed by a relatively high number of beta-strands (see Fig. 2), we were not surprised by this result. It indicates that in the opposite to the beta-strands, the alpha-helices are less stable. So, we may conclude that the biggest contribution to the RMS value comes from the motions of very flexible fragments of secondary structure, eg. links, loops and turns, and some smaller contributions originate from the partial unfolding of alpha-helices. One should note, however, that this trajectory was obtained under vacum conditions, protein in the proper water environment may exhibit a different behavior.

    We were interested in dynamical behaviour of a new, non-standard residue (named here BFE) representing the active site of nitrile hydratase. This residue was constructed in order to perform the MD study of NHase using the MOIL forcelield. To check the stability of BFE we calculated the RMS for all its atoms - see Fig. 7.

    In this case the maximum value of the RMS is about 2.5 Ang. This result indicates the stability of this model of active site; RMS is rather small despite it includes possiblity for a relatively large side chains motions. Thus this is the supporting evidence that our parameters, including partial charges calculated elsevere [3], work.

    It is known that three oxygen atoms in the active site (OG Ser113, OD1 CSD112 and OD CEA114) have unusual triangular arrangement and forms so-called "oxygen claw setting". These oxygen atoms were postulated to play an important role in the stabilization of binding of NO molecule to the NHase [1]. Insight into that interactions could be provided by the monitoring distances between the mentioned oxygen atoms. As shown in Fig. 8 this distances are higher than those observed in the X-ray. Since doing parametrization of the BFE we were trying to keep as many original MOIL parameters as possible, and we did not introduced any additionas constrains to maintain the experimenatlly observed oxygen claw setting, this observation indicated that subtle crystal packing may be required to keep affectively these 3 oxygens in such a close proximity. It would be very useful to know an Xray structure of the active form of NHase with the properly oxidized Cys112 and Cys114 residues. The low resolution structure reported by Huang et al. ( pdb code 1AHJ ) has aparently a wrong interpretation of the active site electron density. On the other hand this observation make us to think that our model of active site is not ideal. Studies on the parametrization of BFE/MOIL "residue" are necessary.

    Fig. 8

    Fig.8. The geometry of the oxygen atom forming so-called "oxygen claw setting",(a) OG Ser113:OD CEA114 (b) OG Ser113:OD1 CSD112 (c) OD CEA114:OD1 CSD112 distances, geometry for the PDB structure is also indicated (red line).

    Graphical analysis of the 1ns trajectory reveals possibilities of a presence of a network of hydrogen bonds with contribution of ArgB56, ArgB141, CSD112 and CEA114. This stabilizing bonds were found in the high-resolution X-ray studies [1]. Our calculation provided further evidences for the presence of such interactions. These H-bond do modify catalytic properties of the nitrile hydratase [Odaki2001].

     

    Conclusions

    The ESFF forcefield gives rather reasonable quality results for the geometry of the nitrile hydratase and therefore might be used as an alternative approach to the modeling studies of this enzyme. However the structure of the iron active center is not correctly maintained, further studies on parametrization of this region are required. The problem is related to the proper description of non-standard sulfenic and sulfinic acid residues. The dioxane molecule is kept in its position by a network of steric interactions. We were not able to indicate any particular hydrogen bonds that might be related to dioxane binding in the entry chanel of NHase. In our ESFF model, the NO molecule could diffuse out of the Fe centre on the 50 ps timesale (300 K), despite the presence of the dioxane "flap". Our data indicate that the environment of the dioxane molecule indentified here (Fig. 4) could play a role in the catalytic activity of the enzyme and should be studied in grater detail. Mutations in this region, especially of charged residues such as Arg56 might affect NHase performance.

    The geometry of the NHase and particularity the structure and interacions in the neighborhood of the active site were studied using the MOIL forcefield. For the first time the active site model of NHase was constructed in this forcefield. Our parameters of the atoms included in the BFE residue reflected stability during the 1ns trajectory (including the equillbration and heating period of our simulation procedure). This reasonable stability is indicated in Fig. 7 and Fig.9.

    Fig. 9

    Fig.9. The overlap of non-standard BFE residue: the starting geometry (red) and the structure after 750ps (green).

    We observed a relatively high value of RMS calculated for CA atoms from all the protein. This result could be caused by dynamical unstability of numerous flexible links between two subunits and between two domains of the subunit B. Furthermore, there is a high content of short alpha-helices which are aparently relatively unstable here. We find that this high value of RMS is caused by a partial unfolding of alpha-helices and relative motions of different "subunits" of NHase. Fig. 10 supports this observation.

    Fig. 10a Fig. 10b

    Fig. 10c Fig. 10d

    Fig.10. The overlap of CA atoms from all protein. The starting geometry is indicated by red colour, the geometries after 250ps (A), 500ps (B) 750ps (C) and 1ns (D) are indicated by green.

    In order to better scrutinize dynamic behaviour of NHase and its enzymatic activity further studies on parametrization of oxidized cysteines in both used forcefields are required. Such research is currently performed in our laboratory.

     

    References

    1. S. Nagashima et al., Nature Struct. Biol. 1998, 5, 347.

    2. P.K. Mascharak, Coord. Chem. Rev. 2002, 225, 201.

    3. W. Nowak et al., Int. J. Quantum Chem. 2002, 90, 1174.

    4. R. Elber et al., Comput. Phys. Comm., 1994, 91, 159.

    4a. C. Simmerling, R.Elber et al., in "Modeling of Biomolecular`Structure and Mechanism", A.Pullman et al. eds., 241-261, Kluwer, 1995.

    5. A. Boone et al., Inorg. Chem. 2001, 40, 1837.

    6. InsightII, Accellrys USA, 1997.

    7. W. Nowak et al., Acta Bioch. Pol., 2001, 48, 903.