Professional Documents
Culture Documents
The algorithm used by the program GEPOL to compute the Molecular Surface (MS), as defined by
Richards, is presented in detail. GEPOL starts like other algorithms from a set of spheres with van der
Waals radii, centered on the atoms or group of atoms of the molecule. GEPOL computes the MS by first
searching the spaces inaccessibleto the solvent and consequently filling them with a new set of spheres.
Here we study the behavior of the method with its parameters, presenting several examples of application.
Select a
COMPUTATION METHOD
How Does the Program Decide if a New small that the effect of adding more spheres ap-
Sphere Should be Created? proaches minimum. Actually, the standard version
of the program" calculates the distance between
In this operation the program analyzes whether the two spheres with the overlapping angle fixed
between the two spheres forming the pair there is at 50", then selects the pairs with a real distance d
a space that is not occupied by another sphere of larger than the calculated one. The angle of over-
the original set, and besides it is inaccessible to lapping was fixed to 50" because after several
the solvent. The decision is made applying three tests it appeared that a larger value did not give a
tests or conditions to the pair and only when they significantly better result, and produced in an in-
are fulfilled a new sphere will be created. The crease of cpu time cost.
three tests are as follows: 3. Are there any other spheres in between? Each
1. Can the solvent get through the two spheres? pair is tested once more before creating a new
If the answer is no, the pair will pass to the next sphere. The program checks whether a sphere al-
test, because this answer means that there is a ready exists between the pair, obstructing the ac-
space inaccessible to the solvent. The computa- cess of the solvent. If so, a new sphere will not be
tion starts calculating the distance d between the created. To decide if a sphere I is between the pair
centers of the spheres that form the pair and only GP, the distance between the center of I and the
the pair that fulfill the following relation will pass line that joins the centers of G and P, is compared
to the next test: with a value that we call radius of hiding (RHID).
This radius is obtained by multiplying the radius
d < (RG+ Rp + 2Rs), of I by the parameter FRADIO given in the input
(Fig. 3). FRADIO can take values between 0 and
where Rs is the radius of the solvent represented 1and is used to control the rigidity of this test. In
by a rigid sphere, and RG and Rp the radii of the fact, the smaller the value of FRADIO, the closer
spheres G and P that form the pair. should the center of I be to the line GP consider-
2. Are the spheres well overlapping? In fact, the ing that I is between G and P. One should note
volume inaccessible to the solvent between the two that when comparing spheres created in the same
spheres decreases continuously when the overlap- iteration the condition for hiding is more rigid,
ping angle, as defined in Figure 2, increases. When i.e., the sphere I hides G from P, when its center
this angle takes values close to go", it becomes so is just over the line GP, to avoid the formation
100
2
H = -=-
PI
\ SP (SP- PI) (SP - E)(SP- IE)
Figure 3. Different radii of hiding for values of FRADIO of 0.25,
0.50, 0.75, and 1.0, and calculation of the distance H. See text for fur-
ther details.
of an asymmetrical surface if the molecule is this reason, a new test is used in our algorithm:
symmetric. if the radius of the new sphere is smaller than
RMIN the sphere will be discarded. Thus, the use
of RMIN is a decisive factor to stop the iterative
Creation of New Spheres process that the program follows.
The way to create the new spheres depends on the
type of the pair. In the Figure 4 the three types of
pairs are shown with the new spheres drawn in. STUDY OF THE VALUES OF FRADIO
The pairs of the type A have the spheres overlap-
ping. The difference between B and C is that in B As we saw in the previous section FRADIO is the
the distance between the surface of the solvent parameter that determines the values of RMIN
and axis GP is larger than the radius of the and RHID and so controls the number of spheres
smallest sphere that we allow to be created. The created in the process of smoothing the molecular
radius of this smallest sphere is called RMIN and surface. With the aim of studying the behavior of
is calculated by multiplying FRADIO by the aver- the method with RMIN and RHID, we have made
age of the radii of all initial spheres. a modified version of GEPOL, that allows inde-
If the pair is of the type A or B the new sphere pendent values of RMIN and RHID to be given
will have its center at a point on the axis GP and using two different FRADIO.
equidistant from the two spherical surfaces of the For this analysis we have used a structure that
pair, and with such a radius that it will be tangent we call NAD-LOOP, where NAD is the coenzyme
to the solvent. The methods for calculating the co- nicotinamideadenine dinucleotide and LOOP is
ordinates and radius of the new sphere for these a polypeptide loop of the LADH (Liver Alcohol
types of pair are shown in the Figures 5 and 6. As Dehydrogenase) involved in the binding of the
can be seen in the Figure 7, for the pairs of the C coenzyme. The association of the NAD and this
type the center will be in the intersection of the loop of the LADH allows the LADH to catalyze
surface of the biggest sphere of the pair with the the reaction of alcohols to aldehydes or ketones.'l
axis GP and the radius will be calculated being This structure was not selected for its special
tangent to the solvent. biochemical interest but as a real structure with
The addition of new spheres having too small a which to test the method. NAD-LOOP is composed
radius encumbers the computation without pro- of 165 nonhydrogen atoms, the initial spheres of
ducing significant improvement in the results. For radius 1.8 being centered on them. The carte-
GEPOL; IN IMPROVED DESCRIPTION OF MOLECULAR SURFACES 1051
m0 .....
sian coordinates were obtained from a molecular diagonal of Table I are drawn in bold type. We can
dynamic calculation.22The radius of the solvent see that they lie close to the continuous line, which
was taken equal to 1.5 A. is not a surprise because in this way both FRADIO
In the Table I we indicate with the symbol + (RMIN)and FRADIO(RH1D)give a similar degree
the calculations that have been made. The calcula- of freedom in the process of selection of the pairs.
tions corresponding to smaller values of FRADIO- This effect which we have observed in other cases
(RHID) and FRADIO(RM1N)have not been made is the reason for combining them in one, FRADIO,
because they do not give more information and that controls both factors at the same time.
they require a large amount of cpu time. For the In the Figure 8 we can also see that when
value of FRADIO(RM1N) equal to 0.9 there is no FRADIO decreases the area and volume approach
creation of spheres. a limit. This limit (FRADIO equal to 0.0) corre-
In the Figure 8 are shown the values of area sponds to the continuous molecular surface de-
and volume as a function of the cpu time for all of fined by Richards.17 Consequently, there is a
the calculations given in Table I. Due to the filling direct relation between the computational effort
of the space inaccessible to the solvent, the process and the quality of the result. After these calcula-
of smoothing implies an increase to the volume and tions and some others that we have made, we rec-
a decrease of the area. In these figures we can see ommend a value for FRADIO of 0.5 as giving a
that there is a limit of efficiency shown with a good ratio between the cpu time used and the re-
continuous line. The values corresponding to the sult obtained.
1052 PASCUAL-AHUIR AND SILLA
I
I
1 msa=
d2+
2 d E
2 - 2
- PS
From the graphs it can be seen that the results this method you get a basically similar area though
obtained for the MS are good, the difference with the strategy used is very different. The differences
the exact values being no larger than 1.5%in any between the exact value and those obtained with
case. GEPOL always overestimates the value of the MSDOT and GEPOL methods are shown in
area and underestimates the value of volume. This Figure 10. The MSDOT results show bigger mis-
is hardly surprising given the strategy used by takes than our results for almost all the cases.
GEPOL to build the MS. In fact, the smoothing The MSDOT results show large downward fluctu-
process implies the filling up of space inaccessible ations which confirm its oscillating nature.” In
to the solvent in successive steps, increasing in the WMS (Fig. lO(A)) the relative error for our pro-
each one the molecular volume and decreasing the gram is never larger than 0.2%while Connolly’s
molecular area until reaching the limit given by program goes up to 2.6%.In the MS (Fig. 10(B))the
the surface of Richards. relative error for GEPOL is never larger than 1.5%
As we know the exact value of the area we can while MSDOT goes up to 2.6%.The CPU time
set up a comparison with the Connolly’s method,13 consumed by our program is always smaller. The
which is the most widely used nowadsys. By using overall calculations shows that MSDOT uses a
1054 PASCUAL-AHUIR AND SILLA
l + h
ZN =-
ZG+hZp
I+h I
42.5% more time-consuming than GEPOL (41% by the MS has a maximum but its values are al-
in the WMS and 44% in the MS). ways larger than in the WMS.
In Figure 9 it is also possible to see the differ-
ences of the behavior of the WMS in respect to
Internal Rotation
the MS. The area of the WMS is bigger than the
MS up t o a distance of 5.5 A between the centers The study of the energetic barrier in an internal
of the spheres; after that it takes smaller values. rotation is a common objective for a theoretical
That is logical because the WMS is formed for chemist. In solution the changes of the area and
two isolated spheres at a distance of 4.8 A (twice volume during an internal rotation produces vari-
the radius of the spheres) and for the MS that oc- ations in the values of the electrostatic interac-
curs at a distance equal to 7.8 (4.8plus the di- tion, energy of cavitation, etc. that can alter the
ameter of the solvent). This difference in behavior form of the barrier. GEPOL has demonstrated its
is important because the interaction of the sol- capacity to describe these changes, using both MS
vent with the molecule is through the surface6@ and WMS,lg versus other well known
and any variation of that is reflected in the inter- Here we will show the differences in the area and
action. Also the variation of the volume enclosed volume of octanol molecule when either type of
GEPOL; IN IMPROVED DESCRIPTION OF MOLECULAR SURFACES 1055
2300
2250
g 2200 O
n
n
2150
2100
2050
2000
0 2000 4000 6000 8000 10000 12000 14000 16000 18000
CPU TIME
b)
surface is used. Figure 11 shows the differences From the area and volume curves, it can be
between the areas obtained with the WMS and seen that the differences between the two kinds
MS, as well as the volumes enclosed by them, as a of surfaces are significant. Besides that it is in-
function of the angle of rotation. The angle of ro- teresting to note that these differences depend on
tation corresponds to the rotation around the the angle and, thus, on the molecular geometry.
bond C4-C5 (values used are between 0" and In fact the difference between the areas increases
180", in increments of 10'). Standard values were almost 50% when the angle changes from 10 to
used for the ge~metry,'~ a solvent sphere of 1.5 70. If the envelope surface of a molecule modu-
and a value of FRADIO of 0.5. lates the interaction of the molecule with the sur-
1056 PASCUAL-AHUIR AND SILLA
Table I. The symbol + marks the calculations performed for the system NAD-LOOP.
FRADIO FRADIO(RM1N)
(RHID) .1 .2 .3 .4 .5 .6 .7 .8 .9
.1 - + +
.2 - + +
.3 - + +
.4 + + +
.5 + + +
.6 + + +
.7 + + +
.8 + + +
.9 + + +
1.0 + + +
160
150
140
130
g 120
110
100
90
80
b)
130
125
120
115
110
105
100
95
90
85
80
75
70
1,o 2,o 3,O 4,o 5.0 6,O 7,o 8,O
DISTANCE
Figure 9. Values of the area (A') and the volume (A3)enclosed by the surface formed ini-
tially for two spheres, versus the distance (A) between their centers. WMS GEPOL; MS
GEPOL; W MS Analytical.
GEPOL; IN IMPROVED DESCRIPTION OF MOLECULAR SURFACES 1057
DISTANCE
Figure 10. Absolute error in the area calculation for a two spheres model versus the
distance (A) between their centers. A MSDOT; 0 GEPOL. (A) van der Waals Molecular
Surface; (B) Richards Molecular Surface.
roundings?" the type of molecular surface used and our own. Figure 12 shows the comparison be-
will be vital in calculating the energy of interac- tween both methods. The MSDOT results show
tion. Consequently, the choice of the surface can big fluctuations which ratifies its oscillating na-
favor one conformation over others. ture2' whereas the GEPOL results show in a better
The difference between the WMS and the MS way the continuous variation that the molecular
areas as a function of the rotation angle have been area has to go through when changing the rota-
calculated according to the Connolly's method tion angle.
1058 PASCUAL-AHUIR AND SILLA
ANGLE
Figure 11. Difference between the MS and the WMS (MS minus WMS) for an internal
rotation of the octanol molecule. 0 area (A'); volume (A3).
Fractality of the Surface carbons. The geometry was taken from a molecu-
lar dynamic simulation28and we used a value of
The fractality is a good index to characterize the
FRADIO equal to 0.6. Values for the area have
irregularities of the molecular surface, which have
been obtained for radii of the probe from 0.5 to
been shown to be related to the reactivity of the
3 A. In the Figure 13 are shown the values of
biomolecules. In fact, a correlation between re-
fractality obtained for the different radii. This
gions of high fractal index and those involved in
graph exhibits similar behavior to that obtained
protein-protein interactions has been
by Lewis and Reed for other proteins.
This index can be calculated using the formula of
Lewis and Reed.25
CONCLUSION
2 - D = dlog (A)/dlog ( R )
where D is the index of fractality and A is the In conclusion, we have presented the procedure
area of the MS calculated for a given radius of used by the program GEPOL to calculate the
probe R . Molecular Surface starting from a set of spheres
We have calculated the global fractality of the centered on the atoms or group of atoms of the
Retinol Binding Protein (RBP), only in order to molecule. GEPOL computes the MS by first
show the capability of GEPOL working with searching the spaces inaccessible to the solvent
structures of this size. This molecule is formed by and then filling them with a new set of spheres,
1438 nonhydrogen atoms, where we have centered this strategy being one of the fundamental differ-
the spheres with the following radii: 1.4A for the ences between GEPOL and other method^.'^"^
oxygens, 1.5 A for the nitrogens, and 1.9A for the The parameters used by this method are princi-
GEPOL; IN IMPROVED DESCRIPTION OF MOLECULAR SURFACES 1059
ANGLE
Figure 12. Difference between the MS and the WMS (MS minus WMS) for a n inter-
nal rotation of the octanol molecule. A MSDOT; 0 GEPOL.
,1 1 10
PROBE RADIUS
Figure 13. Index of fractality for the surface of RBP versus the probe radius in A.
1060 PASCUAL-AHUIRAND SILLA
pally t h e radius of the probe sphere, and FRADIO 16. B. Lee and EM. Richards, J. Mol. Biol., 55, 379
w h i c h c o n t r o l s the c r e a t i o n p r o c e s s of n e w (1971).
spheres. The latter p a r a m e t e r c a n take values be- 17. EM. Richards, Ann Rev. Biophys. Bioeng., 6, 151
(1977).
tween 0 and 1.0. 18. J.L. Pacsual-Ahuir, E. Silla, J. Tomasi, and R.
Bonaccorsi, J. Comp. Chem., 8, 778 (1987).
19. J. L. Pascual-Ahuir and E. Silla, Quantum Chem-
References istry-Basic Aspects, Actual Trends, R. Carbb, Ed.,
Studies in Physical and Theoretical Chemistry,
1. I. Langmuir, Colloyd. Symp. Monogr., 3, 3 (1925). Elsevier, New York, 1989, Vol. 82, p. 597.
2. R.B. Hermann, J. Phys. Chem., 76, 2754 (1972). 20. J.L. Pascual-Ahuir, E. Silla, J. Tomasi, and R.
3. S.C. Valvani, S. H. Yalkowsky, G. L. Amidon, Bonaccorsi, GEPOL QCPE Program 554 (1988).
J. Phys. Chem., 80, 829 (1976). 21. 0. Tapia, H. Eklund, and (3.1. Branden, in Esteric
4. R.A. Pierotti, Chem. Rew., 76, 717 (1976). aspects of Biomolecular Interactions, G. Naray-
5. S. Miertus, E. Scrocco, and J. Tomasi, Chem. Phys., Szabo and K. Simun, Eds., CRC Press, Boca Raton,
55, 117 (1981). FL, 1986.
6. G. Floris and J. Tomasi, J. Comp. Chem., 10, 616 22. 0. Tapia and E Colonna-Cesari, personal commu-
(1989). nication.
7. L. Pauling, Nature, 161, 707 (1948). 23. G.A. Arteca and P.G. Mezey, J. Phys. Chem., 93,
8. B. Linder, Mu. Chem. Phys., 12, 225 (1967). 4746 (1989).
9. B. J. Costa Cabral, D. Rinaldi, and J.L. Rivail, 24. J . A . Pople and D.L. Beveridge, Approximate
C. R . Acad. Sc. Paris, 298, 675 (1984). Molecular Orbital Theory, McGraw-Hill, New York,
10. A. Horta, I. Fernandez-Pierola, Macromolecules, 1970.
14, 1519 (1981). 25. M. Lewis and D.C. Rees, Science, 230,1163 (1985).
11. A.Y. Meyer, J. Chem. Soc. Perkzn ZZ, 1161 (1985). 26. P. Pfeizer, U. Welz, and H. Wippermann, Chem.
12. M. L. Connolly, J. Appl. Cryst., 16, 548 (1983). Phys. Letters, 113, 535 (1985).
13. M. L. Connolly, Molecular Surface P r o g r a m , 27. J. Aquist and 0. Tapia, J. Mol. Graph., 5, 30
QCPE 429. (1987).
14. M . L . J . Drummond, J. Chem. Phys., 88, 5021 28. J. Aquist, P. Sandblom, T.A. Jones, M.E. New-
(1988). comer, W F van Gunsteren, and 0. Tapia, J. Mol.
15. H.R. Karfunkel and V. Eyraud, J. Comp. Chem., Biol., 192, 593 (1986).
10, 628 (1989). 29. A. Meyer, Chem. Soc. Rev., 15, 449 (1986).