8. Secondary structure of
proteins Kai Griebenow*, University of Puerto Rico, * Corresponding author, e-mail griebeno@adam.uprr.pr To see diagrams at full size click here
Introduction FTIR spectroscopy has been applied to study the secondary structure of proteins in aqueous solution for quite some time [13, 34,42,67,84]. However, other methods, such as crystallography in combination with X-ray structure determination and NMR spectroscopy, are frequently more suitable to gain detailed structural information, even at the atomic level, under such conditions. In addition, fluorescence-, absorption-, Raman-, CD-, and ESR-spectroscopy can frequently complement this information. Therefore, why would anyone use a methodology that typically does not provide detail information but rather an "overview" picture of global structural arrangements in proteins? The answer lays in the main strength of FTIR spectroscopy, the possibility to investigate protein structural changes under conditions not directly non-invasively accessible by other spectroscopic methods. For example, FTIR spectroscopy has proven to be a particularly useful tool in gaining a detailed understanding of protein structural changes upon protein binding to membranes or membrane-mimicking conditions [11,46,47]. The advantage of FTIR spectroscopy in such applications is that the physical state of the sample is irrelevant. This is because stray-light is surpressed due to its inappropriate modulation [11]. Based on the resulting flexibility, FTIR spectroscopy has recently been employed to determine the secondary structure of proteins under conditions where other spectroscopic methods could only be used with difficulty, if at all. These conditions include amorphous, dehydrated protein powders, suspensions of such in organic solvents, and encapsulated in polymer matrices. Understanding the structure of proteins in such unnatural environments is important for many biotechnologically relevant areas and applications. For example, pharmaceutical proteins are frequently lyophilized to increase their shelf-life and further transportability [2]. However, proteins are susceptible to solid-state aggregation under such conditions [23]. FTIR spectroscopic results on various proteins have shown that dehydration-induced structural changes occur for most proteins and are responsible for the solid-state aggregation. Furthermore, FTIR spectroscopy was used to develop improved formulations for some proteins in the solid state, e.g., for recombinant human growth hormone, and to identify mechanisms of the prevention of detrimental solid-state aggregation [29]. In another area, the synthesis of chemical compounds using enzymes under nearly non-aqueous conditions in neat organic solvents, dehydrated enzyme powders are suspended in the solvents in many of the applications [55-57]. FTIR spectroscopy has been employed to investigate the structure of the enzymes under such conditions in order to gain insight on the influence of the protein structure on the catalytic performance [44]. In addition, FTIR spectroscopy has been used to investigate why proteins denature in mixtures of organic solvents with water but not in pure solvents [45]. The conditions tested included both, dissolved and suspended protein. Finally, by utilizing the combined knowledge by the above experiments it was recently possible to stabilize the structure of two model proteins upon encapsulation in a hydrophobic polymer matrix [17,18]. Such encapsulation involves the exposure of proteins to very denaturing conditions, such as dehydration and organic solvents. This review highlights the FTIR methodologies employed in these recent studies to demonstrate the applicability of FTIR spectroscopy as a fast and reliable method for exploring structure and function of proteins under various conditions. After discussing the vibrational modes of the peptide bond, the main analytical procedures employed are introduced, and limitations of the FTIR methodology are discussed. Finally, some recent results on lyophilized proteins are used to exemplify how the developed tools were employed to address important scientific issues. Methods used to analyze the secondary The use of FTIR spectroscopy to analyze the structure of proteins has been reviewed extensively in the last years [3,7,11,49,83] and the following section is not meant to introduce the background theory comprehensively. Within this review only those techniques and methodologies employed in investigations on the structure of proteins in the amorphous solid state will be discussed. Structurally sensitive vibrational Figure 1 shows the IR spectrum of a water-soluble globular protein, human albumin, in
the spectral window feasible for measurements with aqueous solvent. (Other amide
vibrations overlap with some very strong water IR absorption bands at ca. >3000 cm-1
and <800 cm-1). The amide absorptions in this spectral region are labeled
and named amide I, II, and III. The amide group, in model systems (e.g.,
N-methylacetamide), peptides, and proteins, exhibits 9 vibrational modes, giving rise to
the amide bands A, B, and I-VII [7,58]. However, as pointed out by Arrondo et al.
[3], due to technical and theoretical limitations, only the amide I, II, and III bands can
be used to investigate the secondary structure of proteins. Normal coordinate analysis has
revealed that the amide I is primarily a C=O stretching (C=O s) vibration with some
contributions from CN s and CCN deformation (CCN def). In contrast, amide II and III are
heavily mixed vibrational modes. The amide II is an out-of-phase combination of NH
in-plane-bending (NH ib) and CN s with minor contributions from C=O ib, CC s, and NC s.
The amide III is an in-phase combination of CN s and NH ib with some contributions from CC
s and C=O ib. The interested reader is referred to the reviews of Krimm and Bandekar [58]
and Bandekar [7] on normal coordinate analysis for details.
In principle, if the frequencies of all IR-active amide modes of each element of the secondary structure together with their respective molar extinction coefficients were established, one will be able to determine the secondary structure of any given protein precisely from its vibrational spectrum. Unfortunately, this is presently not the case. In fact, there is still some discussion even on the extinction coefficients of the most common secondary structure motifs in proteins.§ Moreover, the frequencies at which the corresponding secondary structure elements absorb also vary somewhat from protein to protein. In spite of these obstacles, several strategies have successfully been applied to obtain estimates for the secondary structures of proteins from the shapes of the amide I and III region. These approaches and strategies are outlined below. Thereby we put emphasis on the amide I spectral region because theoretically it is comparatively well understood and has been employed thus far in probably more than 90% of all IR investigations of protein secondary structure [3].
Force-field calculations using "standard" conformations This approach has been pioneered by Krimm and co-workers [see 7,58 for a recent reviews of this work] and also by some other groups [20, 66]. In short, the amide frequencies for certain well-ordered homo-polypeptides were computed and compared with those experimentally determined. The basis of this work is the established fact that many homo-polypeptides adopt a well-defined secondary structure in water, such as the a-helix (a-poly-L-alanine) and b-sheet (b-poly-L-alanine) structure. These investigations have provided a vast amount of information on the normal mode frequencies of a-helices, b-sheets, and to a minor extent of turns in the amide I, II, and III region, thus providing explanations for many experimental results. However, these theoretical data still can not completely account for the complex situation in proteins. First, solvent effects are excluded from these calculations. They are expected to be small for amide groups connected by hydrogen bonds, but they could be significant in instances for "free" amide groups. This is a well-established fact obtained from investigations of model compounds in various organic solvents [30]. Second, and even more important, the calculations were done for very well ordered, highly repetitive secondary structures, which are not always an appropriate model for the secondary structure observed in proteins. For example, a-helices in proteins have a different length which may give rise to IR frequency differences [66]. Furthermore, a-helices and b-sheets in proteins may be embedded into hydrophobic or at least amphiphilic environments, while those of model peptides are completely hydrated. Since a-helices have the capability to bind water strongly [6], some dependence of the amide mode on the hydration level could be expected. Finally, a-helices in proteins frequently exhibit some structural irregularities (e.g., bends and kinks)[8] which are also expected to influence the amide vibrations for these structures. Semi-theoretical approach (amide I only) An approach compromising the theoretical one described above and the semi-empirical one described below has been developed by Jackson and Mantsch [for a review see 49 and references herein]. Each element of the secondary structure is characterized by typical dihedral angles (p and F in the Ramachandran plot). Therefore, the hydrogen bonds formed by the C=O amide groups have a characteristic strength for each individual element of the secondary structure. The frequency of the C=O s vibration, the main amide I contribution, is related to the strength of the hydrogen bond formed: the stronger the hydrogen bond, the lower the electron density of the C=O group, the lower the frequency of the amide I absorption. Intermolecular b-sheets form the strongest hydrogen bonds and therefore absorb at 1610-1625 cm-1, intramolecular b-sheets at 1630 cm-1, unordered structures (hydrogen bonded to water or side chain groups) at about 1640-1650 cm-1, and finally a-helices with quite weak hydrogen bonds at about 1653-1660 cm-1. According to Jackson and Mantsch the vibration of the free amide I C=O s is expected at ca. 1670 cm-1.% IR bands in the amide I at ca. 1695 cm-1 are caused by the strong transition dipole coupling in b-sheets. However, there are bands at 1670-1690 cm-1 observed for protein amide I spectra. As an explanation for these frequencies the author use the concept of electron back-donation occurring when C=O groups are tilted out of plane by steric reasons, e.g., out of the plane of a cyclic ring [see e.g. 48] which yields uncharacteristically high C=O s frequencies. Such C=O groups tilted out of the amide plane by steric reasons can frequently be found in turns. Therefore, it is likely that much of the amide I absorbance in the spectral region under scrutiny is caused by turns. However, this is uncertain [49]. Overall, the above explanation has been proven very useful for the assignment of amide I bands to specific secondary structure elements.
Semi-empirical approach. Byler and Susi [13] developed this approach in scholarly work. This paper is still very useful even though in the last 12 years marked technical, computational, and conceptual advances have been achieved in the infrared spectroscopic investigation of protein secondary structure. In this work, the authors investigate the amide I region of proteins. The amide I is the amide vibration after deuteration of the NH backbone groups. Technically, this can be easily (but not fully) achieved by dissolving lyophilized proteins in D2O followed by a prolonged period of incubation (typically 24 h at 4ºC). At the time of this work reliable subtraction of the strong water absorption band at ca. 1640 cm-1 was not easily possible due to limitations of the instrumentation (e.g., detector non-linearity). It should be noted that this is not the case anymore and it has become routine to determine the secondary structure of proteins in H2O [28,44,45] (vide infra). In D2O this band is downshifted and correction of the spectrum by background subtraction is quite easy in the amide I region. As already mentioned, the individual bands are strongly overlapping in the amide I and
I region. Therefore, resolution-enhancement (better called band-narrowing) is
required prior to the analysis by curve-fitting procedures. Some investigators, however,
prefer to use the original amide I spectra for curve-fitting [see e.g. 83]. We do not
follow this recommendation because too many possible solutions may arise. We recommend
instead to follow the method developed by Byler and Susi [13] who band-narrowed their
spectra by Fourier-selfdeconvolution [51,52,65,91]. The secondary structure was then
subsequently determined by Gaussian curve-fitting (see Figure 4), i.e., the spectral shape
is fitted by a sum of Gaussian bands. From the areas of these individual bands
after their assignment to the specific elements of the secondary structure their
presence in the protein structure are calculated from their fraction of the total band
area in the amide I region [13, 44,45].
In their approach, Byler and Susi [13] used proteins where the atomic coordinates were known with sufficient accuracy from X-ray crystallography and the secondary structure was calculated by others from these coordinates [50, 63]. After the Gaussian curve-fitting procedure the resulting component bands were assigned for all proteins in the same way, depending on their frequency. The band assignments were based on both, empirical and theoretical knowledge from normal coordinate analysis. Using this approach, the authors were able to demonstrate that the a-helix and b-sheet contents can be calculated with great accuracy from the amide I IR spectra of proteins. Only proteins with a very high a-helix and very low b-sheet content caused some problems. The "extended chains" in these proteins (the segments at the ends of the helices) absorb at similar frequencies as do b-sheets. Therefore, strictly spoken, theb-sheet content of a protein can not be determined with the same accuracy from the amide I spectrum as the a-helix content. The only solution to this problem is the simultaneous investigation of a second spectral region, the amide III, or employment of an alternative method, such as CD- or Raman-spectroscopy. In a similar fashion as outlined above, Singh and co-workers employed the amide III spectral region to calculate the secondary structure for various proteins [40,80,81,82]. Even though the amide III is much weaker than the amide I, better band separation allows for curve-fitting without the need of employing resolution-enhancing methods. The results by Fu et al. [40] based on a set of structurally differing proteins, have been confirmed for other proteins by Griebenow and Klibanov [43] who found that the amide III results and X-ray data were in good agreement for most proteins. Similarly, Dong et al. [34] established the band-assignments for the amide I in water. These authors employed second derivative spectra in the curve-fitting procedure. Band-narrowing by second derivatization is suspected not to be a sound approach [49]. However, again the assignments have been proven to be correct when using other proteins and employing FSD as the band narrowing methodology [17,18,29,44,45]. The methodology is exemplified in Figure 4 for the protein bovine pancreatic trypsin inhibitor at pH 7.0 in the amide I area. In this case the original spectrum was band-narrowed by FSD using the parameters of FWHM 25 cm-1 and enhancement factor k = 2.8 using the program OMNIC. The result of the Gaussian-curve fitting is shown in red and coincides with the measured spectrum. The individual Gaussian bands were assigned according to the literature. This example was chosen because the b-sheet structure of BPTI absorbs at a somewhat uncharacteristically high frequency of ca. 1643 cm-1 close to the absorbance of unordered secondary structures. This exemplifies that careful assignment of the bands is required because for some proteins atypical frequencies are found [49]. For all other bands standard assignments (see Table 1) can be used. The result of this fit suggests that BPTI has 26% of a-helix and 41% of b-sheet secondary structure. This is in excellent agreement with the X-ray structural data (26% a-helix and 45% b-sheet) and amide III analysis (20% a-helix and 43% b-sheet; )[43]. Quantification of changes in In the following, we confine ourselves on describing those methods for the quantitative analysis of protein secondary structure that were applied to dehydrated, solid protein powders. Not included are more recent approaches that use calibration sets (FTIR spectra of proteins with known X-ray structures) in combination with factor analysis procedures [see 3 for a review]. All these approaches work well but statistically not much better than the Gaussian curve-fitting procedures. The interested reader is referred to the original literature [for example, 37,61,74,77]. Correlation Coefficient The calculation of the correlation coefficient was introduced by Prestrelski et al. [72,73] to quantitate dehydration-induced structural changes in proteins. In principle, however, the method can be employed to characterize the overall-structural changes occurring in a protein upon any changes in the environment. This method is explained in detail in the section describing dehydration-induced structural changes in proteins. Area Overlap Method The "area overlap method" was introduced by Heimburg and Marsh [47] to analyze spectral changes occurring upon heat-denaturation of cytochrome c. Therein, protein vibrational spectra measured at various temperatures were resolution enhanced by FSD in the amide I region. Difference spectra were obtained by subtracting area-normalized spectra obtained at elevated temperatures from that of the native protein (in our case at 25ºC). The changes in total area above the baseline can be used to determine the melting temperature Tm. These changes are a quantitative description of overall structural changes occurring in proteins upon changes in the environment. A modification of this methodology has been applied by Kendrick et al. [54] to characterize dehydration-induced structural alterations in proteins. In short, amide I protein spectra were area-normalized after calculating the second derivative spectra followed by baseline correction. The integrated differences in the area between two samples were used as a quantitative criterion to describe overall structural changes. The rationale by Carpenter and co-workers [15] for employing this method instead of the correlation coefficient method (see above, developed by the same research group) was, that the correlation coefficient method was hampered occasionally by baseline-offsets in the second derivative spectra that could not be accounted for appropriately. However, we do not recommend the method because any quantitative use of second derivative spectra is problematic [49]. Instead, we prefer the method of Heimburg and Marsh (or at least the use of area normalized FSD spectra in the method of Kendrick et al., [54]). We have developed a method that does not rely on self-written programs and can be done with the typical software used to run FTIR instruments (e.g., OMNIC) and a good spreadsheet program (e.g., SigmaPlot). Herein, we first resolution enhance the spectra to be compared by FSD. It is important to use the same values for both samples for the bandwidth and enhancement factor. These FSD spectra are then baseline corrected in the amide I region. Next, both spectra are area normalized to 100 for the spectral region to be employed (e.g., 1700-1600 cm-1 for the amide I). The spectral region to be compared is then saved as ascii (.txt) file. The spectra to be compared have to be saved with the same data spacing and on exactly the same wavenumber scale. These spectra have then to be imported into a spreadsheet program. In SigmaPlot the 1st column contains the wavenumber scale of sample 1, the 2nd column the IR intensities of sample 1, the 3rd the wavenumber scale of sample 2, and the 4th the IR intensities of sample 2. It is then easy to compute the difference spectrum for both samples and store the difference spectrum in column 5 (e.g., col(5)] = col(2) col(4)). Next, the mathematical operation summing up the absolute differences (e.g., col(6) = max(sum(abs(col(5)))) gives the area of difference, e.g. 22. Since the spectra were normalized to an area of 100, this represents 22% area that is different between the two spectra. The area-overlap is therefore 78% for this example. This whole procedure is exemplified in Figure 5 using the above example.
Gaussian Curve-Fitting The general applicability of curve-fitting procedures to quantitate protein secondary structure has already been presented above. The following focuses in particular on important issues with respect to studying the solid-state structure of proteins. Amide III This region has been established using similar semi-empirical approaches as before described for the amide I [13] by Singh and co-workers [40,80-82]. Herein, various proteins with solved X-ray structures were measured by FTIR spectroscopy. Results on the secondary structure composition obtained by Gaussian curve-fitting were compared with the X-ray crystallographic data. Overall, the analysis revealed an excellent agreement between secondary structures derived by both methods, a finding consolidated for other proteins by Griebenow and Klibanov [43]. In addition, it has also been shown that results obtained from Gaussian curve-fitting of amide I and amide III are in good agreement [40,44,45]. However, it has to be pointed out that the theoretical understanding of the origin of the vibrations in the amide III is not as good as for the amide I [49]. While the amide I mainly consists of C=O s with minor CN s and CCN d contribution, the amide III is a more complex vibrational mode. It mainly is the in-phase combination of NH in-plane-bending and CN s with contributions from CC s and CO ib [7]. It has been suggested that there may not be a correlation between amide III bandshape and secondary structure content of proteins [49]. This is in obvious contrast to various investigations on proteins with well known structures where a clear relation of the amide III bandshape and the secondary structure has been established [40, 43]. This discrepancy between theoretical models and experiment suggests that the formers are not yet complete for the amide III. More theoretical work is required for developing a thorough understanding of the amide III. In the following, we show an example demonstrating that results of the secondary
structure determination in the amide III region are consistent with those in the amide I.
Human growth hormone, a protein with a high a-helix content of ca. 55% [16], was studied in
aqueous solution at various pH-values. The secondary structure was then determined at
these pH-values from the protein vibrational spectra. It is evident from Figure 6 that
structural changes are reflected in the amide I, I, and III to the same extent. This
result strongly suggests that in this case the a-helix bands were assigned consistently and
correctly in the amide I and III spectral region. This is in agreement with the data of
Griebenow and Klibanov [45] for the denaturation of lysozyme and subtilisin by mixtures of
organic solvents and water. Herein, the a-helix contents determined in amide III and I
were also in agreement for the native and largely denatured states of the proteins. Thus,
in IR experiments the amide III and I behave similar. In conclusion, we found that the
amide III is a very useful spectral feature to analyze the secondary structure of
proteins. Real drawbacks of the amide III spectral region result from its overlap with
vibrations of other molecules. This is a problem when investigating proteins in
co-lyophilizates with polyols and in organic solvents because appropriate correction for
the additive by subtraction is often not possible.
In summary, we are convinced that the use of the amide III is advantageous, when the following conditions are fulfilled for the specific protein to be studied:
Amide I After FSD After 2nd derivatization Which spectral region should I use? Figure. 7 exemplifies the results obtained by Gaussian curve-fitting in the amide I, I, and III in comparison. It should be noted that this comparison does only include data from some selected papers and should not be taken as being comprehensive at all. However, qualitative analysis of the plots given in Figure 7 allows for an evaluation of the different fitting procedures in comparison. The data for the amide I region were mainly obtained from Dong et al. [34] (myoglobin, cytochrome c, ribonuclease A, lysozyme, a-chymotrypsin, trypsin, alcohol dehydrogenase, concanavalin A, and immunoglobulin G). Herein, the authors quantified the secondary structure composition by Gaussian curve-fitting of amide I second derivative spectra after baseline correction. In addition, data on subtilisin Carlsberg from Griebenow and Klibanov [44] and recombinant human growth hormone [29] after band-narrowing by FSD are also included. It is evident that the a-helix content is estimated very accurately from IR spectra in the amide I. This is in contrast to some statements in the literature that a-helix and unordered secondary structure produce strongly overlapping bands in the amide I therefore artificially increasing the a-helix content [40]. We do not find this to be a significant problem. The a-helix band is typically found between 1655 and 1660 cm-1 (Table 1) and therefore well separated from unordered secondary structure ("random coil") at 1645-1650 cm-1. However, estimation of the b-sheet content is problematic when using the amide I solely for some proteins with a high a-helix and low b-sheet content, such as myoglobin or human growth hormone. This is because the extended chains of the a-helices in such proteins absorb at the same frequency as b-sheets (approximately 1630 cm-1). Therefore, from investigating the amide I IR spectrum of a protein, one can not be sure about the b-sheet content without knowledge of the atomic coordinates.
The amide III data were compiled from papers of Fu et al. [40] (lysozyme, In conclusion, both, the amide I and III region should be analyzed simultaneously, if possible. Since there are no technical limitations anymore with the water subtraction in the amide I [34, 44,45], this does not require additional measurements. Furthermore, simultaneous analysis of amide I and III is also possible for protein powders when they are dehydrated without additives [see e.g., 44,45]. Sample preparation considerations Studying the structure of a protein in the solid state does not require employing additional expensive accessories, such as those for diffuse reflectance or attenuating total reflectance measurements. The protein can simply be measured pressed into a KBr pellet. It is now established that the procedure used to produce KBr-pellets does not induce protein structural changes [26,29,72,73]. In principle, because of the employment of high-pressure during the pellet preparation, there were some concerns on procedure-induced protein structural changes. High pressure is known to affect the structure of proteins in aqueous solution [62,71,85]. However, the following works established that this is not a problem when working with dehydrated protein powders:
However, it has to be pointed out that the above scenario may only hold true for dehydrated proteins. In such powders the conformation of the protein is kinetically trapped due to the increased rigidity of proteins when the "molecular lubricant" water is removed [45]. When proteins are exposed to conditions where they are able to sorb water (experimentally in humidity chambers or accidentally by opening a vial with lyophilized powder that was stored in the refrigerator), it may be possible that the rigidity is significantly decreased. This, in consequence, could lead to a somewhat higher susceptibility of the proteins to undergo procedure-induced structural changes. Thus, under such circumstances the use of techniques that do not require sample preparation, in particular diffuse reflectance measurements using silicon carbide samplers, are recommended. Such investigations are currently ongoing in our laboratory. Lyophilization-induced structural Theoretical investigations implied for quite some time that dehydration of proteins should result in pronounced structural alterations. For example, Kunz and Kauzman [59] theorized what would happen if one would dehydrate a protein crystal. They concluded that they would expect conformational changes to occur caused by the enormous unsatisfied intermolecular forces when removing water from the spaces between the protein molecules. The authors also expected the rigidity of the proteins to increase upon dehydration thus preventing the individual molecules from reaching an equilibrium state. Therefore, the protein molecules should be trapped in various conformational states depending on the exact conditions of the dehydration. In consequence, the authors predicted a dependence of the final protein structure reached on the mode of dehydration. However, until recently, the issue on the conformation of proteins in the dehydrated state was very controversial. Early IR works were mainly performed by difference spectroscopy between one dehydrated protein powder and a set of partially hydrated protein powders (mainly employed was hen egg-white lysozyme).# Due to technical restrictions, these investigations in addition employed partially deuterated samples. Without presenting clear evidence, some groups claimed that the structure of lysozyme in aqueous solution and the dehydrated state is the same [14, 76]. However, it was easy to miss fine spectral changes in the amide I due to the lack of a proper resolution-enhancement methodology at this time and the use of difference spectra. Other studies contradicted these believes. For example, Baker et al. [5] interpreted the increase in the overall tritium-exchange rate upon dehydration as strong though indirect evidence for lyophilization-induced structural changes in lysozyme. Raman spectroscopy has also been used to study the structure of lysozyme in solution, as crystal, and in the lyophilized state. From comparison of the spectra in the amide III region the authors qualitatively concluded that lyophilization caused some main-chain conformational changes [92]. Similarly, Poole and Finey reached this conclusion using Raman- [68,69] and IR-spectroscopy [70]. Interestingly, works on this matter disappeared then for quite some time until the early 90s.
The first direct proof for lyophilization-induced
structural changes occurring in proteins was presented by Carpenter and co-workers in
scholarly work utilizing the potential of FTIR spectroscopy [72,73]. Herein, various
proteins belonging to different structural classes (mainly a-helix, mainly b-sheet,
or mainly unordered secondary structure) were measured in aqueous solution and in the
lyophilized state as powders pressed into KBr pellets(1 mg protein per 200 mg of KBr). The
spectra were then analyzed in the amide I spectral region by first calculating the second
derivative spectra. Next, the overall spectral and thus structural differences were
quantified by calculating the correlation coefficient r according to the following
equation:
This method is exemplified in Figure 8.
The second derivative spectrum of the native protein, in this case
bovine pancreatic trypsin inhibitor, is shown in Figure 8A. Several well-resolved bands
can be observed with frequencies at ca. 1691 cm-1 (b-sheet),
1775 cm-1 (shoulder, unordered), Using the correlation coefficient Prestrelski et al. [72] were able to demonstrate that lyophilization induced structural changes in all proteins involved in their study belonging to different structural classes, but to a varying extent. They also clearly showed that polyols prevented such structural changes (vide infra). The same polyol, however, worked with different efficiency for different proteins. It is important to note that the spectral changes in the amide I are (mainly) a result of protein structural changes and not just caused by the removal of water per se causing the spectral changes (e.g., because of changes in the extinction coefficients). Supporting arguments are summarized in the following:
In summary, convincing though indirect evidence has been presented that spectral changes in amide I and III reflect protein-structural changes upon dehydration. The interpretation of spectral changes being the direct result of the removal of water lacks any such evidence. However, one additional issue that has to be discussed in this context is the vibration of the so-called random coil portion of proteins. It is obvious that in random coil portions of proteins hydrogen bond donors other than the peptide NH backbone group must satisfy the H-bonding capability of the backbone amide carbonyl groups. This could be by hydrogen bonding to various suitable side chain groups or to water as the H-bond donor. Indeed, it has been shown that in crystals of hen-egg white lysozyme 42 water molecules are hydrogen bonded to peptide C=O groups [10]. In the latter case, removal of water could lead to "free" instead of H-bonded carbonyl groups and a frequency increase in the amide I is expected under such circumstances. On the other hand, H-bonding of free carbonyl groups to suitable side chain groups or NH-backbone groups could also occur and even provide the driving-force for the structural rearrangements. However, it can not be excluded that some (probably small) amount of spectral changes in the amide I are indeed a direct consequence of the removal of water because free C=O groups are generated. This is consistent with an increase in intensity for amide I bands between 1670 and 1690 cm-1, frequently observed upon lyophilization of proteins [see eg., 29]. The knowledge on structural changes upon lyophilization has been significantly extended afterwards by Griebenow and Klibanov [43]. In particular, employment of the amide III spectral region proved fruitful in this context. The problem with the exclusive use of the amide I in the work of Prestrelski et al. [72] was that the situation with already strongly overlapping amide IR-bands for proteins in aqueous solution was significantly worsened by further broadening of the individual bands upon dehydration. As the result, Prestrelski et al. [72] did not feel confident in using resolution-enhancement methods in combination with Gaussian curve-fitting to quantitate changes in the a-helix and b-sheet content. The amide III spectral region (ca. 1215-1335 cm-1) has some advantages in such applications over the amide I, in particular the better separation of the spectral components caused by the individual elements of the secondary structure [40, 80-82], as outlined earlier. Employing the amide III spectral region, Griebenow and Klibanov [43] were able to show that lyophilization induced similar structural rearrangements in the structure of various proteins. Gaussian curve-fitting analysis of the amide III band revealed for all proteins a significant drop in the a-helix content and an increase in the b-sheet content as the result of dehydration. However, the magnitude of the structural perturbations was protein dependent. For example, while drastic changes occurred for the mainly a-helical proteins myglobin and human albumin, proteins with a high b-sheet content and low a-helix content (e.g., chymotrypsinogen) appeared somewhat less susceptible. Furthermore, the use of FTIR spectroscopy to analyze the structure of proteins in the dehydrated, amorphous state was validated employing bovine pancreatic trypsin inhibitor as a model protein. The results obtained by analysis of the amide III IR spectra were in excellent qualitative agreement with those derived by Desai et al. [33] employing a completely unrelated spectroscopic technique, high-resolution H/D exchange NMR spectroscopy. It was also found for this protein, that the magnitude of the dehydration-induced structural changes depended on the mode of dehydration. Some dehydration methods (e.g., rotory evaporation) produced more structural alterations than others (e.g., lyophilization). Similarly, Costantino et al. [24] found that different methods of dehydration caused different magnitude of structural changes in human albumin. These results are in good agreement with the early predictions of Kunz and Kauzmann [59]. Finally, Figure 9 summarizes all results of the quantitative determination of the dehydration-induced protein structural changes in the last few years. It is obvious that all but one protein underwent the same structural perturbations: the a-helix content decreased and the b-sheet content increased upon dehydration. A possible explanation for this observation may lay in the differences in the amount and strength with which water is bound to a-helices and b-sheets in proteins. Analyzing the X-ray structures of various proteins Barlow and Poole [8] found that water is frequently bound to the peptide backbone in a-helices. Here, it forms quite strong "good" hydrogen bonds. Thus, when this water is removed one should expect structural changes driven by the unsatisfied hydrogen bonding potential. It is therefore not surprising that all proteins with a high a-helix content are quite susceptible to dehydration-induced structural changes. For b-sheets the authors found less bound water forming weaker hydrogen bonds with the protein backbone. Thus, proteins containing mainly b-sheet secondary structure should be less susceptible towards dehydration-induced structural alterations. Again, this is in agreement with the experimental results. For example, immunoglobulin G, a protein with a b-sheet content of more than 60% and no a-helix secondary structure, is thus far the only protein reported to be completely stable towards dehydration [27].
Prevention of lyophilization-induced In the following we present how FTIR spectroscopy can be utilized to quickly select the appropriate additive (also called excipient) to prevent deleterious structural perturbations as the result of dehydration. Such structural changes are frequently the first step leading to the solid-phase aggregation of proteins [23]. Such solid-phase aggregation of proteins can follow covalent (e.g., formation of intermolecular disulfide bonds) or non-covalent pathways. Often, solid-phase aggregation is caused by the sorption of moisture from air and is accordingly called moisture-induced aggregation of proteins [22,64]. In the following we will limit ourselves to covalent aggregate formation by thiol-disulfide interchange. It has been suggested that the first step leading to this detrimental phenomenon involves protein structural perturbations. As the result, disulfide bonds that are buried in the interior of the native protein, become exposed. The sorption of small amounts of moisture as the next step increases the molecular mobility of the dehydrated protein, but does not cause refolding. The exposed and mobile disulfide groups can then react with free thiol groups of cystein residues creating intermolecular disulfide bonds. However, solid-phase aggregation of proteins can also occur in the absence of moisture in closed vessels, e.g., for recombinant human growth hormone [29]. Prevention of the phenomenon can be achieved by co-lyophilization of the proteins with various excipients, including polyol lyoprotectants [17,18, 28,29,79]. Using FTIR spectroscopy it has been shown that there are likely two mechanisms responsible for the prevention of solid-phase aggregation: structural preservation and increased protein-protein distance in the lyophilized powders. Let us first consider how FTIR spectroscopy can easily be employed to identify efficient lyoprotectants that ameliorate protein structural perturbations. As already mentioned earlier, FTIR spectroscopy can be utilized to screen for the influence of the mode of dehydration on the structure of proteins. Similarly, this method can be employed to screen for efficient additives in co-lyophilization experiments. Herein, an excipient is added to the aqueous protein solution prior to its lyophilization. This solution is then lyophilized under standard conditions. The different co-lyophilizates are then pressed into KBr-pellets and FTIR spectra obtained. For many excipients their contribution to the collected FTIR spectrum in the amide I spectral region is insignificant. Frequently, they lack IR bands in this spectral region completely. Therefore, the amide I can often be employed without subtracting the spectrum of the additive. In contrast, the amide III band can frequently not be used to analyze the protein structure in the presence of additives. Many additives exhibit significant IR bands in the amide III region. Removal of the additive contribution from the measured spectrum by subtraction is often impossible because the IR spectrum of many additives is different when co-lyophilized with the protein or lyophilized alone. Therefore, typically only the amide I protein spectra are quantitatively analyzed in such applications. Overall structural differences between the protein in aqueous solution and in the various co-lyophilizates can be quantified by calculation of the correlation coefficient [72,73] or the area overlap [54]. Efficiency of the excipients in co-lyophilization experiments can also be judged excellently by determining the a-helix content of the protein in the various co-lyophilizates [17,18,29,43]. It has been suggested that the a-helix content is a more reliable structural parameter in such investigations. It is possible, that the b-sheet content is inflated by protein-protein contacts in dehydrated protein powders due to the removal of the bulk solvent water upon dehydration [17,18,29,45 ]. Using above quantitative methods, it has been established that lyoprotectants can prevent dehydration-induced protein structural perturbations. Prestrelski et al. [72,73] co-lyophilized various proteins belonging to different structural classes with numerous established lyoprotectants, such as monosaccharides, disaccharides and polyols (e.g., mannitol). The correlation coefficient r revealed that in particular the disaccharide sucrose was efficient in preventing lyophilization-induced structural perturbations. The correlation coefficient r calculated using the native structure as reference, was much closer to 1 for all proteins in the presence of sucrose. However, sucrose (and the other lyoprotectants used) was of varying efficiency for the different proteins. This indicates that specific lyoprotectant-protein interactions may be important in the prevention of structural alterations caused by the dehydration procedure. In addition, there was a clear correlation between prevention of lyophilization-induced structural perturbations and irreversible structural changes in the re-hydrated protein. Similarly, Costantino et al. [29] found that lyophilization-induced structural
perturbations of recombinant human growth hormone (rhGH) were prevented to a varying
extent by saccharide and polyol lyoprotectants. In addition, by employing varying ratios
of lyoprotectant:protein in combination with secondary structure determination by Gaussian
curve-fitting, this work also shed some light onto the mechanism of the prevention of the
solid-state aggregation of rhGH under accelerated storage conditions (exposure of the
protein to 50oC). Lyophilization without additives resulted in pronounced
structural changes for this protein (a decrease from 57% to 29% in the a-helix and increase from 7% to 24% in the b-sheet content). As already mentioned, the a-helix content is a straightforward structural
parameter. Indeed, it has been shown in this work that there is a correlation between
stabilization of the protein towards solid-state aggregation and structural preservation
(demonstrated by a high FTIR spectroscopy has recently been used to screen for efficient lyoprotectants for the
model protein bovine serum albumin [BSA]. Figure 10 shows the decrease in the
In addition, prevention of lyophilization-induced structural changes is
relevant for the area of the so-called non-aqueous enzymology. Enzymes can effectively be
used as catalysts in anhydrous organic solvents for synthetic reactions impossible in
water It has been established recently that important model enzymes employed in these investigations also undergo dehydration-induced structural perturbations [36, 44]. For example, the a-helix content of the serine protease subtilisin Carlsberg decreased from 34% in aqueous solution to 25% in the dry powder upon lyophilization from aqueous phosphate buffer. Suspension of the subtilisin powder in organic solvents did not lead to significant additional structural alterations [44]. In other words, the dehydration process and not the exposure to many organic solvents was what caused the structural perturbations. Such structural perturbations may lead to lower enzyme activity [55]. However, even more important, they are detrimental to some of the significant advantages enzymes have over conventional catalysts (e.g., excellent enantioselectivity for many reactions). Therefore, several additives were tested for their efficiency in preventing lyophilization-induced structural alterations in subtilisin Carlsberg, including trehalose (a disaccharide) and macrocyclic compounds (e.g., methyl-b-cyclodextrin). As the result, we were able to obtain structurally improved enzyme formulations where lyophilization-induced structural alterations were largely prevented. In case of methyl- b-cyclodextrin as the additive, suspension of the enzyme powder in selected organic solvents did not result in additional structural perturbations. In consequence, these structurally improved enzyme formulations had an improved enantioselectivity for a model reaction [K. Griebenow,Y. Diaz-Laureano, A. M. Santos, I. Montanez-Clemente, L. Rodriguez and G. Barletta, submitted to JACS]. This example also highlights the importance and applicability of FTIR spectroscopy to important biotechnological applications. In light of the above section, it would be important to understand at the molecular level how lyoprotectants work. Then one could predict which one should be efficient. As already pointed out, however, the same lyoprotectant may work excellent with one protein, but insufficient with another one. It is evident from this that it may not be possible to develop one general theory explaining all observations. In line with this unfortunate statement is also the literature: While in one work lyoprotectant-protein hydrogen bonds have been favored to be important for an efficient lyoprotectant [15], another spectroscopic work came to the conclusion that the spectroscopic results were inconclusive [9]. Factors believed to be important for an efficient additive now include a high glass-transition temperature, formation of protein-lyoprotectant hydrogen bonds (water substitution), water concentration around the protein, prevention of phase separation, and more. New experimental strategies are certainly required to shed more light onto this area of research, including experiments on how these lyoprotectants interact with each other in solids without proteins. Acknowledgements The authors acknowledge support by NIH-MBRS (S06 GM08102-26S1), NSF-EPSCoR (OSR-9452893), and the University of Puerto Rico (FIPI). We would also express our gratitude to Dr. Reinhard Schweitzer-Stenner (University of Bremen, Germany) for a critical evaluation of parts of this review. References
Received December 1998, received in revised format January
15th 1999. Accepted 15th January 1999
|
||||||||||||||||||||||||||||||||||||||||||||||||