Qualitative and Quantitative Control of Honeys Using NMR Spectroscopy and Chemometrics

1 Chemisches und Veterinäruntersuchungsamt (CVUA) Freiburg, Bissierstraße 5, 79114 Freiburg, Germany 2 Chemisches und Veterinäruntersuchungsamt (CVUA) Karlsruhe, Weissenburger Straße 3, 76187 Karlsruhe, Germany 3 Bruker BioSpin GmbH, Silberstreifen, 76287 Rheinstetten, Germany 4Department of Chemistry, Saratov State University, Astrakhanskaya Street 83, Saratov 410012, Russia 5Ministry of Rural Affairs and Consumer Protection, Kernerplatz 10, 70182 Stuttgart, Germany


Introduction
Honey is a natural, sweet, and syrupy fluid collected by bees from nectar of flowers [1].The taste and aroma of this liquid vary according to its floral origin, geographical and seasonal conditions [1].The large number of melliferous sources gives therefore the opportunity to produce many characteristical unifloral and a high number of polyfloral nectar honeys.
Each honey is unique on the basis of chemistry, amount, and combination of the various components that give each honey a unique and individual organoleptic character.The control and characterization of quality and botanical origin of unifloral honeys are of great importance and interest in apiculture.Today the most important techniques to determine or certify the unifloral origin of honeys are the melissopalynological analysis and the evaluation of organoleptic characteristics [2].Current quality assessment of honey by these methods are time-consuming and often operator dependent.Moreover, some types of adulterations (e.g., the addition of sugar concentrate to honey) can hardly be detected with such methods [3].
Nuclear magnetic resonance (NMR) spectroscopy has been also used to assess the botanical origin of honey and quantify some major compounds in it [26][27][28][29][30].It was shown that NMR has a good potential to become a useful quality control tool in the analysis of honey samples.However, the number of floral honey types and the total number of investigated samples have been insufficient to construct a good discrimination model for routine analysis.Targeted quantitative NMR analysis was limited to major carbohydrates and amino acids [27,31].Therefore, this paper further advances the investigation of a combined NMR spectroscopy ( 1 H and 13 C NMR) and chemometric data analysis approach to distinguish the botanical origin of honey.We also explored the potential of high-resolution 1 H NMR to allow the identification and the quantification of 13 selected components in honeys.

Sample Preparation and Calibration.
The water content was obtained for each honey before NMR measurement using the German reference refractometric method [32].The equivalent of 200 mg water-free honey (about 240 mg) was weighted and combined with 300 L of NMR buffer (see above), 700 L of distilled water and 100 L of an internal standard (D 2 O containing 0.1% of TSP (sodium salt of 3(trimethylsilyl)-propionate acid-d4)).Stock solutions were prepared by mixing of about 20 mg of a pure substance in 300 L of NMR buffer, 700 L of distilled water, and 100 L of an internal standard.For neutralization of organic acids, 1-2 L of 1 M NaOH were added to solutions because the buffer capacity of our NMR buffer was not otherwise sufficient to maintain a constant pH of 4.5 for these standard solutions.By diluting the stock solutions, several calibration standards were further prepared.600 L of the final solution were poured into an NMR tube for direct measurement.For quantification, linear calibration curves were constructed from the standards by integrating the specific resonances for each compound against TSP as an intensity reference. 13C NMR Measurements at 400 MHz.All NMR measurements were performed on a Bruker Avance 400 Ultrashield spectrometer (Bruker BioSpin, Rheinstetten, Germany) equipped with a 5 mm SEI probe with Z-gradient coils, using a Bruker Automatic Sample Changer (B-ACS 120). 1 H NMR spectra were acquired at 300.0 K without sample rotation.64 scans and 4 prior dummy scans of 65 k points were acquired with a spectral width of 19.9914 ppm, a receiver gain of 22.6, and an acquisition time of 4.096 s.Water suppression was achieved using the NOESY-presaturation pulse sequence (Bruker 1D noesygppr1d pulse sequence) with irradiation at the water frequency (1890.60Hz) during the recycle and mixing time delays. 13C NMR spectra were acquired using a Bruker zgpg30 pulse sequence with 1024 scans and 4 prior dummy scans.The sweep width was 238.9 ppm, the time domain of the FID was 66 k, receiver gain of 2050, and an acquisition time of 1.38 s.The data were acquired automatically under the control of ICON-NMR (Bruker BioSpin, Rheinstetten, Germany), requiring about 91 min per sample (for both 1 H and 13 C NMR).All NMR spectra were phased, baseline-corrected, and calibrated by the TSP signal at 0.0 ppm.

NMR Spectra Preprocessing and Chemometrics.
Multivariate data analysis was performed using Unscrambler X version 10.0.1 (CAMO Software AS, Oslo, Norway) and Amix version 3.9.4(Bruker BioSpin, Rheinstetten, Germany).First, to cope with small variations in pH or other sample conditions such as ionic strength or temperature, simple rectangular bucket tables were obtained from the complete sets of 1 H and 13 C NMR spectra.In both cases, scaling to total intensity was used.Further details on the bucketing process of NMR spectra for multivariate data analysis were previously described [33].Before multivariate analysis, all data were mean centered.In the context of this study, principal component analysis (PCA) was used for visualization and as a tool for a differentiation between different honey types.During PCA, several new axes instead of old variables (buckets) called principal components (PC) are calculated and each NMR spectrum is projected on the selected PCs resulting in the scatter plot.We tested several spectral regions for calculation:  0-3 ppm,  3-6 ppm,  6-10 ppm, and  0-10 ppm for 1 H NMR and  0-45 ppm,  45-135 ppm,  135-200 ppm, and  0-200 ppm for 13 C NMR.In cases when the whole spectral range was used, two preprocessing methods (scaling to unit variance and Pareto scaling) [34] as well as no scaling were tested for each data set in order to eliminate the magnitude effect of intensity variations in the  0-3 ppm and  6-10 ppm regions.The bucket width was 0.01 ppm in all cases.The technique of cross-validation was applied to determine the optimal number of principal components (PCs) required to obtain robust models.Kruskal-Wallis one-way analysis of variance, Shapiro-Wilk test, and Welch's t-test methods were used to analyze loadings plots in order to find out the most important buckets for differentiation.After the construction of the models to evaluate the classification performance, soft independent modeling of class analogy (SIMCA) and partial least squares-discriminant analysis (PLS-DA) classification methods were tested on randomly chosen test-set samples that were not included in the classification models.

Results and Discussion
3.1.Nontargeted Multivariate Analysis.Figure 1 shows the complete 1 H NMR spectra of tilia (or linden), Robinia pseudoacacia (or acacia), and fir honeys.It can be seen that the mid-low-frequency region between  4.2 and 3.0 ppm is dominated with very intensive signals of the major monosaccharides (glucose and fructose) and disaccharides (maltose and sucrose).Other less intensive resonances are also observed in the  9.0-6.0ppm and  2.5-1.0 ppm regions in 1 H NMR spectra of honey.The 13 C NMR spectra of honey investigated in our study were similar to those obtained previously in D 2 O [35].Most of the 13 C NMR signals were related to anomeric carbons of reducing and nonreducing sugars and were present in the  105-60 ppm region in the majority of samples.Due to the high spectral complexity, differences between honey types cannot be obtained without  multivariate techniques.In general, the NMR spectra of our honey samples could be classified into two major groups: polyfloral samples (with floral and honeydew (forest flower honeys) as subgroups) and unifloral honeys (such as rape, tilia, chestnut, and others).
The PCA score plots generated using PC3-PC4 ( 1 H NMR) and PC1-PC3 ( 13 C NMR) to visualize the separation of the polyfloral honeys are shown in Figure 2, which clearly suggests that the samples can be separated into two groups: honeydew honeys clusters are in the region of positive PC3 ( 1 H NMR) and negative PC1 ( 13 C NMR), respectively; floral honey samples are located in the negative values of PC3 ( 1 H NMR) and positive PC1 ( 13 C NMR) values.Furthermore, Figure 3 suggests that we could not only differentiate the two main polyfloral classes of honey but clusters from several unifloral honeys were also clearly separated from each other.It should be noted that the 13 C NMR spectra provided inferior discrimination power as 1 H NMR spectra.For example, the PCA scores of rape and sunflower honeys or tilia, sunflower and Robinia pseudoacacia honeys were mixed in the same cluster (Figure 3(b)).With 1 H NMR spectra even minor differences in botanical composition can be traced (e.g., rape and rape/clover honeys are occurring in two separate clusters).On both scatter plots, honey samples from coniferous (spruce, fir, and pine trees) were clearly distinguished from the other honey types.
Loadings plots allow to specify the variables (chemical shifts), which are responsible for the observed clustering for both data sets ( 1 H and 13 C NMR).Table 1 lists the most important buckets (signals) for different honey types obtained from the loadings plots.It was found that the signals of glucose and fructose play the key factor for differentiation, and this finding is in accordance with another NMR study of honeys [35].However, resonances of minor compounds also play a certain role such as quinoline alkaloids and kynurenic acid for chestnut honey [36,37] or unsaturated carboxylic acids for tilia honey [37].Therefore, the 1 H NMR honey profile can be used for the identification of chemical markers of different botanical origin.
Next, it is interesting to show the predictive power of the chemometric methods by classifying new samples.To do this, two data analysis methods (SIMCA and PLS-DA) were evaluated for predicting class membership of honey samples from the 1 H NMR spectra.The independent test set for the floral/honeydew honey model (honeydew) consisted of 20 randomly selected objects (10 floral, 10 honeydew honeys).For the unifloral honey model, mountain ( = 2), rape ( = 3), coniferous ( = 2), Robinia pseudoacacia ( = 3), and chestnut ( = 1) honeys were selected for the test data set.The rest of the available 1 H NMR spectra were included in the calibration data set.All samples from both test sets were correctly recognized by SIMCA method at the 10% significance level.A prediction ability of 95% was obtained by PLS-DA for the honeydew/flower honey model.Thus, our results have shown that 1 H NMR coupled with multivariate statistics is an efficient tool for the classification of the different botanical origins of honey samples.

Quantification Studies.
Besides the classification of botanical origin of honey samples, it would be advantageous to establish a NMR method for the quantification of main constituents in the honey matrix.As first evaluation, if a quantitative approach is at all possible from the NMR spectra, we measured 34 commercially available compounds that may be present in honey.Then, the spectra of standards were compared to the spectra of honey samples.For most of the substances studied, direct quantification with integration is not possible due to extensive spectra overlap.As an example, the spectra of four carbohydrates are shown in Figure 4.
Clearly, a large number of overlapped signals for all isomeric forms of sugars exist.Thus, for such compounds more advanced techniques, such as multivariate regression or curve deconvolution, are required for quantification.Moreover, the two main carbohydrates-glucose and fructose-have much higher peak intensities than other compounds and, therefore, obscure the rest of the signals.
However, we were able to find 13 metabolites for which at least one resolved unambiguous resonance could be identified.Selected 1 H NMR peaks (i.e., signals not overlapped or interfered by matrix) corresponding to each substance are shown in Table 2.The high correlation coefficients ( > 0.99) obtained for each calibration graph indicate a good linear response within the concentration range studied for each compound.As an example, Figure 5 shows the NMR peaks of the main carbohydrate fructose and formic acid in authentic honey samples in comparison with two exemplary reference spectra.We applied the aforementioned procedure to the identification and direct quantification of the selected substances in authentic honeys of different floral types ( = 20) (Table 3).Only in two cases direct quantification of malic acid was not possible due to spectral interferences.

Conclusions
NMR spectroscopy has already been used in honey analysis to determine its botanical and geographical origin.In the paper of Lolli et al., 71 Italian honey samples (Robinia, chestnut, citrus, eucalyptus, and polyfloral) were analyzed by 1 H NMR and heteronuclear multiple bond correlation (HMBC) spectroscopy [35].PCA and general discriminant analysis (GDA) were not able to group samples according to their botanical origin by using 1 H NMR data.Only with the use of 2D 1H-13C HMBC acceptable clustering occurred [35].In another article by this research group, HMBC spectroscopy in combination with GDA was used to detect 10%, 20%, and 40% adulteration of authentic honey by commercial sugar syrups [38]. 1 H NMR spectroscopy and multivariate analysis techniques have also been used to classify honey into two geographical groups (non-Corsican and Corsican samples) [30].96.2% correct classification obtained by cross-validation was obtained for partial least squares-genetic programming (PLS-GP) algorithm.It should be also noted that the sitespecific natural isotopic fraction NMR (SNIF-NMR) was not found to be successful for the characterization of geographical and botanical origins of honey [26].However, to be used in practice, it would be necessary to extend the domain of a Not detectable (0.03 g/kg (phthalic acid and malic acid), 0.02 g/kg (HMF and fumaric acid), 0.18 g/kg (pyruvic acid), 0.16 g/kg (L(+)-rhamnose), 0.04 (citric acid), and 0.01 g/kg (succinic acid)).b Overlapped signal, direct quantification is not possible.
application of the method for other unifloral honeys and to expand the database.Our study, which is the largest evaluation of honey samples by NMR so far, provides such an opportunity.We can conclude that our models can be used as a method to determine and monitor the botanical origin of honey samples.With regard to quantification, NMR was only used for determining several saccharides with 13 C NMR [27] or methylglyoxal and amino acids with 1 H NMR [39] in honey matrices.Clearly, we expanded the range of substances that can be analyzed with NMR spectroscopy without preceding separation; 1 H NMR is also suitable for quantification of several aliphatic and aromatic acids as well as aldehydes.
In conclusion, it should be noted that honey is a very complex matrix endowed with very specific physicochemical properties.This complexity makes the analysis of honey difficult in terms of its different properties.Often the determination of botanical origin is complicated because of the incomplete correlation between analytical parameters: sensory properties and botanical identity.
Our investigation has shown that 1 H NMR spectra of honeys in combination with appropriate multivariate statistics can provide qualitative information about the botanical origin and represent a good basis for the identification of marker compounds for the specific honey types.Quantitative information about a number of major components is also available from the same spectra without need for chromatographic separation.In combination with multivariate data analysis, NMR spectroscopy possesses the speed, simplicity, and low cost per analysis required for a screening technique.

Figure 5 :
Figure 5: 400 MHz 1 H NMR spectra of fructose (a) and formic acid (b) in standard solutions and honey samples.

Table 1 :
The most important variables (buckets) for the differentiation of different botanical origins of honey (chemical shifts are given in ppm).

Table 2 :
NMR integration regions and investigated linear concentration ranges.

Table 3 :
Results of the quantitative determination of substances by NMR (values are given in g/kg honey).