Professional Documents
Culture Documents
x
2011 Blackwell Verlag GmbH
wileyonlinelibrary.com
458 V . L e b o t , A . N d i a y e and R . M a l a p a
Santo Island, Vanuatu. Nine local varieties from different islands of for the measurement of all spectra over the wavelength range of 350–
Vanuatu and twelve introduced varieties originating from six different 2500 nm. On average, 6 g of homogenized flour was placed in an
countries were used as parents in polycross plots. Breeding lines were individual cell and compacted with a tea spoon to eliminate air voids
selected from populations obtained via open-pollination in different within the sample. Each spectrum was obtained by averaging three
cycles of an ongoing recurrent selection programme. Accessions were different cells (repetitions) per sample with 25 scans for each. A
chosen to represent the greatest morphological variation. All acces- reference reading (baseline) was taken when starting a session and
sions were planted at the same time in a common plot and were another every 30 min. All of the spectra were recorded in diffuse
harvested when mature, 3–4 months later. Their morphological reflectance as log(1/R) with respect to a LabsphereÕs Spectralon
characterization involved 14 standardized descriptors and their coded material reflectance standard (Labsphere Inc., North Sutton, NH,
modalities. These included vine pigmentation, petiole pigmentation, USA) that is a lambertian reflective PTFE (thermoplastic resin) with
mature leaf colour, immature leaf colour, abaxial leaf vein pigmenta- high overall reflectance. For each sample, corresponding to individual
tion, mature leaf shape, general outline of the leaf, leaf lobes type, accession, three sub-samples were scanned 25 times each and then
storage root shape, storage root predominant skin colour, secondary averaged. The resulting averaged spectrum was recorded for the
skin colour, predominant flesh colour, secondary flesh colour, distri- accession. Overall, 240 spectra were recorded and converted to
bution of secondary flesh colour (CIP, AVRDC, IBPGR 1991). absorbance using the INDICOTM software (ASD Inc.). To assess the
performance of the calibration, samples were separated in two sets: the
Sample preparation: For each of the 240 accessions, one full storage calibration and the prediction set. The prediction set was created by
root was peeled and cut. Approximately 0.5–1 kg of fresh weight was randomly selecting 50 accession numbers (approximately 20% of total
manually sliced into chips and oven dried at 60C for 48 h. DM 240 acc.), and the calibration set contained 190 samples.
samples were split into two sub-samples: one sub-sample was used for
chemical analysis and the other for NIRS. Samples of 200 g were sent Data analysis: Morphological descriptions and major constituent
to France for chemical analyses. Samples of approximately 50 g of data were subjected to multivariate analysis using XLSTAT (version
dried chips were milled into flour just after oven drying, and dried 6.02, 2009). Multivariate analysis (Principal Component) of the spectra
chips were ground in a stainless kitchen steel mill (SEB, Ecully, was conducted with GRAMS/AITM (version 8.0). The spectra and
France) prior to NIRS analysis in Vanuatu. reference data were mathematically modelled using PLSPlus/IQ
The potential of using NIR spectral fingerprints to differentiate and spectroscopy software (Thermo Electron Corporation, Marietta, OH,
classify individual roots and varieties was tested. Twelve varieties USA). Using the values obtained with chemical analyses as the analyte
planted in the field (1 · 1 m) the same day were harvested together. value, a separate calibration was made for each of the five major
Three clones were selected at random within lines of ten plants per constituents. Calibration of residual moisture was not attempted
variety. One full storage root was collected on three clones. Roots were because spectra were recorded in Vanuatu, just after oven drying the
washed, peeled, cut into slices and separated in two sub-samples. One samples, while residual moisture was measured in France on hygro-
was oven dried, and the other was frozen in a normal kitchen freezer to scopic dry raw material. Partial least-squares (PLS) regression tech-
compare the two processes. Samples were then freeze dried overnight nique was used to develop a predictive model of the near-infrared part
in a freeze drier Christ-Alpha 1-2 LD Plus (Martin Christ, Osterode, of the spectra (1000–2500 nm). The optimum number of PLS factors
Germany), and the resulting DM samples were ground into a fine used for prediction was determined by full cross-validation and
powder for NIRS analysis and spectra comparison between oven-dried prediction residual error sum of squares (PRESS). Additionally, light
and freeze-dried samples. scattering effects caused by particle size differences were corrected by
multiplicative scatter correction (MSC). The data were mean-centred
and the average spectrum calculated from all of the calibration spectra
Chemical analyses: Major constituents (starch, sugars, cellulose, N
and then subtracted from every calibration spectrum.
and ash) were analysed by Laboratoire dÕAnalyses Agricoles Teyssier,
As part of the model process, a principal component analysis (PCA)
Bourdeaux, France, according to AFNOR (Association Française, the was used to check for gross spectral outliers. The Mahalanobis
French standards association) and/or EEC methods (AFNOR: http://
distance of each spectrum to the mean spectrum of the group was
www.boutique.afnor.org). Following Norme Française (NF) V 18-109
calculated, and the removal of outliers was based on distance H > 3
for DM determination, samples were dried again to remove residual
from the average spectrum of the file. Spectra and concentration
moisture (measured as % of total dry weight), and the powder was outliers were removed, and PLS was run again until the highest r2cv
analysed on an oven-dried air basis. Moisture was therefore expressed
(determination coefficient for cross-validation) corresponding to the
as a measurement of the sample prior to drying. All measurements
smallest standard error of cross-validation (SECV) were obtained. At
were then expressed in %DM, and the data were adjusted by the
that point, factor loadings were used to determine which wavelengths
residual moisture following oven drying. Starch was quantified using were important to correlate with concentrations to narrow down the
Ewers protocol (NF ISO 10-520) corresponding to hydrolysis in HCl,
spectral region. For starch and sugars, the regions used were 1000–
filtration and polarimetric measurement (specific rotation angle used:
2400 nm while for cellulose, proteins and minerals, the region was
185.7). Total sugars were quantified through the colorimetric method 1200–2400 nm. The PLS analysis was then conducted again on this
of Luff Schoorl (CEE 98\54\CE). Crude cellulose (total fibres) was
new region to obtain for each constituent, equations with higher
measured by Weende method (NF V 03-040), which corresponds to
explanation of the total variability in the calibration values without
non-soluble organic residue obtained by sulphuric acid and alkaline
increasing the number of PLS factors used.
treatments. N content was calculated using the Kjeldahl method (NF Statistical parameters used to evaluate models performances
V 18-100). Estimation of total minerals content was obtained from
included the standard error of calibration (SEC), the determination
ashes produced at 550C (NF V 18-101). All analyses were performed
coefficient for cross-validation (r2cv), the SECV, the determination
in duplicate with accepted mean coefficient of variation of ±3% for
coefficient for prediction (r2pred) and the standard error of prediction
starch, sugars, cellulose, and residual moisture and ±2% for proteins (SEP). SEC and SEP were calculated using Excel spreadsheet by
(equivalent N), and ashes (minerals).
squaring the differences of the actual minus the predicted concentra-
tions for each sample in the calibration (SEC) and test (SEP) sets.
NIRS measurements and data pretreatment: Dry matter samples were These values were then summed, and the sum was divided by the
milled into flour, and granule size was homogenized using four sieves number of samples (n). The square root of this value was used for SEC
with decreasing diameters until granules passed through the 106 lm and SEP. SEC describes the calibration set, and SEP describes the test
sieve. An ASD LabSpecPro spectrophotometer from Analytical set composed of 50 samples not included in the calibration set. The
Spectral Devices Inc. (ASD Inc., Boulder, CO, USA) fitted with a ratio of performance to deviation (RPD = SD/SECV) was also used
ÔmuglightÕ or high intensity source probe (HISP) (ASD Inc.) was used to evaluate performances of the models (with SD as the standard
Phenotypic characterization of sweet potato 459
Results
0.5
Important variation was observed for all morphological traits.
Highest correlation coefficients were observed between vine
pigmentation and leaf petiole pigmentation (r = 0.783), gen- 0.25
F2 (21.74 %)
eral leaf outline and leaf lobes type (0.781), mature leaf shape Minerals
and leaf lobes type (0.693), abaxial leaf vein pigmentation and 0
immature leaf colour (0.501) and secondary flesh colour and Sugars Starch
3 2
138 55
209
53
154
2 27
127 128
34 95 161 194
126
147
79 3
97 36
33 185
89
188 37 193 21
141 181 129
1 5
32
155
143 35
150 190149
106
192 191 29
52206
115 197 31 96 22 73
174 145
118 117
102 151 76 20
157 142 1 136 43 92
6 14838
225
F2 (21.74 %)
80 68
61
186 87203 23 15
116 180 182
120 152 50105
134 459 130
39 183187
146 137 25 59
159 99 184
179 177
109 104 49195 16
75
207 164 229156 101
7416014 83
100 111 122
121 86 4 12113 51
114
110 77163 88 72
0 131 48
153 226 158 108
135 81 63
132 94
175 57 19 62 7
178
237 46 144 166 172 44
54 233 123 8465 140 91
169 112 235 66 47 13 202
56 133 78 17
67 205 204 71
168 170 107 98 10 189
11 60 70
239 119 139 176
64
199 227 125 208 85
234 40 165 69
240
–1 167
211
124
213 162
58 18
82
236
217 218 8
24216 21923142
238 214
103 212 210
222 41220
23030 93
215
223 228
200
–2 221
201
28 198
224
171 196 90
173
26
–3 232
–5 –4 –3 –2 –1 0 1 2 3 4
F1 (44.61 %)
Fig. 2: Principal component analysis of 240 accessions analysed for five major constituents (starch, sugars, cellulose, proteins and minerals)
Table 3: Statistical parameters of the calibration, validation and new calibration sets1
Validation
Calibration (n = 190) (n = 50) New calibration (n = 240)
Starch 68.6 6.6 2.05 12 12 0.82 2.37 1.48 0.71 3.13 2.11 8 0.82 2.49 2.27 2.65
Sugars 10.7 5.3 0.32 8 11 0.91 1.55 1.20 0.82 2.31 2.29 10 0.86 1.77 1.77 2.75
Cellulose 4.1 1.4 0.12 6 3 0.21 1.26 – – – – 6 0.46 1.26 1.17 1.58
Proteins 5.8 1.3 0.12 12 9 0.89 0.39 0.34 0.87 0.44 2.93 10 0.91 0.36 0.30 3.47
Minerals 3.5 0.8 0.07 13 10 0.74 0.32 0.27 0.56 0.73 1.1 9 0.72 0.38 0.10 2.34
1
Wavelengths region for starch and sugars is 1000–2400 nm and 1200–2400 nm for cellulose, proteins and minerals.
SEC, standard error of calibration; SECV, smallest standard error of cross-validation; SEP, standard error of prediction.
(2.37) and SEC (1.48) values are not close enough to indicate SECV, SEC and SEP values (0.39, 0.34 and 0.44, respectively)
robust fitting. When SECV and SEP values differ significantly, and a high r2pred of 0.87 indicating good and robust prediction
this could be an indication that too many samples were with 87% of confidence. The RPD value above 2.5 also
removed during the modelling process. However, the SEP indicates a very good potential of prediction. Deviations of
(3.13) is not too distant, and the r2pred of 0.71 indicates an single samples are visualized in a scatter plot between
acceptable estimation of the equation accuracy on the valida- measured and predicted proteins values (Fig. 5). Minerals
tion samples. Deviations of single samples are visualized in a have a poor relationship with NIRS and could be predicted
scatter plot between measured and predicted starch values of with only 56% of confidence and a very low RPD value (1.1).
the 50 acc. in the test set (no outliers were removed) (Fig. 3). In Cellulose could not be satisfactorily predicted, and a very poor
terms of predictive performance, the equations for starch could r2cv (0.21) was obtained, limiting further calculations.
be considered as good with RPD parameters above 2. Some The r2pred values of starch, sugars and proteins are high
authors, however, claim that a RPD value of at least 3 is enough to allow good estimates of their contents. However,
necessary for efficient NIR reflectance predictions (Williams determination coefficients for the prediction sets (r2pred) cannot
and Norris 2001). The total sugars model presents similar reflect the whole situation because the range of the 50
SECV (1.55) and SEC (1.20) values, but the SEP is much accessions values in the prediction test set affects the coefficient
higher (2.31) although the r2pred is of 0.82. The RPD value value. SEP is, therefore, a better overall indicator. RPD
above 2 indicates, however, a good predictive potential for this between 2 and 2.5 for the starch and sugars models allow
equation. Deviations of single samples are visualized in a approximate quantitative predictions to be made. Values
scatter plot between measured and predicted sugars values above 2.5 for proteins are considered to be good models.
(Fig. 4). Proteins (measured as N equivalent) produce similar The number of terms is relatively low if we consider a general
Phenotypic characterization of sweet potato 461
75
2.271, respectively), sugars (1.77 and 1.769) and proteins (0.36
and 0.30). The new model for proteins presented a RPD value
above 3, and such value indicates a very good predictive model
70
(Williams and Sobering 1993). RPD values for starch and
sugars were also above 2.5. These models will need to be
65
further tested on independent samples but present good
potential. Also, the selection of the first calibration set was
60 carried out by removing at random 50 samples for the test set.
The first calibration set has not covered the whole variation of
55 the data set. The second set gave better results, indicating that
50 55 60 65 70 75 80 85
a better sample selection might be helpful by selecting, for
Reference starch
example, on constituent concentration rather than a random
selection of numbers.
Fig. 3: Starch prediction comparison on independent test set (50 acc.)
Discussion
The correlation coefficients between morphological traits
25 R2 = 0.82 obtained in our study are similar to those reported from 310
sweet potato accessions in Argentina (Manifesto et al. 2010).
20 These results confirm that there are no significant correlations
NIR predicted sugars
When comparing the performance of the new calibration CIP, AVRDC, IBPGR, 1991: Descriptors for Sweet Potato, Interna-
models (with n = 240 acc.), the SECV values were, in most tional Board for Plant Genetic Resources, Rome, Italy.
cases, close enough to the SEP indicating fairly robust fitting, Devaux, M. F., D. Bertrand, and G. Martin, 1986: Discrimination of
and RPD values were also improved. Determination coeffi- bread-baking quality of wheats according to their variety by near-
infrared reflectance spectroscopy. Cereal Chem. 63, 151—154.
cients (r2pred) generally improve as the working range
Edelmann, A., J. Diewok, K. C. Schuster, and B. Lendl, 2001: Rapid
increases. Consequently, if more range is added in the same method for the discrimination of red wine cultivars based on mid-
model, then it could improve coefficient values. Additionally, infrared spectroscopy of phenolic wine extracts. J. Agric. Food
when different samples are added, a larger spectral diversity is Chem. 49, 1139—1145.
described, and therefore, some samples might actually be FAO, 2009: Annual Production Statistics, www.fao.org.
better spectrally described as the number of samples increases. Gibson, R. W., E. Byamukama, I. Mpembe, J. Kayongo, and R. O. M.
The protein content calibration is particularly interesting as Mwanga, 2008: Working with farmer groups in Uganda to develop
it can be further improved. Protein content is usually estimated new sweet potato cultivars: decentralisation and building on
by multiplying the total N content by a standard conversion traditional approaches. Euphytica 159, 217—228.
factor of 6.25. However, the nitrogen to protein ratio does vary Grüneberg, W. J., K. Manrique, D. Zhang, and M. Hermann, 2005:
Genotype x environment interactions for a diverse set of sweetpo-
according to the species considered and change with amino
tato clones evaluated across varying ecogeographic conditions in
acid content and mineral nitrogen and non-protein nitrogen. Peru. Crop Sci. 45, 2160—2171.
For the present study, we decided to present our results Haase, N. U., 2006: Rapid estimation of potato tuber quality by near-
measured as total N as proteins. In the future, it would be infrared spectroscopy. Starch/Stärke 58, 268—273.
interest to improve the calibration models on the real protein Hicks, C., M. R. Tuinstral, J. F. Pedersen, F. E. Dowell, and K. D.
content that vary according to amino acids. Kofoid, 2002: Genetic analysis of feed quality and seed weight of
In most sweet potato breeding programmes, mass selection sorghum inbred lines and hybrids using analytical methods and
results in the rapid accumulation of suitable genes but has to NIRS. Euphytica, 127, 31—40.
be complemented with efficient screening techniques. For Jiang, H. Y., Y. J. Zhu, L. M. Wei, J. R. Dai, T. M. Song, Y. L. Yan,
many traits, their accurate measurement is often the weakest and S. J. Chen, 2007: Analysis of protein, starch and oil content of
single intact kernels by near infrared reflectance spectroscopy
operation. A common difficulty is to satisfy the requirements
(NIRS) in maize (Zea mays L.). 2007. Plant Breed. 126, 492—497.
of both the industry and the fresh market. Obviously, NIRS Kim, S. W., S. R. Min, J. Kim, S. K. Park, T. I. Kim, and J. R. Liu,
could assist breeders in their choice and selection of the best 2009: Rapid discrimination of commercial strawberry cultivars using
genotypes, based on the chemical composition requested by Fourier transform infrared spectroscopy data combined by multi-
the market (high starch, sugars or protein contents). NIRS variate analysis. Plant Biotechnol. Rep. 3, 87—93.
could also be used to characterize the thousands of germplasm Komaki, K., K. Katayama, and S. Tamiya, 1998: Advancement of
accessions existing in the world and to classify them in sweet potato breeding for high starch content in Japan. Trop. Agric.
chemotypes relevant to breeding objectives. For example, for 75, 220—223.
starch extraction, fresh table consumption, desert types or for Loebenstein, G., and G. Thottappilly (eds), 2009: The Sweetpotato.
animal feed. Such physico-chemical characterization would be Springer Science+Business Media, USA, 522.
Lu, G., H. Huang, and D. Zhang, 2006: Application of near-infrared
useful to breeders. The models developed in the present study
spectroscopy to predict sweetpotato starch thermal properties and
show good accuracy, but it remains to be seen whether larger noodle quality. J. Zhejiang Univ. Sci. 7, 475—481.
sample sets will improve them to enable more precise predic- Manifesto, M. M., S. M. Costa Tàrtara, C. M. Arizio, M. A. Alvarez,
tion. Further work should concentrate on validating the results and N. R. Hompanera, 2010: Analysis of the morphological attributes
over different years. Other constituents such as amylose, of a sweetpotato collection. Ann. Appl. Biol. 157, 273—281.
carotenoids or individual sugars should be investigated as well. Munck, L., and B. Møller, 2005: Principal component analysis of near
infrared spectra as a tool of endosperm mutant characterization and
in barley breeding for quality. Czech. J. Gen. Plant Breed. 41,
Acknowledgements 89—95.
This research was financially supported by the Fond Français pour le Munck, L., B. Møller Jespersen, A. Rinnan, A. Fast Seefeldt, M.
Pacifique and represents a part of the Sweet Potato Project funded by Møller Engelsen, L. Nørgaard, and S. Balling Engelsen, 2010: A
French Aid to Vanuatu Ministry of Agriculture. physiochemical theory on the applicability of soft mathematical
models—experimentally interpreted. J. Chemom. 24, 481—485.
Rakszegi, M., D. Boros, C. Kuti, L. Làang, Z. Bed o, and P. R.
References Shewry, 2008: Composition and end-use quality of 150 wheat lines
Amar, S., H. C. Becker, and C. Möllers, 2009: Genetic variation in selected for the HEALTHGRAIN diversity screen. J. Agric. Food.
phytosterol content in winter rapeseed (Brassica napus L.) and Chem., 56, 9750—9757.
development of NIRS calibration equations. Plant Breed. 128, Skinner, D. Z., G. R. Bauchan, G. Auricht, and G. Hughes, 1999:
78—83. A method for the efficient management and utilization of large
Berardo, N., G. Mazzinelli, P. Valoti, P. Laganà, and R. Redaelli, germplasm collections. Crop Sci. 39, 1237—1242.
2009: Characterization of maize germplasm for the chemical Wang, H., 1982: The breeding of sweet potatoes for human consump-
composition of the grain. J. Agric. Food. Chem. 57, 2378—2384. tion. In R. L. Villareal, and T. D. Griggs (eds), Sweet Potato,
Bonierbale, M., W. Grüneberg, W. Amoros, G. Burgos, E. Salas, E. 297—312. Proceedings of the First Symposium, AVRDC, Taiwan.
Porras, and T. Zum Felde, 2009: Total and individual carotenoid Williams, P., 2003: Near-Infrared Technology: Getting the Best out of
profiles of Solanum phureja cultivated potatoes : II. Development Light. A short course in the practical implementation of near-
and application of near-infrared reflectance spectroscopy (NIRS) infrared spectroscopy for the user, vol. March 2003. PDK Projects,
calibrations for germplasm characterization. J. Food Compost. Inc., Nanaimo, Canada.
Anal. 22, 509—516. Williams, P. C., and K. H. Norris (eds), 2001: Near-Infrared
Bradbury, J. H., and W. D. Holloway, 1988: Chemistry of Tropical Technology in the Agricultural and Food Industries, 2nd edn.
Root Crops: Significance for Nutrition and Agriculture in the AACC, St Paul, MN, USA.
Pacific. ACIAR Monograph No. 6, Canberra.
Phenotypic characterization of sweet potato 463
Williams, P. C., and D. C. Sobering, 1993: Comparison of commer- Fuglie, and M. Hermann (eds), Sweetpotato Post-Harvest Research
cial near infrared transmittance and reflectance instruments for and Development in China, 160. CIP, ESAP Region, Bogor,
analysis of whole grains and seeds. J. Near Infrared Spectrosc. 1, Indonesia.
25—32. Zum Felde, T., A. Baumert, D. Strack, H. Becker, and C. Möllers,
Woolfe, J. A., 1992: Sweet Potato, an Untapped Food Resource. 2007: Genetic variation for sinapate ester content in winter rapeseed
Cambridge University Press, Cambridge, UK, 643. (Brassica napus L.) and development of NIRS calibration. Plant
Zhang, D. P., and X. Q. Li, 2004: Sweetpotato as animal feed: the Breed. 126, 281—296.
perspective of crop improvement for nutrition quality. In: K. O.