Skip to main content
Data in Brief logoLink to Data in Brief
. 2020 Jan 31;29:105211. doi: 10.1016/j.dib.2020.105211

Near infrared spectroscopy (NIRS) data analysis for a rapid and simultaneous prediction of feed nutritive parameters

Samadi a,∗∗, Sitti Wajizah a, Agus Arip Munawar b,c,
PMCID: PMC7015983  PMID: 32071986

Abstract

Presented paper described dataset on near infrared spectroscopy (NIRS) used as a rapid and robust method to predict and determine several nutritive parameters of animal feed simultaneously. Near spectra data were acquired and recorded in wavelength range from 1000 to 2500 nm with co-added of 64 scans per sample measurement. On the other hand, actual reference nutritive parameters: in vitro organic matter digestibility (IVOMD), in vitro dry matter digestibility (IVDMD), neutral detergent fibre (NDF) and acid detergent fibre (ADF) of animal feed were measured using proximate laboratory procedures. Near infrared datasets can be enhanced using several spectra correction methods to improve prediction accuracy and robustness. Animal feed nutritive parameters can be determined simultaneously and rapidly by establishing prediction models by means of principal component regression (PCR), partial least squares regression (PLSR) and other regression approaches.

Keywords: NIRS, Feed, Nutritive value, Prediction, Spectroscopy


Specifications Table

Subject Agricultural and Biological Sciences
Animal sciences
Specific subject area Spectroscopy, non-destructive test for animal feed quality evaluation
Type of data Table
Graph
Spectroscopic data
How data were acquired Spectral datasets of animal feed samples were acquired using a benchtop Fourier transform infrared spectroscopy (Thermo Nicolet Antaris II TM). Spectra data were recorded in form of absorbance spectrum in wavelength range from 1000 to 2500 nm with co-added of 32 scans with resolution windows of 0.2 nm. On the other hand, to obtain actual reference data of neutral detergent fibre (NDF) and acid detergent fibre (ADF), standard laboratory procedures as proposed by Ref. [1] is employed. Feed samples were boiled in neutral and acid detergent solutions for an hour in sequential. NDF and ADF data were expressed in percent. Meanwhile, in vitro organic matter digestibility (IVOMD) and in vitro dry matter digestibility (IVDMD) were determined by subtracting residues of dry matter and organic matter prior to fermentation, respectively. The in vitro incubation was carried out in three runs and two serum bottles represented for each run. Those four nutritive parameters were performed in triplicate and averaged.
Data format Raw
Analysed
Enhanced
Presented as .xls and .unsb file formats
Parameters for data collection Nutritive parameters of animal feed samples were in vitro organic matter digestibility (IVOMD), in vitro dry matter digestibility (IVDMD), neutral detergent fibre (NDF) and acid detergent fibre (ADF).
Description of data collection Near infrared spectroscopic data in form of absorbance spectrum were used to predict four mentioned animal feed nutritive parameters (IVOMD, IVDMD, NDF and ADF) simultaneously. Prediction data of these nutritive attributes were obtained by establishing models through calibration. Principal component regression (PCR) and partial least square regression (PLSR) were applied as methods in collecting prediction values. Prediction data were then quantified by means of cross validation during calibration phase.
Data source location Data were collected at the Department of Animal Husbandry, Bogor Agricultural University and Department Agricultural Engineering, Syiah Kuala University, Banda Aceh – Indonesia.
Data accessibility Dataset are available on this article and can be found in Mendeley repository data: https://data.mendeley.com/datasets/r6fjtx943d/1
or https://doi.org/10.17632/r6fjtx943d.1
Value of the Data
  • Spectral dataset of animal feed samples can be used to predict nutritive parameters derived from calibration models. It provides a rapid, non-destructive and simultaneous approach to determine nutritive attributes of biological objects like animal feed in this case.

  • Data were benefited in animal feed industries for quality inspection of their feed products. This dataset can also be re-used to develop prediction models for other nutritive parameters like starch, protein, pH and others.

  • Spectral dataset can be enhanced using several data pre-processing approaches and transferred onto established NIRS instrument.

  • Prediction performances may vary, depends on spectra enhancement and regression approaches to be applied during calibration and prediction models development.

1. Data

Near infrared spectral dataset of feed samples were acquired and recorded as absorbance spectrum in wavelength range from 1000 to 2500 nm (Fig. 1). Typically, the near infrared spectra data can be represented as a function of the energies (cm−1) or wavelengths (nm) of the electromagnetic radiation. Spectra data contains chemical properties and information that can be revealed through calibration by means of regression approaches. Beside relevant and important information, spectra data may also contain irrelevant information background known as noises due to light scattering [2]. These noises can interfere prediction accuracy and robustness resulted during calibration. Thus, in order to eliminate or minimize noises, spectra data can be corrected and enhanced using several pre-processing techniques such as spectra smoothing, normalization, multiplicative scatter correction (MSC), standard normal variate (SNV), orthogonal signal correction (OSC), spectra derivatives, de-trending, and combination among them [3]. The selection of spectra pre-processing method must be followed by the knowledge of the sample characteristics, sample measurement protocol, radiation interaction, and the requirements of the analytical problem [4].

Fig. 1.

Fig. 1

Typical near infrared absorbance spectrum of feed sample.

Nutritive parameters such as NDF, ADF, pH, starch content, moisture content, IVOMD, IVDMD and others are buried in the spectra pattern along near infrared region. Specific wavelengths are corresponded to certain nutritive attributes from which represents the amount of light absorbed, reflected or transmitted. Light absorption and scattering of the NIR radiation are two main phenomena affecting the featured spectrum [5]. Spectra data can also be transformed into its derivative as presented in Fig. 2 to observe spectra feature more details and detect potential noises along near infrared wavelength region. Spectra derivatives is a good way to correct these distortions which commonly employed a Savitzky-Golay derivative algorithm.

Fig. 2.

Fig. 2

Near infrared spectrum after first derivative (a) and second derivative (b).

The main aim of NIRS application is to establish models used to predict and determine nutritive or quality attributes of studied samples. In last few decades, multiple linear regression (MLR) was firstly applied with a few original variables previously transformed and selected to carry the relevant and important chemical properties information. Nowadays, other regression approaches like principal component regression (PCR) and partial least squares regression (PLSR) are usually employed [6].

These two regression methods were fitted like a glove in the emerging field of NIRS till now. Both methods seek to find best correlation between NIRS spectra data and respective nutritive or quality parameters such as NDF, ADF, IVOMD, and IVDMD in this case. Prediction models were developed by regressing spectra data (X variable) and actual nutritive attributes obtained by standard laboratory procedures (Y variable). Predicted results were then compared to the actual one, in order to judge the prediction performance as shown in Fig. 3.

Fig. 3.

Fig. 3

Prediction performance of NDF determination (a) and ADF (b).

Prediction models can be developed directly either using raw (un-treated) spectra data or enhanced (treated) spectra [7]. As mentioned previously, it is necessary to correct and pre-process the near infrared spectra data in order to achieve more accurate and robust prediction results. Table 1 showed a comparison of prediction performance between raw and corrected spectra data for feed nutritive parameters determination.

Table 1.

Comparison of prediction performance between raw and corrected spectra data using principal component regression (PCR) approach.

Nutritive Parameter Spectra data Statistical indicators
R2 r RMSE RPD
NDF Raw 0.757 0.870 3.665 1.627
Corrected 0.830 0.911 3.098 1.925
ADF Raw 0.831 0.912 0.538 1.782
Corrected 0.867 0.931 0.486 1.972

ADF: acid detergent fibre, NDF: neutral detergent fibre, R2: coefficient of determination, r: correlation coefficient, RMSE: the root mean square error, RPD: residual predictive deviation.

Prediction performance may vary depends on which spectra pre-processing algorithms and regression approach to be used. As presented in Table 2, prediction performances were varied among different spectra correction methods were in this data analysis using standard normal variate (SNV), baseline shift correction (BSC), and de-trending (DT) generally improved when the models are constructed using corrected spectra data.

Table 2.

Comparison among different spectra correction methods to the prediction performance of nutritive parameters using partial least squares regression (PLSR) approach.

Nutritive parameters Spectra data Statistical indicators
R2 r RMSE RPD
IVOMD Raw 0.590 0.770 2.510 1.599
SNV 0.810 0.922 1.710 2.347
BSC 0.750 0.861 1.960 2.048
DT 0.690 0.832 2.180 1.841
IVDMD Raw 0.610 0.780 2.032 2.284
SNV 0.860 0.936 1.201 3.911
BSC 0.720 0.861 1.651 3.706
DT 0.720 0.852 1.712 3.362

BSC: baseline shift correction, DT: de-trending, IVOMD: in vitro organic matter digestibility (IVOMD), IVDMD: in vitro dry matter digestibility, R2: coefficient of determination, r: correlation coefficient, RMSE: the root mean square error, RPD: residual predictive deviation. SNV: standard normal variate.

2. Experimental design, materials, and methods

2.1. Instrument setup

Near infrared spectra data of feed samples were acquired using a benchtop NIR instrument (Thermo Nicolet Antaris II TM). The instrument was controlled and configured under integrated software Thermo Integration® and Thermo Operation®. Specified tasks were performed by establishing workflow using Thermo Integration software. High resolution measurement with integrating sphere was chosen as a method for spectra acquisition [8].

For each spectra measurement, sample labelling was required automatically prior to acquisition in order to distinct feed samples respectively. Spectra data were acquired and recorded as absorbance spectrum in wavelength range from 1000 to 2500 nm and saved in three different file formats: Nicolet (.spa), Jcamp (.jdx) and comma separated value (.csv). Standard laboratory methods were also prepared for four mentioned nutritive parameters of feed samples.

2.2. Spectra data acquisition

Near spectra data were firstly acquired by means of NIR instrument for all feed samples, made from eight sources of agro-industrial residues by products (sago residues, coconut meal, soybean-ketchup by product, coffee pulp, cacao pod, sago tree, corncob, and rice brand). Around 33 g of a bulk of feed samples were placed centrally upon sample holder. Each bulk sample was hand placed manually right to the incoming hole (1 cm of diameter) of the light source to ensure direct contact and minimize noises due to light scattering. Absorbance spectrum in wavelength range from 1000 to 2500 nm were acquired with co-added of 64 scans. Sample was set to rotate during spectra acquisition to ensure uniformity.

2.3. IVOMD, IVDMD, NDF and ADF measurements

Once after spectra data acquisitions were completed, feed samples were directly analysed to determine actual nutritive parameters. For NDF and ADF measurement, feed samples were boiled in neutral detergent and acid detergent solutions for 1 h in sequential. The NDF value was determined without using α-amylase and sodium sulfite, whilst for ADF, both of them were used. Determination of NDF and ADF were based on neutral detergent insoluble CP (NDICP) and acid detergent insoluble CP (ADICP) contents respectively and expressed as exclusive of residual ash [9].

Moreover, in vitro rumen fermentation was performed to determine IVDMD and IVOMD. Briefly, rumen fluid was collected in the morning before feeding through a rumen fistulated cow. Rumen fluid was filtered with four layers of gauze before using [10]. A 125 ml serum bottle was prepared to fill in 0.75 sample and add 75 ml buffered rumen fluid with the ratio of rumen fluid buffer (1.4 v/v). Incubation was taken place in a water bath with the temperature of 39 °C for 48 hours. After incubation, supernatant obtained was analysed to determine pH its residue was further incubated with 75 ml pepsin-HCl 0.2 N solution for another 48 hours [1,10]. Feed nutritive attributes: IVOMD and IVDMD were determined by subtracting residues of organic matter (OM) and dry matter (DM) from initial prior to fermentation respectively. The in vitro incubation was conducted in three replicates and two serum bottles represented for each replicate. Descriptive statistics of actual measured IVOMD, IVDMD, NDF and ADF are shown in Table 3.

Table 3.

Descriptive statistics of actual measured nutritive parameters of feed samples.

Statistical indicators Nutritive parameters
IVOMD IVDMD NDF ADF
Mean 56.50 54.14 27.36 17.48
Max 64.58 60.36 49.77 18.95
Min 50.34 48.56 22.10 15.30
Range 14.24 11.80 27.67 3.65
Std. Deviation 4.01 3.34 5.96 0.96
Variance 16.11 11.12 35.54 0.92
RMS 56.63 54.24 27.97 17.51
Skewness 0.73 0.40 2.72 -0.20
Kurtosis -0.34 -0.60 8.34 -0.48
Median 55.85 53.92 25.78 17.25
Q1 54.15 52.09 24.21 16.97
Q3 57.39 55.55 27.61 18.10

ADF: acid detergent fibre, IVOMD: in vitro organic matter digestibility, IVDMD: in vitro dry matter digestibility, NDF: neutral detergent fibre, Q1: first quartile, Q3: third quartile.

2.4. Sample outlier detection

In many practices, NIRS users sometimes faced with the problems in selecting the most representative sample datasets for calibration, splitting a large dataset into subsets, aiming at calibration and validation, or identifying and detecting samples that are somehow considerably different from the majority of the remaining samples from which known as outlier(s) [4,11]. These outliers can be found in the sample datasets used for model construction and validation, or arise among new samples during the use of those models for independent prediction.

There are several methods that can be used to detect and remove outliers, one of the most common method is the Hoteling statistics (or t2) ellipse, used to define statistical boundaries assuming a normal distribution of scores of principal component analysis (PCA). Typically, outliers are identified as samples found outside or beyond the ellipse confidence limit, usually established at the level of 95% [12,13]. Another recommended method for outlier detection is the use of the Mahalanobis distance (leverage) and the spectral residual to detect outliers in the raw spectra datasets.

Once after the outliers have been detected and identified by any kind of those mentioned methods, they should not be simply removed from the datasets, but the reasons why the outliers were present must be verified [14,15]. This can help to increase the knowledge about the data set and provide information on how to improve its quality to achieve better model performance.

2.5. Infrared spectra data corrections

Before performing prediction model development, it is necessary to pre-process and correct spectra data in order to achieve more accurate and robust prediction results. Several correction methods were available and can be employed based on sample characteristics, spectra impact and other related knowledge that must be recognized prior to spectra corrections. In this study, we employ three spectra correction methods namely baseline shift correction (BSC), standard normal variate (SNV) and de-trending (DT). Those three spectra corrections were then compared and see the impact on the prediction performance on nutritive parameters prediction. In NIRS practices, spectra corrections sometimes can be combined to generate a better prediction model.

2.6. Prediction models

The important core of NIRS practices and applications is to construct and develop models used to predict desired nutritive or quality attributes of studied samples. These quality attributes can be predicted rapidly and simultaneously through a process called as calibration, by regressing NIR spectra data (X variables) and actual measured nutritive attributes (Y variables). Ideally, the sample set employed in the regression stage must be representative of the present and of future prediction samples. It means that all expected sources of variability must be considered in both, the calibration and validation sample datasets.

In most common NIRS practices, partial least squares regression (PLSR) is one of the most widely used as regression method in constructing prediction models. The PLSR method continue to be the workhorse for regression in NIRS applications. The original PLSR is a linear method which assuming a linear relationship of the modelled nutritive parameters or concentrations as a function of the infrared spectral variations. Weak nonlinearities may be solved by increasing the number of latent variables (LVs) included in the PLSR model [12,16]. Another user preference for creating and developing prediction models in NIRS is principal component regression (PCR). It is a similar multivariate regression method works based on PCA and multiple linear regression (MLR).

The prediction performances were evaluated by means of these following statistical parameters: the coefficient of correlation (r) and determination (R2) between predicted and measured nutritive parameters or quality attributes, prediction error which is defined as the root mean square error (RMSE) and the residual predictive deviation (RPD), defined as the ratio between standard deviation (SD) of the population's actual value of IVOMD, IVDMD, NDF and ADF, and the RMSE of predicted nutritive parameters [1,6]. Based on literatures, good model in NIRS should have coefficient of r and R2 above 0.75 and RPD index above 2.5 respectively [4,17,18]. The higher value of RPD, the greater probability of models to predict desired nutritive parameters or chemical concentrations of samples dataset accurately and robustly [11,19].

Acknowledgments

We sincere acknowledge to the LPPM of Universitas Syiah Kuala in Banda Aceh, Indonesia for funding supports through Hibah Penelitian Professor (PP) scheme 2019.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.dib.2020.105211.

Contributor Information

Samadi, Email: samadi177@unsyiah.ac.id.

Agus Arip Munawar, Email: aamunawar@unsyiah.ac.id.

Conflict of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A. Supplementary data

The following is the Supplementary data to this article:

Multimedia component 1
mmc1.xml (999B, xml)

References

  • 1.Belanche A., Weisbjerg M.R., Allison G.G., Newbold C.J., Moorby J.M. Measurement of rumen dry matter and neutral detergent fiber degradability of feeds by Fourier-transform infrared spectroscopy. J. Dairy Sci. 2014;97(4):2361–2375. doi: 10.3168/jds.2013-7491. [DOI] [PubMed] [Google Scholar]
  • 2.Munawar A.A., von Hörsten D., Wegener J.K., Pawelzik E., Mörlein D. Rapid and non-destructive prediction of mango quality attributes using Fourier transform near infrared spectroscopy and chemometrics. Eng. Agric. Environ. Food. 2016;9(3) [Google Scholar]
  • 3.Munawar A.A., Hörsten D.V., Mörlein D., Pawelzik E., Wegener J.K. “Rapid and non-destructive prediction of mango sweetness and acidity using near infrared spectroscopy,” in Lecture Notes in Informatics (LNI) Proc. Ser. Ges. Inf. 2013:211. [Google Scholar]
  • 4.Pasquini C. “Near infrared spectroscopy: a mature analytical technique with new perspectives – a review. Anal. Chim. Acta. 2018;1026:8–36. doi: 10.1016/j.aca.2018.04.004. [DOI] [PubMed] [Google Scholar]
  • 5.Suci R.H., Zulfahrizal Z., Munawar A.A. Applying LIBS-QCL spectrum coupled with principal component analysis to distinguish Gayo Arabica and robusta coffee. Int. J. Sci. Technol. Res. Technol. Res. 2019;8(10):919–924. [Google Scholar]
  • 6.Munawar A.A., Kusumiyati, Wahyuni D. Near infrared spectroscopic data for rapid and simultaneous prediction of quality attributes in intact mango fruits. Data Br. 2019;27:104789. doi: 10.1016/j.dib.2019.104789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Syahrul S., Yunus Y., Satriyo P., Munawar A.A. Applying infrared reflectance spectroscopy to predict water quality in Aceh river. Int. J. Sci. Technol. Res. 2019;8(10):969–972. [Google Scholar]
  • 8.Sudarjat Kusumiyati, Hasanuddin, Munawar A.A. IOP Conference Series. Earth and Environmental Science; 2019. Rapid and non-destructive detection of insect infestations on intact mango by means of near infrared spectroscopy Rapid and non-destructive detection of insect infestations on intact mango by means of near infrared spectroscopy. [Google Scholar]
  • 9.Samadi S. Wajizah, Munawar A.A. Rapid and simultaneous determination of feed nutritive values by means of near infrared spectroscopy. Trop. Anim. Sci. J. 2018;41(2) [Google Scholar]
  • 10.Gallo A., Bruschi S., Masoero F. Technical note: evaluation of a novel enzymatic method to predict in situ undigested neutral detergent fiber of forages and nonforage fibrous feeds. J. Dairy Sci. 2019;102(7):6235–6241. doi: 10.3168/jds.2019-16279. [DOI] [PubMed] [Google Scholar]
  • 11.Yunus Y., Devianti D., Satriyo P., Munawar A.A. IOP Conference Series. Earth and Environmental Science; 2019. Rapid prediction of soil quality indices using near infrared spectroscopy rapid prediction of soil quality indices using near infrared spectroscopy. [Google Scholar]
  • 12.Saputri Y., Yusriana Y., Munawar A.A. IOP Conference Series: Earth And Environmental Science. 2019. Infrared spectroscopic features of turmeric powder. [Google Scholar]
  • 13.Munawar A.A., Syah H., Yusmanizar . IOP Conference Series. Earth and Environmental Science; 2019. Fast and robust quality assessment of honeys using near infrared spectroscopy fast and robust quality assessment of honeys using near infrared spectroscopy. [Google Scholar]
  • 14.Agustina R., Munawar A.A. IOP Conference Series. Earth and Environmental Science; 2019. “Electro-Optic properties of dried pliek U Powder : local ingredients from Aceh electro-optic properties of dried pliek U powder : local ingredients from Aceh. [Google Scholar]
  • 15.Munawar A.A., Devianti D., Satriyo P., Yunus Y. Rapid and simultaneous prediction of soil quality attributes using near infrared technology. Int. J. Sci. Technol. Res. 2019;8(9):9–12. [Google Scholar]
  • 16.Iskandar C.D., Zainudin Z., Munawar A.A. IOP Conference Series. Earth and Environmental Science; 2019. Beef freezing optimization by means of planck model through simulation beef freezing optimization by means of planck model through simulation. [Google Scholar]
  • 17.Munawar A.A., Kusumiyati Hafidh, Hayati R., Wahyuni D. The application of near infrared technology as a rapid and non-destructive method to determine vitamin C content of intact mango fruit. INMATEH Agric. Eng. 2019;58(2):1–12. [Google Scholar]
  • 18.Nordey T., Joas J., Davrieux F., Chillet M., Léchaudel M. Robust NIRS models for non-destructive prediction of mango internal quality. Sci. Hortic. (Amst.) 2017;216:51–57. [Google Scholar]
  • 19.Darusman D., Zulfahizal Z., Yunus Y., Munawar A.A. “Soil quality assessment by near infrared Spectroscopy : predicting ph and soil organic carbon. Int. J. Sci. Technol. Res. 2019;8(10) [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.xml (999B, xml)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES