Skip to main content
PNAS Nexus logoLink to PNAS Nexus
. 2023 Mar 29;2(4):pgad111. doi: 10.1093/pnasnexus/pgad111

mHealth hyperspectral learning for instantaneous spatiospectral imaging of hemodynamics

Yuhyun Ji 1, Sang Mok Park 2, Semin Kwon 3, Jung Woo Leem 4, Vidhya Vijayakrishnan Nair 5, Yunjie Tong 6, Young L Kim 7,8,9,10,✉,b
Editor: Yannis Yortsos
PMCID: PMC10129064  PMID: 37113981

Abstract

Hyperspectral imaging acquires data in both the spatial and frequency domains to offer abundant physical or biological information. However, conventional hyperspectral imaging has intrinsic limitations of bulky instruments, slow data acquisition rate, and spatiospectral trade-off. Here we introduce hyperspectral learning for snapshot hyperspectral imaging in which sampled hyperspectral data in a small subarea are incorporated into a learning algorithm to recover the hypercube. Hyperspectral learning exploits the idea that a photograph is more than merely a picture and contains detailed spectral information. A small sampling of hyperspectral data enables spectrally informed learning to recover a hypercube from a red–green–blue (RGB) image without complete hyperspectral measurements. Hyperspectral learning is capable of recovering full spectroscopic resolution in the hypercube, comparable to high spectral resolutions of scientific spectrometers. Hyperspectral learning also enables ultrafast dynamic imaging, leveraging ultraslow video recording in an off-the-shelf smartphone, given that a video comprises a time series of multiple RGB images. To demonstrate its versatility, an experimental model of vascular development is used to extract hemodynamic parameters via statistical and deep learning approaches. Subsequently, the hemodynamics of peripheral microcirculation is assessed at an ultrafast temporal resolution up to a millisecond, using a conventional smartphone camera. This spectrally informed learning method is analogous to compressed sensing; however, it further allows for reliable hypercube recovery and key feature extractions with a transparent learning algorithm. This learning-powered snapshot hyperspectral imaging method yields high spectral and temporal resolutions and eliminates the spatiospectral trade-off, offering simple hardware requirements and potential applications of various machine learning techniques.

Keywords: informed deep learning, statistical learning, hyperspectral imaging, snapshot imaging, ultrafast imaging


Significance Statement.

The development of ultrafast hyperspectral imaging systems is an active area of research for a variety of applications from biomedical to defense domains. However, such hardware-focused approaches have been challenging mainly because of the intrinsic trade-off between spatial and spectral resolutions and the slow rates of data acquisition. Hyperspectral learning takes inspiration from the idea that a photograph is more than merely a picture and indeed contains rich spectral information. Hyperspectral learning enables the recovery of spectral information with high spectral and temporal resolutions from red–green–blue color values acquired using a conventional camera. The main advantages of hyperspectral learning include hardware simplicity, no trade-off between spatial and spectral resolutions, high temporal resolution, and machine learning augmentation for snapshot imaging.

Introduction

Hyperspectral (with a high spectral resolution of ∼10 nm) or multispectral (with several spectral bands of ∼50 nm) imaging systems acquire a hyperspectral image data set (hypercube)—a 3D data set of spectral intensity in spatial coordinates. Subsequently, both the spatial and spectral data are processed (1–3). Hyperspectral imaging technologies offer extensive physical and biological information in stationary or dynamic samples, ranging from microscopic settings to airborne remote sensing environments, for a variety of applications in geology, mineralogy, agriculture, environmental science, astronomy, forensic medicine, defense, security, and biomedicine (4–9). Notably, hyperspectral imaging technologies have been reinvigorated through recent advances in data-driven machine learning (2, 10–14). For example, deep learning approaches have enabled the effective processing of extremely large hypercube data for classical imaging tasks (11, 15–17) and allowed for the optimization of hypercube acquisition to achieve specific tasks and objectives (18–22). Data fusion of complementary images with high spectral or high spatial resolutions (23, 24) and neural networks of improving spatial resolutions (25) can overcome the intrinsic trade-off between spatial and spectral resolutions. However, conventional hyperspectral imaging systems still face the intrinsic limitations: bulky instruments, slow data acquisition rates, low detection efficacy (i.e. low signal-to-noise ratio), and motion artifacts (1, 2).

Typically, hyperspectral imaging systems rely on mechanical scanning elements either in the spectral or spatial domains. In particular, spectral scanning systems employ a number of narrow bandpass spectral filters or dispersive optical components, whereas point scanning and line scanning systems rely on mechanical translational components that require high precision (1, 2, 6, 7, 9). Thus, these scanning elements result in bulky instruments and yield suboptimal temporal resolutions. In particular, prolonged time of data acquisition time fundamentally limits dynamic imaging with a high temporal resolution. In this respect, the development of snapshot imaging technologies capable of acquiring a hypercube in a single-shot manner has been an active area of research (summarized in Table S1). The most common configuration used for snapshot imaging involves simultaneously capturing multiple images with different spectral bands using a large-area image sensor (26–32). In particular, large-area image sensor-based snapshot imaging is beneficial for reducing the acquisition time (2, 33, 34). Other snapshot imaging technologies employ dispersion patterns or coded apertures projecting irradiance mixed with spatial and spectral information to further enhance the light collection efficiency and readout rate (26, 27, 35–37). Subsequently, the modulated projection comprising spatial and spectral information is reconstructed into a hypercube by utilizing computational algorithms such as compressed (or compressive) sensing (32, 38, 39), Fourier transformation (40, 41), or deep learning (19, 20, 42–48).

However, previously developed hyperspectral imaging technologies with a snapshot ability face several limitations (1, 2, 9, 46, 49). First, typical snapshot systems are limited by the intrinsic trade-off that must be made between the spectral and spatial resolutions; an improvement in spatial resolution causes a deterioration in the number of spectral bands, thereby compromising the spectral resolution or the spatial resolution (or imaging area). Second, snapshot imaging systems are sensitive to light conditions and imaging configurations, thereby introducing significant errors in field applications. Third, the hyperspectral filter arrays, dispersion patterns, and coded apertures require high-precision fabrication or nanofabrication, including precision alignment of array components, optimized miniaturization, integration with pixel-level filters, and customized calibrations, all of which inhibit manufacturability. Consequently, the previous studies have generally been performed under laboratory settings (50, 51) or with stationary biological samples (7, 9, 52), thereby hampering the practical and widespread utilization.

To develop an alternative to instantaneous hyperspectral imaging, we take advantage of spectral learning (also known as spectral superresolution), which enables the recovery of spectral information from red–green–blue (RGB) values (tristimulus) acquired by a conventional trichromatic camera; a full reflectance spectrum in the visible range is computationally reconstructed from an RGB image. Several scientific communities, especially in the fields of color science, machine vision, and biomedical imaging, have addressed this ill-posed problem using different but related methods including compressed sensing, machine learning, and deep learning (26, 30, 53–67). Owing to its hardware simplicity, spectral learning can be performed by using a smartphone camera. In practice, dispersive optical components, such as spectrometers and bulky optical filters, are usually attached to mobile health (mHealth) sensing applications, potentially weakening user acceptance and hampering the practical translation from research to clinical practice (68–74). Conversely, spectral learning requires only a built-in camera, potentially offering hardware-independent mHealth tools using unmodified smartphones.

In this paper, we introduce a learning-based spatiospectral imaging method offering high spectral and temporal resolutions. The proposed spectral learning involves mapping from a sparse spectral space (i.e. RGB values) to a dense spectral space. Specifically, the spectral resolution is in a range of 0.5–1 nm, comparable to those of scientific spectrometers and spectrographs for biomedical or biochemical applications (thereby referred to as hyperspectral learning, compared with spectral learning). Hyperspectral learning also allows us to use a video frame rate with a high temporal resolution. First, we construct a customized dual-channel imaging setup coupled with a trichromatic camera (e.g. smartphone camera) and a spectrograph to simultaneously acquire an RGB image and subarea hyperspectral data. Second, we establish a simple statistical assumption to infer the entire field of view from a sampled subarea and recover a hypercube from incomplete measurements. Third, we establish two complementary machine learning frameworks based on statistical and deep learning, incorporating the domain knowledge of tissue optics into learning algorithms. Finally, we demonstrate reliable extractions of hemodynamic parameters from several different samples of tissue phantoms, chick embryos, and human conjunctiva; the results are validated through conventional hyperspectral imaging and functional near-infrared spectroscopy (fNIRS). Moreover, this hyperspectral learning method is applied to smartphone video recording to demonstrate the dynamic imaging of peripheral microcirculation and ultrafast imaging of oxygen depletion in tissue phantoms.

Results

Figure 1A illustrates the concept of hyperspectral learning for instantaneous spatiospectral imaging by significantly minimizing the number of necessary hyperspectral measurements. If hyperspectral data in a small yet representative subarea are available, a hyperspectral learning algorithm can be trained using the RGB and hyperspectral data in the subarea. This hyperspectral learning algorithm trained by the sampled (RGB and hyperspectral) data is applied to the entire image area, generating a hypercube without the need for a complete spectral or spatial scan. Thus, the key advantages of hyperspectral learning and hypercube recovery include the hardware simplicity offered by the use of conventional cameras, high temporal resolution if a video is used (e.g. slow-motion video recording on a smartphone), independence (no trade-off) between the spatial and spectral resolutions, and abundant spectral information for a variety of machine learning techniques. Locally sampled hyperspectral data can serve as prior information or physical constraints for incorporating domain-specific modeling into the learning algorithm, extracting critical features and parameters, and resulting in explainable and interpretable neural networks (64, 75–79).

Fig. 1.

Fig. 1.

Schematic illustration of hyperspectral learning for instantaneous imaging using a conventional trichromatic camera (i.e. three-color image sensor). A sampling of detailed spectral (hyperspectral) information in a small yet representative subarea enables hyperspectral learning to recover a hypercube in the entire image as well as to perform spectrally driven machine learning for physical or biological parameter extractions. A) Hyperspectral learning for hypercube recovery. Hyperspectral data in a small subarea are used to train a learning algorithm that takes the RGB values in each pixel as the input and returns a spectrum with a high spectral resolution (also known as spectral superresolution). Hyperspectral learning is not affected by an intrinsic trade-off between spatial and spectral resolutions, which often limits conventional snapshot hyperspectral imaging. B) Subarea sampling of hyperspectral data. A dual-channel spectrograph with a photometric slit (Materials and methods) simultaneously acquires an RGB image in the entire field of view and a hyperspectral line scan in a small subarea (i.e. the central line) in a single-shot manner. The sampled hyperspectral data serve as prior information or physical constraints for training the hyperspectral learning algorithm. The spectrally driven informed machine learning algorithm trained by the sampled hyperspectral data transforms the RGB image into a hyperspectral image data set (also known as a hypercube), which can further be used to extract physical or biological parameters.

To instantaneously sample hyperspectral data in a small subarea, a trichromatic camera (e.g. smartphone camera) is combined with a line-scan spectrograph (Fig. 1B). This configuration is commonly used in amateur telescopes that are equipped with astronomical spectroscopy systems (80). Specifically, a dual-channel spectrograph with a photometric slit simultaneously acquires an RGB image in the entire area and the hyperspectral data of a subarea (i.e. central line) in a single-shot manner (Fig. 1B and Materials and methods). The field of view is as small as 2.5 mm × 2 mm with a spatial resolution of 55 µm (Fig. S1). The sampled hyperspectral data have a spectral range of λ = 380–720 nm with a spectral resolution Δλ = 0.5 nm (Fig. S2 and Materials and methods). The dual-channel imaging setup provides sufficient training data (750–1,500 data points). This data set is randomly split into training (80%) and testing (20%) data sets for effectively training the hyperspectral learning algorithm to be applied to the entire area eventually. In addition, this imaging setup allows us to use a smartphone camera that can acquire videos at different frame rates. In particular, highly dynamic imaging is even possible with a high temporal resolution even up to 0.0005 s for 1,920 frames per second (fps), using commercially available smartphone models (at the time of publication).

Hyperspectral learning addresses an ill-posed problem, which is also known as spectral superresolution and hyperspectral reconstruction. The mathematical relationship between the RGB and hyperspectral intensity is described:

x3×1=S3×kyk×1+e3×1, (1)

where x denotes a 3 × 1 vector corresponding to three color values in the R, G, and B channels (x=[R,G,B]T), S represents a 3 × k matrix of the RGB spectral response of the three-color sensor the spectral response functions in the R, G, and B channels of the smartphone camera (also known as the sensitivity function of the camera) (81, 82), y is a k × 1 vector that has the spectral intensity (y=[I(λ1),I(λ2),,I(λk)]T) where λ is discretized in the visible range with a spectral interval of 1 nm (Materials and methods), and e symbolizes a 3 × 1 vector of the system noise. Basically, hyperspectral learning is to obtain a pseudoinverse of S3×k. Depending on the availability of training data and the desired spectral resolution, three machine learning approaches can be used for solving this underdetermined problem: l1 norm minimization (i.e. compressed sensing or sparsity regularization), l2 norm minimization (i.e. least squares regression), and deep learning (26, 30, 53–65). Among these approaches, statistical learning using fixed-design linear regression with polynomial expansions offers a highly stable inverse calculation that can transform RGB data to high-resolution spectral data owing to the nature of l2 norm minimization (Materials and methods) (55, 62, 81, 83, 84).

The key assumption for reliable hyperspectral learning is that a sampling distribution (i.e. RGB values of the sampled subarea) should follow the parent distribution (i.e. RGB values of the entire image area); the intensity distributions between the sampled subarea and the entire field of view of interest are statistically identical. Specifically, the probability distribution of the R, G, and B values in the subarea needs to conform to those in the entire area in terms of variability and shape. In addition, to reliably predict unknown hyperspectral output responses from RGB values outside the subarea, the hyperspectral learning algorithm should be applied within the same (minimum and maximum) range of sampled RGB values used to train the algorithm. In a similar manner to nonparametric tests with non-Gaussian distributions, quantile–quantile (QQ) plots can conveniently be used to assess if the two sets of data plausibly follow the same distribution within the same range. This assumption of interpolation offers an important advantage over conventional snapshot hyperspectral imaging. If these assumptions are valid, then the hyperspectral learning is not limited by the intrinsic trade-off between spatial and spectral resolutions.

Spectrally informed learning allows for the incorporation of physical and biological understanding of domain knowledge into learning algorithms. Typically, purely data-driven learning requires a large volume of training data and lacks explainable and interpretable learning (76, 78). Among the various snapshot imaging applications, we focus on extracting biological parameters or spectral signatures from a hypercube using the domain knowledge of tissue optics. In this perspective, light propagation in tissue can be explained by the theory of radiative transport and robust approximations (e.g. diffusion, Born, and empirical modeling) (85–90). Specifically, taking advantage of tissue reflectance spectral modeling (Materials and methods), we extract the key hemodynamic parameters: oxygenated hemoglobin (HbO2), deoxygenated hemoglobin (Hb), and oxygen saturation (sPO2), which are the fundamental determinants of oxygen transport to tissue associated with a variety of physiological changes, diseases, and disorders.

sPO2=HbO2Totalhemoglobin=HbO2HbO2+Hb. (2)

Notably, tissue optics serves as the cornerstone of biophotonics and biomedical optics to deepen our knowledge of light–tissue interactions and develop noninvasive optical diagnostic methods and devices (89, 91).

To demonstrate the versatility of hyperspectral learning and hypercube recovery, we formulate two complementary machine learning frameworks. In statistical learning (Fig. 2A), sampling of hyperspectral data in a subarea serves as a training data set for hyperspectral learning. The hypercube of the entire area is reconstructed using a statistical learning algorithm (Materials and methods). Fixed-design linear regression, featuring polynomial expansions, is utilized as the method of least squares (l2 norm minimization) (55, 62, 81, 83). A full spectroscopic resolution is achieved in a range of Δλ = 0.5–1 nm, highly comparable to those of scientific spectrometers or spectrographs. This level of spectral resolution is 10 times better than the typical resolution of hyperspectral imaging of Δλ = 10 nm. Subsequently, the recovered hypercube is fitted using a tissue reflectance spectral model to extract the hemodynamic parameters (Materials and methods) (85–90). In deep learning (Fig. 2B), hyperspectral data in a subarea are directly fed into the tissue reflectance spectral model to compute the hemodynamic parameters within the same subarea. The obtained data set serves to train a deep neural network that computes the hemodynamic parameters with RGB values (tristimulus) as an input. Specifically, the deep neural network is directly trained by using the hemodynamic parameters extracted from the tissue reflectance spectral model fed with the sampled data, thereby reducing the computational load. In both cases, separate training and validation data sets are employed to strengthen the learning algorithm for training: 80% of the data points among the sampled data (i.e. 600 data points out of 750) are randomly selected as a training data set and the remaining 20% (i.e. 150 data points) are blindly tested as a testing data set.

Fig. 2.

Fig. 2.

Flowcharts of two complementary machine learning frameworks for hypercube recovery and hemodynamic imaging and schematic illustration of deep learning networks. A) Statistical learning involving hyperspectral learning and tissue reflectance spectral modeling. Hyperspectral and RGB data in a subarea are used to train a statistical learning algorithm. The trained statistical learning algorithm is subsequently applied to the RGB data in the entire area to reconstruct the hypercube. Using tissue reflectance spectral modeling, the key hemodynamic parameters are extracted from the hypercube to generate the hemodynamic maps. B) Deep learning informed by the tissue reflectance spectral model. The hemodynamic parameter values (output) from the hyperspectral data in the subarea extracted via tissue reflectance spectral modeling of the corresponding RGB values (input) are used to train the deep learning network. The trained deep learning network is thereafter applied to the RGB data for the entire area to generate the hemodynamic maps. C) Illustration of the deep neural network that takes RGB values and returns key hemodynamic parameters (i.e. HbO2 and Hb). This network is trained by the RGB values and hemodynamic parameters in a subarea; HbO2 and Hb values are computed from the ground-truth hyperspectral data obtained with conventional pushbroom-type hyperspectral imaging system in the subarea via the tissue reflectance spectral model (Materials and methods). D) The first hidden layer is fully connected to 18 nodes (or neurons) to mimic the polynomial expansions of statistical learning. E) Representative spectral intensity differences in the ground-truth hyperspectral data obtained from a tissue phantom with oxygenated and deoxygenated hemoglobin. Inset: The ground-truth hyperspectral data when the sample is oxygenated and deoxygenated. F) Representative difference in the computed output values of the first hidden layer between two different RGB values of HbO2 and Hb from the same tissue phantom in E). The order of the nodes is assigned such that the rank of the output differences is the same as that of the wavelengths in the spectral intensity differences.

Importantly, deep learning informed by hyperspectral information is advantageous for designing explainable and interpretable neural networks. Among similar yet distinct terms, such as understandability and comprehensibility (77), spectrally informed deep learning enables transparency in the learning algorithm as it is understandable in a manner similar to statistical regression. The deep neural network model is designed to mimic the concept of polynomial expansions in statistical hyperspectral learning (Figs. 2C and S3 and Materials and methods). In particular, the first hidden layer, which is one of the important hyperparameters for building a neural network, is fully connected with a relatively large number of nodes (18 nodes) to mimic hyperspectral learning, which transforms RGB values to a spectral intensity profile with a high spectral resolution. After the network is trained, each node in the first hidden layer possesses a distinct weight representing hyperspectral information, such that the RGB values of a certain hemodynamic parameter (e.g. sPO2) generate the corresponding spectral feature, which is further propagated throughout the network (Fig. 2D). Figure 2E illustrates two representative cases of hyperspectral data measured from a tissue phantom by varying sPO2 between HbO2 and Hb (Fig. S3 and Supplementary Methods). The output values at different nodes in the first hidden layer can be understood based on the spectral intensity differences as a function of λ (Fig. 2E and F) that a scientific spectrometer or spectrograph can quantify. This direct spectral understanding should be differentiated from other conventional heat maps or saliency maps employed for visualizing or explaining features extracted through typical convolutional neural networks (92, 93). In particular, the differences in the computed output values of the first hidden layer between the two different RGB values of HbO2 and Hb in a tissue phantom (Supplementary Methods) resemble the spectral intensity differences between HbO2 and Hb measured from the same tissue phantom (Fig. 2E and F).

We comprehensively validate the hyperspectral learning-based instantaneous imaging method using an experimental model of vascular development. Chicken embryos serve as an excellent model system for imaging the development of blood vessel formation and hemodynamics (Fig. 3A) (94, 95). The key statistical assumptions between the sampled area (i.e. the central line) and the entire image area are tested by evaluating the probability distribution profiles of each RGB channel (Fig. 3B) and the QQ plots (Fig. 3C). The sampled data set (both hyperspectral and RGB data along the central line) is used to train a learning algorithm that has the input of RGB values and the output of hemodynamic parameters (i.e. HbO2 and Hb). The residuals between the ground-truth (acquired by a pushbroom-type hyperspectral imaging system in Materials and methods) and reconstructed hyperspectra are minimal, as shown with 95% CIs as a function of λ (Fig. 3D, E, and G and Materials and methods). The hemodynamic parameter extractions are further supported by the 95% CIs of residuals between the ground-truth and fitted hyperspectra obtained using the tissue reflectance spectral model (Fig. 3D, F, and H and Materials and methods). Figure 3G and H show that the reconstructed and fitted spectra are in excellent agreement with the ground-truth spectra. Moreover, the spectral angle mapper (Materials and methods) between the ground-truth and reconstructed hypercubes captures a high degree of similarity with an average value of 0.035 rad (Fig. 3I). The direct comparison (i.e. spectral angle mapper) between the ground-truth and fitted hypercubes also returns an average value of 0.046 rad (Fig. 3J), supporting the performance of hyperspectral learning and tissue reflectance spectral modeling.

Fig. 3.

Fig. 3.

Comprehensive evaluations of hyperspectral learning, hypercube recovery, and hemodynamic parameter extractions in an experimental vascular developmental model. A) Photograph (RGB image) of the chicken embryo on day 8. The vertical gray line represents a subarea for hyperspectral sampling. B) Probability distributions and histograms of R, G, and B values of the entire image area and the subarea. The black bins show the distributions of R, G, and B values of the entire image area. The black solid lines are displayed together with the black bins to visualize the trends of the data distributions of the entire image area. The colored bins depict the distributions of R, G, and B values of the subarea [i.e. the center line in A)] according to the RGB colors. The colored lines are illustrated together with the colored bins to visualize the trends in the data distribution of the subarea. C) QQ plots for distribution comparisons between the entire field of view and the subarea. The subarea [the center line in A)] ensures a similar intensity distribution as the intensity distributions of the entire area A) in each RGB channel. The black dotted straight diagonal line indicates that the two distributions are identical. D, F) Visualization of the ground-truth D), reconstructed E), and fitted F) spectra from the testing data (data size = 150) of the sampled data (data size = 750). A logarithmic scale of intensity is used for enhanced visualization. G) Comparison between the ground-truth and reconstructed spectra in the testing data set. The differences (residuals) between the ground-truth and reconstructed spectra are plotted with 95% CIs for each λ. H) Comparison between the ground-truth and fitted spectra in the testing data set. Hemodynamic parameters are extracted using the tissue reflectance spectral model (Materials and methods). The differences (residuals) between the ground-truth and fitted spectra are plotted with 95% CIs. I) Spectral angle mapping image between the ground-truth and reconstructed hypercubes. J) Spectral angle mapping image between the ground-truth and fitted hypercubes.

Figure 4 depicts the hemodynamic maps of Hb, HbO2, and sPO2 corresponding to a white leghorn chicken (Gallus gallus domesticus, Hy-Line W-36) embryo on day 8 (Fig. 4A–C). In this statistical learning framework, after recovering the hypercube for the entire area via a hyperspectral learning algorithm, the recovered hypercube is fitted by using the tissue reflectance spectral model (Fig. 2A and Materials and methods), thereby computing the hemodynamic parameters in a pixel-by-pixel manner. Figure 4D clearly demonstrates the high levels of sPO2 in the major blood vessel compared with capillary vessels. As a deep learning framework (Fig. 2B), we design a fully connected deep neural network that takes RGB values as the input and returns the hemodynamic parameters as the output (Figs. 2C and S3 and Materials and methods). As stated previously, the two unique design aspects are that the neural network encompasses the concept of polynomial expansions in hyperspectral learning and incorporates the output parameters extracted from the tissue reflectance spectral modeling (Fig. 2D and Materials and methods). As illustrated in Fig. 4E, the hemodynamic parameters are directly computed using the deep neural network. In particular, the hemodynamic maps obtained from both frameworks are in excellent agreement with the ground-truth hemodynamic maps generated via conventional pushbroom-type hyperspectral imaging, which are supported by high values of structural similarity index (Fig. 4F and Table S2). Interestingly, the conventional hemodynamic maps are noisier due to the motion artifact of the live sample, which results from the slow rate of data acquisition (data acquisition time = 45 min) and the mechanical scanning, as shown by the horizontal lines in Fig. 4F.

Fig. 4.

Fig. 4.

Hemodynamic maps of HbO2, Hb, and sPO2 in the experimental model of vascular development. A) Chick embryo in a petri dish as an experimental vascular developmental model. The image area for learning-based snapshot imaging with a high spatial resolution is marked by a box. B) RGB image in field of view [the box in A)] for hyperspectral learning-based snapshot imaging. C) RGB image generated by the conventional pushbroom-type hyperspectral imaging system. D) Statistically learning-based hemodynamic maps. Hyperspectral learning generates a hypercube that is processed to compute hemodynamic maps using tissue reflectance spectral modeling. E) Deep learning-based hemodynamic maps. Hemodynamic maps are directly generated using a deep neural network that takes the RGB values as the input and returns HbO2 and Hb values as the output. F) Reference hemodynamic maps generated by a conventional pushbroom-type hyperspectral imaging system for validation. The two learning-based hemodynamic maps D, E) are in excellent agreement with the ground-truth maps F). Statistical learning and deep learning-based hemodynamic maps are further assessed using the structural similarity index (Supplementary Methods) to show that they are qualitatively identical to the reference hemodynamic maps (Table S2).

Moreover, another advantage offered by learning-based snapshot imaging is that a video can be transformed into parameter maps after the learning algorithm is trained. Essentially, a video comprises a time series of multiple RGB images and each frame represents an RGB image. A video captured using a common trichromatic camera (e.g. a smartphone camera) can conveniently be processed using hyperspectral learning. Recently, smartphone models are capable of slow-motion video mode, featuring as high frame rates as 960–1,920 fps. Thus, hyperspectral learning enables unprecedentedly fast hyperspectral imaging with high temporal resolution (e.g. 0.0005–0.0010 s). To demonstrate ultrafast dynamic imaging, we take advantage of an off-the-shelf smartphone (Samsung Galaxy S21) to record a video at 960 fps (temporal resolution = 0.0010 s). Each frame representing an RGB image of the video is fed into the hyperspectral learning algorithm to compute the corresponding hemodynamic images. Then the hemodynamic images are restacked into a video format (Materials and methods and Supplementary Methods). Figure 5 shows time series hemodynamic maps of sPO2 changes during a selected short period (Videos S1 and S2 for the entire duration of 0.5 s) in a tissue phantom that has a rapid oxygen exchange process (Supplementary Methods and Fig. S3). The hemodynamic video with the millisecond resolution reveals rapid hemoglobin dispersion and oxygen diffusion, varying sPO2 between HbO2 and Hb. It should be noted that hyperspectral learning enables ultrafast dynamic imaging merely using a commercially available unmodified smartphone.

Fig. 5.

Fig. 5.

Ultrafast dynamic imaging of sPO2 changes via hyperspectral learning of smartphone's ultraslow video recording. A) Selected 15 frames (RGB images) from a video of the dynamic tissue phantom recorded by the smartphone's built-in ultraslow-motion mode (960 fps). B) Hemodynamic change images of sPO2 (ΔsPO2) compared with the initial image, processed using the corresponding RGB images with the deep learning algorithm. Hyperspectral learning allows an ultrafast data acquisition rate at a temporal resolution of 0.0010 s (Video S1 for the entire duration) using the off-the-shelf smartphone.

To further demonstrate dynamic imaging of peripheral microcirculation at a particular video frame rate, we record a video at 60 fps for a duration of 180 s (temporal resolution = 0.0167 s) (Video S3 and Figs. S5 and S6). As a model system for peripheral microcirculation in humans, we visualize spatiotemporal hemodynamic changes in the microvessels of the inner eyelid (i.e. the palpebral conjunctiva) (Fig. 6A and B). The inner eyelid is an easily accessible and highly vascularized peripheral tissue site that receives blood from the ophthalmic artery. Thus, the inner eyelid serves as a feasible sensing site for various diseases and disorders (96–98). Figure 6C and D show the peripheral hemodynamic maps of Hb, HbO2, and sPO2 obtained during the resting state of a healthy adult volunteer. Videos of hemodynamic maps computed from the smartphone video are generated using the two complementary machine learning frameworks of statistical learning and informed deep learning (Materials and methods and Videos S1–S3). In Fig. 6, the sPO2 maps reveal spatially complex patterns of perfusion in the inner eyelid, which are not evident in the photo (i.e. the RGB image). The statistical learning and deep learning-based hemodynamic maps of HbO2 and Hb are in excellent agreement with each other, thereby indicating the reliable extraction of hemodynamic parameters.

Fig. 6.

Fig. 6.

Representative peripheral hemodynamic maps of HbO2, Hb, and sPO2 extracted from a healthy adult in the resting state. The inner eyelid (i.e. palpebral conjunctiva) has high vasculature connected to the ophthalmic artery, serving as an easily accessible peripheral tissue site. A) Photograph of a healthy adult volunteer taking a picture with a smartphone while the inner eyelid pooled down. B) RGB image of the inner eyelid with high spatial resolution showing the field of view for hyperspectral learning-based snapshot imaging. Microvessels in the inner eyelid are clearly visible without the effects of skin pigments, which are easily accessible for imaging. C) Statistical learning-based snapshot hemodynamic maps of the inner eyelid. Hemodynamic maps are generated using tissue reflectance spectral modeling based on spectral information from the recovered hypercube. D) Deep learning-based snapshot hemodynamic maps of the inner eyelids. Hemodynamic maps are directly generated using the deep neural network, which returns HbO2 and Hb values as the output when the RGB values are employed as the input.

Furthermore, we validate the computed hemodynamic maps obtained via the smartphone video (Video S3) against the functional imaging signals obtained using a fNIRS system (Materials and methods and Supplementary Methods). In this in vivo imaging setting, simultaneously acquiring the concurrent hypercube from the same sensing site (i.e. inner eyelid in Fig. 7A) is practically unachievable using a conventional hyperspectral imaging system. In this study, the peripheral hemodynamic signals obtained from the smartphone video are compared with the concurrent fNIRS signals averaged from all brain regions (Fig. 7D) and the fingertip (Fig. 7F) of the same subject measured using an fNIRS neuroimaging system (Materials and methods and Supplementary Methods). We focus on the existence of the same low-frequency oscillations of HbO2 and Hb associated with nonneuronal circulatory signals. In particular, low-frequency oscillations of HbO2 and Hb in a frequency range of 0.01–0.1 Hz are commonly present in functional imaging signals (e.g. blood oxygen level-dependent functional magnetic resonance imaging and fNIRS) (99, 100). Specifically, a hemodynamic phase of approximately 180° between HbO2 and Hb changes (ΔHbO2 and ΔHb) during the resting state of healthy adults conveys autonomic physiological information, potentially serving as a biomarker of circulatory dysfunction.

Fig. 7.

Fig. 7.

Temporal changes in the eyelid's hemodynamic signals extracted from the smartphone video capture and concurrent resting-state fNIRS signals from the brain and another peripheral site (fingertip). A) Representative photographs of data acquired from the inner eyelid of a healthy participant. A smartphone camera can easily record a video of the inner eyelid. B, C) Temporal traces of statistical learning-based B) and deep learning-based C) hemodynamic signals of changes in HbO2 (ΔHbO2) and Hb (ΔHb) in the inner eyelid. A relatively large blood vessel is selected (Fig. S7) to analyze the hemodynamic parameters. A temporal resolution of 0.016 s is set by the smartphone video recording setting (Video S3). A vector representation of the time-averaged phase difference between the corresponding ΔHbO2 and ΔHb signals in the eyelid is displayed on the right-hand side of the temporal trace. D) Representative photographs of the data acquisition from the brain of the subject using fNIRS. The fNIRS signals from the brain of the same subject are also measured at the same time to be compared with the signal from the smartphone video. E) Temporal traces of concurrent fNIRS signals averaged from all brain regions of the same subject with a vector representation of the time-averaged phase difference between ΔHbO2 and ΔHb from all brain regions. F) Representative photographs of the data acquisition from the fingertip of the subject using fNIRS. The fNIRS signals from the fingertip of the same subject are also measured at the same time to be compared with the signal from the smartphone video. G) Temporal traces of concurrent fNIRS signals averaged from the fingertip of the same subject with a vector representation of the time-averaged phase difference between the corresponding ΔHbO2 and ΔHb in the fingertip. The phase difference extracted from the smartphone video B, C) is in excellent agreement with the functional imaging modality E, G).

We track hemodynamic signals in a relatively large vessel (Fig. S7) in terms of the changes in HbO2 and Hb (ΔHbO2 and ΔHb), which are computed via statistical learning as well as deep learning. Figure 7B and C characterize ΔHbO2 and ΔHb obtained after filtering out the heartbeat-related signals using a bandpass filter (0.01–0.1 Hz) (Materials and methods). Moreover, vector analyses reveal that the phase differences between ΔHbO2 and ΔHb over time under the resting state are 180.9° and 180.0° for statistical learning and deep learning, respectively (Fig. 7B and C). When the concurrent resting-state fNIRS signals from the cortical areas of the brain and another peripheral site (fingertip) from the same healthy subject are analyzed, the time-averaged phase differences between ΔHbO2 and ΔHb also return 184.3° and 184.8°, respectively (Fig. 7E and G). Particularly, these consistent phase differences corresponding to the brain and peripheral sites (i.e. inner eyelid and fingertip) indicate that low-frequency hemodynamic oscillations are nonneuronal and circulate to the different parts of the body. More importantly, these results validate the accurate and reliable extraction of hemodynamic parameters from smartphone video recordings.

Discussion

The two machine learning frameworks of statistical learning and informed deep learning are complementary. In statistical learning, l2 norm minimization (i.e. least squares regression) of hyperspectral learning enables stable and robust hypercube recovery with high spectral and temporal resolutions from an RGB image. This method is analogous to compressed sensing (i.e. l1 norm minimization) such that incomplete measurements can be used to recover an entire image. The recovered hypercube is an intermediate product to be processed to extract key features or classifications of interest, requiring an additional step. On the other hand, in deep learning, the sampling of hyperspectral data in a subset area facilitates the direct incorporation of tissue optics modeling into the algorithm, resulting in direct parameter extractions. In addition, this spectrally informed deep learning approach is advantageous for featuring a transparent network architecture, realizing explainable and interpretable learning. Moreover, this reported snapshot ability will offer ample opportunities for applying hyperspectral imaging to emerging areas where instantaneous data acquisitions have been the bottleneck.

Hyperspectral learning from an RGB image can potentially be enhanced by using the state-of-the-art three-color image sensors and high dynamic range images. First, conventional three-color sensors interpolate missing color information in selected pixel positions using a demosaicing algorithm, because each pixel records only one of the RGB colors in a format of a grid or checkerboard (also known as Bayer filter). Foveon X3 sensors capture all incident light (three colors) within the same pixel without having three separate color channels (101). An image sensor with a color router can directly detect the entire color content without using a filter to select the intended color channel (102, 103). Second, a higher color depth (also known as bit depth) in each pixel can further improve hyperspectral learning to avoid metamerism such that different spectral profiles result in the same RGB values. Because the color depth in each R, G, and B channel is 10 bit in our case, the number of different colors (combinations of RGB values) is 210 × 210 × 210 (= 1.07 billion) colors. In other words, it is extremely unlikely that metamerism occurs. The higher the color depth of an RGB image, the more colors the image can store, further reducing the possibility of metamerism in hyperspectral learning.

In conclusion, the main advantages of this snapshot imaging method are the independence between the spectral and spatial resolutions, the hardware simplicity that could enable the use of commercially available smartphones, and the versatility offered by different machine learning techniques. We further envision that spectrally informed hyperspectral imaging will lay the foundation for mHealth technologies to noninvasively access hemodynamic information from an easily accessible peripheral tissue site using off-the-shelf smartphones in resource-limited settings or at-home settings.

Materials and methods

Dual-channel imaging setup mounted with a smartphone camera

First, we constructed a dual-channel imaging spectrograph setup to acquire an RGB image in the entire field of view and hyperspectral data in a subarea; a line-scan hyperspectral system was mounted with a trichromatic camera (i.e. RGB camera and smartphone camera). This configuration was inspired by an image-guided hyperspectral line-scan system mounted on a telescope, which is a common configuration used in amateur astronomy (80). In this dual-channel setup, a photometric slit, which is a mirror with a slit opening (slit width = 23 μm), enabled the simultaneous acquisition of an RGB image and hyperspectral line-scan data. An image reflected by the photometric slit (mirror portion) was captured using a trichromatic (RGB) camera (GS3-U3-120S6C-C, FLIR). The light passing through the slit was dispersed by a diffraction grating (groove density = 150 mm−1) and captured using a monochrome camera (GS3-U3-120S6M-C, FLIR). A tungsten halogen light source (HL-2000-HP Light Source, Ocean Insight) was used as a broadband illumination source. Spectral calibration of the spectrograph was performed using a xenon calibration light source emitting multiple narrow peaks at specific wavelengths. An objective lens (4× and NA = 0.10) was mainly used to image the microvessels, and the field of view was as small as 2.5 mm × 2 mm with a spatial resolution of 55 µm (Fig. S1). The spectral resolution was determined via a helium–neon laser that emits monochromatic radiation with a narrow bandwidth at 543.5 nm. The spectral range of hyperspectral line-scan data was 380–720 nm, and the spectral resolution Δλ was 0.5 nm (Fig. S2). The typical sizes of the RGB image and hyperspectral data were 750 × 300 pixels and 750 × 1 pixels in the spatial domain, respectively. Second, we incorporated a smartphone (Samsung Galaxy S21) into the dual-channel system using a cube beamsplitter (Supplementary Methods and Fig. S8). The color depths (bit depth) were 10 bit (file format: DNG) for photos and 8 bit (MP4) for videos, respectively. The smartphone's video recording function (i.e. ultraslow-motion recording) was used to acquire videos at frame rates of 60 or 960 fps (Videos S1–S3, Supplementary Methods, and Fig. S9). Third, we employed a motorized linear stage (T-LSR150A, Zaber) to move the entire system to directly acquire the ground-truth hyperspectral image cube (hypercube) in the entire field of view in the same manner as the pushbroom-type hyperspectral imaging system. A mechanical linear scan step was performed at 10 μm. Mechanical scanning was performed only for the validation studies. The data acquisition was synchronized using a custom-built MATLAB interface.

Frontend data processing

We implemented frontend data processing for both hyperspectral and RGB imaging to factor out the spectral contributions from the illumination, system responses, and background ambient light. The measured spectral intensity IM(λ) reflected from the sample is expressed as a function of wavelength λ:

IM(λ)=L(λ)C(λ)S(λ)IR(λ), (3)

where L(λ) denotes the spectral shape of the illumination light source, C(λ) represents the spectral response of all the optical components in the system, S(λ) depicts the spectral response function (also known as the sensitivity function) of the image sensors, and IR(λ) symbolizes the intensity reflected in the sample (26, 30, 53–65). First, a white reflectance standard (SRS-99-010, Labsphere) was used to compensate for the spectral response of the system, including the illumination. The white reflectance standard possesses a reflectivity of 99% in the visible range, and its measured intensity can be expressed as:

IWhite(λ)=L(λ)C(λ)S(λ). (4)

Second, a black reflectance standard (SRS-02-010, Labsphere) with a reflectivity of 2% in the visible range was acquired to eliminate the ambient stray and background room light. After subtracting the black standard intensity IBlack from each measurement, the normalized intensity reflected from the sample can be obtained:

IR(λ)=IM(λ)IBlack(λ)IWhite(λ)IBlack(λ). (5)

Hyperspectral statistical learning

We established a statistical learning framework to reliably reconstruct a hyperspectrum from RGB values. Conceptually, hyperspectral learning solves a pseudoinverse of S3×k in Eq. (1) where S3×k is the spectral response function in the R, G, and B channels of the camera. As the method of least squares (l2 norm minimization), we took advantage of fixed-design linear regression with polynomial expansions to solve [S3×k]+ (55, 62, 81, 83). First, the sampled hyperspectral and RGB data (m = training data size) were used to change this underdetermined problem to an overdetermined problem such that X3×m and Yk×m were formed by adding x3×1 and yk×1 of the training data. The relationship in Eq. (1) is rewritten:

X3×m=S3×kYk×m. (6)

The relationship in Eq. (6) can be expressed in the following inverse format:

Yk×m=Tk×3X3×m, (7)

where Tk×3=[S3×k]+. Second, to incorporate the nonlinearity between RGB and hyperspectral data and improve the hyperspectral learning, we added fourth degree polynomial expansions by including a bias term such that X3×m was expanded to X^p×m with p = 34 (55, 62, 83):

X^p×m=[1,1,1,R,G,B,R2,G2,B2,RG,GB,RB,R3,G3,B3,RG2,RB2,GR2,GB2,BR2,BG2,RGB,R4,G4,B4,R3G,R3B,G3R,G3B,B3R,B3G,R2G2,R2B2,G2B2,R2GB,G2RB,B2RG]T. (8)

Subsequently, Eq. (7) becomes:

Yk×m=T^k×pX^p×m. (9)

Third, the inverse of the expanded matrix T^ can easily be solved via QR decomposition or the Moore–Penrose pseudoinverse (55, 62, 81–83). Finally, upon substituting the RGB values (x=[R,G,B]T) into Eq. (7), the corresponding hyperspectral intensity values (y=[I(λ1),I(λ2),,I(λk)]T) were obtained. Thus, T^ transforms RGB data to hyperspectral data.

Tissue reflectance spectral modeling

We make use of a tissue reflectance spectral model to extract key hemodynamic parameters from the ground-truth and recovered hyperspectral data. Light propagation in tissue can be modeled in accordance with the theory of radiative transport and robust approximations (e.g. diffusion, Born, and empirical modeling) (85–90). Specifically, we conducted parameter extractions using an extensively used empirical modeling method. The intensity reflected from a biological sample can be expressed as a function of λ in the visible range:

IR(λ)=[b1(λλ0)b2+b3(λλ0)4]×exp[b4×{b5×εHbO2(λ)+(1b5)×εHb(λ)}], (10)

where b1, b2, and b3 are associated with the scattering (Mie or Rayleigh) contributions at λ0 = 800 nm, εHbO2(λ) denotes the absorption coefficient of HbO2, εHb(λ) denotes the absorption coefficient of Hb, b4 is the hemoglobin concentration multiplied by the optical path length, and b5 is the blood sPO2. In the chicken embryo study, the lipid content of the egg yolk is high, as avian embryos rely on over 90% of the caloric requirement of fatty acids (104). Thus, an additional absorption coefficient term for lipid was added to Eq. (10) to account for this high lipid content (105). All fitting parameters were computed using the simplex search (Nelder–Mead) algorithm (106). The differences (residuals) between the original and fitted hyperspectra were plotted with 95% CIs as a function of λ (Figs. 3, S5, and S6).

Deep neural network designing and training

We designed a deep neural network considering the polynomial expansions of hyperspectral learning (Eq. (8)) and tissue reflectance spectral modeling (Eq. (10)). First, as an alternative to the sequence of hyperspectral learning and hemodynamic parameter extraction in statistical learning, a deep neural network was established that utilized RGB values as the input and returned the hemodynamic parameters of HbO2 and Hb (or total hemoglobin and sPO2) as the output. Hemodynamic parameters were calculated using the tissue reflectance spectral model (Eq. (10)). The two hemodynamic parameters as outputs were fed to the network for supervised learning, resulting in rapid convergence and high prediction accuracy. The neural network was trained and validated using 80% (600 data points) and 20% (150 data points) of locally sampled data.

Second, in the fully connected network, 18 nodes (neurons) in the first hidden layer are intended to have optimal selections of RGB values with different weights on the R, G, and B values and their combinations. It should be noted that the first hidden layer is conceptually understandable in a manner similar to the polynomial expansions in statistical hyperspectral learning (Eq. (8)), thereby offering a transparent neural network model:

N1,1=w1,RR+w1,GG+w1,BB,N1,2=w2,RR+w2,GG+w2,BB,N1,i=wi,RR+wi,GG+wi,BB, (11)

where R, G, and B are the input RGB values, N1,i indicates the ith node of the first hidden layer, and wi,RGB is a weight connecting the R, G, or B input nodes and the ith node of the first hidden layer, respectively. Overall, the network included the input layer with three nodes, four hidden layers, and the output layer (Fig. S3). Batch normalization, the softplus activation function (i.e. a smooth approximation to ReLU), and the ADAM optimization were employed for the network. The weight decay (factor for l2 regularization) was specified to be 10−5 to avoid overfitting. The initial learning rate was set to 0.01, but the learning rate was set to drop for a given number of epochs. The batch size was 20 and the value of the epoch was 15 (Fig. S3).

Spectral angle mapper

The spectral angle mapper directly compares two spectra (e.g. measured spectrum and reference spectrum) allowing for a geometric interpretation in a space with dimensionality equal to the number of discretized wavelengths (k). Specifically, the spectral angle mapper calculates an angle between the spectra in the k-dimensional space, quantifying spectral similarity in a pixel-by-pixel manner (107).

α=cos1(i=1kIA(λi)IB(λi)i=1kIA(λi)2i=1kIB(λi)2), (12)

where IA is the spectrum to be compared and IB is the reference spectrum.

fNIRS measurements

We utilized an fNIRS neuroimaging system (NIRx Medical Technologies) to concurrently measure hemodynamic signals from the brain and fingertips during the resting state of healthy adult volunteers. In brief, the head cap and finger clip had two sets of laser sources and detectors at 785 and 830 nm, respectively, and the separation distance between the laser and the detector was 30 mm. Six detection channels were used to cover all brain areas and two detection channels were employed for a fingertip. The sampling rate of fNIRS was 10 Hz. HbO2 and Hb signals were synchronized with the dual-channel imaging setup and smartphone camera. The fNIRS signals were processed for motion correction and spatial smoothing with a full width at half maximum of a 5-mm isotropic Gaussian kernel. A bandpass filter (0.01–0.1 Hz, third-order Butterworth) was used to extract the low-frequency signals of ΔHbO2 and ΔHb. The phase differences between the HbO2 and Hb signals (ΔHbO2 relative to ΔHb) were calculated after application of the bandpass filter (100).

Supplementary Material

pgad111_Supplementary_Data

Acknowledgments

We thank Norvin Bruns for helping with the imaging setup at the Weldon School of Biomedical Engineering at Purdue University and Darrin Karcher for providing the fertilized eggs (Gallus gallus domesticus, Hy-Line W-36) at the Purdue University Animal Sciences Research and Education Center poultry unit.

Contributor Information

Yuhyun Ji, Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA.

Sang Mok Park, Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA.

Semin Kwon, Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA.

Jung Woo Leem, Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA.

Vidhya Vijayakrishnan Nair, Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA.

Yunjie Tong, Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA.

Young L Kim, Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA; Purdue Institute for Cancer Research, Purdue University, West Lafayette, IN 47906, USA; Regenstrief Center for Healthcare Engineering, Purdue University, West Lafayette, IN 47907, USA; Purdue Quantum Science and Engineering Institute, Purdue University, West Lafayette, IN 47907, USA.

Ethics Approval Statement

This study was conducted with IRB approval (IRB-2022-1202) at Purdue University and informed consent was obtained from all participants.

Supplementary Material

Supplementary material is available at PNAS Nexus online.

Funding

This work was supported by the NIH Fogarty International Center (R21TW012486), the Technology Accelerator Challenge Prize from the NIH National Institute of Biomedical Imaging and Bioengineering, Veterans Affairs Merit Award (I01BX005816), the Ministry of Trade, Industry & Energy (MOTIE) in Korea (P0017306), and the Ralph W. and Grace M. Showalter Trust.

Author Contributions

Y.J. and Y.L.K. conceived the idea, developed the setup, and design the experimental design. Y.J., S.M.P., J.W.L., and V.V.N. worked on the experimental setup, imaging, and analyzing the data. Y.J., S.M.P., S.K., V.V.N., Y.T., and Y.L.K. participated in the analyses. Y.J. and Y.L.K. wrote the manuscript. Y.L.K. directed the overall research. All of the authors discussed the results and the manuscript.

Data Availability

Modified data and source codes are available at https://purr.purdue.edu/publications/4244/1 (Kim, Y. L. (2023). mHealth hyperspectral learning for hemodynamics. Purdue University Research Repository. doi:10.4231/VAFP-DW68). Data have been modified to protect potentially identifying or sensitive patient information.

References

  • 1. Yokoya  N, Grohnfeldt  C, Chanussot  J. 2017. Hyperspectral and multispectral data fusion: a comparative review of the recent literature. IEEE Geosci Remote Sens Mag. 5(2):29–56. [Google Scholar]
  • 2. Huang  LQ, Luo  RC, Liu  X, Hao  X. 2022. Spectral imaging with deep learning. Light Sci Appl. 11(1):61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Yang  ZY, Albrow-Owen  T, Cai  WW, Hasan  T. 2021. Miniaturization of optical spectrometers. Science. 371(480):6528. [DOI] [PubMed] [Google Scholar]
  • 4. Redding  B, Liew  SF, Sarma  R, Cao  H. 2013. Compact spectrometer based on a disordered photonic chip. Nat Photonics.  7(9):746–751. [Google Scholar]
  • 5. Bao  J, Bawendi  MG. 2015. A colloidal quantum dot spectrometer. Nature. 523(7558):67–70. [DOI] [PubMed] [Google Scholar]
  • 6. Ghamisi  P, et al.  2017. Advances in hyperspectral image and signal processing: a comprehensive overview of the state of the art. IEEE Geosci Remote Sens Mag. 5(4):37–78. [Google Scholar]
  • 7. Khan  MJ, Khan  HS, Yousaf  A, Khurshid  K, Abbas  A. 2018. Modern trends in hyperspectral image analysis: a review. IEEE Access. 6:14118–14129. [Google Scholar]
  • 8. Yang  ZY, et al.  2019. Single-nanowire spectrometers. Science. 365(1017):6457. [DOI] [PubMed] [Google Scholar]
  • 9. Jaiswal  G, Sharma  A, Yadav  SK. 2021. Critical insights into modern hyperspectral image applications through deep learning. Data Min Knowl Discov. 11(6):e1426. [Google Scholar]
  • 10. Reichstein  M, et al.  2019. Deep learning and process understanding for data-driven earth system science. Nature. 566(7743):195–204. [DOI] [PubMed] [Google Scholar]
  • 11. Manifold  B, Men  SQ, Hu  RQ, Fu  D. 2021. A versatile deep learning architecture for classification and label-free prediction of hyperspectral images. Nat Mach Intell. 3(4):306–315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Zhang  JG, et al.  2022. A survey on computational spectral reconstruction methods from RGB to hyperspectral imaging. Sci Rep.  12(1):11905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Tsagkatakis  G, et al.  2019. Survey of deep-learning approaches for remote sensing observation enhancement. Sensors. 19(18):3929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Hong  DF, et al.  2021. Interpretable hyperspectral artificial intelligence: when nonconvex modeling meets hyperspectral remote sensing. IEEE Geosci Remote Sens Mag. 9(2):52–87. [Google Scholar]
  • 15. Kirchner  SR, et al.  2018. Snapshot hyperspectral imaging (SHI) for revealing irreversible and heterogeneous plasmonic processes. J Phys Chem C. 122(12):6865–6875. [Google Scholar]
  • 16. Hadoux  X, et al.  2019. Non-invasive in vivo hyperspectral imaging of the retina for potential biomarker use in Alzheimer’s disease. Nat Commun.  10:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Yoon  J, et al.  2019. A clinically translatable hyperspectral endoscopy (HySE) system for imaging the gastrointestinal tract. Nat Commun.  10:1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Zhang  L, et al.  2020. Unsupervised adaptation learning for hyperspectral imagery super-resolution. IEEE/CVF Conf Comput Vis Pattern Recognit (CVPR). 3070–3079. [Google Scholar]
  • 19. Wang  LZ, Sun  C, Zhang  MQ, Fu  Y, Huang  H. 2020. DNU: deep non-local unrolling for computational spectral imaging. IEEE/CVF Conf Comput Vis Pattern Recognit (CVPR). 1658–1668. [Google Scholar]
  • 20. Zhang  WY, et al.  2021. Deeply learned broadband encoding stochastic hyperspectral imaging. Light Sci Appl. 10(1):108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Sun  YY, Zhang  JC, Liang  RG. 2022. pHSCNN: CNN-based hyperspectral recovery from a pair of RGB images. Opt Express.  30(14):24862–24873. [DOI] [PubMed] [Google Scholar]
  • 22. Du  X, et al.  2022. Label-free hyperspectral imaging and deep-learning prediction of retinal amyloid β-protein and phosphorylated tau. PNAS Nexus. 1(4):164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Wei  Q, Dobigeon  N, Tourneret  JY. 2015. Fast fusion of multi-band images based on solving a Sylvester equation. IEEE Trans Image Process. 24(11):4109–4121. [DOI] [PubMed] [Google Scholar]
  • 24. Kanatsoulis  CI, Fu  X, Sidiropoulos  ND, Ma  WK. 2018. Hyperspectral super-resolution: a coupled tensor factorization approach. IEEE Trans Signal Process. 66(24):6503–6517. [Google Scholar]
  • 25. Li  YS, Hu  J, Zhao  X, Xie  WY, Li  JJ. 2017. Hyperspectral image super-resolution using deep convolutional neural network. Neurocomputing. 266:29–41. [Google Scholar]
  • 26. Wang  Z, et al.  2019. Single-shot on-chip spectral sensors based on photonic crystal slabs. Nat Commun.  10:1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Wang  YF, Escuti  MJ, Kudenov  MW. 2019. Snapshot channeled imaging spectrometer using geometric phase holograms. Opt Express.  27(11):15444–15455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. He  QH, Wang  RK. 2019. Analysis of skin morphological features and real-time monitoring using snapshot hyperspectral imaging. Biomed Opt Express.  10(11):5625–5638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Mcclung  A, Samudrala  S, Torfeh  M, Mansouree  M, Arbabi  A. 2020. Snapshot spectral imaging with parallel metasystems. Sci Adv.  6(38):eabc7646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Altaqui  A, et al.  2021. Mantis shrimp-inspired organic photodetector for simultaneous hyperspectral and polarimetric imaging. Sci Adv.  7(10):eabe3196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Monakhova  K, Yanny  K, Aggarwal  N, Waller  L. 2020. Spectral DiffuserCam: lensless snapshot hyperspectral imaging with a spectral filter array. Optica. 7(10):1298–1307. [Google Scholar]
  • 32. Yako  M, et al.  2023. Video-rate hyperspectral camera based on a CMOS-compatible random array of Fabry–Pérot filters. Nat Photonics.  17(3):218–223. [Google Scholar]
  • 33. Bouchard  MB, Chen  BR, Burgess  SA, Hillman  EMC. 2009. Ultra-fast multispectral optical imaging of cortical oxygenation, blood flow, and intracellular calcium dynamics. Opt Express.  17(18):15670–15678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. He  Q, Wang  R. 2020. Hyperspectral imaging enabled by an unmodified smartphone for analyzing skin morphological features and monitoring hemodynamics. Biomed Opt Express.  11(2):895–910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Johnson  WR, Wilson  DW, Fink  W, Humayun  M, Bearman  G. 2007. Snapshot hyperspectral imaging in ophthalmology. J Biomed Opt.  12(1):014036. [DOI] [PubMed] [Google Scholar]
  • 36. Boniface  A, Gusachenko  I, Dholakia  K, Gigan  S. 2019. Rapid broadband characterization of scattering medium using hyperspectral imaging. Optica. 6(3):274–279. [Google Scholar]
  • 37. Mu  TK, et al.  2022. Snapshot hyperspectral imaging polarimetry with full spectropolarimetric resolution. Opt Lasers Eng.  148:106767. [Google Scholar]
  • 38. Wagadarikar  AA, Pitsianis  NP, Sun  XB, Brady  DJ. 2009. Video rate spectral imaging using a coded aperture snapshot spectral imager. Opt Express.  17(8):6368–6388. [DOI] [PubMed] [Google Scholar]
  • 39. French  R, Gigan  S, Muskens  OL. 2017. Speckle-based hyperspectral imaging combining multiple scattering and compressive sensing in nanowire mats. Opt Lett.  42(9):1820–1823. [DOI] [PubMed] [Google Scholar]
  • 40. Deng  C, et al.  2018. Snapshot hyperspectral imaging via spectral basis multiplexing in Fourier domain. Opt Express.  26(25):32509–32521. [DOI] [PubMed] [Google Scholar]
  • 41. Antipa  N, et al.  2018. Diffusercam: lensless single-exposure 3D imaging. Optica. 5(1):1–9. [Google Scholar]
  • 42. Wang  LZ, Zhang  T, Fu  Y, Huang  H. 2019. Hyperreconnet: joint coded aperture optimization and image reconstruction for compressive hyperspectral imaging. IEEE Trans Image Process. 28(5):2257–2270. [DOI] [PubMed] [Google Scholar]
  • 43. Toivonen  ME, Rajani  C, Klami  A. 2020. Snapshot hyperspectral imaging using wide dilation networks. Mach Vis Appl.  32(1):1–11. [Google Scholar]
  • 44. Bacca  J, Fonseca  Y, Arguello  H. 2021. Compressive spectral image reconstruction using deep prior and low-rank tensor representation. Appl Opt.  60(14):4197–4207. [DOI] [PubMed] [Google Scholar]
  • 45. Lin  JY, et al.  2018. Dual-modality endoscopic probe for tissue surface shape reconstruction and hyperspectral imaging enabled by deep neural networks. Med Image Anal.  48:162–176. [DOI] [PubMed] [Google Scholar]
  • 46. Brown  C, et al.  2021. Neural network-based on-chip spectroscopy using a scalable plasmonic encoder. ACS Nano. 15(4):6305–6315. [DOI] [PubMed] [Google Scholar]
  • 47. Kudenov  MW, Altaqui  A, Williams  C. 2022. Practical spectral photography II: snapshot spectral imaging using linear retarders and microgrid polarization cameras. Opt Express.  30(8):12337–12352. [DOI] [PubMed] [Google Scholar]
  • 48. Xie  H, et al.  2022. Dual camera snapshot hyperspectral imaging system via physics-informed learning. Opt Lasers Eng.  154:107023. [DOI] [PubMed] [Google Scholar]
  • 49. Xiong  J, et al.  2022. Dynamic brain spectrum acquired by a real-time ultraspectral imaging chip with reconfigurable metasurfaces. Optica. 9(5):461–468. [Google Scholar]
  • 50. Lu  GL, Fei  BW. 2014. Medical hyperspectral imaging: a review. J Biomed Opt.  19(1):010901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Pichette  J, et al.  2016. Intraoperative video-rate hemodynamic response assessment in human cortex using snapshot hyperspectral optical imaging. Neurophotonics. 3(4):1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Mori  M, et al.  2014. Intraoperative visualization of cerebral oxygenation using hyperspectral image data: a two-dimensional mapping method. Int J Comput Assist Radiol Surg.  9(6):1059–1072. [DOI] [PubMed] [Google Scholar]
  • 53. Chen  S, Liu  Q. 2012. Modified Wiener estimation of diffuse reflectance spectra from RGB values by the synthesis of new colors for tissue measurements. J Biomed Opt.  17(3):030501. [DOI] [PubMed] [Google Scholar]
  • 54. Yoshida  K, et al.  2015. Multispectral imaging of absorption and scattering properties of in vivo exposed rat brain using a digital red-green-blue camera. J Biomed Opt.  20(5):051026. [DOI] [PubMed] [Google Scholar]
  • 55. Yoon  SC, et al.  2015. Hyperspectral imaging using RGB color for foodborne pathogen detection. J Electron Imaging.  24(4):043008. [Google Scholar]
  • 56. Galliani  S, Lanaras  C, Marmanis  D, Baltsavias  E, Schindler  K. 2017. Learned spectral super-resolution. arXiv preprint. arXiv:1703.09470. [Google Scholar]
  • 57. Koundinya  S, et al.  2018. 2D-3D CNN based architectures for spectral reconstruction from RGB images. IEEE/CVF Conf Comput Vis Pattern Recognit Workshops (CVPRW). 957–964. [Google Scholar]
  • 58. Yan  YQ, Zhang  L, Li  J, Wei  W, Zhang  YN. 2018. Accurate spectral super-resolution from single RGB image using multi-scale CNN. Pattern Recognit Comput Vis. 11257:206–217. [Google Scholar]
  • 59. Stiebel  T, Koppers  S, Seltsam  P, Merhof  D. 2018. Reconstructing spectral images from RGB-images using a convolutional neural network. IEEE/CVF Conf Comput Vis Pattern Recognit Workshops (CVPRW). 1061–1066. [Google Scholar]
  • 60. Arad  B, et al.  2018. NTIRE 2018 challenge on spectral reconstruction from RGB images. IEEE/CVF Conf Comput Vis Pattern Recognit Workshops (CVPRW). 1042–1051. [Google Scholar]
  • 61. Kaya  B, Can  YB, Timofte  R. 2019. Towards spectral estimation from a single RGB image in the wild. IEEE/CVF Int Conf Comput Vis Workshops (ICCVW). 3546–3555. [Google Scholar]
  • 62. Park  SM, et al.  2020. Mhealth spectroscopy of blood hemoglobin with spectral super-resolution. Optica. 7(6):563–573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Zou  CZ, Wei  MH. 2021. Cluster-based deep convolutional networks for spectral reconstruction from RGB images. Neurocomputing. 464:342–351. [Google Scholar]
  • 64. Lin  YT, Finlayson  GD. 2020. Physically plausible spectral reconstruction from RGB images. IEEE/CVF Conf Comput Vis Pattern Recognit Workshops (CVPRW). 2257–2266. [Google Scholar]
  • 65. Kwak  Y, Park  SM, Ku  ZY, Urbas  A, Kim  YL. 2021. A pearl spectrometer. Nano Lett.  21(2):921–930. [DOI] [PubMed] [Google Scholar]
  • 66. Lin  YT, Finlayson  GD. 2021. On the optimization of regression-based spectral reconstruction. Sensors. 21(16):5586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Arad  B, et al.  2020. NTIRE 2020 challenge on spectral reconstruction from an RGB image. IEEE/CVF Conf Comput Vis Pattern Recognit Workshops (CVPRW). 1806–1822. [Google Scholar]
  • 68. Munos  B, et al.  2016. Mobile health: the power of wearables, sensors, and apps to transform clinical trials. Ann N Y Acad Sci.  Special Issue: Annals Reports  1375(1):3–18. [DOI] [PubMed] [Google Scholar]
  • 69. Wood  CS, et al.  2019. Taking connected mobile-health diagnostics of infectious diseases to the field. Nature. 566(7745):467–474. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Mathews  SC, et al.  2019. Digital health: a path to validation. npj Digital Med. 2:1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Hunt  B, Ruiz  AJ, Pogue  BW. 2021. Smartphone-based imaging systems for medical applications: a critical review. J Biomed Opt.  26(4):040902–040902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Hussain  I, Bowden  AK. 2021. Smartphone-based optical spectroscopic platforms for biomedical applications: a review. Biomed Opt Express.  12(4):1974–1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Hussain  I, et al.  2022. A multi-channel smartphone-based spectroscopic system for high-throughput biosensing in low-resource settings. Analyst. 147(13):3007–3016. [DOI] [PubMed] [Google Scholar]
  • 74. Sedhom  R, Mcshea  MJ, Cohen  AB, Webster  JA, Mathews  SC. 2021. Mobile app validation: a digital health scorecard approach. npj Digital Med. 4(1):111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Von Rueden  L, et al.  2021. Informed machine learning–A taxonomy and survey of integrating knowledge into learning systems. IEEE Trans Knowl Data Eng. 35(1):614–633. [Google Scholar]
  • 76. Iten  R, Metger  T, Wilming  H, Del Rio  L. 2020. Discovering physical concepts with neural networks. Phys Rev Lett. 124(1):010508. [DOI] [PubMed] [Google Scholar]
  • 77. Arrieta  AB, et al.  2020. Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inf Fusion. 58:82–115. [Google Scholar]
  • 78. Karniadakis  GE, et al.  2021. Physics-informed machine learning. Nat Rev Phys. 3(6):422–440. [Google Scholar]
  • 79. Cuomo  S, et al.  2022. Scientific machine learning through physics-informed neural networks: where we are and what’s next. J Sci Comput.  92(3):88. [Google Scholar]
  • 80. Skopal  A, et al.  2014. Early evolution of the extraordinary Nova Delphini 2013 (V339 Del). Astron Astrophysics. 569:A112. [Google Scholar]
  • 81. Finlayson  G, Darrodi  MM, Mackiewicz  M. 2016. Rank-based camera spectral sensitivity estimation. J Opt Soc Am A. 33(4):589–599. [DOI] [PubMed] [Google Scholar]
  • 82. Ji  Y, Kwak  Y, Park  SM, Kim  YL. 2021. Compressive recovery of smartphone RGB spectral sensitivity functions. Opt Express.  29(8):11947–11961. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Finlayson  GD, Zhu  YT. 2019. Finding a colour filter to make a camera colorimetric by optimisation. Comput Color Imaging. 11418:53–62. [Google Scholar]
  • 84. Yoon  SC, et al.  2019. Hyperspectral image recovery using a color camera for detecting colonies of foodborne pathogens on agar plate. J Biosyst Eng. 44(3):169–185. [Google Scholar]
  • 85. Flock  ST, Patterson  MS, Wilson  BC, Wyman  DR. 1989. Monte Carlo modeling of light-propagation in highly scattering tissues. 1. Model predictions and comparison with diffusion-theory. IEEE Trans Biomed Eng. 36(12):1162–1168. [DOI] [PubMed] [Google Scholar]
  • 86. Finlay  JC, Foster  TH. 2004. Effect of pigment packaging on diffuse reflectance spectroscopy of samples containing red blood cells. Opt Lett.  29(9):965–967. [DOI] [PubMed] [Google Scholar]
  • 87. Amelink  A, Robinson  DJ, Sterenborg  HJCM. 2008. Confidence intervals on fit parameters derived from optical reflectance spectroscopy measurements. J Biomed Opt.  13(5):054044. [DOI] [PubMed] [Google Scholar]
  • 88. Rajaram  N, Gopal  A, Zhang  XJ, Tunnell  JW. 2010. Experimental validation of the effects of microvasculature pigment packaging on in vivo diffuse reflectance spectroscopy. Lasers Surg Med.  42(7):680–688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Boas  DA, Pitris  C, Ramanujam  N. 2011). Handbook of biomedical optics. Boca Raton: CRC Press. p. 787. [Google Scholar]
  • 90. Wang  LV, Wu  HI. 2012. Biomedical optics: principles and imaging. Hoboken, NJ: John Wiley & Sons. [Google Scholar]
  • 91. Vo-Dinh  T. 2014. Biomedical photonics handbook: biomedical diagnostics. Boca Raton, FL: CRC press. [Google Scholar]
  • 92. Simonyan  K, Vedaldi  A, Zisserman  A. 2013. Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv preprint. arXiv:1312.6034. [Google Scholar]
  • 93. Selvaraju  RR, et al.  2017. Grad-cam: visual explanations from deep networks via gradient-based localization. IEEE Int Conf Comput Vis. 618–626. [Google Scholar]
  • 94. Perry  MM. 1988. A complete culture system for the chick-embryo. Nature. 331(6151):70–72. [DOI] [PubMed] [Google Scholar]
  • 95. Yoon  J, Grigoroiu  A, Bohndiek  SE. 2020. A background correction method to compensate illumination variation in hyperspectral imaging. Plos One. 15(3):e0229502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. Abraham  E, Fink  SE, Markle  DR, Pinholster  G, Tsang  M. 1985. Continuous monitoring of tissue pH with a fiberoptic conjunctival sensor. Ann Emerg Med.  14(9):840–844. [DOI] [PubMed] [Google Scholar]
  • 97. Lin  JY, Fisher  DE. 2007. Melanocyte biology and skin pigmentation. Nature. 445(7130):843–850. [DOI] [PubMed] [Google Scholar]
  • 98. Khansari  MM, Tan  M, Karamian  P, Shahidi  M. 2018. Inter-visit variability of conjunctival microvascular hemodynamic measurements in healthy and diabetic retinopathy subjects. Microvasc Res.  118:7–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99. Watanabe  H, et al.  2017. Hemoglobin phase of oxygenation and deoxygenation in early brain development measured using fNIRS. Natl Acad Sci U S A. 114(9):E1737–E1744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Liang  ZH, et al.  2018. Symbolic time series analysis of fNIRS signals in brain development assessment. J Neural Eng.  15(6):066013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101. Zhang  J. 2017. Film-like images with super-fine details using Foveon X3 technology. Imaging Syst Appl. IM3E-4. [Google Scholar]
  • 102. Zhao  N, Catrysse  PB, Fan  S. 2021. Perfect RGB-IR color routers for sub-wavelength size CMOS image sensor pixels. Adv Photonics Res. 2(3):2000048. [Google Scholar]
  • 103. Zou  X, et al.  2022. Pixel-level Bayer-type colour router based on metasurfaces. Nat Commun.  13(1):1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104. Royle  NJ, Surai  PF, Mccartney  RJ, Speake  BK. 1999. Parental investment and egg yolk lipid composition in gulls. Funct Ecol.  13(3):298–306. [Google Scholar]
  • 105. Van Veen  RL, et al.  2005. Determination of visible near-IR absorption coefficients of mammalian fat using time- and spatially resolved diffuse reflectance and transmission spectroscopy. J Biomed Opt.  10(5):054004. [DOI] [PubMed] [Google Scholar]
  • 106. Lagarias  JC, Reeds  JA, Wright  MH, Wright  PE. 1998. Convergence properties of the Nelder-Mead simplex method in low dimensions. SIAM J Optim. 9(1):112–147. [Google Scholar]
  • 107. Kruse  FA, et al.  1993. The spectral image-processing system (sips)—interactive visualization and analysis of imaging spectrometer data. Remote Sens Environ.  44(2–3):145–163. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

pgad111_Supplementary_Data

Data Availability Statement

Modified data and source codes are available at https://purr.purdue.edu/publications/4244/1 (Kim, Y. L. (2023). mHealth hyperspectral learning for hemodynamics. Purdue University Research Repository. doi:10.4231/VAFP-DW68). Data have been modified to protect potentially identifying or sensitive patient information.


Articles from PNAS Nexus are provided here courtesy of Oxford University Press

RESOURCES