Skip to main content
Nature Communications logoLink to Nature Communications
. 2025 Oct 24;16:9409. doi: 10.1038/s41467-025-64184-6

Targeted high-resolution sensing of volatile organic compounds by covalent nanopore detection

Lauren E McGivern 1,#, Zhong Hui Lim 1,#, Yizhi Yuan 1, Zonghua Bo 1, Guangqi Wu 1, Hagan Bayley 1,, Yujia Qing 1,
PMCID: PMC12552616  PMID: 41136404

Abstract

Volatile organic compounds are choice analytes in a variety of contexts. For example, humans release over 4000 volatile organic compounds, many of which are diagnostic of life-threatening medical conditions. The analysis of combinations of a large number of potential analytes requires the application of costly, cumbersome technology. In this work, we show that covalent nanopore sensing can be used for the targeted detection of a reduced set of analytes in a mixture. In this case aldehydes, which constitute ~5% of human volatiles, can be selectively detected by using reversible thiol-aldehyde chemistry. Further, nanopore engineering permits high-resolution detection, which allows closely related aldehydes, including isomers, to be distinguished. Differential sensing of members of other chemical classes, such as mono alcohols, is also demonstrated by leveraging their enzymatic conversion into aldehydes. Our approach is compatible with the use of low-cost, portable, user-friendly diagnostic devices applicable to a wide variety of objectives, including pollutant monitoring, food and beverage testing and the quality control of pharmaceuticals, as well as disease diagnostics.

Subject terms: Single-molecule biophysics, Metabolomics, Nanopores


Volatile organic compounds are important analytes in medical diagnostics, with aldehydes forming a characteristic subclass. Here, the authors report that covalent nanopore sensing enables the targeted detection of aldehydes in mixtures through reversible thiol–aldehyde chemistry, offering a route to portable and low-cost analysis.

Introduction

Aldehydes are ubiquitously produced through chemical processes across various biological, environmental, and industrial contexts. These include lipid peroxidation in the human body1, incomplete combustion during wood burning2, and Strecker degradation during beer storage3. Single-molecule detection of diverse aldehydes against background chemicals therefore offers wide applications.

For example, small-molecule aldehydes are found in human breath or bodily fluids, alongside a spectrum of volatile organic compounds (VOCs) generated through metabolic processes1 or by the gut microbiome4. Metabolic alterations in specific conditions, such as cancers5, gastrointestinal diseases6, respiratory disorders7, and viral infections8, lead to distinct VOC profiles, which offer a non-invasive window into the body’s internal biochemical processes. The current gold standard for detection of small molecules is liquid or gas chromatography-mass spectrometry (LC/GC-MS)9, which generates a near-complete profile of all collected VOCs, but requires centralised labs that use expensive equipment and sophisticated analysis packages.

The targeted detection of aldehydes, comprising about 170 species among over 4000 VOCs of human origin10, presents an appealing approach to reduce the complexity of small-molecule fingerprints, while retaining significant diagnostic value. Aldehyde mixtures in breath or bodily fluids commonly contain linear or branched, unsaturated or saturated carbon chains, containing from 1 to 17 carbon atoms, as well as aromatic species such as benzaldehyde and its derivatives10. A ratiometric fingerprint of aldehydes holds promise for disease detection and health monitoring, provided that tests can be made rapid, simple and inexpensive. For example, the ethanal:pentanal:heptanal ratio in breath shifts from 112:1:1.5 in healthy individuals to 24:1.7:1 in patients with lung cancer11, and similarly, the hexanal:heptanal ratio in urine changes from 1.3:1 to 2.5:112. For COVID infection, the abundance of octanal and benzaldehyde relative to each other is a hallmark of recent infection8.

We have established single-molecule covalent sensing mediated by protein nanopores to detect analytes based on chemical reactivity13. The concept was first demonstrated with engineered α-hemolysin (αHL) pores1417 and later validated by others using alternative nanopores1820. Analyte molecules form covalent bonds reversibly or irreversibly with a sensing group on the internal surface of a pore, thereby generating characteristic changes in the ionic current flowing through the pore under a transmembrane potential. When examining mixed analytes, the current signatures (e.g., the current blockade amplitude and the noise during blockades) identify analytes, while the frequency of reversible events reveals the concentration of individual analytes21.

One general challenge is to identify sensing chemistry with suitable kinetics to ensure rapid and quantitative detection of analytes; the ideal lifetimes of covalent analyte-pore adducts and the intervals between sensing events are tens to hundreds of milliseconds, which can be reliably measured by electrical recording. This requirement has so far limited the practical application of covalent sensing. Another challenge is the rational engineering of nanopore sensors to resolve structures with minor variations, such as chain isomers that differ by the position of a methyl group. To date, this has largely relied on trial and error. A more systematic investigation is essential to guide the future development of covalent sensors.

In this work, we focus on the development of a single-molecule, real-time detection technology for rapid ratiometric profiling of aldehydes. We exploit hemithioacetal chemistry for the covalent sensing of aldehydes within a thiol-containing αHL nanopore, achieving rapid, high-resolution analysis by the formation of short-lived adducts that are identified by a machine learning algorithm. The diastereomeric adducts produce current signatures with differences that are accentuated by rational engineering of nanopores, affording information that discriminates between closely related molecular isomers. We extend the scope of our approach to mono alcohols by selectively converting them to aldehydes. Hence, we have developed a versatile means for the targeted detection of a subset of analytes present within a complex mixture of molecules.

Results

Covalent detection of aldehydes through reversible hemithioacetal formation

Stochastic sensing of aldehyde analytes was achieved using reversible thiol-aldehyde chemistry. To eliminate background reactions, notably imine formation and metal chelation, four mutations were introduced in wild-type (WT) αHL (i.e., AG = αHL-K8A-M113G-K131G-K147G) (Supplementary Fig. 1)22. A single-cysteine mutation was then introduced at position 115 (i.e., AG-T115C). Heteroheptameric nanopores containing one cysteine-bearing subunit, (AG)6(AG-T115C), were prepared and used for covalent detection (Fig. 1a, see Methods). (AG)6(AG-T115C) carried a single-channel current (IP) of −129 ± 3 pA at −50 mV (N = 80 pores) (recording buffer: 2 M KCl, 200 mM PIPES and 20 μM EDTA at pH 6.8).

Fig. 1. Single-molecule covalent sensing of aldehydes.

Fig. 1

a The engineered α-hemolysin (αHL) nanopore, (AG)6(AG-T115C), bears a single cysteine at position 115 on one of the seven subunits (boxed). b An aldehyde molecule was detected upon reaction with the cysteine thiol to form a hemithioacetal adduct. Aldehydes are hydrated to varying extents in aqueous solutions. Concentrations of free aldehyde were calculated by using the hydration constant Khyd determined by 1H NMR. c Single-channel recordings at −50 mV (trans) with 0.0, 3.0, 5.8, 8.5 or 11 mM propanal (trans) in 2 M KCl, 200 mM PIPES and 20 μM EDTA at pH 6.8. Signals were low-pass filtered at 10 kHz and sampled at 50 kHz. Traces were further filtered at 100 Hz for display. Current levels corresponded to the hemithioacetal adducts formed (level 2, orange) and the unoccupied nanopore (level 1, blue). d Rates of adduct formation (von) and dissociation (voff), recorded with individual pores, plotted against total propanal concentration showing bimolecular kinetics for the forward association step (level 1 to 2), and unimolecular kinetics for the reverse dissociation step (level 2 to 1). Source data are provided as a Source Data file.

The introduction of an aldehyde in the trans compartment (Fig. 1a) produced reversible current blockades, which were attributed to the formation of hemithioacetal adducts (Fig. 1b, c and Supplementary Figs. 210, see Supplementary Section 2). Residual currents (Ires) after hemithioacetal formation are given as percentages of the open pore current (Ires% = Ires/IP × 100%) (Supplementary Table 1). For example, for propanal, Ires% = 98.7 ± 0.1% (N = 3 pores, >300 events). Aldehydes exist in both hydrated and non-hydrated forms in aqueous solution23. The rates of hemithioacetal formation (von) were expressed in terms of the total aldehyde concentration and were consistent with bimolecular kinetics (i.e., for propanal, von = kon[propanal]tot, where kon is the observed rate constant of adduct formation). The rates of hemithioacetal dissociation (voff) were independent of propanal concentration, consistent with a unimolecular step (i.e., voff = koff, where koff is the rate constant of adduct dissociation) (Fig. 1d and Supplementary Figs. 29). Both the association and dissociation reactions were pH-dependent and single-channel recordings were performed at pH 6.8. At this pH value, frequent events occurred at µM-mM analyte concentrations (e.g., ~500 events within 10 min for 3 mM butanal), and the mean lifetimes of the hemithioacetal adducts were long enough to allow accurate aldehyde identification from Ires% values (e.g., ~130 ms for butanal).

In total, 10 different aldehydes were characterised with the (AG)6(AG-T115C) nanopore (Fig. 2a and Supplementary Table 1), ranging from straight-chain to branched-chain to aromatic aldehydes. We demonstrated single CH2 resolution in distinguishing straight-chain aldehydes (Fig. 2b and Supplementary Table 1). From ethanal to octanal, each additional CH2 group reduced the Ires% value by ~0.7%. Interestingly, our approach was able to distinguish between diastereomeric hemithioacetal adducts with opposite chirality at the Cα position for both heptanal and octanal (i.e., ∆Ires% = 0.29 ± 0.03% for heptanal (N = 3 pores, >100 events for each diastereomer); 0.39 ± 0.05% for octanal (N = 2 pores, >40 events for each diastereomer)). We arbitrarily assigned adducts with larger Ires% as diastereomers A and those with smaller Ires% as diastereomers B (i.e., Ires%,A > Ires%,B). For shorter straight-chain aldehydes, diastereomeric adducts were not clearly separated by (AG)6(AG-T115C) with ∆Ires% < 0.2%. Aromatic aldehydes produced higher Ires% values than saturated aliphatic aldehydes with similar masses (i.e., Ires% = 97.5 ± 0.1% for benzaldehyde (N = 3 pores, >300 events) and 96.1 ± 0.1% for hexanal (N = 3 pores, >400 events); Ires% = 96.6 ± 0.1% for phenylacetaldehyde (N = 4 pores, >400 events) and 95.7 ± 0.1% and 95.4 ± 0.1% for heptanal (N = 3 pores, >100 events for each diastereomer)). We speculate that the open-chain aldehydes extend away from the protein wall, creating larger steric blockades than the corresponding cyclic aromatic aldehydes. Branched-chain aldehydes and their straight-chain isomers produced similar Ires% values (i.e., Ires% = 98.1 ± 0.1% for 2-methylpropanal (N = 5 pores, >500 events) and 97.9 ± 0.1% for butanal (N = 4 pores, >400 events)), with ∆Ires% ~ 0.2% for 2-methylpropanal and butanal simultaneously recorded with the same pore (N = 1 pore, >600 events).

Fig. 2. Thiol-aldehyde chemistry within the (AG)6(AG-T115C) nanopore.

Fig. 2

a Aldehydes reacted with the cysteine at position 115 to generate hemithioacetal adducts reversibly. Diastereomeric adducts were resolved for heptanal and octanal. Diastereomer A was arbitrarily assigned with a higher Ires% than diastereomer B. Recording conditions: −50 mV (trans) with 2 M KCl, 200 mM PIPES and 20 μM EDTA at pH 6.8. Aldehyde concentrations (trans): 5.8 mM ethanal, 5.8 mM propanal, 6.1 mM 2-methylpropanal, 5.8 mM butanal, 5.7 mM benzaldehyde, 5.5 mM pentanal, 5.5 mM phenylacetaldehyde, 2.7 mM hexanal, 1.5 mM heptanal, 1.3 mM octanal. Signals were low-pass filtered at 10 kHz and sampled at 50 kHz. Traces were further filtered at 100 Hz for display. b Ires% values for the hemithioacetal adducts. Error bars are standard deviations of Ires% values determined with >3 different nanopores (Except for octanal which is with 2 different nanopores). c Rate constants for adduct formation corrected for hydration (konʹ) and dissociation (koff). Errors are standard deviations across 3 different nanopores (Except for octanal which is for 2 different nanopores). Corrected rate constants for adduct formation were obtained from observed rate constants (kon) by using the equation: konʹ = kon × (1 + Khyd). Rates of adduct formation for octanal were not reported due to the poor solubility of octanal in recording solutions, precluding accurate determinations. Source data are provided as a Source Data file.

Simultaneous detection of aldehydes facilitates ratiometric profiling

To enable quantitative aldehyde detection, kinetic analysis of the formation and dissociation of hemithioacetal adducts was carried out. The solubility of each aldehyde in the recording solution was measured by NMR in the presence of internal standards (5 mM calcium formate and 5 mM maleic acid). Aldehydes formed hydrates to varying extents in aqueous solutions, and the hydrated forms were unreactive towards thiols and thus not detected by single-channel recording (Fig. 1b)24. We determined the hydration equilibrium constants (Khyd) by 1H NMR under electrical recording conditions (Supplementary Tables 2, 3 and Supplementary Fig. 11) and derived the corrected rate constants of adduct formation (konʹ) by using the equations:

von=konaldehydetot=konaldehydeald 1
Khyd=[aldehyde]hyd/[aldehyde]ald=[aldehyde]tot/[aldehyde]ald1 2
kon=kon×1+Khyd 3

von, rate of hemithioacetal formation; kon, observed rate constant of hemithioacetal formation; konʹ, corrected rate constant of hemithioacetal formation; Khyd, hydration equilibrium constant; [aldehyde]tot, total concentration of aldehyde; [aldehyde]hyd, concentration of aldehyde in the hydrate form; [aldehyde]ald, concentration of free aldehyde.

For the bimolecular formation of hemithioacetal adducts, konʹ gradually decreased as chain length increased, from ~0.5 mM‒1s‒1 for propanal to ~0.3 mM‒1s‒1 for heptanal (Fig. 2c). While the energy difference is small, these observations could reflect the steric hindrance for the thiolate to approach the carbonyl group along the Bürgi-Dunitz angle, caused by the conformationally labile alkyl chains25. As rates of adduct dissociation were independent of aldehyde concentration, no correction for hydration was required. For the unimolecular dissociation of hemithioacetal adducts formed with straight-chain aldehydes, values of koff remained within an order of magnitude, gradually decreasing, as the chain length increased, from 9.4 s‒1 for ethanal to ~5 s‒1 for hexanal, heptanal and octanal (Fig. 2c). This could be attributed to the positive inductive effect of the increasing alkyl chain length.

Simultaneous detection of multiple aldehydes with a single nanopore was demonstrated with a mixture of 7 straight-chain aldehydes (i.e., ethanal to octanal) (Fig. 3a). The consistent current signatures for individual aldehydes and the clear separation between them enabled automated assignment of events by using machine learning (see Supplementary Section 5). Almost 1000 individual events per aldehyde were collected from separate traces of each analyte (Supplementary Fig. 12). A stratified random split was performed, leaving 30% of the events as the test set. Three features were extracted to characterise each event: Ires%, event duration, and the root-mean-squared noise of the event (see Supplementary Section 5). The random forest model achieved the highest accuracy of 98% on both the training and test sets, which was calculated as the fraction of correctly classified events in the dataset, with manual labels taken as the ground truth (Supplementary Fig. 13).

Fig. 3. Single-molecule profiling of aldehyde mixtures.

Fig. 3

a Simultaneous detection of 7 straight-chain aldehydes at the single-molecule level. Total aldehyde concentrations (trans): 0.7 mM ethanal, 0.7 mM propanal, 0.7 mM butanal, 0.5 mM pentanal, 0.6 mM hexanal, 0.5 mM heptanal, and 0.3 mM octanal. b Ratiometric detection of mixtures of butanal and pentanal at varying ratios. Ratios of aldehydes were altered mid-experiment by incrementally replacing the analyte solution in the trans chamber with a butanal stock solution (see Supplementary Section 6). Recording conditions: −50 mV (trans) with 2 M KCl, 200 mM PIPES and 20 μM EDTA at pH 6.8. Signals were low-pass filtered at 10 kHz and sampled at 50 kHz. Traces were further filtered at 100 Hz for display. c Measured total concentrations of pentanal and butanal were in good agreement with expected total concentrations (i.e., <10% difference). Errors are standard deviations of concentrations determined with 3 different nanopores. Source data are provided as a Source Data file.

Additionally, we demonstrated ratiometric measurements of aldehydes in mixtures with pentanal and butanal, for which the detection frequencies reflected their varied concentration ratios (Fig. 3b, c, see Supplementary Section 6). Within <10 minutes of single-channel recording, ratios of adduct formation events could be used to accurately determine the concentrations of pentanal and butanal (i.e., <10% difference between measured and expected concentrations) (Fig. 3c, see Supplementary Section 6). For disease diagnosis and product quality control applications, a more practical, and equally informative approach, is to directly use the ratios of adduct formation rates as fingerprints of relative concentrations of aldehydes in samples.

Rational nanopore engineering for diastereomer and structural isomer resolution

Challenged by the similar current signatures observed with aldehyde structural isomers (e.g., butanal and 2-methylpropanal) with the (AG)6(AG-T115C) nanopore, rational nanopore engineering was undertaken. We also hypothesised that current level resolution of diastereomeric hemithioacetal adducts would provide an additional layer of information to aid structural isomer resolution. To this end, we designed two additional nanopores in which the reactive cysteine residue was positioned within a narrower, and hence more sensitive, region of the nanopore β barrel (Fig. 4a). In the (MK)6(MK-T115C) nanopore (MK = αHL-K8A-K131G-K147G), methionine residues were reintroduced at position 113 to reduce the internal diameter near the sensing site (Supplementary Fig. 14). In the (AG)6(AG-G137C) nanopore, the cysteine residue was moved to position 137, which was in close proximity to the narrowest region of the nanopore bearing an AG background (Supplementary Fig. 14). Three asparagine-to-alanine mutations around the sensing site were further introduced to reduce steric hindrance and promote diastereomeric interactions of hemithioacetal adducts with the local protein environment, leading to the (AG)6(AG-G137C-Ala3) nanopore (AG-G137C-Ala3 = AG-N121A-N123A-G137C-N139A) (Fig. 4a).

Fig. 4. Diastereomer discrimination in engineered nanopores facilitates structural isomer resolution.

Fig. 4

a Rational engineering of nanopores to enhance diastereomer discrimination through two parallel strategies: Left: narrowing the pore diameter around the sensing group (yellow); Right: moving the sensing group to the narrowest internal site. Mutations are highlighted relative to the WT background in all subunits (blue) or in the cysteine-bearing subunit (red). b Within the (MK)6(MK-T115C) nanopore, diastereomeric adducts were resolved for all straight-chain aldehydes tested. Diastereomer A was arbitrarily assigned with a higher Ires% than diastereomer B. c Within the (AG)6(AG-T115C) nanopore, diastereomeric adducts were resolved for only heptanal and octanal. d Within the (AG)6(AG-G137C) nanopore, diastereomeric adducts were resolved for all straight-chain aldehydes tested. e Good current level separation was achieved across all diastereomeric adducts in the (MK)6(MK-T115C) nanopore. f A pair of chain isomers, butanal and 2-methylpropanal, were distinguished by diastereomers B in the (AG)6(AG-G137C) nanopore or by diastereomers A in the (AG)6(AG-G137C-Ala3) nanopore. g Propanal, butanal, 2-methylpropanal, and pentanal were detected and distinguished within a single (MK)6(MK-T115C) pore. Butanal and 2-methylpropanal were distinguished by the diastereomer A of 2-methylpropanal and the diastereomer B of butanal. Recording conditions: −50 mV (trans) with 2 M KCl, 200 mM PIPES and 20 μM EDTA at pH 6.8. Signals were low-pass filtered at 10 kHz and sampled at 50 kHz. Traces were further filtered at 50 Hz for display. Source data are provided as a Source Data file.

The Ires% of 7 different straight-chain aldehydes (i.e., ethanal to octanal) were characterised with the (MK)6(MK-T115C) and (AG)6(AG-G137C) nanopores (Fig. 4b–d and Supplementary Tables 4, 5). Improved diastereomer discrimination was observed in both nanopores: diastereomeric hemithioacetal adducts from ethanal to octanal were easily separable in both nanopores: 0.3% <∆Ires% < 1.0%, in the (MK)6(MK-T115C) nanopore (Fig. 4b, e and Supplementary Table 4) and 0.4% <∆Ires% < 1.6%, in the (AG)6(AG-G137C) nanopore (Fig. 4d and Supplementary Table 5). Good current level separation was achieved across all diastereomeric adducts in the (MK)6(MK-T115C) nanopore (Fig. 4e), whereas some overlap was seen in the (AG)6(AG-G137C) nanopore (e.g., Ires% = 95.9 ± 0.1% for diastereomer B of butanal and 95.8 ± 0.1% for diastereomer A of pentanal).

Current level resolution of diastereomeric hemithioacetal adducts allowed for structural isomer resolution. In the (AG)6(AG-G137C) nanopore, butanal and 2-methylpropanal could be distinguished based on diastereomers B (i.e., ΔIres% = 0.25 ± 0.02% for diastereomers B of butanal and 2-methylpropanal, N = 3 pores, >35 events for each aldehyde) (Fig. 4f). Further, within the (AG)6(AG-G137C-Ala3) nanopore, diastereomer pairs were better resolved for both butanal and 2-methylpropanal; distinct separation between diastereomers A enabled clear resolution of these chain isomers (i.e., ΔIres% = 0.42 ± 0.03% for diastereomers A of butanal and 2-methylpropanal, N = 4 pores, >60 events for each aldehyde) (Fig. 4f). In the (MK)6(MK-T115C) nanopore, butanal and 2-methylpropanal could be distinguished by using diastereomer A of 2-methylpropanal and diastereomer B for butanal (i.e., ΔIres% = 0.43 ± 0.03% for diastereomer B of butanal and diastereomer A of 2-methylpropanal, N = 2 pores, >50 events for each aldehyde). As a proof of concept, simultaneous detection of propanal, butanal, 2-methylpropanal and pentanal was demonstrated in the (MK)6(MK-T115C) nanopore (Fig. 4g). As the current level blockades for pentanal and propanal do not overlap with those of butanal and 2-methylpropanal, all four aldehydes could be differentiated from Ires% alone.

Enzyme-assisted differential sensing of alcohols and aldehydes

Nanopore covalent sensing is inherently selective for a targeted class of analytes. For example, in a mixture of alcohols (1-pentanol and 1-hexanol) and aldehydes (propanal and butanal), only the aldehydes produced events within the (AG)6(AG-T115C) nanopore sensor (Fig. 5, Left). After treatment with an engineered alcohol oxidase26, additional aldehyde species—pentanal and hexanal—were identified (Fig. 5, Right), revealing the presence of alcohols in the original sample. This simple functional group conversion step, coupled to nanopore sensing, therefore allowed for differential single-molecule detection of alcohols and aldehydes in a mixture—a powerful strategy worth further development.

Fig. 5. Enzyme-facilitated indirect alcohol detection.

Fig. 5

The (AG)6(AG-T115C) nanopore selectively detected aldehydes in an aldehyde-alcohol mixture (Left). After treatment with an engineered alcohol oxidase (AcCO6), additional aldehyde species—pentanal and hexanal—were observed (Right), which indicated the presence of the corresponding alcohols in the original sample. Recording conditions: −50 mV (trans) with 2 M KCl, 190 mM PIPES and 20 μM EDTA at pH 6.8. Signals were low-pass filtered at 10 kHz and sampled at 50 kHz. Traces were further filtered at 100 Hz for display.

Discussion

The present work exploited thiol-aldehyde chemistry, a previously unexplored dynamic covalent chemistry within protein nanopores, to target the sensing of aldehydes, a key class of chemicals ubiquitous in daily life. We achieved single-molecule identification of 10 straight-chain, branched-chain, and aromatic aldehydes, which represent potential biomarkers for lung cancer5, Crohn’s disease6, SARS-CoV-28, as well as being atmospheric pollutants2, and beverage impurities3. Our approach resolved straight-chain aldehydes that differed by single CH2 groups, members of hemithioacetal diastereomeric pairs, and structural isomers—challenges often faced by conventional small-molecule detection methods. Compared to traditional LC/GC-MS methods, pre-derivatisation of aldehydes (e.g., with hydrazine-based chromophores) was not required27. Additionally, we demonstrated rational engineering of the protein environment surrounding the covalent sensing site, achieving improved signal separation for diastereomeric adducts by promoting interactions with the pore interior. Rational engineering of the sensing region thus holds promise for improving small-molecule covalent detection with a given nanopore scaffold.

As a proof of concept, we established ratiometric profiles of mixed aldehydes at mM concentrations with >400 events collected in 10 min with a single nanopore. Given that aldehydic VOCs ranging from propanal to nonanal are exhaled at 3-300 pM concentrations28, in a commercialised device, pre-concentration steps would be coupled to nanopore detection. For example, 3-300 µM concentrations of aldehydes can theoretically be obtained by pre-concentrating 5 L of breath volatiles into 5 μL of solvent by using solid-phase microextraction (SPME)29,30. Detection of analytes in nanolitre droplets (i.e., 300 nL) was previously demonstrated in a microfluidics set-up, and could also be integrated here31. Further, selective enrichment of aldehydes has been achieved using SPME devices with fibre coatings selective for aldehydes28,32,33—this would simultaneously eliminate potentially interfering chemical classes (e.g., disulfides) that might hinder disease diagnosis. For longer chain aldehydes with poor solubilities (i.e., <1 µM solubilities), the concentration of VOCs into solvents such as water/ethanol solutions might be considered, although this would require alternative bilayer systems with greater tolerance to organic solvents. Improved detection frequency might also be achieved by using nanopore engineering, e.g., mutants with multiple covalent sensing sites14 or mutagenesis of proximate residues to improve cysteine thiol reactivity34. In addition, nanopore sensing devices containing arrays of pores would enhance throughput (e.g., the MinION device contains up to 512 active pores per flow cell)35. Based on our findings, we envision an accessible nanopore platform for rapid aldehyde detection, paving the way for applications in disease diagnosis, environmental monitoring, and the quality control of food and beverages.

Moving forward, the detection of other biologically relevant chemical classes can be explored by capitalising on the aldehyde-sensing system developed here. As central metabolites, aldehydes are generated by a wide range of enzymes from various functional groups, including carboxylic acids, primary alcohols, and primary amines36. Many of these convertible chemical classes are also potential disease biomarkers. For example, 1-pentanol, detected as 1-pentanal in this work, has been found in the exhaled breath of lung cancer patients but not healthy individuals5. While we have demonstrated mono alcohol sensing after enzymatic conversion to aldehydes, the detection of other chemical classes will require the application of means for the direct chemical conversion to reactive molecules or the identification of suitable enzymes for conversion, particularly with respect to substrate scope and catalytic efficiency. In the long term, to leverage thiol-aldehyde sensing chemistry, we envision a versatile sensing workflow that employs a suite of reagents for conversion to aldehydes, followed by rapid single-molecule detection in low-cost, portable, user-friendly devices.

Methods

Preparation of plasmids

Construction of plasmids encoding mutant αHL monomers, together with the primers required, are described in the Supplementary Information. Plasmids are available from the corresponding authors upon request.

Preparation of nanopore sensors

Nanopore heteroheptamers were prepared according to the procedure below, which is a modification of a method previously reported37.

αHL monomers were prepared with an E. coli in vitro transcription and translation (IVTT) system (E. coli T7 S30 Extract System for Circular DNA, Cat #L1130, Promega). Prior to use, the T7 S30 extract provided in the kit was treated with 1 µL rifampicin (1 µg/mL, final concentration) to suppress transcription by E. coli RNA polymerase. A standard reaction comprised: DNA plasmid mixture ( < 4 µg, plasmids encoding cysteine-free and cysteine-containing subunits were in an 8:1 ratio), amino acid mixture without methionine (5 µL, as supplied in the kit), S30 premix without amino acids (20 µL, as supplied in the kit), [35S]methionine (2 µL, 1200 Ci/mmol, 15 mCi/mL, MP Biomedicals), rabbit red blood cell membranes (2 µL, ~ 1 mg protein/mL), and T7 S30 extract for circular DNA (15 µL, as supplied in the kit). The reaction mixture was incubated at 37 °C for 2 h.

Next, MBSA buffer (1 mL; 3-morpholinopropane-1-sulfonic acid (10 mM), NaCl (150 mM), bovine serum albumin (1 mg/mL), pH 7.4) was added to the reaction mixture. The mixture was centrifuged at 25,000 x g for 10 min at 4 °C. The supernatant was then removed, and the pellet solubilised at room temperature with 2× Laemmli sample buffer (25 µL) which contains 10% 2-mercaptoethanol. Before loading the samples, a 5.5% SDS/PAGE gel was pre-run with 1× Tris-Glycine SDS running buffer containing dithiothreitol (2 mM) and sodium thioglycolate (1 mM). The resuspended pellets were then loaded onto the gel and electrophoresed at +70 V overnight (13 h).

αHL heptamers containing different numbers of mutant subunits were separated in the gel based on their electrophoretic mobilities which were determined by the number of octa-aspartate (D8) tails present (i.e., each cysteine-bearing mutant subunit contained a D8 tail). Hence, the top band corresponded to homoheptamers bearing no octa-aspartate tail (i.e., (AG)7 or (MK)7), the second band corresponded to heteroheptamers bearing a single octa-aspartate tail (i.e., (mutant-D8)1(AG)6 or (mutant-D8)1(MK)6) and so on, with consecutive bands having an aspartate-tail-free subunit replaced with a mutant subunit bearing an octa-aspartate tail. In this work, the second band from the top contained the desired protein pore with a single cysteine residue.

To extract the protein pores, the gel was dried under vacuum onto Whatman 3MM filter paper for 5 h at 50 °C. The dried gel was then exposed to photographic film (Kodak Bio Max MR autoradiography film) for 8 h and the developed film was used to locate the target protein bands in the gel. The desired protein bands were excised and rehydrated in TE buffer (300 μL; Tris·HCl (10 mM), DTT (0.5 mM), EDTA (1 mM), pH 8.0) for 1 h at room temperature. The paper was then removed, and the gel was crushed with a plastic pestle. The resulting suspension was filtered through a 0.2 µm hydrophilic membrane filter (Proteus Mini Clarification Spin Column, Generon). The filtrate was stored in 10 µL aliquots at −80 °C.

Single-channel electrical recordings

Single-channel recordings were carried out in a planar bilayer apparatus as previously described38. A single αHL pore was allowed to insert into the bilayer. Aldehyde substrates were introduced from the trans compartment. Experiments were conducted using recording buffer containing 2 M KCl, 200 mM PIPES and 20 μM EDTA titrated to pH 6.8. Aldehyde solutions were prepared with recording buffer. Single-channel recordings were conducted with a coverslip placed atop the recording chamber.

Ionic currents were recorded by using a patch clamp amplifier (Axopatch 200B, Molecular Devices), and filtered with a low-pass Bessel filter (80 dB/decade) with a corner frequency of 10 kHz. Signals were digitised with a Digidata 1320 A digitizer (Molecular Devices) at an acquisition frequency of 50 kHz. The current traces were processed with Clampfit 10.7 (Molecular Devices). Current traces were idealised by using Clampfit 10.7 (Molecular Devices). The idealised data were analysed with QuB 2.0 software (www.qub.buffalo.edu)39. Dwell time analysis and rate constant determinations were performed by using the maximum interval likelihood algorithm of QuB40.

Arthrobacter cholorphenolicus choline oxidase (AcCO6) preparation

The plasmid pET28a-AcCO6 encoding an engineered Arthrobacter cholorphenolicus choline oxidase (AcCO6) was a kind gift from Nicholas J. Turner26. The plasmid was transformed in E. coli BL21(DE3) competent cells for expression, which were cultured in LB media (10 mL) containing 50 μg/mL kanamycin at 37 °C, 220 RPM overnight. The overnight culture was transferred to autoinduction media (1 L, LB based, Formedium, #AIMLB0110) containing 50 μg/mL kanamycin and grown at 220 RPM for 16 hours at 20 °C. Cells were harvested by centrifugation at 4000 x g for 20 min at 4 °C. The pellet was resuspended in 50 mL pre-chilled Buffer A (100 mM KPi pH 7.8, 300 mM KCl, 20 mM imidazole) supplemented with 250 U nucleases (Pierce™ Universal Nuclease for Cell Lysis, #88700). Cells were lysed by sonication (10 s pulse, 10 s pause, 3 min duration) on ice, followed by centrifugation at 40,000 x g for 30 min at 4 °C. The supernatant was filtered through a 0.45 μm syringe filter and applied to a 5 mL HisTrapTM FF column (Buffer A: 100 mM KPi pH 7.8, 300 mM KCl, 20 mM imidazole; Buffer B: 100 mM KPi pH 7.8, 300 mM KCl, 1000 mM imidazole). Fractions containing His-tagged AcCO6 were pooled and dialysed against a buffer containing 100 mM KPi pH 7.8 at 4 °C overnight. The His-tagged AcCO6 was concentrated, aliquoted, flash frozen with liquid N2, and stored at −80 °C.

Differential sensing of aldehydes and alcohols

The control aldehyde-alcohol mixture was prepared as follows: propanal, butanal, 1-pentanol, 1-hexanol (5 mM each) were dissolved in oxygen-saturated KPi buffer (100 mM, pH 7.8). The treated mixture was prepared as follows: AcCO6 (0.1 mg/mL for each mM of alcohol) was added to the control aldehyde-alcohol mixture and shaken overnight at 300 RPM at 30 °C. After incubation, both the control and treated mixtures were adjusted to pH 6.8, and diluted 4-fold with buffer (2.66 M KCl, 200 mM PIPES at pH 6.8, 20 μM EDTA) (i.e., final concentration of 2 M KCl).

Either the control (100 μL) or treated mixture (100 μL) was introduced into the trans chamber. As the control mixture was not treated with AcCO6, only propanal and butanal could be detected. Single-channel recordings with the treated mixture revealed additional events with conductance levels corresponding to hemithioacetal adducts derived from pentanal and hexanal. This confirmed the oxidation of alcohols to aldehydes after AcCO6 treatment.

Note added in proof: During the production of this paper, we reported complementary findings41 on the diastereoselectivity of reactions between a nanoreactor cysteine and aromatic aldehydes. These results further confirm that protein nanopores can discriminate hemithioacetal diastereomers at the single-molecule level.

Supplementary information

Source data

Source Data (32.7KB, xlsx)

Acknowledgements

This research was supported by the Bill & Melinda Gates Foundation, a European Research Council Advanced Grant (SYNTISU), and a European Research Council Starting Grant (NANOPRO) under the UK Research and Innovation (UKRI) Guarantee (EP/Z000351/1). Y.Y. is supported by an EPSRC Postdoctoral Fellowship (UKRI939). G.W. thanks the support of UKRI for the Horizon Europe Guarantee funding of a Marie Skłodowska-Curie Actions Postdoctoral Fellowship (EP/Z001366/1). We thank Prof. Nicholas J. Turner from the University of Manchester for providing the AcCO6-encoding plasmid. We thank Steven Barry at the Physical and Theoretical Chemistry Laboratory, University of Oxford, for fabricating the recording chambers.

Author contributions

H.B. and Y.Q. conceived and supervised the project. L.E.M. and Z.H.L. prepared the proteins and conducted the single-channel recording experiments. Z.B. prepared the script for machine learning and performed the molecular dynamics simulations. Y.Y. prepared the AcCO6 enzyme. G.W. carried out parallel sensing experiments. L.E.M., Z.H.L., Z.B. and Y.Q. performed the data analysis. L.E.M., Z.H.L., Y.Y., Z.B., H.B. and Y.Q. wrote the manuscript.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Data availability

Data supporting the findings of this study are shown in the main text, Supplementary Information and Source data. Data needed to reproduce the machine learning results are available in the repository: 10.5281/zenodo.16949339. Source data are provided with this paper.

Code availability

Codes needed to reproduce the machine learning results are available in the repository: 10.5281/zenodo.16949339.

Competing interests

H.B. is the founder of, a consultant for and a shareholder of Oxford Nanopore Technologies, a company engaged in the development of nanopore sensing and sequencing technologies. L.E.M., Z.H.L., H.B. and Y.Q. have filed patents describing the engineered nanopores and their applications in small-molecule covalent sensing. Z.B. and Y.Y. are listed as contributors to the IP. The remaining authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Lauren E. McGivern, Zhong Hui Lim.

Contributor Information

Hagan Bayley, Email: hagan.bayley@chem.ox.ac.uk.

Yujia Qing, Email: yujia.qing@chem.ox.ac.uk.

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-025-64184-6.

References

  • 1.Sutaria, S. R. et al. Lipid peroxidation produces a diverse mixture of saturated and unsaturated aldehydes in exhaled breath that can serve as biomarkers of lung cancer—a review. Metabolites12, 561 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Schauer, J. J., Kleeman, M. J., Cass, G. R. & Simoneit, B. R. T. Measurement of emissions from air pollution sources. 3. C1-C29 organic compounds from fireplace combustion of wood. Environ. Sci. Technol.35, 1716–1728 (2001). [DOI] [PubMed] [Google Scholar]
  • 3.Moreira, M. T. G., Pereira, P. R., Aquino, A., Conte-Junior, C. A. & Paschoalin, V. M. F. Aldehyde accumulation in aged alcoholic beer: addressing acetaldehyde impacts on upper aerodigestive tract cancer risks. Int. J. Mol. Sci.23, 14147 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Smolinska, A. et al. Volatile metabolites in breath strongly correlate with gut microbiome in CD patients. Anal. Chim. Acta1025, 1–11 (2018). [DOI] [PubMed] [Google Scholar]
  • 5.Rudnicka, J., Kowalkowski, T. & Buszewski, B. Searching for selected VOCs in human breath samples as potential markers of lung cancer. Lung Cancer135, 123–129 (2019). [DOI] [PubMed] [Google Scholar]
  • 6.Hicks, L. C. et al. Analysis of exhaled breath volatile organic compounds in inflammatory Bowel disease: a pilot study. J. Crohns Colitis9, 731–737 (2015). [DOI] [PubMed] [Google Scholar]
  • 7.Ibrahim, W. et al. Visualization of exhaled breath metabolites reveals distinct diagnostic signatures for acute cardiorespiratory breathlessness. Sci. Transl. Med.14, eabl5849 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ibrahim, W. et al. Diagnosis of COVID-19 by exhaled breath analysis using gas chromatography–mass spectrometry. ERJ Open Res.7, 00139-2021 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Su, R. et al. Mass spectrometry for breath analysis. Trends Anal. Chem.158, 116823 (2023). [Google Scholar]
  • 10.Drabińska, N. et al. A literature survey of all volatiles from healthy human breath and bodily fluids: the human volatilome. J. Breath. Res.15, 034001 (2021). [DOI] [PubMed] [Google Scholar]
  • 11.Fuchs, P., Loeseken, C., Schubert, J. K. & Miekisch, W. Breath gas aldehydes as biomarkers of lung cancer. Int. J. Cancer126, 2663–2670 (2010). [DOI] [PubMed] [Google Scholar]
  • 12.Guadagni, R. et al. Solid-phase microextraction–gas chromatography–mass spectrometry method validation for the determination of endogenous substances: urinary hexanal and heptanal as lung tumor biomarkers. Anal. Chim. Acta701, 29–36 (2011). [DOI] [PubMed] [Google Scholar]
  • 13.Bayley, H., Luchian, T., Shin, S.-H. & Steffensen, M. B. Single-molecule covalent chemistry in a protein nanoreactor. In Single Molecules and Nanotechnology (eds Rigler, R. & Vogel, H.) (Springer, 2008).
  • 14.Wu, H.-C. & Bayley, H. Single-molecule detection of nitrogen mustards by covalent reaction within a protein nanopore. J. Am. Chem. Soc.130, 6813–6819 (2008). [DOI] [PubMed] [Google Scholar]
  • 15.Boersma, A. J. & Bayley, H. Continuous stochastic detection of amino acid enantiomers with a protein nanopore. Angew. Chem. Int. Ed.51, 9606–9609 (2012). [DOI] [PubMed] [Google Scholar]
  • 16.Boersma, A. J., Brain, K. L. & Bayley, H. Real-time stochastic detection of multiple neurotransmitters with a protein nanopore. ACS Nano6, 5304–5308 (2012). [DOI] [PubMed] [Google Scholar]
  • 17.Ramsay, W. J. & Bayley, H. Single-molecule determination of the isomers of D-glucose and D-fructose that bind to boronic acids. Angew. Chem. Int. Ed.57, 2841–2845 (2018). [DOI] [PubMed] [Google Scholar]
  • 18.Liu, Y. et al. Nanopore identification of alditol epimers and their application in rapid analysis of alditol-containing drinks and healthcare products. J. Am. Chem. Soc.144, 13717–13728 (2022). [DOI] [PubMed] [Google Scholar]
  • 19.Fan, P. et al. Nanopore analysis of Cis-diols in fruits. Nat. Commun.15, 1969 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang, K. et al. Unambiguous discrimination of all 20 proteinogenic amino acids and their modifications by nanopore. Nat. Methods21, 92–101 (2024). [DOI] [PubMed] [Google Scholar]
  • 21.Zhang, M. et al. Real-time detection of 20 amino acids and discrimination of pathologically relevant peptides with functionalized nanopore. Nat. Methods21, 609–618 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Choi, L. S., Mach, T. & Bayley, H. Rates and stoichiometries of metal ion probes of cysteine residues within ion channels. Biophys. J.105, 356–364 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hilal, S. H., Bornander, L. L. & Carreira, L. A. Hydration equilibrium constants of aldehydes, ketones and quinazolines. QSAR Comb. Sci.24, 631–638 (2005). [Google Scholar]
  • 24.Lienhard, G. E. & Jencks, W. P. Thiol addition to the carbonyl group. Equilibria and kinetics. J. Am. Chem. Soc.88, 3982–3995 (1966). [DOI] [PubMed] [Google Scholar]
  • 25.Daw, G., Regan, A. C., Watt, C. I. F. & Wood, E. Steric effects and mechanism in the formation of hemi-acetals from aliphatic aldehydes. J. Phys. Org. Chem.26, 1048–1057 (2013). [Google Scholar]
  • 26.Heath, R. S. et al. An engineered alcohol oxidase for the oxidation of primary alcohols. ChemBioChem20, 276–281 (2019). [DOI] [PubMed] [Google Scholar]
  • 27.El-Maghrabey, M. H. et al. Comprehensive greenness evaluation of the reported chromatographic methods for aldehydes determination as clinical biomarkers and food quality indicators. Trends Anal. Chem.171, 117548 (2024). [Google Scholar]
  • 28.Poli, D. et al. Determination of aldehydes in exhaled breath of patients with lung cancer by means of on-fiber-derivatisation SPME–GC/MS. J. Chromatogr. B878, 2643–2651 (2010). [DOI] [PubMed] [Google Scholar]
  • 29.Hamidi, S. Recent advances in solid-phase extraction as a platform for sample preparation in biomarker assay. Crit. Rev. Anal. Chem.53, 199–210 (2023). [DOI] [PubMed] [Google Scholar]
  • 30.Pleil, J. D., Wallace, M. A. G., Davis, M. D. & Matty, C. M. The physics of human breathing: flow, timing, volume, and pressure parameters for normal, on-demand, and ventilator respiration. J. Breath. Res.15, 042002 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Czekalska, M. A. et al. A droplet microfluidic system for sequential generation of lipid bilayers and transmembrane electrical recordings. Lab Chip15, 541–548 (2015). [DOI] [PubMed] [Google Scholar]
  • 32.Yu, L.-Q. et al. A gate-opening controlled metal-organic framework for selective solid-phase microextraction of aldehydes from exhaled breath of lung cancer patients. Microchim. Acta185, 307 (2018). [DOI] [PubMed] [Google Scholar]
  • 33.Noreña-Caro, D. & Álvarez-Láinez, M. Functionalization of polyacrylonitrile nanofibers with β-cyclodextrin for the capture of formaldehyde. Mater. Des.95, 632–640 (2016). [Google Scholar]
  • 34.Bo, Z., Lim, Z. H., Duarte, F., Bayley, H. & Qing, Y. Mobile molecules: reactivity profiling guides faster movement on a cysteine track. Angew. Chem. Int. Ed.62, e2023008 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dreamer, D., Akeson, M. & Branton, D. Three decades of nanopore sequencing. Nat. Biotechnol.34, 518–524 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Schober, L. et al. Enzymatic reactions towards aldehydes: an overview. Flavour Fragr. J.38, 221–242 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Braha, O. et al. Designed protein pores as components for biosensors. Chem. Biol.4, 497–505 (1997). [DOI] [PubMed] [Google Scholar]
  • 38.Montal, M. & Mueller, P. Formation of bimolecular membranes from lipid monolayers and a study of their electrical properties. Proc. Natl. Acad. Sci. USA69, 3561–3566 (1972). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Nicolai, C. & Sachs, F. Solving ion channel kinetics with the QuB software. Biophys. Rev. Lett.08, 191–211 (2013). [Google Scholar]
  • 40.Qin, F., Auerbach, A. & Sachs, F. Estimating single-channel kinetic parameters from idealized patch-clamp data containing missed events. Biophys. J.70, 264–280 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Lim, Z. H., Bo, Z., Armstrong, E., Bayley, H. & Qing, Y. Unexpected diastereoselective chemistry on a 2D protein surface. Chem. 102717, 10.1016/j.chempr.2025.102717 (2025).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Source Data (32.7KB, xlsx)

Data Availability Statement

Data supporting the findings of this study are shown in the main text, Supplementary Information and Source data. Data needed to reproduce the machine learning results are available in the repository: 10.5281/zenodo.16949339. Source data are provided with this paper.

Codes needed to reproduce the machine learning results are available in the repository: 10.5281/zenodo.16949339.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES