Skip to main content
PLOS One logoLink to PLOS One
. 2021 Oct 28;16(10):e0259155. doi: 10.1371/journal.pone.0259155

Design of effective value calculation model for dynamic dataflow of infrared gas online monitoring

Dong Xiao 1,2,*, Lu Huang 3, Mohamed Keita 1,4, Hailun He 5, Dayong Chen 1, Jin Li 6,*
Editor: Pasquale Avino7
PMCID: PMC8553131  PMID: 34710194

Abstract

The development of “CC30A CH4-CO2 combined analyzer” with infrared gas sensor as the core detection device can be widely used in online gas component analysis. In data analysis, the maximum value and arithmetic mean of the sensor data for each test period are not effective value. The characteristics of the dynamic data are: (1) Each DAW completes one test for one parameter, there is a unique effective value; (2) In test state, the fluctuation of the sensor value gradually decreases when approaching to the end of the test. An effective value calculation model was designed using the method of dimensionality reduction of dynamic data. The model was based on the distribution characteristics of the process data, and consists of 4 key steps: (1) Identify the Data Analysis Window (DAW) and build DAW dataset; (2) Calculate the value of optimal DAW dataset segmentation and build DAW subdataset; (3) Calculate the arithmetic mean (Mc) and count the amount of data in each subdataset (Fc), and build the optimal segmentation statistical set; (4) Effective value calculation and error evaluation. Calculation result with 50 sets of monitor data conformed that the EVC model for dynamic data of gas online monitoring meets the requirements of experimental accuracy requirements and test error. This method can be independently calculated without relying on the feedback information of the monitoring device, and it has positive significance for using the algorithm to reduce the hardware design complexity.

Introduction

CH4 and CO2 are the key products in the anaerobic fermentation of coal or biomass, and the trend analysis of gas componsition changes is the key factor for the control of anaerobic digestion. Gas composition online analysis technology has gradually diversified with the improvement of the accuracy of gas sensors [1]. Especially using infrared CH4-CO2 gas sensor instead of gas chromatography (GC) to achieve low-cost online gas components detection and analysis [2, 3]. At the same time, the optimization of data analysis methods (such as profile monitoring technology [4, 5] and ensemble model [6]) and the improvement of computing capabilities of single-chip microcomputer, has further improved the reliability and accuracy of online monitoring [7]. When the sensor is working, it analyzes the target gas concentration in its gas chamber in real time, and continuously send the value to microcomputer [8], forming a data flow which is named dynamic data flow. However, in order to avoid the mutual interference of different gas samples and ensure the test accuracy, the analyzer will flush gas chamber with N2 or air after complete each test [9]. Meanwhile, the response time of sensor further enhanced the delay effect of test result [10]. For sensors with quick response time, such as noncontact thermopile [11], pressure difference sensor [12], hall sensor [13], etc., the raw sensor data is correlated with the measured parameter and identified effective data, when the data collection period is longer than the sensor response period. And for the sensor with slow response time (such as infrared CH4 sensors [14], the response time longer than 20 S), the effective data will be mixed with process data when the data collection frequency is shorter than the sensor response period is. Infrared CH4-CO2 sensor is a typical case. These two factors cause the gas composition to be in dynamic changes all the time in sensor gas chamber, and effective data and process data mixes in dynamic data flow. More than 80% of the sensor data is process data, which is invalid data, when data collection frequency is set to 1 data/S.

In the gas composition online analysis technology, the tube bundle system is widely used in coal mines [15, 16], anaerobic digestion systems [17, 18], and fermentation industries [19]. This system uses one set/group of gas sensors to perform gas concentration tests from multiple monitoring points in patrol read mode [16]. And it suitable to test the atmosphere where the gas composition is relatively stable and the gas concentration changes slowly. CH4 and CO2 yield monitoring of coal biogasification experiment are a typical case of tube bundle system utilization [20]. Affected by the characteristics of the biodegradation of organic compounds in coal, the experiment cycle is usually more than 200 days, or even one year [21, 22]. When setting the test frequency for each experiment sample to 8 hours, one system can serve more than 60 samples. Gas analysis measures 80 experiment samples in a patrol model, and outputs one set of data every second (take the IR-EK2 infrared CH4-CO2 sensor evaluation kit as an example). The total amount of data can reach 1.66 billion for one standard coal biogasification experiment, if all the data would be recorded. However, only 1.4% of the data is effective data. The storage of a large amount of process data is not only waste storage resources, but also waste computing resources for data analysis. Meanwhile, limited by the computing power of the ARM7 processor [23], it is necessary to design an algorithm that occupies less computing resources to ensure that the processor has enough computing power to process concurrent instructions. Therefore, the dynamic dataflow analysis and effective value calculation would be important for simplifying data processing and improving the utilization of computing resources.

Methods

Gas chromatographic analysis of gas composition

1.00mL of gas samples were collected from every experiment sample in each test cycle. And the CH4 and CO2 contents were analyzed by gas chromatography (GC) (7890A, Agilent, America). The gas composition tested by GC was defined as GC value.

The N2 (carrier gas) flow rate was set at 1.0 mL/min. The injection port was maintained at 150°C, the oven temperature was 25°C, and the thermal conductivity detector (TCD) was operated at 200°C [24]. The retention time for methane was 3.76 minutes, and for carbon dioxide was 5.0 minutes. Calibration standards consisting of 40% CH4, 20% CO2, 10% H2 and 30% N2 were injected to generate the calibration plot.

Gas composition online monitoring

The gas composition analysis flow data was monitored using CC30A CH4-CO2 combined analyzer (Jundong, China). The gas chamber was 5.0 mL. The analysis period was set for 8 hours, one injection volume was set to 90 mL, the flow rate was set to 120 mL/min, and the dehydrator temperature was set to -25°C. The gas sample time was set to 45 S, and the test time was set to 90 S. After one test was completed, the sensor was flushed with N2 for 60 S and with dried air for 165 S. N2 and air flow rate was set to 100 mL/min. The CC30A sensitivities of CH4 and CO2 were 2000 ppm, and the resolution was 500 ppm for both gases.

Effective value analysis model design for gas online monitoring

Dynamic data characteristics of gas online test

In the coal biogasification experiment, “CC30A CH4-CO2 combined analyzer (abbreviated as CC30A)” was used to test the gas composition of each sample every 8 hours. When CC30A was in preparation state before the gas test, the gas chamber was filled with air, and the sensor data of CH4 and CO2 below 0.50%vol. This data was identified as background value.

The sensor data in test state, 90 mL gas sample was slowly injected into the gas chamber, and CH4 and CO2 concentration in the gas chamber was gradually increased. The gas sample injection was lasted for 45 S, the gas composition in the gas chamber gradually consistent with the sample (Fig 1. StageⅠ), and the slope of curve were gradually decreased. When gas sample injection process was over, the gas was sealed in gas chamber for 90 S.

Fig 1. Gas concentration curve in one monitoring window.

Fig 1

The test process was divided into three stages based on the test method and gas concentration change factors. Ct is the threshold value.

When the sensor data was tending to be stable and the test result was tending to be consistent with the GC value (Fig 1. Stage Ⅱ). After the test state was completed, the gas chamber was (why would be) flushed with N2 for 60 seconds (Fig 1. Stage Ⅲ). At the beginning of stage Ⅲ, affected by the check valve, which set at the air outlet of gas chamber, in the initial stage of N2 flushing, the short-term pressure increase in the chamber caused the sensor value to increase first and then decrease (Fig 1, stage Ⅲ). Stage Ⅰ to stage Ⅲ constitute a data analysis window (DAW). According to the CC30A CH4-CO2 combined analyzer design, in the later stage of stage Ⅱ, the sensor data tended to be stable and fluctuated around the effective value. Therefore, according to the data distribution characteristics in a monitoring window, the effective value is equal to the sensor value with the highest frequency. Because CC30A completes the gas concentration of CH4 and CO2 simultaneously, two gas test values have the same data fluctuation. To simplify the calculation of effective value, monitoring window should be identified firstly in dynamic data.

The data in Fig 1 as an example, the DAW was established based on CH4 test results, and the threshold value (Ct) was defined as 0.50% (background value). When the dynamic data was larger than the Ct, and data fluctuation conformed to stages Ⅰ- Ⅲ, this data set was defined as one Data Analysis Window (DAW). The effective value calculation model was designed based on the window.

According to the CH4 sensor data distribution in Fig 1, compare the character difference of data in stage Ⅰ/ Ⅲ and stage Ⅰ, it was: (1) The absolute value of the difference between adjacent data was larger in stage Ⅰand Ⅲ; (2) If divided the CH4 test value from Ct to the maximum into several equal parts, the amount of data in the interval of effective value was the largest. Therefore, two concepts were defined in the calculation model design: (1) Data Numerical Distribution (DND): the absolute value of the difference between adjacent data in one monitoring window data set; (2) Data Frequency Distribution of Subdataset (DFDS): the sensor value from Ct to the maximum in one DAW was divided into several subdataset. DFDS is the amount of data in every subdataset if the data was simplified according to certain hexadecimal, the data in the DAW would be divided into groups. There should be a group of dataset with the highest statistical value, and the effective value should be in this group. By this data analysis method, the effective value could be identified with the data value domain segmentation (DVDS) and statistical frequency (SF) analysis.

Effective value calculation model design of dynamic dataflow

Effective value calculation was based on a data set in one DAW. The characteristics of every dataset were: (1) each DAW completes one test for one parameter, there is a unique effective value; (2) in test state, the fluctuation of the sensor value gradually decreases when approaching the end of the test. Therefore, the frequency of each sensor data in stage Ⅰand Ⅲ was less than 2; meanwhile, the frequency of some sensor data in stage Ⅱwas higher than 2. The effective value calculation model was designed as follow steps:

(1) Identify the DAW in the dynamic dataflow.

One gas concentration test cycle was 360 S. The CC30A sensitivities of CH4 and CO2 were 2000 ppm, and the resolutions were 500 ppm for both gases. Threshold value (Ct) was been used to identify the test data and value was set to 0.50%. The first sensor data greater than Ct was defined as the DAW initial data. The time of each DAW was marked by the time of the DAW initial data plus 135 S. In order to prevent misjudgment caused by abnormal data fluctuations, it was required that 100 sensor data before the DAW initial data should less than Ct and the data volume of DAW data set no less than 150.

(2) Optimal DAW dataset segmentation.

In the second step, a DAW dataset was divided from Ct to the maximum data (Cmax) into n equal parts (n DAW subdatasets, and n is calculated by Eq 1) (Fig 2). The value of segmentation was integer multiple of CC30A resolution (Si), and the multiple was limited 1–20 (example: when the multiple value was 2, the segmentation value was recorded as S2). The smaller value of i, the larger value of n, and the greater number of subdatasets. The value of i start from 1, and count the amount of data in each subdataset (SNb) under the Si until the SNb has a unique maximum. Through a large amount of data analysis, the SNb-max should be greater than 6 in order to reduce the effective value error.

Fig 2. Monitoring window data set segmentation and effective segmentation statistical set calculation method.

Fig 2

Fig 3 as an example, when the segmentation value of i was 1, the SNb-max of two subdatasets was 3 (Fig 3A-1); when the value of i was 2, there was only one SNb-max = 4 (Fig 3B-1). Until the value of i was 10, there was only one SNb-max = 7 (Fig 3C-1) Therefore, the optimal value of DAW dataset segmentation was S10.

Fig 3. Value frequency and data dispersion changes with difference value domain segmentation.

Fig 3

Fig A shows the variation of the value frequency of different segmentation values with a histogram. Fig B shows the data dispersion for different segmentation values. The smaller the dispersion, the better the relative continuity of the values in the data set.

(3) The optimal segmentation statistical set building.

Based on DAW dataset segmentation, statistical calculation of the average value of each subdataset and DFDS values.

Step 1: Build DAW dataset. After the DAW has been established, removed the time stamp and reordered the sensor data in the window (V1 ……Vm) (Step 1 in Fig 2).

Step 2: Calculate the value of optimal DAW dataset segmentation. And divide the DAW dataset into n equal parts, build DAW subdataset (SNb, b = 1 to n).

n=CmaxSi (1)

where: n: the number of DAW subdataset;

Cmax: the largest value in DAW;

Si: segmentation value

Step 3: calculate the arithmetic mean (Mc) and count the amount of data in each subdataset (Fc). If there is no data in a subdataset, it is judged as an invalid subset and recorded as empty. The empty subset would not record into Optimal Segmentation Statistical Set (OSS).

Mc=i=bminbmaxVi(bmaxbmin)+1 (2)

Where: Mc: the c data in Optimal Segmentation Statistical Set;

Vi: a data in a DAW dataset;

bmax is the max Vi number in a DAW subdataset;

bmin is the min Vi number in a DAW subdataset;

Step 4: Build the optimal segmentation statistical set. There are two parameters in each OSS subset: ① the arithmetic mean of SNb, ② number of data in SNb.

(4) Effective value calculation.

Effective value is one data in optimal segmentation statistical set. It needs meet two conditions at the same time: ① it has the unique maximum SN; ② the effective value is no the largest OSS data. Therefore, the effective value can be found by the query method. A review calculation was introduced in this model to ensure the reliability of effective value. The principle was that the curve slope where the effective value was located tends to 0. The value dispersion (VD) was introduced, and it could analyze the proportional relationship between the adjacent data of effective value and optimal DAW dataset segmentation (Eq 3). Because the fluctuation of sensor data at the end of test was the smallest, the VD of effective value should close to 1. Taking the data of Fig 1 as an example, the VD was large in the stage Ⅰand Ⅲ and small in the Stage Ⅱ, especially for the data close to the effective value (Fig 3A-2, 3B-2 and 3C-2).

VDb=(OSSb+1OSSb1)2S0(b11) (3)

Where: VDb: the value dispersion of the b data;

OSSb: the b data in optimal segmentation statistical set;

S0: the optimal DAW dataset segmentation.

Validation of the effective value calculation model

The effective value calculation (EVC) model was verified based on 50 sets of CH4 and CO2 data which obtained in the coal biogasification experiment. The calculation process was performed according to the data acquisition time. And the error of each parameter was calculated based on GC value. The allowable error was set to≤2.5% (Fig 4).

Fig 4. The absolute error and relative error statistics of effective value analysis model calculation results.

Fig 4

Fig A and B are the effective value analysis model calculation result with absolute error for 50 sets of CH4 and CO2 data. Fig C and D show the statistical results of the data in graphs A and B respectively with 0.5% relative error interval.

Calculation result conformed that the EVC model for dynamic data of gas online monitoring less than allowable error. The relative error statistics data identified that the relative error for more than 65% data was less than 1% for both CH4 and CO2 concentration data.

Results and discussion

This model was designed for the key algorithm in CC30A CH4-CO2 combined analyzer system development. In the CC30A system, infrared CH4-CO2 sensors were used as the core analysis unit. The system needs to be physical separated between calculation unit and analysis unit, including power ground and signal transmission. To solve this problem, only one optical coupler was used in system design to establish one signal isolation path for the two units. This simplified design improved the anti-interference ability of the system, but created a problem which was how to make the computing unit fast and complete the result analysis with low calculate resource utilization. This algorithm design was based on the analysis steps and sensor characteristics of CC30A CH4-CO2 combined analyzer, and clarifies the reason and law of the fluctuation of the infrared gas component sensor data. It was to allow the computing unit to automatically lock the data analysis window and complete the effective value calculation according to data changes in dynamic data flow.

The model uses the data dimensionality reduction method to extract the data distribution characteristics. The dimensionality reduction calculation instead of curve slope change analysis or curve fitting, and completes the calculation of the effective value by the conditional judgment and mean calculation within 8 data. This algorithm calculation is mainly divided into three steps (Fig 5). The calculation of the DAW subdataset and the OSS subset completed the dimensionality reduction calculation of the original data. It is the core algorithm of the EVC model. The model was verified based on 50 sets of CH4 and CO2 data. Error analysis confirmed that the EVC model for dynamic data of gas online monitoring meets the requirements of experimental accuracy requirements and test error.

Fig 5. Effective Value Calculation (EVC) model calculation flow chart.

Fig 5

Conclusion

Real-time and accurate acquisition the concentration of CH4 and CO2 in anaerobic fermentation is key indicators for monitoring the fermentation system. The design goal of the CC30A CH4-CO2 combined analyzer is to realize low-cost, automated, real-time online analysis of key gas components. Limited by the computing power of the ARM7 processor, the effective value calculation (EVC) model has been designed. The advantage of this algorithm is that only a few simple judgments and statistics are needed to replace complex algorithms to extract the target data from the data flow. According to the principle of model operation, the model can be well utilized to dynamic data flow operations with the following characteristics. (1) the effective value is related to the data distribution characteristics, and is not the maximum or average value in dataset; (2) the calculation is independent and complete according to the fluctuation of the data, and does not rely on any peripheral devices signals. The design of the EVC model enables the calculations independently, and it has positive significance for using the algorithm to reduce the hardware design complexity.

Supporting information

S1 Dataset. Original origin data of Fig 1.

(OPJU)

S2 Dataset. Original origin data of Fig 3.

(OPJU)

S3 Dataset. Original origin data of Fig 4.

(OPJU)

S4 Dataset. Raw data in paper analysis.

(RAR)

S1 Definition

(DOCX)

Acknowledgments

The authors’ acknowledge the contributions of the following companies for allowing access to coal samples and other information used in this paper: Sihe mining, J&D Technology Company.

Abbreviations

GC

Gas Chromatography

EVC

Effective Value Calculation Model

TCD

Thermal Conductivity Detector

DND

Data Numerical Distribution

DFDS

Data Frequency Distribution of Subdataset

Si

segmentation value

DAW

Data Analysis Window

Ct

Threshold Value

SNb

the amount of data in number b subdataset

OSS

Optimal Segmentation Statistical Set

VD

value dispersion

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work was supported by The Open Research Project of State Key Laboratory of Coal Resources and Safe Mining (grant number SKLCRSM17KFA08: DX), The Independent Research Project of State Key Laboratory of Coal Resources and Safe Mining, CUMT (grant number SKLCRSM19X0012: DX), The Key Research and Development Project of Xuzhou Science and Technology (grant number KC20137: JL). D.X. and J.L. wrote and review this paper.

References

  • 1.Murkin JM, Arango M. Near-infrared spectroscopy as an index of brain and tissue oxygenation. British Journal of Anaesthesia. 2009;103: 3–13. doi: 10.1093/bja/aep299 [DOI] [PubMed] [Google Scholar]
  • 2.Ward AJ, Bruni E, Lykkegaard MK, Feilberg A, Adamsen APS, Jensen AP, et al. Real time monitoring of a biogas digester with gas chromatography, near-infrared spectroscopy, and membrane-inlet mass spectrometry. Bioresource Technology. 2011;102: 4098–4103. doi: 10.1016/j.biortech.2010.12.052 [DOI] [PubMed] [Google Scholar]
  • 3.Lu H, Iseley T, Behbahani S, Fu L. Leakage detection techniques for oil and gas pipelines: State-of-the-art. Tunnelling and Underground Space Technology. 2020. doi: 10.1016/j.tust.2019.103249 [DOI] [Google Scholar]
  • 4.Qiu P, Zou C, Wang Z. Nonparametric profile monitoring by mixed effects modeling. Technometrics. 2010;52: 265–277. doi: 10.1198/TECH.2010.08188 [DOI] [Google Scholar]
  • 5.Qiu P, Xiang D. Univariate dynamic screening system: An approach for identifying individuals with irregular longitudinal behavior. Technometrics. 2014;56: 248–260. doi: 10.1080/00401706.2013.822423 [DOI] [Google Scholar]
  • 6.Lu H, Iseley T, Matthews J, Liao W, Azimi M. An ensemble model based on relevance vector machine and multi-objective salp swarm algorithm for predicting burst pressure of corroded pipelines. Journal of Petroleum Science and Engineering. 2021;203. doi: 10.1016/j.petrol.2021.108585 [DOI] [Google Scholar]
  • 7.Xu Z-D, Yang Y, Miao A-N. Dynamic Analysis and Parameter Optimization of Pipelines with Multidimensional Vibration Isolation and Mitigation Device. Journal of Pipeline Systems Engineering and Practice. 2021;12. doi: 10.1061/(asce)ps.1949-1204.0000504 [DOI] [Google Scholar]
  • 8.Zhang K, Luo W, Wang T, Yang J, Yuan Y, Zhang Z, et al. Gas chamber and thermal isolation structure simulation for an integrated NDIR gas sensor. 15th IEEE International Conference on Nano/Micro Engineered and Molecular System, NEMS 2020. 2020. doi: 10.1109/NEMS50311.2020.9265577 [DOI] [Google Scholar]
  • 9.Maffei N, Kuriakose AK. A solid-state potentiometric sensor for hydrogen detection in air. Sensors and Actuators, B: Chemical. 2004;98. doi: 10.1016/j.snb.2003.09.024 [DOI] [Google Scholar]
  • 10.Schmidt M, Linke P, Esser D. Recent development in IR sensor technology for monitoring subsea methane discharge. Marine Technology Society Journal. 2013;47. doi: 10.4031/MTSJ.47.3.8 [DOI] [Google Scholar]
  • 11.Schmidt W. New manufacturing technology improves thermopile sensors. Laser Focus World. 1995;31: 77–79. [Google Scholar]
  • 12.Price C, Parker D, Nester C. Validity and repeatability of three in-shoe pressure measurement systems. Gait and Posture. 2016;46: 69–74. doi: 10.1016/j.gaitpost.2016.01.026 [DOI] [PubMed] [Google Scholar]
  • 13.Paun MA, Sallese JM, Kayal M. Hall effect sensors design, integration and behavior analysis. Journal of Sensor and Actuator Networks. 2013;2: 85–97. doi: 10.3390/jsan2010085 [DOI] [Google Scholar]
  • 14.Dinh TV, Choi IY, Son YS, Kim JC. A review on non-dispersive infrared gas sensors: Improvement of sensor detection limit and interference correction. Sensors and Actuators, B: Chemical. 2016;231: 529–538. doi: 10.1016/j.snb.2016.03.040 [DOI] [Google Scholar]
  • 15.Karl Zipf R, Marchewka W, Mohamed K, Addis J, Karnack F. Tube bundle system: For monitoring of coal mine atmosphere. Mining Engineering. 2013;65: 57. [PMC free article] [PubMed] [Google Scholar]
  • 16.Liang Y, Zhang J, Wang L, Luo H, Ren T. Forecasting spontaneous combustion of coal in underground coal mines by index gases: A review. Journal of Loss Prevention in the Process Industries. 2019;57: 208–222. doi: 10.1016/j.jlp.2018.12.003 [DOI] [Google Scholar]
  • 17.Li L, Peng X, Wang X, Wu D. Anaerobic digestion of food waste: A review focusing on process stability. Bioresource Technology. 2018. doi: 10.1016/j.biortech.2017.07.012 [DOI] [PubMed] [Google Scholar]
  • 18.Ardolino F, Parrillo F, Arena U. Biowaste-to-biomethane or biowaste-to-energy? An LCA study on anaerobic digestion of organic waste. Journal of Cleaner Production. 2018;174. doi: 10.1016/j.jclepro.2017.10.320 [DOI] [Google Scholar]
  • 19.Omar B, El-Gammal M, Abou-Shanab R, Fotidis IA, Angelidaki I, Zhang Y. Biogas upgrading and biochemical production from gas fermentation: Impact of microbial community and gas composition. Bioresource Technology. 2019;286. doi: 10.1016/j.biortech.2019.121413 [DOI] [PubMed] [Google Scholar]
  • 20.Choi YC, Li XY, Park TJ, Kim JH, Lee JG. Numerical study on the coal gasification characteristics in an entrained flow coal gasifier. Fuel. 2001;80: 2193–2201. doi: 10.1016/S0016-2361(01)00101-6 [DOI] [Google Scholar]
  • 21.Xiao D, Peng SP, Wang EY. Fermentation enhancement of methanogenic archaea consortia from an Illinois basin coalbed via DOL emulsion nutrition. PLoS ONE. 2015;10: e0124386. doi: 10.1371/journal.pone.0124386 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Xiao D, Peng SP, Wang BY, Yan XX. Anthracite bio-degradation by methanogenic consortia in Qinshui basin. International Journal of Coal Geology. 2013;116–117: 46–52. doi: 10.1016/j.coal.2013.06.008 [DOI] [Google Scholar]
  • 23.Jeong GY, Park JS. Design of 32-bit RISC processor and efficient verification. Proceedings—KORUS 2003: 7th Korea-Russia International Symposium on Science and Technology. 2003.
  • 24.Stra̧poć D, Picardal FW, Turich C, Schaperdoth I, Macalady JL, Lipp JS, et al. Methane-producing microbial community in a coal bed of the Illinois Basin. Applied and Environmental Microbiology. 2008;74: 2424–2432. doi: 10.1128/AEM.02341-07 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Jamie Males

28 Aug 2021

PONE-D-20-38017

Design of Effective Value Calculation Model for Dynamic Dataflow of Infrared Gas Online Monitoring

PLOS ONE

Dear Dr. Xiao,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please pay particular attention to carefully addressing the points raised by Reviewer 1. Regarding the references suggested by Reviewer 2, please only include these if you feel they are relevant to the context of your study.

Please submit your revised manuscript by Oct 10 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Jamie Males

Staff Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The paper is well written and interesting. I recommend accepting it after a minor revision.

1. Please note the standardization of writing. For example, there should be no colon after "results and discussion".

2. Literature review is insufficient. I suggest that the author read the following papers and add them to the literature review:

(1) Lu, H., Iseley, T., Matthews, J., Liao, W., & Azimi, M. (2021). An ensemble model based on relevance vector machine and multi-objective salp swarm algorithm for predicting burst pressure of corroded pipelines. Journal of Petroleum Science and Engineering, 203, 108585.

(2) Xu, Z. D., Yang, Y., & Miao, A. N. (2021). Dynamic Analysis and Parameter Optimization of Pipelines with Multidimensional Vibration Isolation and Mitigation Device. Journal of Pipeline Systems Engineering and Practice, 12(1), 04020058.

(3) Lu, H., Iseley, T., Behbahani, S., & Fu, L. (2020). Leakage detection techniques for oil and gas pipelines: State-of-the-art. Tunnelling and Underground Space Technology, 98, 103249.

Reviewer #2: This study attempts to explain design of effective value calculation model for dynamic data-flow of infrared gas online monitoring. The study develops an effective value calculation model using the method of dimensionality reduction of dynamic data. The model was based on the distribution characteristics of the process data, and consisted of four steps. This research aids in improving knowledge on gas monitoring and should be considered for publication after addressing the following suggestions / comments:

1. The introduction section of the study should include the background of the study and the concept of dynamic data flow analysis for clarity on the subject. This is necessary to offer an understanding of the topic and scope of the study.

2. The results and discussion section should be enhanced further. In fact, it primarily offers a summary and repetition of information already appearing in both introduction and methods section. Therefore, this section needs to be revised to offer a discussion of the study results.

3. The study lacks conclusion section. This is an integral part of any research and need to be added.

4. Additionally, the following style and grammatical errors should be addressed:

i. Check the in-text references and make sure they are according to the author guideline. Avoid any mixing two different referencing styles. For example, in line 71 there is a superscript 9. Is this for the foot notes or a typo?

ii. Ensure that you use correct symbols all through the paper. Example, check line 103

iii. Follow the provided author guidelines while numbering and formatting your equations

iv. Kindly proof read your document before submission to ensure that all sentences are well structured.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Joseph Githiria

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PONE-D-20-38017 review comments.docx

PLoS One. 2021 Oct 28;16(10):e0259155. doi: 10.1371/journal.pone.0259155.r002

Author response to Decision Letter 0


13 Sep 2021

REVIEWER #1:

The paper is well written and interesting. I recommend accepting it after a minor revision.

Q1. Please note the standardization of writing. For example, there should be no colon after "results and discussion".

Answer:wither another reviewer's suggestion, a “conclusion” chapter was added after "results and discussion".

The chapter 6“6. Declarations” has been modified as “agreements” according to the PLOS ONE specification.

Q2. Literature review is insufficient. I suggest that the author read the following papers and add them to the literature review:

(1) Lu, H., Iseley, T., Matthews, J., Liao, W., & Azimi, M. (2021). An ensemble model based on relevance vector machine and multi-objective salp swarm algorithm for predicting burst pressure of corroded pipelines. Journal of Petroleum Science and Engineering, 203, 108585.

(2) Xu, Z. D., Yang, Y., & Miao, A. N. (2021). Dynamic Analysis and Parameter Optimization of Pipelines with Multidimensional Vibration Isolation and Mitigation Device. Journal of Pipeline Systems Engineering and Practice, 12(1), 04020058.

(3) Lu, H., Iseley, T., Behbahani, S., & Fu, L. (2020). Leakage detection techniques for oil and gas pipelines: State-of-the-art. Tunnelling and Underground Space Technology, 98, 103249.

Answer: These three articles are very interesting and meaningful. They have been added in references.

REVIEWER #2:

This study attempts to explain design of effective value calculation model for dynamic data-flow of infrared gas online monitoring. The study develops an effective value calculation model using the method of dimensionality reduction of dynamic data. The model was based on the distribution characteristics of the process data, and consisted of four steps. This research aids in improving knowledge on gas monitoring and should be considered for publication after addressing the following suggestions / comments:

1. The introduction section of the study should include the background of the study and the concept of dynamic data flow analysis for clarity on the subject. This is necessary to offer an understanding of the topic and scope of the study.

Answer: the introduction section has been rewritten, and add background of this study (as follows). The definition of dynamic data flow has been added in “Definition” and “Introduction” section.

CH4 and CO2 are the key products in the anaerobic fermentation of coal or biomass, and the trend analysis of gas componsition changes is the key factor for the control of anaerobic digestion. Gas composition online analysis technology has gradually diversified with the improvement of the accuracy of gas sensors[1]. For example, using infrared CH4-CO2 gas sensor instead of gas chromatography (GC) to achieve low-cost online gas components detection and analysis[2,3]. At the same time, the optimization of data analysis methods (such as profile monitoring technology[4,5] and ensemble model[6]) and the improvement of computing capabilities of single-chip microcomputer, has further improved the reliability and accuracy of online monitoring. When the sensor is working, it analyzes the target gas concentration in its gas chamber in real time, and continuously send the value to microcomputer, forming a data flow which is named dynamic data flow. However, taking the CC30A CH4-CO2 combined analyzer (Jundong, China) as an example, in order to avoid the mutual interference of the gas samples from each channel and ensure the test accuracy, the analyzer will flush gas chamber with N2 after complete each test. Meanwhile, the response time of sensor further enhanced the delay effect of test result. For sensors with quick response time (such as noncontact thermopile[7], pressure difference sensor [8], hall sensor[9], etc.), the raw sensor data is correlated with the measured parameter and identified effective data, when the data collection period is longer than the sensor response period. And for the sensor with slow response time (such as infrared CH4 sensors[10], the response time longer than 20 S), the effective data will be mixed with process data when the data collection frequency is much shorter than the sensor response period is. Infrared CH4-CO2 sensor is a typical case. These two factors cause the gas composition to be in dynamic changes all the time in sensor gas chamber, and effective data and process data mixes in dynamic data flow. More than 80 % of the sensor data is process data, which is invalid data, when data collection frequency is set to 1 data/S.

Q2. The results and discussion section should be enhanced further. In fact, it primarily offers a summary and repetition of information already appearing in both introduction and methods section. Therefore, this section needs to be revised to offer a discussion of the study results.

Answer:the result and discussion have been rewritten, and add a new figure (Figure 5) has been added.

This model was designed for the key algorithm in CC30A CH4-CO2 combined analyzer system development. In the CC30A system, infrared CH4-CO2 sensors were used as the core analysis unit. The system needs to be physical seperated between calculation unit and analysis unit, including power ground and signal transmission. To solve this problem, only one optical coupler was used in system design to establish one signal isolation path for the two units. This simplified design improved the anti-interference ability of the system, but created a problem which was how to make the computing unit fast and complete the result analysis with low calculate resource utilization. This algorithm design was based on the analysis steps and sensor characteristics of CC30A CH4-CO2 combined analyzer, and clarifies the reason and law of the fluctuation of the infrared gas component sensor data. It was to allow the computing unit to automatically lock the data analysis window and complete the effective value calculation according to data changes in dynamic data flow.

The model uses the data dimensionality reduction method to extract the data distribution characteristics. The dimensionality reduction calculation instead of curve slope change analysis or curve fitting, and completes the calculation of the effective value by the conditional judgment and mean calculation within 8 data. This algorithm calculation is mainly divided into three steps (Figure 5) . The calculation of the DAW subdataset and the OSS subset completed the dimensionality reduction calculation of the original data. It is the core algorithm of the EVC model. The model was verified based on 50 sets of CH4 and CO2 data. Error analysis confirmed that the EVC model for dynamic data of gas online monitoring meets the requirements of experimental accuracy requirements and test error.

Figure 5 here

Figure 5 effective value calculation (EVC) model calculation flow chart.

Q3. The study lacks conclusion section. This is an integral part of any research and need to be added.

Answer:This suggestion is very important, the “Conclusion” section has been added.

6. Conclusion

Real-time and accurate acquisition the concentration of CH4 and CO2 in anaerobic fermentation is key indicators for monitoring the fermentation system. The design goal of the CC30A CH4-CO2 combined analyzer is to realize low-cost, automated, real-time online analysis of key gas components. Limited by the computing power of the ARM7 processor, the effective value calculation (EVC) model has been designed. The advantage of this algorithm is that only a few simple judgments and statistics are needed to replace complex algorithms to extract the target data from the data flow. According to the principle of model operation, the model can be well utilized to dynamic data flow operations with the following characteristics. (1) the effective value is related to the data distribution characteristics, and is not the maximum or average value in dataset; (2) the calculation is independent and complete according to the fluctuation of the data, and does not rely on any peripheral devices signals. The design of the EVC model enables the calculations independently, and it has positive significance for using the algorithm to reduce the hardware design complexity.

Q4. Additionally, the following style and grammatical errors should be addressed:

i. Check the in-text references and make sure they are according to the author guideline. Avoid any mixing two different referencing styles. For example, in line 71 there is a superscript 9. Is this for the foot notes or a typo?

Answer: mistakes of referencing styles have been revised.

ii. Ensure that you use correct symbols all through the paper. Example, check line 103

Answer: symbols mistakes have been revised in paper.

iii. Follow the provided author guidelines while numbering and formatting your equations

Answer: all the equations and numbering have been re-edit.

iv. Kindly proof read your document before submission to ensure that all sentences are well structured.

Answer:The grammar and presentation of the thesis have been modified by native English students.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Pasquale Avino

14 Oct 2021

Design of Effective Value Calculation Model for Dynamic Dataflow of Infrared Gas Online Monitoring

PONE-D-20-38017R1

Dear Dr. Xiao,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Pasquale Avino, Ph.D.

Academic Editor

PLOS ONE

Acceptance letter

Pasquale Avino

19 Oct 2021

PONE-D-20-38017R1

Design of Effective Value Calculation Model for Dynamic Dataflow of Infrared Gas Online Monitoring

Dear Dr. Xiao:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Pasquale Avino

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Dataset. Original origin data of Fig 1.

    (OPJU)

    S2 Dataset. Original origin data of Fig 3.

    (OPJU)

    S3 Dataset. Original origin data of Fig 4.

    (OPJU)

    S4 Dataset. Raw data in paper analysis.

    (RAR)

    S1 Definition

    (DOCX)

    Attachment

    Submitted filename: PONE-D-20-38017 review comments.docx

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES