Abstract
New protocols related to internet-of-things applications may introduce previously unnoticed measurement effects in reverberation chambers due to the narrowband nature of these protocols. Such technologies also require less loading to meet the coherence-bandwidth conditions, which may lead to higher variations, hence uncertainties, across the channel. In this work, we extend a previous study of uncertainty in NB-IoT and CAT-M1 device measurements in reverberation chambers by providing, for the first time, a comprehensive uncertainty analysis of the components related to the reference and DUT measurements. By use of a significance test, we show that certain components of uncertainty become more dominant for such narrowband protocols, and cannot be considered as negligible, as in current standardized test methods. We show that the uncertainty, if not accounted for by using the extended formulation, will be greatly overestimated and could lead to non-compliance to standards.
Keywords: CAT-M1, Cellular Telecommunications, Chamber transfer function, Internet of Things, NB-IoT, Reverberation chamber, Uncertainty, Wireless System
I. INTRODUCTION
The use of internet-of-things (IoT) or machine-to-machine (M2M) applications is gaining popularity to meet demands such as improved indoor coverage, increased reconfigurability, and mobility, that are required for 5G and beyond [1, 2]. Many of these devices will work in the FR1, or sub-6 GHz, bands using protocols such as narrowband IoT (NB-IoT) and CAT-M1 (or LTE-M) [1–3].
The performance of these cellular devices is often studied with over-the-air (OTA) tests by metrics such as Total Isotropic Sensitivity (TIS) and Total Radiated Power (TRP) [4–9]. These tests can be carried out either in an anechoic chamber (AC) or a reverberation chamber (RC). An RC is a large metal cavity, with one or more mode-stirring mechanisms to produce, on average, a uniform distribution of the fields, and can often produce faster, lower-cost, or more flexibly configurable measurements than an AC [4]. This makes an RC an excellent candidate for testing IoT devices when directional information is not required.
RCs have been researched extensively and were shown to be suitable for TIS measurements on earlier-generation protocols, such as W-CDMA (4 MHz channel bandwidth) [4–9]. However, for NB-IoT we expect additional challenges due to the narrowband nature of this protocol (180 kHz channel bandwidth). Traditionally, to provide accurate results, a wideband RC reference measurement is averaged over frequency in post processing to match the bandwidth of the modulated signal. Such frequency averaging has the added benefit of resulting in a low-uncertainty estimate of the chamber loss. When averaging the frequency response over a narrow bandwidth, the uncertainty estimate is more sensitive to peaks and nulls in the RC’s frequency response for the mode-stirring samples and may increase uncertainty.
Multiple works have studied uncertainty effects in loaded RCs for wireless-device testing for wideband protocols [10–14], but little research has been published on uncertainty in loaded RCs for narrowband protocols [15]. As [10, 13] show, for the wideband (4 MHz channel bandwidth) protocols, the uncertainty budget contains many contributing components, but generally, the biggest contributor is the chamber lack of spatial uniformity due to loading. This component can be estimated by measuring the standard deviation between independent realizations of the stepped mode-stirring sequence [10]. This method, as is advised in current standardized test methods, deems differences within an independent realization as negligible. In a previous work, we showed with preliminary results that larger variations occur within such an independent realization due to the low averaging bandwidths of narrowband protocols, as compared to wideband protocols, and that they cannot be considered as negligible [16].
In this paper, we extend the previous work by providing a comprehensive uncertainty analysis, where we include all components of uncertainty discussed in [17, 18] to obtain the total expanded uncertainty. We also show a more extensive chamber characterization, and more extensive results for uncertainty and the significance test over multiple bands, where we show that a formulation that takes both the uncertainty between independent realizations of a given mode-stirring sequence and within an independent realization into account should be used, as compared to a formulation that only takes the between uncertainty into account. Using the latter, the user may greatly overestimate the uncertainty of the measurement system, as we will show. We base the majority of our analyses on the Test Plan for Wireless Large-Form-Factor Device Over-The-Air Performance [17] by the CTIA, an organization which provides test plans for wireless-device OTA testing and is planning on providing such a test plan for NB-IoT. This work aims to aid in that goal.
In Section II, we introduce the current standardized procedure for performing TIS measurements in reverberation chambers. In Section III, we describe the theory of the significance test, where we show with measurement results that the formulation used in current standardized wideband test methods should be changed for NB-IoT. In Section IV, we show the expanded uncertainties using that formulation, which are 1.26 dB and 1.14 dB for an NB-IoT and CAT-M1 device, respectively, operating in the Cellular Band 2. The work is concluded in Section V.
II. TIS MEASUREMENT PROCEDURE
TIS is a measure of the minimum received power that a device can accept without incurring an unacceptably low throughput or an unacceptably high error rate for a certain protocol. An illustration of a typical RC setup for a TIS measurement is shown in Fig. 1. The measurement procedure is as follows: A wireless link is established between a base-station emulator (BSE) and a device under test (DUT), where the BSE transmits a signal at decreasing power levels at the downlink frequency, and measures the DUT’s reported throughput or error rate at the uplink frequency. Per the CTIA test plan [17], TIS measurements are performed using data throughput as the measurement metric. The TIS for the NB-IoT and CAT-M1 protocols corresponds to the minimum downlink power required to provide a data throughput rate greater than or equal to 95% of the maximum throughput of the reference measurement channel. We measure the BSE power for a high value of starting power and as long as the throughput is higher than this threshold, we step the power down until the throughput drops below the threshold to obtain a minimum power for each mode-stirring sample. This process is repeated for every sample in the stepped mode-stirring sequence, and then averaged over all mode-stirring samples to obtain TIS [17].
Fig. 1.:

Illustration of the RC setup for TIS, including a turntable for position stirring, which is needed in loaded chamber measurements. The DUT is replaced by a second antenna for the reference measurement of chamber loss.
Usually, we need to load the chamber by adding RF absorbers to flatten the RC’s frequency response allowing us to keep the communication link between the BSE and the DUT while measuring TIS. This is due to the fact that, in an unloaded chamber, the frequency selectivity is usually too high for the DUT’s equalizers. Loading increases frequency correlation and reduces spatial uniformity, which may increase uncertainty if not compensated for using position stirring with, for example, a turntable as shown in Fig. 1 [19]. The amount of loading necessary can be determined from the coherence bandwidth (CBW), defined as the average bandwidth over which the frequency samples have a minimum specified level of correlation [19]. In general, the CBW needs to be wider than the channel bandwidth to maintain the link [17].
In the CTIA Test Plan for Wireless Large-Form-Factor Device Over-the-Air Performance [17], TIS is calculated from
| (1) |
where PTIS is the total isotropic sensitivity in W and the total efficiency of the measurement antenna (see Fig. 1). Gcable is the cable loss between the measurement antenna and the BSE, PBSE(m) is the minimum received power measured by the BSE at the threshold throughput in W for mode-stirring sample m, 〈·〉M is an ensemble average over the total number of mode-stirring samples M. Gref is the chamber transfer function given by [17, 19]
| (2) |
where is the total efficiency of the reference antenna (not shown in Fig. 1) and 〈·〉F is an ensemble average over F frequencies across the channel bandwidth. Gref is frequency averaged over the same bandwidth as the DUT channel being measured. The uncertainty in all metrics introduced in (1) and (2) should be taken into account in a comprehensive uncertainty analysis, as we will discuss in Section IV.
III. SIGNIFICANCE TEST
In this section, we perform a significance test to determine what formulation should be used to estimate uncertainty in both the reference and the DUT measurements.
A). Theory
The current CTIA formulation for reverberation-chamber-induced uncertainty is based on the concept that the lack of spatial uniformity is the dominant component of uncertainty, since chambers are typically loaded for the widest channel bandwidth to be tested, which is often 4 MHz [5, 17, 19]. This type of uncertainty is calculated from the variation between different independent realizations of the stepped mode-stirring sequence, which assumes uncertainty due to variations within an independent realization to be negligible. However, for a narrow channel bandwidth, such as that of NB-IoT, this is not the case. To illustrate this, we perform a significance test as described in detail in [10] to determine which uncertainty should be used:
Only the variation between independent realizations (lack of spatial uniformity) of the mode-stirring sequence is dominant. This is the current CTIA formulation.
Both the variation due to the number of samples within the mode-stirring sequence and the variation between independent realizations of the mode-stirring sequence are included. This is the formulation we propose for NB-IoT and CAT-M1 measurements.
The ‘significance’ is determined in an F-test [10, 20] and is defined as the ratio between the variance in the between and within samples. The significance is compared to a threshold derived from a 95th-percentile of an F-distribution, with NB − 1 and NB(NW − 1) degrees of freedom, respectively. The 95th-percentile corresponds to 95 % probability of the lower cumulative distribution function of the F-distribution, NB is the number of independent realizations and NW is the number of samples within an independent realization [10, 20]. If the calculated significance value is above the specified threshold, between differences dominate (Formulation 1) and the following formulation may be used to estimate the uncertainty of the reference measurement [10]:
| (3) |
If within and between differences are both significant (Formulation 2), the formulation that should be used is given by [10]:
| (4) |
Note that Formulation 2 yields a lower uncertainty-estimate due to the number of samples and degrees of freedom in the denominator [20]. Physically, this means that when the lack of spatial uniformity dominates the uncertainty, the uncertainty can be significantly higher unless the stirring sequence includes a large number of spatial-stirring samples [10]. In the comprehensive uncertainty analysis, both the uncertainty in the reference measurement and the DUT measurement are taken into account, where , since typically NB = 1 for the DUT [17]. This is because a test lab typically performs a single measurement of each device. Next, we discuss the measurement setting for estimating GRef, such that we can calculate the significance, and the uncertainty using both formulations. We applied the significance test to these measurements with the results described in Section III-C.
B). Measurement Setup and Mode-Stirring Sequence
Measurements were carried out in a 4.6 m × 3.1 m × 2.8 m RC at the National Institute of Standards and Technology (NIST), as shown in Fig. 2, which has one paddle as a mode-stirring mechanism and a turntable and height translation for position stirring. A vector network analyzer (VNA) was used in all measurements, with an IF BW setting of 1 kHz, a source power of −8 dBm and a 1 kHz frequency spacing. We focus on 3 different sub-bands of the Cellular NB-IoT Band 2, each with a 10 MHz bandwidth, centered at 1930 MHz, 1960 MHz and 1990 MHz. All results are shown for the frequency-averaging bandwidths of both narrowband protocols NB-IoT (180 kHz) and CAT-M1 (1.4 MHz), and one of 2 MHz, to study a more wideband protocol. We averaged all transmission-coefficient results over these three bandwidths, with which we computed GRef and the significance. To investigate the effects of loading, we used one measurement setup with “light loading” (two absorbers) and one with “heavy loading” (eight absorbers), resulting in coherence-bandwidth values of 1.5 MHz and 3.3 MHz, respectively. We calculated the coherence bandwidth with a threshold of 0.5 [14, 17]. The measurement setup with eight absorbers is shown in Fig. 2. The unloaded CBW is on the order of 500 kHz, which is larger than the channel bandwidth of NB-IoT. However, for this large chamber, we always introduce some loading to minimize the potential for a large amount of constructive interference damaging the DUTs. Even a small amount of RF absorber dampens the modes sufficiently to prevent such damage. We used two low-loss broadband antennas for the reference measurement, where the calibration reference plane was specified at the connectors of the antennas using an N-type electronic calibration module. We obtained the GRef estimate from a transmission-coefficient measurement between two antennas (as discussed in [19]), where the second antenna was replaced with the DUT for the DUT measurement.
Fig. 2.:

RC setup to measure Gref for eight absorbers. The chamber contains one vertical paddle for mode stirring, and a turntable with height translation for position stirring.
By subsetting all of the mode-stirring samples, we acquired 6 independent realizations (IRs) (NB = 6), each containing 120 stepped mode-stirring samples (NW = 120) within the mode-stirring sequence obtained from 8 paddle and 15 turntable angles with 45 °and 24 °angle spacing, respectively, as shown in Table 1. IR1-3 and IR4-6 were measured at antenna heights of 0.3 m and 1.3 m, respectively, where IRs with the same height have different paddle-angle offsets, as shown in Table 1. To confirm low correlation between samples, we performed a linear autocorrelation test of the data within each of the independent realizations and a Pearsons’ cross-correlation test of the data between all independent realizations. For both cases we show the worst-case scenario, which is, according to the data, a heavily loaded case. The within correlation is shown in Fig. 3, which shows the normalized correlation value versus lag shifted copies of the entire sequence with itself. The peak correlation value of 1 at 0 sample lag occurs because the exact same two arrays are being compared. Lag shifting the sequence over by one sample with itself, in either direction, drops the correlation value to below the 0.3 threshold [17, 21], as shown in Fig. 3, verifying independent samples. The correlation between independent realizations for Band 1 is shown in Table 2. A few cases slightly exceed the 0.3 threshold. These cases are underlined in Table 2. However, since the loading is much higher than required for NB-IoT and CAT-M1, this is not expected to influence the final results significantly. In the CBW = 1.5 MHz case, which we use in the final uncertainty budget, all correlations between independent realizations are below the threshold.
Table 1.
Mode-stirring sequence for each independent realization (IR).
| IR | Height | Paddle angles | Turntable Angles | |
|---|---|---|---|---|
| Angles | Offset | |||
| 1–3 | 0.3 m | 8 | 0°, 15°, 30° | 15 |
| 4–6 | 1.3 m | 8 | 0°, 15°, 30° | 15 |
Fig. 3.:

Correlation of within samples using linear autocorrelation for Band 1, for NB-IoT (a) and CAT-M1 (b), with a worst-case CBW = 3.3 MHz.
Table 2.
Worst-case correlation between independent realizations using Pearson’s cross correlation for CBW = 3.3 MHz.
| IR | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|
| 1 | 1 | 0.27 | 0.32 | 0.08 | 0.18 | 0.11 |
| 2 | x | 1 | 0.32 | 0.18 | 0.21 | 0.19 |
| 3 | x | x | 1 | 0.26 | 0.20 | 0.13 |
| 4 | x | x | x | 1 | 0.33 | 0.25 |
| 5 | x | x | x | x | 1 | 0.38 |
| 6 | x | x | x | x | x | 1 |
C). Results
Using the significance test, we calculated the percentage of the band over which Formulation 2, (4) should be used. These results are shown in Table 3 for the three bands, the two absorber cases and the three averaging bandwidths. The results show that Formulation 2 holds for the majority of the band, in both the the NB-IoT (180 kHz) and CAT-M1 (1.4 MHz) bandwidths, in contrast to the current standardized methods which used Formulation 1, (3) [17].
Table 3.
Percent of frequencies where Formulation 2 should be used to calculate uncertainty, with exceptions to the general trend underlined or in italics.
| Band | CBW | 180 kHz | 1.4 MHz | 2 MHz |
|---|---|---|---|---|
| 1930 MHz | 1.5 MHz | 97.5 % | 85.2 % | 72.9 % |
| 3.3 MHz | 80.2 % | 70.1 % | 62.4 % | |
| 1960 MHz | 1.5 MHz | 85.2 % | 82.3 % | 84.5 % |
| 3.3 MHz | 91.1 % | 100.0 % | 100.0 % | |
| 1990 MHz | 1.5 MHz | 98.3 % | 95.9 % | 100.0 % |
| 3.3 MHz | 88.7 % | 83.2 % | 75.7 % |
For the majority of the results, the between significance increases for a higher CBW, as loading reduces spatial uniformity. This yields larger differences between independent realizations of the mode-stirring sequence and increases the between uncertainty. There are three exceptions in the band centered at 1960 MHz which are marked in italics in Table 3. These can be attributed to high variations in the significance in combination with a narrow bandwidth, as we will show. In most cases, we also observe an increase in the between significance for higher averaging bandwidths. This is due to the fact that the within differences reduce significantly due to the reduction in peaks and nulls in the frequency response when averaging. Four exceptions, due to the reduced number of points by averaging as we will show, are underlined in Table 3.
As the band centered at 1960 MHz has the most exceptions, we show this case in Fig. 4. This figure shows the significance for this band for two absorber cases and three averaging bandwidths, with the alpha-percentile (95 %) of the F-distribution, as discussed in Section III-C. Note that the trend of the significance changes for each averaging bandwidth, as it is based on the ratio between the variance of within and between samples, which both use GRef averaged over the channel bandwidth. Since both variances change, the ratio between them, hence the F-statistic, changes too. Fig. 4 shows that a high variation of the significance metric can occur over frequency. Due to the narrow bandwidth used, an exception may occur where the significance metric lies below the threshold for a higher percentage of the band in a higher loading case, as compared to a lower one, since they were different setups.
Fig. 4.:

The result of the significance test for the band centered at 1.96 GHz, for two absorber cases, and three averaging bandwidths. The majority of the results lie below the alpha percentile (95 %), implying a definition that takes both the within and between uncertainty into account should be used.
The underlined exceptions in Table 3 can be attributed to a loss of samples at the edges of the band for increasing averaging bandwidths due to the running-average technique used, as shown in Fig. 4. If peaks in significance that are above the threshold occur at the edge of the band, these will be averaged out, resulting in a lower percentage of between significance in Table 3. This is specifically the case in the band centered at 1960 MHz, where a peak at the lower edge of the band exceeding the threshold is averaged out for the 1.4 MHz and 2 MHz averaging bandwidths, resulting in two exceptions in Table 3. For all cases, these exceptions only showed when peaks in significance occur close to the edges of the band. Note that these exceptions do not influence the outcome of which formulation should be used. In general, for all NB-IoT and CAT-M1 cases, between differences do not dominate, and the within uncertainty should be taken into account by using Formulation 2 (4). Next, we use this formulation for a comprehensive uncertainty analysis and we show the effects of this choice on the measurement uncertainty.
IV. UNCERTAINTY ANALYSIS
In this section, we first analyze results of the uncertainty in the GRef measurement, using the previously defined formulation. Then, we include other uncertainty components as well, where we present a comprehensive uncertainty analysis.
A). Combined Uncertainty
Using Formulation 2, (4), we can calculate and . Using the root-sum-of-squares (RSS) technique, we can calculate the combined uncertainty of those, uCombined, normalized to Gref using [10, 17]
| (5) |
The results for all bands are shown in Fig. 5. In the current standard, the user selects the highest value of uncertainty, computed over all frequencies within the band of interest, since, as shown in Fig. 5, uncertainty estimates can change over frequency. Table 4 shows this value for all cases, calculated using both Formulation 1 and 2. It can be clearly seen that Formulation 1, (3), overestimates the uncertainty in all cases. Several other effects related to the loading and averaging bandwidth can be observed in the combined uncertainty results.
Fig. 5.:

The normalized combined uncertainty for three bands centered at 1930 MHz (a), 1960 MHz (b) and 1990 MHz (c), calculated with (5) using Formulation 2.
Table 4.
Maximum combined uncertainty in dB, calculated using both Formulation 1 (F1) and 2 (F2). These values do not include a coverage factor.
| Band (MHz) | CBW (MHz) | 180 kHz | 1.4 MHz | 2 MHz | |||
|---|---|---|---|---|---|---|---|
| F1 | F2 | F1 | F2 | F1 | F2 | ||
| 1930 | 1.5 | 0.60 | 0.43 | 0.51 | 0.35 | 0.48 | 0.31 |
| 3.3 | 0.85 | 0.45 | 0.79 | 0.41 | 0.73 | 0.38 | |
| 1960 | 1.5 | 0.76 | 0.45 | 0.58 | 0.37 | 0.50 | 0.33 |
| 3.3 | 0.73 | 0.43 | 0.56 | 0.39 | 0.43 | 0.37 | |
| 1990 | 1.5 | 0.59 | 0.43 | 0.48 | 0.35 | 0.40 | 0.32 |
| 3.3 | 0.68 | 0.42 | 0.62 | 0.38 | 0.57 | 0.36 | |
First, the uncertainty reduces for higher averaging bandwidths, as expected since it reduces within uncertainty. Second, the maximum uncertainty for the NB-IoT averaging bandwidth is very similar for both loading cases (note that the black curves in Fig. 5 overlay), which is generally not the case in wideband measurements. In Fig. 5(b) and (c), the maximum uncertainty for the NB-IoT bandwidth is even higher for a lower-loading case. Even with Formulation 1 (see Table 4), the uncertainty does not always increase for increased loading. This all implies that the within uncertainty is more dominant than the between uncertainty, and that this loading has relatively little effect on this uncertainty for the NB-IoT bandwidth. A third effect is that the uncertainty is higher for the CAT-M1 and 2 MHz averaging bandwidths in the eight-absorber case, as compared to the two-absorber case. This is as expected, since the within differences become less significant for higher averaging bandwidths, while the between differences become more significant for higher-loading cases (see Table 3).
B). Comprehensive Uncertainty Analysis
In this subsection, we estimate the uncertainty in the whole measurement, taking into account all metrics in (1). Table 5 shows a summary of all the components of uncertainty related to the measurement, split into two groups. The groups contain contributions to the uncertainty in the reference measurement and the DUT measurement. In this analysis, we used the NB-IoT and CAT-M1 bandwidth results, where CBW = 1.5 MHz, as this is wider than the channel bandwidth of interest, while it does not excessively load the chamber. We based our analysis on the components discussed in [17].
Table 5.
Comprehensive uncertainty budget for NB-IoT and CAT-M1. Expanded with a 1.96 coverage factor.
| Uncertainty Contribution | Std. Unc. (dB) |
|---|---|
| Contributions in the DUT measurement part | |
| Mismatch (BSE - Measurement Antenna) [22] | < 0.01 |
| BSE output level (stability) [5] | 0.18 |
| Cable factor: Measurement Antenna [22] | < 0.01 |
| Insertion loss: Measurement Antenna cable [17] | < 0.01 |
| Sensitivity search step size [22] | 0.15 |
| Temperature variation (Calculated 3K) [22] | 0.14 |
| Miscellaneous uncertainty [22] | 0.10 |
| Chamber lack of spatial uniformity (NB-IoT) | 0.42 |
| Chamber lack of spatial uniformity (CAT-M1) | 0.34 |
| Frequency resolution for TIS Measurement [17] | 0.05 |
| Contributions in the reference measurement part | |
| Mismatch (VNA - Reference Antenna) [22] | < 0.01 |
| Mismatch (VNA - Measurement antenna) [22] | < 0.01 |
| VNA absolute level & level stability [23] | 0.30 |
| Insertion loss: Calibrated Ref. Antenna cable [22] | < 0.01 |
| Insertion loss: Measurement Antenna Cable [17] | < 0.01 |
| Chamber lack of spatial uniformity (NB-IoT) | 0.18 |
| Chamber lack of spatial uniformity (CAT-M1) | 0.14 |
| Antenna: Radiation efficiency Reference Antenna | 0.17 |
| Total expanded Uncertainty | Expanded with coverage factor |
| NB-IoT (180 kHz) | 1.26 |
| CAT-M1 (1.4 MHz) | 1.14 |
In the contributions to the DUT measurement, we calculated the mismatch between the BSE and the measurement antenna, and the temperature variation in the system using equations provided in [22]. In the calculation for temperature variation, we used a variation of ±3 K, assuming worst-case values as presented in [22, 24]. Fixed worst-case standard uncertainties were used for the cable factor, insertion loss, the sensitivity search step size, miscellaneous uncertainty and the frequency resolution for the TIS measurement. The BSE output level (stability) was extracted from the manufacturer data sheet. In the reference measurement, we extracted the VNA absolute level and level stability from an earlier work that used a similar setup. The uncertainties in the impedance mismatch and cable measurements were calculated using [22], which were, in this case, negligible. We therefore state them as being < 0.01 dB. It should be noted that these are not always negligible. In this measurement setup, the uncertainty due to the cable movements is not considered in the reference measurement, since they are calibrated out. The uncertainty from moving cables due to the movement of the turntable was found negligible due to the use of a rotary joint. We calculated the uncertainty of the radiation efficiency of the reference antenna using [25].
In [17], one component of uncertainty is the chamber “lack of spatial uniformity”, which is calculated using Formulation 1, which only uses between uncertainty. We used Formulation 2, that also includes within uncertainty, so this component does not contain only uncertainty due to a lack of spatial uniformity. Since we measured multiple bands, we used the maximum uncertainty derived from all bands. It should be noted that the maximum uncertainty value did not vary more than 0.03 dB between the bands. The same holds for the chamber “lack of spatial uniformity” component of uncertainty in the contribution in the reference measurement part. The uncertainty is lower here, since NB = 6 for the reference measurement, while NB = 1 for the DUT measurement.
We estimated the total expanded uncertainty by using an RSS technique on the uncertainties in dB from both groups, according to [17]. We assume all of the components of uncertainty to be uncorrelated and Gaussian distributed here. To cover the uncertainty due to a limited number of samples, we multiplied the result with a coverage factor of 1.96 to obtain a 95 % confidence interval [20]. The total expanded uncertainties for NB-IoT and CAT-M1 are 1.26 dB and 1.14 dB, respectively. For both protocols, the uncertainty lies below the maximum allowed uncertainty for TIS, which is 2.3 dB (2.0 dB for TRP) [22]. It should be noted that, if the formulation taking only between uncertainty into account was used, these values would be 1.75 dB and 1.45 dB, respectively, which overestimates the uncertainty significantly. If another uncertainty component turns out to be higher than anticipated, this could lead to non-compliance with the standard. This shows the importance of taking both the within and between uncertainty into account.
V. CONCLUSION
In this paper, we presented for the first time a comprehensive uncertainty analysis of NB-IoT and CAT-M1 measurements of TIS in a reverberation chamber. We performed a significance test and analyzed the results using three different NB-IoT bands and multiple coherence-bandwidth cases. Using the outcome of the test, we showed that a formulation that takes both within and between uncertainty into account should be used to calculate the uncertainty in the reference and DUT measurements, as compared to current standardized test methods, which only use the between uncertainty. This is due to the narrowband nature of these protocols, which greatly increases the uncertainty within an independent realization. This type of uncertainty has been considered as negligible, up until now. For the results shown here, use of the between formulation will overestimate the total expanded uncertainty by approximately 0.5 dB for NB-IoT and CAT-M1. This could lead to non-compliance to the standard and is therefore critical to be taken into account by engineers.
ACKNOWLEDGEMENTS
The authors would like to thank Sara Catteau of Bluetest and Benjamin Jamroz of NIST for their thorough reviews of this manuscript.
Biographies

Anouk Hubrechsen received the B.Sc. and M.Sc. degrees in electrical engineering from the Eindhoven University of Technology, Eindhoven, The Netherlands, in 2017 and 2019, respectively, where she is currently pursuing the Ph.D. degree.
She was a Guest Researcher with the National Institute of Standards and Technology, Boulder, CO, USA, in 2019, where she was involved in reverberation-chamber metrology. She is currently involved in the AMICABLE Project, researching interference effects in cable bundles. Ms. Hubrechsen received the regional and district Zonta Women in Technology awards in 2019. She is currently vice-chair of IEEE Benelux Women in Engineering.

Kate A. Remley (S’92-M’99-SM’06-F’13) was born in Ann Arbor, MI. She received the Ph.D. degree in Electrical and Computer Engineering from Oregon State University, Corvallis, in 1999.
From 1983 to 1992, she was a Broadcast Engineer in Eugene, OR, serving as Chief Engineer of an AM/FM broadcast station from 1989–1991. In 1999, she joined the RF Technology Division of the National Institute of Standards and Technology (NIST), Boulder, CO, as an Electronics Engineer. She is currently the Leader of the Metrology for Wireless Systems Project at NIST, where her research activities include development of calibrated measurements for microwave and millimeter-wave wireless systems and standardized over-the-air test methods for the wireless industry.
Dr. Remley is a Fellow of the IEEE and was the recipient of the Department of Commerce Bronze and Silver Medals, an ARFTG Best Paper Award, the NIST Schlichter Award, and is a member of the Oregon State University Academy of Distinguished Engineers. She was the Chair of the MTT-11 Technical Committee on Microwave Measurements (2008–2010), the Editor-in-Chief of IEEE Microwave Magazine (2009–2011), and Chair of the MTT Fellow Evaluating Committee (2017–2018). She was a Distinguished Lecturer for the IEEE Electromagnetic Compatibility Society (2016–2017).

Robert Jones received dual B.S. degrees in electrical and mechanical engineering from the Colorado School of Mines in 2019, where he is currently pursuing his masters degree. Since 2017, he has been a student researcher at the National Institute of Standards and Technology (NIST), conducting experiments with loaded reverberation chambers.
His current research interests are in computational electromagnetics, antenna design, and loaded reverberation chamber metrology.

Robert. D. Horansky Robert D. Horansky received the B.A. degree in chemistry and the Ph.D. degree in physics from the University of Colorado, Boulder, CO, USA, in 1999 and 2005, respectively. His thesis work focused on low-noise dielectric measurements on novel materials in molecular electronics. Since 2005, he has been with the National Institute of Standards and Technology (NIST), Boulder, CO, USA, where he started out developing the highest resolving power energy dispersive sensor to date. He then went on to develop metrology techniques for single photon sensors in nuclear radiation and optical power measurements. In 2015, he joined the Metrology for Wireless Systems Project in the Communications Technology Laboratory, NIST developing calibrations and traceability for millimeter-wave wireless systems and reverberation-chamber measurements for cellular applications. He is the Secretary of the IEEE P1765 Standards Working Group on Uncertainty for EVM.

Vincent T. Neylon is currently working on a B.A. in Biology at the University of Colorado at Denver with the goal to get a M.S. in bioengineering. He was an associate researcher at the National Institute of Standards and Technology (NIST), Boulder, CO, USA, from 2016 to 2020 where he researched methods of measurement in wireless and IoT technologies. He is now focusing on finishing his undergrad before seeking other opportunities.

Sander Bronckers received the M.Sc. degree (cum laude) in electrical engineering from Eindhoven University of Technology (TU/e), The Netherlands, in 2015. In 2018, he was a visiting researcher at NIST (Boulder, Colorado) on antenna measurements in reverberation chambers. He obtained the Ph.D. degree (cum laude) in electrical engineering in 2019, within the electromagnetics group at TU/e, on design and measurement techniques for next-generation integrated antennas. He is currently an assistant professor on Metrology for Antennas and Wireless Systems at TU/e. His research interests include antenna measurements in reverberation and anechoic chambers, channel sounding and emulation, and RF material characterization.
REFERENCES
- [1].GSMA, “Mobile IoT in the 5G future: NB-IoT and LTE-M in the context of 5G,” April 2018.
- [2].3GPP TS 36.101, “Evolved universal terrestrial radio access (EUTRA); user equipment (UE) radio transmission and reception (release 16),” December 2019.
- [3].3GPP, “TS 38.101–1: NR; user equipment (ue) radio transmission and reception; part 1: Range 1 standalone (release 16),” September 2020.
- [4].Horansky RD and Remley KA, “Flexibility in over-the-air testing of receiver sensitivity with reverberation chambers,” IET Microwaves, Antennas Propagation, vol. 13, no. 15, pp. 2590–2597, 2019. [Google Scholar]
- [5].Aan Den Toorn J, Remley KA, Holloway CL, Ladbury JM, and Wang C, “Proximity-effect test for lossy wireless-device measurements in reverberation chambers,” IET Science, Measurement Technology, vol. 9, no. 5, pp. 540–546, 2015. [Google Scholar]
- [6].Orlenius C, Kildal P, and Poilasne G, “Measurements of total isotropic sensitivity and average fading sensitivity of CDMA phones in reverberation chamber,” in 2005 IEEE Antennas and Propagation Society International Symposium, vol. 1A, 2005, pp. 409–412 Vol. 1A. [Google Scholar]
- [7].Chen X, Tang J, Li T, Zhu S, Ren Y, Zhang Z, and Zhang A, “Reverberation chambers for over-the-air tests: An overview of two decades of research,” IEEE Access, vol. 6, pp. 49 129–49 143, 2018. [Google Scholar]
- [8].Kildal P, Orlenius C, and Carlsson J, “OTA testing in multi-path of antennas and wireless devices with MIMO and OFDM,” Proceedings of the IEEE, vol. 100, no. 7, pp. 2145–2157, 2012. [Google Scholar]
- [9].Delangre O, De Doncker P, Horlin F, Lienard M, and Degauque P, “Reverberation chamber environment for testing communication systems: Applications to OFDM and SC-FDE,” in 2008 IEEE 68th Vehicular Technology Conference, 2008, pp. 1–5. [Google Scholar]
- [10].Remley KA, Wang CJ, Williams DF, aan den Toorn JJ, and Holloway CL, “A significance test for reverberation-chamber measurement uncertainty in total radiated power of wireless devices,” IEEE Transactions on Electromagnetic Compatibility, vol. 58, no. 1, pp. 207–219, February 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Chen X, Kildal P, and Lai S, “Estimation of average rician k-factor and average mode bandwidth in loaded reverberation chamber,” IEEE Antennas and Wireless Propagation Letters, vol. 10, pp. 1437–1440, 2011. [Google Scholar]
- [12].Garcia-Fernandez MA, Sanchez-Heredia JD, Martinez-Gonzalez AM, Sanchez-Hernandez DA, and Valenzuela-Valdes JF, “Advances in mode-stirred reverberation chambers for wireless communication performance evaluation,” IEEE Communications Magazine, vol. 49, no. 7, pp. 140–147, 2011. [Google Scholar]
- [13].Kildal P, Chen X, Orlenius C, Franzen M, and Patane CSL, “Characterization of reverberation chambers for ota measurements of wireless devices: Physical formulations of channel matrix and new uncertainty formula,” IEEE Transactions on Antennas and Propagation, vol. 60, no. 8, pp. 3875–3891, 2012. [Google Scholar]
- [14].Chen X, Kildal P, Orlenius C, and Carlsson J, “Channel sounding of loaded reverberation chamber for over-the-air testing of wireless devices: Coherence bandwidth versus average mode bandwidth and delay spread,” IEEE Antennas and Wireless Propagation Letters, vol. 8, pp. 678–681, 2009. [Google Scholar]
- [15].Luo J, Mendivil E, and Christopher M, “Over-the-air performance evaluation of NB-IoT in reverberation chamber and anechoic chamber,” in 2018 AMTA Proceedings, Nov 2018, pp. 1–3. [Google Scholar]
- [16].Hubrechsen A, Neylon VT, Remley KA, Jones RD, Horansky RD, and Bronckers LA, “A preliminary study on uncertainty of NB-IoT measurements in reverberation chambers,” in 2020 50th European Microwave Conference (EuMC), 2021, pp. 1051–1054. [Google Scholar]
- [17].CTIA Certification, “Test plan for wireless large-form-factor device over-the-air performance, version 1.2.1,” February 2019.
- [18].3GPP TR 37.977, “Verification of radiated multi-antenna reception performance of user equipments (release 12),” 2013.
- [19].Remley KA, Dortmans J, Weldon C, Horansky RD, Meurs TB, Wang C, Williams DF, Holloway CL, and Wilson PF, “Configuring and verifying reverberation chambers for testing cellular wireless devices,” IEEE Transactions on Electromagnetic Compatibility, vol. 58, no. 3, pp. 661–672, June 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Joint Committee for Guides in Metrology, “Evaluation of Measurement Data—Guide to the Expression of Uncertainty in Measurement.” Sept. 2008. [Online]. Available: http://www.bipm.org/en/publications/guides/gum.html
- [21].Pirkl RJ, Remley KA, and Patane CSL, “Reverberation chamber measurement correlation,” IEEE Transactions on Electromagnetic Compatibility, vol. 54, no. 3, pp. 533–545, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].CTIA Certification, “Test plan for wireless over-the-air performance: Method of measurement for radiated RF power and receiver performance, version 3.6.2,” May 2017.
- [23].van de Beek S, Remley KA, Holloway CL, Ladbury JM, and Leferink F, “Characterizing large-form-factor devices in a reverberation chamber,” in 2013 International Symposium on Electromagnetic Compatibility, 2013, pp. 375–380. [Google Scholar]
- [24].ETSI, “TR 100 028–2 V1.3.1: Electromagnetic compatibility and radio spectrum matters (ERM); uncertainties in the measurement of mobile radio equipment characteristics part 2,” March 2001.
- [25].Bronckers LA, Remley KA, Jamroz BF, Roc’h A, and Bart Smolders A, “Uncertainty in reverberation-chamber antenna-efficiency measurements in the presence of a phantom,” IEEE Transactions on Antennas and Propagation, vol. 68, no. 6, pp. 4904–4915, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
