Frequency-Domain Models for Nonlinear Microwave Devices Based on Large-Signal Measurements

Jeffrey A Jargon; Donald C DeGroot; K C Gupta

doi:10.6028/jres.109.029

. 2004 Aug 1;109(4):407–427. doi: 10.6028/jres.109.029

Frequency-Domain Models for Nonlinear Microwave Devices Based on Large-Signal Measurements

Jeffrey A Jargon ¹, Donald C DeGroot ¹, K C Gupta ²

PMCID: PMC4847587 PMID: 27366621

Abstract

In this paper, we introduce nonlinear large-signal scattering ( $S$ ) parameters, a new type of frequency-domain mapping that relates incident and reflected signals. We present a general form of nonlinear large-signal $S$ -parameters and show that they reduce to classic $S$ -parameters in the absence of nonlinearities. Nonlinear large-signal impedance ( $Z$ ) and admittance ( $D$ ) parameters are also introduced, and equations relating the different representations are derived. We illustrate how nonlinear large-signal $S$ -parameters can be used as a tool in the design process of a nonlinear circuit, specifically a single-diode 1 GHz frequency-doubler. For the case where a nonlinear model is not readily available, we developed a method of extracting nonlinear large-signal $S$ -parameters obtained with artificial neural network models trained with multiple measurements made by a nonlinear vector network analyzer equipped with two sources. Finally, nonlinear large-signal $S$ -parameters are compared to another form of nonlinear mapping, known as nonlinear scattering functions. The nonlinear large-signal $S$ -parameters are shown to be more general.

Keywords: frequency-domain, large-signal, measurement, microwave, model, network analyzer, nonlinear, scattering parameter

1. Introduction

Vector network analyzers (VNAs) are one of the most versatile instruments available for RF and microwave measurements. They are used to measure complex scattering parameters (S-parameters) of linear devices or circuits. RF engineers use them to verify their designs, confirm proper performance, and diagnose failures. A VNA works by exciting a linear device under test (DUT) with a series of sine wave signals, one frequency at a time, and detecting the response of the DUT at its signal ports. Since the DUT is linear, the input and output signal frequencies are the same as the source; these signals can be described by complex numbers that account for the signals’ amplitudes and phases. The input-output relationships are described by ratios of complex numbers, known as S-parameters. For a two-port network, four S-parameters completely describe the behavior of a linear DUT when excited by a sine wave at a particular frequency. Although the measurement of S-parameters by VNAs is invaluable to the microwave designer for modeling and measuring linear circuits, these measurements are oftentimes inadequate for nonlinear circuits operating at large-signal conditions, since nonlinearities transfer energy from the stimulus frequency to products at new frequencies. Thus, conventional linear network analysis, which relies on the assumption of superposition, must be replaced by a more general type of analysis, which we refer to as nonlinear network analysis.

Nonlinear network analysis involves characterizing a nonlinear device under realistic, large-signal operating conditions. To do this, complex traveling waves (rather than ratios) are measured at the ports of a DUT not only at the stimulus frequency (or frequencies), but also at other frequencies where energy may be created. Assuming the input signals are sine-waves and the DUT exhibits neither sub-harmonic nor chaotic behavior, the input and output signals will be combinations of sine-wave signals, caused by the nonlinearity of the DUT in conjunction with impedance mismatches between the measuring system and the DUT. If a single excitation frequency is present, new frequency components will appear at harmonics of the excitation frequency, and if multiple excitation frequencies are present, new frequency components will appear at the intermodulation products as well as at harmonics of each of the excitation frequencies. In practice, there will be a limited number of significant harmonics and intermodulation products. The set of frequencies at which energy is present and must be measured is known as the frequency grid.

A class of instruments known as nonlinear vector network analyzers (NVNA) are capable of providing accurate waveform vectors by acquiring and correcting the magnitude and phase relationships between the fundamental and harmonic components in the periodic signals [1–5]. An NVNA excites a nonlinear DUT with one or more sine wave signals and detects the response of the DUT at its signal ports. Assuming the DUT does not exhibit any sub-harmonic or chaotic behavior, the input and output signals will be combinations of sine wave signals due to the nonlinearity of the DUT in conjunction with mismatches between the system and the DUT. With these facts in mind, the major difference between a linear VNA and an NVNA is that a VNA measures ratios between input and output waves one frequency at a time while an NVNA measures the actual input and output waves simultaneously over a broad band of frequencies.

Even though S-parameters cannot adequately represent nonlinear circuits, some type of parameters relating incident and reflected signals are beneficial so that the designers can “see” application-specific engineering figures of merit that are similar to what they are accustomed to. In first part of this paper, we propose definitions of such ratios that we refer to as nonlinear large-signal scattering ( $S$ ) parameters. We also introduce nonlinear large-signal impedance ( $Z$ ) and admittance ( $D$ ) parameters, and present equations relating the different representations. Next, we make two simplifications when considering the cases of a one-port network with a single-tone excitation and a two-port network with a single-tone excitation.

For existing nonlinear models, we can readily generate nonlinear large-signal $S$ -parameters by performing a harmonic balance simulation. For devices, with no model available, we can extract these parameters from artificial neural network (ANN) models that are trained with multiple frequency-domain measurements made on a nonlinear DUT with an NVNA. To illustrate applications and generation of nonlinear large-signal $S$ -parameters, we present two examples. First, we illustrate how nonlinear large-signal $S$ -parameters can be used as a tool in the process of designing a simple nonlinear circuit, specifically a single-diode 1 GHz frequency-doubler circuit. And secondly, we describe a method for generating nonlinear large-signal $S$ -parameters based upon ANN models trained on frequency-domain data measured using an NVNA. We compare a diode circuit model, generated using this method, to a harmonic balance simulation of a commercial device model.

Finally, we compare our nonlinear large-signal $S$ -parameters to another form of nonlinear mapping, known as nonlinear scattering functions [6–7]. Specifically, we show that the two formulations are not equivalent. Nonlinear large-signal $S$ -parameters are more general than the nonlinear scattering functions, which are useful in approximating a specific class of nonlinearity in a more compact form.

2. Nonlinear Large-Signal Scattering Parameters

In this section, we introduce the concept of nonlinear large-signal scattering parameters. Like commonly used linear S-parameters, nonlinear large-signal scattering ( $S$ ) parameters can also be expressed as ratios of incident and reflected wave variables. However, unlike linear S-parameters, nonlinear large-signal $S$ -parameters depend upon the signal magnitude and must account for the harmonic content of the input and output signals since energy can be transferred to other frequencies in a nonlinear device.

After presenting the general form of nonlinear large-signal $S$ -parameters, we also introduce nonlinear large-signal impedance ( $Z$ ) and admittance ( $D$ ) parameters, and present equations for relating the different representations. Next, we make two simplifications in which we consider the cases of a one-port network with a single-tone excitation and a two-port network with a single-tone excitation.

2.1 General Form

Consider an N-port network. Normalized wave variables a_jl and b_jl at the jth port and lth harmonic are proportional to the incoming and outgoing waves, respectively, and may be defined in terms of the voltages associated with these waves as follows:

a_{j l} = \frac{V_{j l}^{+}}{\sqrt{Z_{o j}}}; b_{j l} = \frac{V_{j l}^{-}}{\sqrt{Z_{o j}}},

(1)

where $V_{j l}^{+}$ and $V_{j l}^{-}$ represent voltages associated with the incoming and outgoing waves in the transmission lines connected to the jth port and containing frequencies of the lth harmonic; Z_oj represents the characteristic impedance of the line at the jth port.

The nonlinear large-signal scattering matrix $S$ of the network expresses the relationship between a’s and b’s at various ports and harmonics through the matrix equation

b = S a,

(2)

where b and a are (N×M)-element column vectors. Here N refers to the number of ports and M refers to the number of harmonics being considered. Matrix $S$ is an (N×M)²-element square matrix. We assume all a’s and b’s are phase referenced to a₁₁ to enforce time invariance [8].

As an example, consider a two-port network with 3 harmonics; Eq. (2) then becomes

[\frac{{\bar{b}}_{1}}{{\bar{b}}_{2}}] = [\begin{matrix} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{matrix}] [\begin{matrix} {\bar{a}}_{1} \\ {\bar{a}}_{2} \end{matrix}],

(3)

where

[S_{i j}] = [\begin{matrix} S_{i j 11} & S_{i j 12} & S_{i j 13} \\ S_{i j 21} & S_{i j 22} & S_{i j 23} \\ S_{i j 31} & S_{i j 32} & S_{i j 33} \end{matrix}] .

(4)

For each nonlinear large-signal scattering parameter $S_{i j k l}$ the index i refers to the port number of the b wave, the index j refers to the port number of the a wave, k is the harmonic index of the b wave, and l is the harmonic index of the a wave. The vectors ${\bar{a}}_{j}$ and ${\bar{b}}_{i}$ are (M=3)-element vectors given by

{\bar{a}}_{j} = [\begin{matrix} a_{j 1} \\ a_{j 2} \\ a_{j 3} \end{matrix}]; {\bar{b}}_{j} = [\begin{matrix} b_{i 1} \\ b_{i 2} \\ b_{i 3} \end{matrix}] .

(5)

Equation (3) can be expanded as follows

[\begin{array}{l} b_{11} \\ b_{12} \\ b_{13} \\ b_{21} \\ b_{22} \\ b_{23} \end{array}] = [\begin{array}{l} S_{1111} & S_{1112} & S_{1113} & S_{1211} & S_{1212} & S_{1213} \\ S_{1121} & S_{1122} & S_{1123} & S_{1221} & S_{1222} & S_{1223} \\ S_{1131} & S_{1132} & S_{1133} & S_{1231} & S_{1232} & S_{1233} \\ S_{2111} & S_{2112} & S_{2113} & S_{2211} & S_{2212} & S_{2213} \\ S_{2121} & S_{2122} & S_{2123} & S_{2221} & S_{2222} & S_{2223} \\ S_{2131} & S_{2132} & S_{2133} & S_{2231} & S_{2232} & S_{2233} \end{array}] [\begin{array}{l} a_{11} \\ a_{12} \\ a_{13} \\ a_{21} \\ a_{22} \\ a_{23} \end{array}] .

(6)

Note that in each of the four sub-matrices, the diagonal elements contain the same-frequency scattering parameters, the upper right elements contain the frequency down-conversion scattering parameters, and the lower left elements contain the frequency up-conversion scattering parameters. If the device under consideration contains no nonlinearities (i.e., no power is transferred to other frequencies), then Eq. (6) reduces to

[\begin{matrix} b_{11} \\ b_{12} \\ b_{13} \\ b_{21} \\ b_{22} \\ b_{23} \end{matrix}] = [\begin{matrix} S_{1111} & 0 & 0 & S_{1211} & 0 & 0 \\ 0 & S_{1122} & 0 & 0 & S_{1222} & 0 \\ 0 & 0 & S_{1133} & 0 & 0 & S_{1233} \\ S_{2111} & 0 & 0 & S_{2211} & 0 & 0 \\ 0 & S_{2122} & 0 & 0 & S_{2222} & 0 \\ 0 & 0 & S_{2133} & 0 & 0 & S_{2233} \end{matrix}] [\begin{matrix} a_{11} \\ a_{12} \\ a_{13} \\ a_{21} \\ a_{22} \\ a_{23} \end{matrix}]

(7)

which is the matrix representation for the well-known linear S-parameters involving three excitation frequencies.

2.2 Nonlinear Large-Signal Impedance Parameters

Rather than expressing the relationship between a’s and b’s in terms of a nonlinear large-signal scattering matrix $S$ , we can alternatively express the relationship between voltages (V’s) and currents (I’s) in terms of a nonlinear large-signal impedance matrix $Z$ , as follows

V = Z I,

(8)

where V and I are (N×M)-element column vectors. Once again N refers to the number of ports and M refers to the number of harmonics being considered. $Z$ is an (N×M)²-element square matrix.

For a two-port network with 3 harmonics, Eq. (8) becomes

[\begin{matrix} {\bar{V}}_{1} \\ {\bar{V}}_{2} \end{matrix}] = [\begin{matrix} [Z_{11}] & [Z_{12}] \\ [Z_{21}] & [Z_{22}] \end{matrix}] [\begin{matrix} {\bar{I}}_{1} \\ {\bar{I}}_{2} \end{matrix}],

(9)

where

[Z_{11}] = [\begin{matrix} Z_{i j 11} & Z_{i j 12} & Z_{i j 13} \\ Z_{i j 21} & Z_{i j 22} & Z_{i j 23} \\ Z_{i j 31} & Z_{i j 32} & Z_{i j 33} \end{matrix}] .

(10)

For each nonlinear large-signal impedance parameter $Z_{i j k l}$ , the index i refers to the port number of the voltage V, the index j refers to the port number of the current I, k is the harmonic index of V, and l is the harmonic index of I. The vectors ${\bar{V}}_{i}$ and ${\bar{I}}_{j}$ are (M=3)-element vectors given by

{\bar{V}}_{i} = [\begin{matrix} V_{i 1} \\ V_{i 2} \\ V_{i 3} \end{matrix}]; {\bar{I}}_{j} = [\begin{matrix} I_{i 1} \\ I_{i 2} \\ I_{i 3} \end{matrix}]

(11)

Equation (9) can be expanded to

[\begin{array}{l} V_{11} \\ V_{12} \\ V_{13} \\ V_{21} \\ V_{22} \\ V_{23} \end{array}] = [\begin{array}{l} Z_{1111} & Z_{1112} & Z_{1113} & Z_{1211} & Z_{1212} & Z_{1213} \\ Z_{1121} & Z_{1122} & Z_{1123} & Z_{1221} & Z_{1222} & Z_{1223} \\ Z_{1131} & Z_{1132} & Z_{1133} & Z_{1231} & Z_{1232} & Z_{1233} \\ Z_{2111} & Z_{2112} & Z_{2113} & Z_{2211} & Z_{2212} & Z_{2213} \\ Z_{2121} & Z_{2122} & Z_{2123} & Z_{2221} & Z_{2222} & Z_{2223} \\ Z_{2131} & Z_{2132} & Z_{2133} & Z_{2231} & Z_{2232} & Z_{2233} \end{array}] [\begin{array}{l} I_{11} \\ I_{12} \\ I_{13} \\ I_{21} \\ I_{22} \\ I_{23} \end{array}]

(12)

2.3 Relating $S$ and $Z$ Matrices

The $S$ and $Z$ matrices can be expressed in terms of one another, if we know how a and b relate to V and I. From Eq. (1), we can express V_ik in terms of a_jl and b_ik as follows:

V_{i k} = V_{i k}^{+} + V_{i k}^{-} = \sqrt{Z_{o i}} (a_{i k} + b_{i k}),

(13)

where the subscripts refer to the ith port and the kth harmonic. We can similarly express I_jl as

I_{j l} = I_{j l}^{+} + I_{j l}^{-} = \frac{1}{Z_{o j}} (V_{j l}^{+} - V_{j l}^{-}) = \frac{1}{\sqrt{Z_{o j}}} (a_{j l} - b_{j l}),

(14)

where the subscripts refer to the jth port and at the lth harmonic.

For simplicity, we will assume for now that the network under consideration consists of two ports. Later, we can easily generalize the equations relating the $S$ and $Z$ matrices for any N-port network. If we allow the two transmission lines or waveguides connecting the two ports to have different characteristic impedances, Z_o₁ and Z_o₂, Eq. (14) can be expressed in matrix form as

[\begin{matrix} {\bar{I}}_{1} \\ {\bar{I}}_{2} \end{matrix}] = [\begin{matrix} [U] / Z_{o 1} & [0] \\ [0] & [U] / Z_{o 2} \end{matrix}] ([\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] - [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}]),

(15)

where [U] is the identity matrix. Equation (9) can be expressed as

[\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] + [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = [\begin{matrix} [Z_{11}] & [Z_{12}] \\ [Z_{21}] & [Z_{22}] \end{matrix}] [\begin{matrix} {\bar{I}}_{1} \\ {\bar{I}}_{2} \end{matrix}] .

(16)

Combining Eqs. (15) and (16) gives

\begin{array}{l} [\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] + [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = \\ [\begin{matrix} [Z_{11}] & [Z_{12}] \\ [Z_{21}] & [Z_{22}] \end{matrix}] [\begin{matrix} [U] / Z_{o 1} & [0] \\ [0] & [U] / Z_{o 2} \end{matrix}] ([\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] - [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}]) \end{array}

(17)

[\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] + [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = [\begin{matrix} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{matrix}] ([\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] - [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}]),

(18)

where

[\begin{matrix} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{matrix}] = [\begin{matrix} [Z_{11}] & [Z_{12}] \\ [Z_{21}] & [Z_{22}] \end{matrix}] [\begin{matrix} [U] / Z_{o 1} & [0] \\ [0] & [U] / Z_{o 2} \end{matrix}]

(19)

is the normalized impedance matrix. Equation (18) can be rewritten as

\begin{array}{l} ([\begin{matrix} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{matrix}] + [\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}]) [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = \\ ([\begin{matrix} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{matrix}] - [\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}]) [\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] \end{array}

(20)

and Eq. (3) can be rewritten as

\begin{array}{l} [\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 2}} \end{matrix}] [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = \\ [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] [\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 2}} \end{matrix}] [\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] . \end{array}

(21)

Combining Eqs. (20) and (21) allows us to solve for $S$ in terms of $Z$ :

\begin{array}{l} [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] = [\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 2}} \end{matrix}] {([\begin{array}{l} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{array}] + [\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}])}^{- 1} \\ ([\begin{array}{l} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{array}] - [\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}]) {[\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 2}} \end{matrix}]}^{- 1} . \end{array}

(22)

If Z_o₁ = Z_o₂, Eq. (22) reduces to

\begin{array}{r} [\begin{matrix} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{matrix}] = {([\begin{matrix} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{matrix}] + [\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}])}^{- 1} \\ ([\begin{matrix} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{matrix}] - [\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}]) . \end{array}

(23)

Alternatively, we can combine Eqs. (20) and (21) to solve for $Z$ in terms of $S$ :

\begin{matrix} [\begin{array}{l} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{array}] = ([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] + {[\begin{matrix} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{matrix}]}^{- 1} [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] [\begin{matrix} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{matrix}]) \\ {([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] - {[\begin{matrix} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{matrix}]}^{- 1} [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] [\begin{matrix} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{matrix}])}^{- 1} . \end{matrix}

(24)

If Z_o₁ = Z_o₂, Eq. (24) reduces to

\begin{array}{r} [\begin{matrix} [{Z^{'}}_{11}] & [{Z^{'}}_{12}] \\ [{Z^{'}}_{21}] & [{Z^{'}}_{22}] \end{matrix}] = ([\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}] + [\begin{matrix} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{matrix}]) \\ {([\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}] - [\begin{matrix} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{matrix}])}^{- 1} . \end{array}

(25)

2.4 Nonlinear Large-Signal Admittance Parameters

We can also express the relationship between voltages (V’s) and currents (I’s) in terms of a nonlinear large-signal admittance matrix $D$ , as follows

I = D V,

(26)

where $D$ is an (N×M)²-element square matrix. For a two-port network with three harmonics, for example, Eq. (26) becomes

[\begin{matrix} {\bar{I}}_{1} \\ {\bar{I}}_{2} \end{matrix}] = [\begin{array}{l} [D_{11}] & [D_{12}] \\ [D_{21}] & [D_{22}] \end{array}] [\begin{matrix} {\bar{V}}_{1} \\ {\bar{V}}_{2} \end{matrix}],

(27)

where

[D_{i j}] = [\begin{array}{l} D_{i j 11} & D_{i j 12} & D_{i j 13} \\ D_{i j 21} & D_{i j 22} & D_{i j 23} \\ D_{i j 31} & D_{i j 32} & D_{i j 33} \end{array}] .

(28)

For each nonlinear large-signal admittance parameter $D_{i j k l}$ , the index i refers to the port number of the current I, the index j refers to the port number of the voltage V, k is the harmonic index of I, and l is the harmonic index of V. The vectors ${\bar{V}}_{j}$ and ${\bar{I}}_{i}$ are, once again, (M=3)-element vectors, defined in Eq. (11). Equation (27) can be expanded as follows

[\begin{array}{l} I_{11} \\ I_{12} \\ I_{13} \\ I_{21} \\ I_{22} \\ I_{23} \end{array}] = [\begin{array}{l} D_{1111} & D_{1112} & D_{1113} & D_{1211} & D_{1212} & D_{1213} \\ D_{1121} & D_{1122} & D_{1123} & D_{1221} & D_{1222} & D_{1223} \\ D_{1131} & D_{1132} & D_{1133} & D_{1231} & D_{1232} & D_{1233} \\ D_{2111} & D_{2112} & D_{2113} & D_{2211} & D_{2212} & D_{2213} \\ D_{2121} & D_{2122} & D_{2123} & D_{2221} & D_{2222} & D_{2223} \\ D_{2131} & D_{2132} & D_{2133} & D_{2231} & D_{2232} & D_{2233} \end{array}] [\begin{array}{l} V_{11} \\ V_{12} \\ V_{13} \\ V_{21} \\ V_{22} \\ V_{23} \end{array}] .

(29)

2.5 Relating $S$ and $D$ Matrices

The $S$ and $D$ matrices can also be expressed in terms of one another, using Eqs. (13) and (14) which show how a and b relate to V and I.

For simplicity, we will again assume the network under consideration consists of two ports. If we allow the two transmission lines or waveguides connecting the two ports to have different characteristic impedances Z_o₁ and Z_o₂, Eq. (14) can be expressed in matrix form as

[\begin{matrix} {\bar{I}}_{1} \\ {\bar{I}}_{2} \end{matrix}] = [\begin{matrix} [U] / Z_{o 1} & [0] \\ [0] & [U] / Z_{o 2} \end{matrix}] ([\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] - [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}]),

(30)

where [U] is the identity matrix. Equation (27) can be expressed as

[\begin{matrix} {\bar{I}}_{1} \\ {\bar{I}}_{2} \end{matrix}] = [\begin{array}{l} [D_{11}] & [D_{12}] \\ [D_{21}] & [D_{22}] \end{array}] ([\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] + [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}]) .

(31)

Combining Eqs. (30) and (31) gives

[\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] - [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = {[\begin{matrix} [U] / Z_{o 1} & [0] \\ [0] & [U] / Z_{o 2} \end{matrix}]}^{- 1} [\begin{array}{l} [D_{11}] & [D_{12}] \\ [D_{21}] & [D_{22}] \end{array}] ([\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] + [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}])

(32)

[\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] - [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = [\begin{array}{l} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{array}] ([\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}] + [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}]),

(33)

where

[\begin{array}{l} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{array}] = {[\begin{matrix} [U] / Z_{o 1} & [0] \\ [0] & [U] / Z_{o 2} \end{matrix}]}^{- 1} [\begin{array}{l} [D_{11}] & [D_{12}] \\ [D_{21}] & [D_{22}] \end{array}]

(34)

is the normalized admittance matrix. Equation (33) can be rewritten as

([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] + [\begin{array}{l} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{array}]) [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = ([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] - [\begin{array}{l} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{array}]) [\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}]

(35)

and Eq. (3) can be rewritten as

[\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 1}} \end{matrix}] [\begin{matrix} {\bar{V}}_{1}^{-} \\ {\bar{V}}_{2}^{-} \end{matrix}] = [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] [\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 2}} \end{matrix}] [\begin{matrix} {\bar{V}}_{1}^{+} \\ {\bar{V}}_{2}^{+} \end{matrix}]

(36)

Combining Eqs. (35) and (36) allows us to solve for $S$ in terms of $D$ :

\begin{matrix} [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] = [\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 2}} \end{matrix}] {([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] + [\begin{array}{l} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{array}])}^{- 1} \\ ([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] - [\begin{array}{l} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{array}]) {[\begin{matrix} [U] / \sqrt{Z_{o 1}} & [0] \\ [0] & [U] / \sqrt{Z_{o 2}} \end{matrix}]}^{- 1} . \end{matrix}

(37)

If Z_o₁ = Z_o₂, Eq. (37) reduces to:

\begin{array}{r} [\begin{matrix} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{matrix}] = {([\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}] + [\begin{matrix} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{matrix}])}^{- 1} \\ ([\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}] - [\begin{matrix} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{matrix}]) \end{array}

(38)

Alternatively, we can combine Eqs. (35) and (36) to solve for $D$ in terms of $S$ :

\begin{matrix} [\begin{array}{l} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{array}] = ([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] - {[\begin{array}{l} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{array}]}^{- 1} [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] [\begin{array}{l} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{array}]) \\ {([\begin{array}{l} [U] & [0] \\ [0] & [U] \end{array}] + {[\begin{array}{l} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{array}]}^{- 1} [\begin{array}{l} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{array}] [\begin{array}{l} \frac{[U]}{\sqrt{Z_{o 1}}} & [0] \\ [0] & \frac{[U]}{\sqrt{Z_{o 2}}} \end{array}])}^{- 1} \end{matrix}

(39)

If Z_o₁ = Z_o₂, Eq. (39) reduces to

\begin{array}{r} [\begin{matrix} [{D^{'}}_{11}] & [{D^{'}}_{12}] \\ [{D^{'}}_{21}] & [{D^{'}}_{22}] \end{matrix}] = ([\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}] - [\begin{matrix} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{matrix}]) \\ {([\begin{matrix} [U] & [0] \\ [0] & [U] \end{matrix}] + [\begin{matrix} [S_{11}] & [S_{12}] \\ [S_{21}] & [S_{22}] \end{matrix}])}^{- 1} \end{array}

(40)

2.6 One-Port Network With Single-Tone Excitation

For a one-port network with a single-tone excitation at the fundamental frequency, we can extract a reflection coefficient given by

{S_{11 k 1} = \frac{| b_{1 k} | ∠ (ϕ_{b_{1 k}} - k ϕ_{a_{11}})}{| a_{11} |} |}_{a_{1 m} = 0 for all m (m \neq 1)} .

(41)

The limitation imposed on the equation is that all other incident waves other than a₁₁equal zero. Instead of simply taking the ratio of b₁_k to a₁₁, we reference the phase of b₁_k to that of a₁₁. To do this, we must subtract k times the phase of a₁₁ from b₁_k [8].

For a one-port network with a single-tone excitation at the fundamental frequency, we can show that the equation relating $S$ and $Z$ reduces to the same well-known equation for the linear case if we assume that no energy is redistributed into the form of frequency down-conversion. To illustrate this, we will consider only M=3 harmonics, for the sake of simplicity. Equation (6) reduces to

[\begin{matrix} b_{11} \\ b_{12} \\ b_{13} \end{matrix}] = [\begin{array}{l} S_{1111} & S_{1112} & S_{1113} \\ S_{1121} & S_{1122} & S_{1123} \\ S_{1131} & S_{1132} & S_{1133} \end{array}] [\begin{matrix} a_{11} \\ 0 \\ 0 \end{matrix}],

(42)

for a one-port network with a single-tone excitation a₁₁. This matrix can be rewritten as a set of three equations:

b_{11} = S_{1111} a_{11}; b_{12} = S_{1121} a_{11}; b_{13} = S_{1131} a_{11} .

(43)

Likewise, Eq. (12) reduces to

[\begin{matrix} V_{11} \\ V_{12} \\ V_{13} \end{matrix}] = [\begin{array}{l} Z_{1111} & Z_{1112} & Z_{1113} \\ Z_{1121} & Z_{1122} & Z_{1123} \\ Z_{1131} & Z_{1132} & Z_{1133} \end{array}] [\begin{matrix} I_{11} \\ I_{12} \\ I_{13} \end{matrix}],

(44)

where the voltage V₁₁ at the first harmonic can be expressed as

V_{11} = Z_{1111} I_{11} + Z_{1112} I_{12} + Z_{1113} I_{13} .

(45)

From Eqs. (13) and (14), we know that

\begin{array}{l} V_{11} = \sqrt{Z_{o 1}} (a_{11} + b_{11}), \\ I_{11} = \frac{1}{\sqrt{Z_{o 1}}} (a_{11} - b_{11}), \\ I_{12} = \frac{1}{\sqrt{Z_{o 1}}} (a_{12} - b_{12}) = - \frac{b_{12}}{\sqrt{Z_{o 1}}}, \\ I_{13} = \frac{1}{\sqrt{Z_{o 1}}} (a_{13} - b_{13}) = - \frac{b_{13}}{\sqrt{Z_{o 1}}} . \end{array}

(46)

Combining Eqs. (45) and (46) gives

\sqrt{Z_{o 1}} (a_{11} + b_{11}) = \frac{1}{\sqrt{Z_{o 1}}} [Z_{1111} (a_{11} - b_{11}) - Z_{1112} b_{12} - Z_{1112} b_{13}] .

(47)

Substituting Eq. (43) into Eq. (47) and solving for $Z_{1111}$ gives

Z_{1111} = \frac{Z_{o 1} (1 + S_{1111}) + Z_{1112} S_{1121} + Z_{1113} S_{1131}}{(1 - S_{1111})} .

(48)

If no energy is redistributed into the form of frequency down-conversion (i.e., $Z_{1112} = Z_{1113} = 0$ ), then Eq. (48) reduces to the same equation as in the linear case:

Z_{11} = Z_{o 1} \frac{(1 + S_{11})}{(1 - S_{11})} .

(49)

A similar derivation can be performed to show that

D_{1111} = \frac{(1 - S_{1111}) / Z_{o 1} - D_{1112} S_{1121} - D_{1113} S_{1131}}{(1 + S_{1111})} .

(50)

Once again, if no energy is transferred to frequency down-conversion (i.e., $D_{1112} = D_{1113} = 0$ ), then Eq. (50) reduces to the same equation as in the linear case:

Y_{11} = \frac{1}{Z_{11}} = \frac{1}{Z_{o 1}} \frac{(1 - S_{11})}{(1 + S_{11})} .

(51)

2.7 Two-Port Network With Single-Tone Excitation

For a two-port network excited at port 1 by a single-tone excitation at the fundamental frequency, we can extract an input reflection coefficient given by

{S_{11 k 1} = \frac{| b_{1 k} | ∠ (ϕ_{b_{1 k}} - k ϕ_{a_{11}})}{| a_{11} |} |}_{a_{m m} = 0 for all m, n [(m \neq 1) \land (n \neq 1)]} .

(52)

As with Eq. (41), instead of simply taking the ratio of b₁_k to a₁₁, we phase reference to a₁₁. To do this we must subtract k times the phase of a₁₁ from b₁_k. The limitation once again imposed on the equation is that all other incident waves other than a₁₁ equal zero.

Another valuable parameter, the forward transmission coefficient, is similarly extracted as follows

{S_{21 k 1} = \frac{| b_{2 k} | ∠ (ϕ_{b_{2 k}} - k ϕ_{b_{11}})}{| a_{11} |} |}_{a_{m m} = 0 for all m, n [(m \neq 1) \land (n \neq 1)]} .

(53)

This parameter provides a value of the gain or loss through a device either at the fundamental frequency or converted to a higher harmonic frequency.

In addition to the previous two parameters, given in Eqs. (52) and (53), an output reflection coefficient can also be useful when trying to determine the output matching network. If a nonlinear DUT is operating under its normal drive condition (a₁₁ at some constant signal level), and a second source, excited by a small-signal tone at frequency f_k, is placed at port 2 of the DUT, one of the equations in the matrix defined by Eq. (6) reduces to

b_{2 k} = S_{21 k 1} a_{11} + S_{22 k k} a_{2 k} .

(54)

If we solve Eq. (54) for $S_{22 k k}$ , we obtain

S_{22 k k} = \frac{b_{2 k}}{a_{2 k}} - \frac{S_{21 k 1} a_{11}}{a_{2 k}} .

(55)

In Eq. (55), the output reflection coefficient $S_{22 k k}$ obviously cannot be determined by simply taking the ratio of b₂_k to a₂_k, since the ratio also depends on a₁₁ through $S_{21 k 1}$ . When a₂_k is small, we can generate another signal ∆a₂_k that is offset slightly from the frequency of interest f_k by ∆f_k. Eq. (54) then becomes

b_{2 k} + Δ b_{2 k} = S_{21 k 1} a_{11} + S_{22 k k} (a_{2 k} + Δ a_{2 k}),

(56)

where ∆a₂_k << a₂_k and $S_{22 k k}$ remains constant over this frequency range. Subtracting Eq. (54) from Eq. (56) gives

Δ b_{2 k} = S_{22 k k} Δ a_{2 k},

(57)

which does not depend on $S_{21 k 1}$ . If we solve Eq. (57) for $S_{22 k k}$ , we obtain

S_{22 k k} = {\frac{Δ b_{2 k}}{Δ a_{2 k}} |}_{Large a_{11}, Small Δ a_{2 k}} .

(58)

Equation (58) is a quasi-linear approximation of the output reflection coefficient under normal operating conditions, and is consistent with the definition of “Hot S₂₂,” which has been used to measure the degree of mismatch at the output port of a power amplifier at its excitation frequency.

2.8 Summary of Sec. 2

In this section, we presented the general form of nonlinear large-signal $S$ -parameters. Unlike linear S-parameters, nonlinear large-signal $S$ -parameters depend upon the signal magnitude and must take into account the harmonic content of the input and output signals, since energy can be transferred to other frequencies in a nonlinear device. We also introduced nonlinear large-signal impedance ( $Z$ ) and admittance ( $D$ ) parameters, and presented equations for relating the different representations. Next, we made two simplifications, considering the cases of a one-port network with a single-tone excitation and a two-port network with a single-tone excitation. For the one-port case with a single-tone excitation at the fundamental frequency, we showed that the equation relating $S$ and $Z$ reduces to the same well-known equation for the linear case if we assume that no energy is transferred to frequency down-conversion. For the two-port case excited at port 1 by a single-tone excitation at the fundamental frequency, we extracted an input reflection coefficient $S_{11 k 1}$ , a forward transmission coefficient $S_{21 k 1}$ , and a quasi-linear output reflection coefficient $S_{22 k k}$ .

3. Using Nonlinear Large-Signal $S$ -Parameters to Design a Diode Frequency-Doubler Circuit With a Harmonic-Balance Simulator

Resistive frequency doublers operate on the principle that a sinusoidal waveform is distorted by the nonlinear I/V characteristic of a Schottky-barrier diode [9]. This distortion causes power to be generated at higher-harmonic frequencies. The design of such doublers involves separating the input and output signals by filters and determining the optimum input and output matching circuits, as illustrated in Fig. 1.

Fig. 1 — Block diagram of a single-diode resistive doubler.

Although single-diode resistive doublers are not very efficient (analysis predicts a conversion loss of at least 9 dB [10]), we chose this circuit because it is simple enough to clearly illustrate how nonlinear large-signal $S$ -parameters can be used as a design tool.

In the following sections, we describe the various steps involved in designing a single-diode 1 GHz frequency-doubler circuit. Since we are using a simulator, we can force the stimulus to consist of only |a₁₁|, with all other a_mn terms equal to zero, where m and n are positive integers such that m ≠ 1 and n ≠ 1. (In practice, this condition can never be completely realized in a measurement environment.) With only an a₁₁ component present, we need only consider the parameters $S_{11 k 1}$ (Eq. 52), which is a measure of the large-signal input match at the kth harmonic, as well as the parameter $S_{21 k 1}$ (Eq. 53), a measure of the large-signal conversion loss or gain at the kth harmonic, plus the quasilinear $S_{2222}$ (Eq. 58) to determine the output matching network at the second harmonic. Figure 2 illustrates the setups required for determining these parameters. Determining $S_{2222}$ requires a second source at port 2 at a frequency slightly offset from ω₂.

Fig. 2 — Nonlinear large-signal $S$ -parameters used to characterize a two-port device excited by a single-tone signal at port 1.

In the first step, we perform a simulation on the diode alone and use $S_{2121}$ to determine the optimum bias condition for converting power from the fundamental frequency to the second harmonic. Second, we add filtering networks to separate the input and output signals, and verify their proper performance by looking at $S_{2111}$ and $S_{1121}$ . Third, we make use of $S_{1111}$ to determine the input matching network. Fourth, with the input matching network in place, we place a second source at port 2 and find the quasi-linear value of $S_{2222}$ , which allows us to determine the output matching network. Fifth, we use the optimization feature of the simulator to minimize $S_{1111}$ by varying the line lengths of the input and output matching circuits. And finally, sixth, we add 4 GHz and 6 GHz filters at the output (and re-determine the proper input and output matching circuits) in order to reduce the values of $S_{2141}$ and $S_{2161}$ , which in turn increases the value of $S_{2121}$ and cleans up the output waveform.

3.1 Diode Only

In this example, we use a compact model to simulate a commercial Schottky-barrier diode. The model includes a series resistance R_s of 14 Ω, a junction capacitance at zero voltage C_j₀ of 0.08 pF, and a reverse saturation current I_s of 3 ×10⁻¹⁰ A.

First, we perform a harmonic-balance simulation on the diode, sweeping the bias voltage to determine which condition gives the highest value of $S_{2121}$ for a₁₁ = 1.0 V. Note that in all simulations we set the generator impedance Z_G and the load impedance Z_L to 50 Ω After sweeping the voltage, we determine that the optimum forward bias is +0.48 V.

3.2 Diode With 1 GHz and 2 GHz Filters

With a stimulus of a₁₁ = 1.0 V and a forward bias of +0.48 V, we add filtering networks to separate the input and output signals. On the input side, we place a 2 GHz, λ/4 (λ/8 at 1 GHz) open-circuited stub. This creates an RF short at 2 GHz, preventing the output power generated in the diode from traveling backward. On the output side, we place a 1 GHz, λ/4 open-circuited stub. This creates an RF short at 1 GHz, preventing any signal at 1 GHz from traveling forward.

Table 1 lists the simulated values for $S_{1111} - S_{1161}, S_{2111} - S_{2161}, G_{2}$ and $G_{2} / G$ for each of the design stages, where $G$ is the expanded power gain and $G_{2}$ is the expanded power gain confined to the second harmonic, as defined in [11]. With the 1 GHz and 2 GHz filters in place, we see that the value of $| S_{1121} |$ decreases from 0.170 to 1.3 × 10⁻⁵, the value of $| S_{2111} |$ decreases from 0.536 to 3.3 × 10⁻⁵, and $G_{2}$ increases from −14.16 dB to −9.73 dB.

Table 1.

Simulated values for $S_{1111} - S_{1161}, S_{2111} - S_{2161}, G_{2}$ and $G_{2} / G$ for each of the design stages of the diode frequency doubler

Quantity	Diode only	Diode w/ 1, 2 GHz filters	Diode w/ 1, 2 GHz filters input match	Diode w/ 1, 2 GHz filters, input & output match	Diode w/ 1, 2 GHz filters, input & output match opt.	Diode w/ 1, 2, 4, 6 GHz filters input & output match opt.
$\| S_{1111} \|$	0.464	0.569	9.4×10⁻²	8.7×10⁻²	6.0×10⁻³	2.1×10⁻⁴
$\| S_{1121} \|$	0.170	1.3×10⁻⁵	8.8×10⁻⁶	8.0×10⁻⁶	9.5×10⁻⁶	9.9×10⁻⁶
$\| S_{1131} \|$	3.2×10⁻²	4.9×10⁻³	4.0×10⁻³	1.4×10⁻²	1.1×10⁻²	2.2×10⁻²
$\| S_{1141} \|$	2.4×10⁻²	3.5×10⁻²	3.7×10⁻²	2.4×10⁻²	2.8×10⁻²	5.1×10⁻²
$\| S_{1151} \|$	1.7×10⁻²	1.1×10⁻²	1.1×10⁻²	1.9×10⁻³	2.3×10⁻³	2.5×10⁻³
$\| S_{1161} \|$	3.9×10⁻³	1.0×10⁻⁶	1.0×10⁻⁶	9.7×10⁻⁷	1.1×10⁻⁶	2.0×10⁻⁶
$\| S_{2111} \|$	0.536	3.3×10⁻⁵	4.0×10⁻⁵	4.0×10⁻⁵	4.0×10⁻⁵	5.0×10⁻⁵
$\| S_{2121} \|$	0.170	0.268	0.326	0.328	0.331	0.332
$\| S_{2131} \|$	3.2×10⁻²	3.5×10⁻⁷	3.3×10⁻⁷	1.5×10⁻⁶	1.1×10⁻⁶	1.7×10⁻⁷
$\| S_{2141} \|$	2.4×10⁻²	3.5×10⁻²	4.5×10⁻²	4.1×10⁻²	4.0×10⁻²	1.4×10⁻⁶
$\| S_{2151} \|$	1.7×10⁻²	7.6×10⁻⁷	1.1×10⁻⁶	2.5×10⁻⁶	2.3×10⁻⁶	3.0×10⁻⁶
$\| S 2161 \|$	3.9×10⁻³	2.0×10⁻²	2.5×10⁻²	2.6×10⁻²	2.9×10⁻²	2.7×10⁻⁶
$G_{2}$ (dB)	−14.16	−9.73	−9.69	−9.65	−9.60	−9.56
$G_{2} / G$	0.091	0.978	0.976	0.979	0.978	0.999

Open in a new tab

3.3 Diode With 1 GHz and 2 GHz Filters and Input Matching

Once the filters are placed in the circuit, we make use of the complex-valued $S_{1111}$ to design the input matching network with the well-known single, open-circuited stub technique. This is possible, assuming that no energy is transferred to frequency down-conversion, as discussed in Sec. 2.6. We see in Table 1 that $| S_{1111} |$ reduces from 0.569 without the input matching network to 9.4 × 10⁻² with the input matching network in place. Likewise, $G_{2}$ increases from −9.73 dB to −9.69 dB.

3.4 Diode With 1 GHz and 2 GHz Filters, Plus Input and Output Matching

Whereas our input matching network is designed for 1 GHz, our output matching network must be designed for 2 GHz. While the circuit is operating under its normal drive condition (a₁₁ = 1.0 V and a forward bias of +0.48 V) we place a second source at port 2, excited by a small-signal tone (Δa₂₂ = 0.01 V) at a frequency offset of 10 kHz from the desired 2 GHz, to give us the quasi-linear value of $S_{2222}$ , which allows us to determine the output matching network. We make use of $S_{2222}$ to design the output matching network with the well-known single, open-circuited stub technique. We see in Table 1 that with the output matching network in place, the value of $| S_{2121} |$ is only marginally increased from 0.326 to 0.328. This is because the value of $S_{2222}$ is relatively low, which means the output is already almost matched to 50 Ω. We also note that $G_{2}$ increases from −9.69 dB to −9.65 dB.

3.5 Diode With 1 GHz and 2 GHz Filters, Plus Optimized Input and Output Matching

With the filters and matching networks in place, we use the optimization feature of the simulator to minimize $S_{1111}$ by varying the lengths of the lines in the input and output matching circuits. Doing this decreases the value of $| S_{1111} |$ from 8.7 × 10⁻² to 6.0 × 10⁻³ while increasing the value of $| S_{2121} |$ from 0.328 to 0.331 and $G_{2}$ from −9.65 dB to −9.60 dB.

3.6 Diode With (1, 2, 4, and 6) GHz Filters, Plus Optimized Input and Output Matching

From Table 1, we see that at the output port, $| S_{2111} |, | S_{2131} |$ , and $| S_{2151} |$ all have values less than or equal to 4.0 × 10⁻⁵, but $| S_{2141} |$ and $| S_{2161} |$ have noticeably higher values (at least 2.9 × 10⁻²).

In order to clean up the output waveform, we add 4 GHz and 6 GHz filters, in the form of λ/4 open-circuited stubs, at the output. With these filters placed in the circuit, we re-determine the proper input and output matching conditions. After optimizing the circuit once again, the value of $| S_{2141} |$ decreases from 4.0 × 10⁻² to 1.4 × 10⁻⁶ and the value of $| S_{2161} |$ decreases from 2.9 ×10⁻² to 2.7 × 10⁻⁶. The addition of these filters, in turn, slightly increases $| S_{2121} |$ from 0.331 to 0.332 and $G_{2}$ from −9.60 dB to −9.56 dB. At this final design stage, the overall power gain is nearly −9.56 dB since the ratio $G_{2} / G = 0.999$ . The semi-empirical analysis of [10] predicts a maximum gain of −9 dB. Figure 3 illustrates the final design of the single-diode resistive doubler circuit. And Fig. 4 shows the time-domain plots of a₁ and b₂ for the final design of the simulated 1 GHz frequency-doubler circuit.

Fig. 3 — Final design of the single-diode resistive frequency doubler. Electrical lengths shown are all at 1 GHz.

Fig. 4 — Time-domain plots of a₁ and b₂ for the simulated 1 GHz frequency-doubler circuit.

3.7 Summary of Sec. 3

We illustrated how nonlinear large-signal $S$ -parameters can be used as a tool in the design process of a single-diode 1 GHz frequency-doubler. Specifically, we used $S_{1111}$ to determine the input matching network, $S_{2222}$ to determine the output matching network, and $S_{11 k 1}, S_{21 k 1}$ (for k = 1 to 6), and $G_{2}$ to quantify the performance of the circuit at each stage.

By the final stage of the design, we had created a doubler with an overall power gain of −9.56 dB, not far from the maximum possible predicted value of −9 dB.

4. Determining Nonlinear Large-Signal $S$ -Parameters from Artificial Neural Network Models Trained With Measurement Data

Although nonlinear large-signal $S$ -parameters can be easily determined for an existing model in a commercial harmonic balance simulator by forcing all a’s other than a₁₁ to zero, they cannot be determined directly from measurements. With currently available NVNAs, the nonlinear DUT, in conjunction with the impedance mismatches and harmonics from the system make it impossible to set all a’s other than a₁₁ (assuming port 1 excitation) to zero. In order to overcome this obstacle, we propose a method [12] that makes use of multiple measurements of a DUT using a second source with isolators, as shown in Fig. 5. This measurement set-up is similar to that introduced by Verspecht et al. [6–7] to generate “nonlinear scattering functions.” As a side note, we compare and contrast the “nonlinear scattering functions” with our definitions of nonlinear large-signal scattering parameters in the Appendix.

Fig. 5 — Block diagram of a nonlinear vector network analyzer equipped with a second source and isolators.

4.1 Methodology

To illustrate our technique of generating nonlinear large-signal $S$ -parameters, let us consider the case where a DUT is excited at port 1 by a single-tone signal at frequency f₁ and signal level |a₁₁|. Utilizing a second source, we take multiple measurements of a nonlinear circuit for different values of a_mn [(m≠1)∧(n≠1)]. We then use these data to develop an artificial neural network (ANN) model that maps values of a’s to b’s, as shown in Fig. 6. Once the ANN model is trained and verified, the nonlinear large-signal $S$ -parameters are obtained by interpolating b’s from the measured results for nonzero values of a_mn [(m≠1)∧(n≠1)] to the desired values for a_mn [(m≠1)∧(n≠1)] equal to zero, as shown in Fig. 7. Alternatively, other conditions may be called for, where $a_{m n} \neq 0$ depending on the desired application-specific figure of merit.

Fig. 6 — An ANN model that maps real and imaginary values of a’s to b’s for different real and imaginary values of *a_mn* [(m≠1)∧(n≠1)]

Fig. 7 — An ANN model that interpolates b’s from the measured results for nonzero values of *a_mn* [(m≠1)∧(n≠1)] to the desired values for *a_mn* [(m≠1)∧(n≠1)] equal to zero. Outputs of the ANN model yield values of $S_{11 k 1}$ .

One popular type of ANN architecture, which is used in our work, is a feed-forward, three-layer perceptron structure (MLP3) consisting of an input layer, a hidden layer, and an output layer [13]. The hidden layer allows for complex models of input-output relationships. ANNs learn relationships among sets of input-output data that are characteristic of the device or system under consideration. After the input vectors are presented to the input neurons and output vectors are computed, the ANN outputs are compared to the desired outputs and errors are calculated. Error derivatives are then calculated and summed for each weight until all of the training sets have been presented to the network. The error derivatives are used to update the weights for the neurons, and training continues until the errors become no greater than prescribed values. In our study, we have utilized software developed by Zhang et al. [14] to construct our ANN models.

To test our method of generating nonlinear large-signal $S$ -parameters, we fabricated a wafer-level test circuit using a Schottky diode in a series configuration, as shown in Fig. 8. The two-port diode circuit was fabricated on an alumina substrate by bonding a beam-lead diode package to the gold metalization layer with silver epoxy. The diode was located in the middle of the coplanar waveguide (CPW) transmission lines, with short lines connecting the diode to probe pads at both ports. We measured the test circuit on an NVNA using an on-wafer VNA line-reflect-reflect-match (LRRM) calibration, along with signal amplitude and phase calibrations. This process places the reference plane at the tips of the wafer probes used to connect with the CPW leads.

Fig. 8 — Schottky diode in a series configuration located in the middle of a CPW transmission line. (White area is metal.)

For all measurements, the first source, located at port 1, used a sine-wave excitation of frequency 900 MHz and magnitude |a₁₁|≈0.178 V (−5 dBm in a 50 Ω environment) at the probe tips. The second source was connected to port 2 and used a sine-wave excitation of frequency 900 MHz and |a₂₁|≈0.178 V. The diode was forward-biased to +0.2 V through the probe tips. In order to obtain the nonlinear large-signal $S$ -parameters, $S_{11 k 1}$ and $S_{21 k 1}$ , the excitation from source 1 was held constant, while the phase of source 2 was randomly changed for 500 different measurements that varied slightly in magnitude. Figure 9 plots the resulting measurements of a₂₁ in the complex plane. The nonlinearities in the test circuit, along with impedance mismatches, created other input components at higher harmonics, as shown in Figs. 10–13 for the second and third harmonics (a₁₂, a₁₃, a₁₂, and a₁₃). These variations in a_ij allowed us to create an ANN model that could be used to interpolate b’s from the measured results for nonzero values of a_mn [(m≠1)∧(n≠1)], as shown in Figs. 14 and 15 for b₁₁ and b₂₁, to the desired values for a_mn [(m≠1)∧(n≠1)] equal to zero, or alternatively another desired device condition.

Fig. 9 — Five hundred measurements of a₂₁in the complex plane with the excitation from source 1 held constant and the output from source 2 set to random phases with constant amplitude.

Fig. 10 — Five hundred measurements of a₁₂ in the complex plane with the excitation from source 1 held constant and the output from source 2 set to random phases with constant amplitude.

Fig. 11 — Five hundred measurements of a₁₃ in the complex plane with the excitation from source 1 held constant and the output from source 2 set to random phases with constant amplitude.

Fig. 12 — Five hundred measurements of a₂₂ in the complex plane with the excitation from source 1 held constant and the output from source 2 set to random phases with constant amplitude.

Fig. 13 — Five hundred measurements of a₂₃ in the complex plane with the excitation from source 1 held constant and the output from source 2 set to random phases with constant amplitude.

Fig 14 — Five hundred measurements of b₁₁in the complex plane with the excitation from source 1 held constant and the output from source 2 set to random phases with constant amplitude.

Fig. 15 — Five hundred measurements of b₂₁ in the complex plane with the excitation from source 1 held constant and the output from source 2 set to random phases with constant amplitude.

4.2 Sensitivity Analysis of ANN Models

Data from the 500 measurements were used to develop two ANN models, one for mapping values from the first five harmonics of a₁ and a₂ (a₁₁, a₁₂, …, a₁₅, a₂₁, a₂₂, …, a₂₅) to the first five harmonics of b₁ (b₁₁, b₁₂, …, b₁₅), and the other for mapping values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₂ (b₂₁, b₂₂, …, b₂₅). We performed a sensitivity analysis to determine how many training points, testing points, and hidden neurons are required to adequately train the two ANN models. Tables 2–4 summarize the results for the first model, where we map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₁, and Tables 5–7 summarize the results for the second model, where we map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₂.

Table 2.

Average testing errors and correlation coefficients as functions of the number of hidden neurons for ANN models trained to map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₁. All models were developed using 250 training points and verified using 250 testing points

Hidden neurons	Average testing error (%)	Correlation Coefficient
1	16.86	0.94814
2	10.84	0.98896
4	4.56	0.99715
6	1.66	0.99971
8	1.15	0.99989
10	1.08	0.99991
12	0.80	0.99996
14	0.72	0.99997
16	0.72	0.99997
18	0.84	0.99996
20	0.70	0.99997

Open in a new tab

Table 3.

Average testing errors and correlation coefficients as functions of the number of training points for ANN models trained to map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₁. All models were developed using 14 hidden neurons and verified using 250 testing points

Training points	Average testing error (%)	Correlation Coefficient
5	20.10	0.96764
10	9.01	0.99556
25	3.64	0.99891
50	1.91	0.99979
125	0.95	0.99995
250	0.72	0.99997

Open in a new tab

Table 4.

Average testing errors and correlation coefficients as functions of the number of testing points for ANN models trained to map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₁. All models were developed using 250 training points and 14 hidden neurons

Testing points	Average testing error (%)	Correlation Coefficient
5	0.80	0.99998
10	0.74	0.99997
25	0.68	0.99998
50	0.68	0.99998
125	0.72	0.99997
250	0.72	0.99997

Open in a new tab

Table 5.

Average testing errors and correlation coefficients as functions of the number of hidden neurons for ANN models trained to map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₂. All models were developed using 250 training points and verified using 250 testing points

Hidden neurons	Average testing error (%)	Correlation Coefficient
1	17.88	0.74320
2	13.22	0.91161
4	6.48	0.96659
6	2.04	0.99893
8	1.43	0.99951
10	0.90	0.99985
12	0.82	0.99989
14	0.78	0.99989
16	0.73	0.99992
18	0.78	0.99988
20	0.99	0.99983

Open in a new tab

Table 6.

Average testing errors and correlation coefficients as functions of the number of training points for ANN models trained to map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₂. All models were developed using 14 hidden neurons and verified using 250 testing points

Training points	Average testing error (%)	Correlation Coefficient
5	27.08	0.50237
10	12.99	0.91962
25	3.72	0.99628
50	1.75	0.99940
125	1.09	0.99978
250	0.78	0.99989

Open in a new tab

Table 7.

Average testing errors and correlation coefficients as functions of the number of testing points for ANN models trained to map values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₂. All models were developed using 250 training points and 14 hidden neurons

Testing points	Average testing error (%)	Correlation Coefficient
5	0.87	0.99995
10	0.84	0.99993
25	0.81	0.99988
50	0.80	0.99989
125	0.81	0.99988
250	0.78	0.99989

Open in a new tab

First, we varied the number of hidden neurons from 1 to 20. All other parameters were held constant. Specifically, the 500 measurements points were divided into 250 training points and 250 testing points, and we used the conjugate gradient method for training. Table 2 lists the average testing errors and correlation coefficients for the models that map a₁ and a₂ to b₁, and Table 5 lists the average testing errors and correlation coefficients for the models that map a₁ and a₂ to b₂. Both mappings show similar trends. The average testing errors decreased with increasing numbers of hidden neurons until around 14 or 16, where the errors were minimized. For more than 16 hidden neurons, the trend reversed and the errors appeared to start increasing again. Figure 16 plots the average testing errors as a function of the number of hidden neurons for both mappings.

Fig. 16 — Average testing errors as functions of the number of hidden neurons for ANN models trained to map a₁and a₂ to b₁ and a₁ and a₂ to b₂. The models were developed using 250 training points and verified using 250 testing points.

Next, we varied the number of training points from 5 to 250. All other parameters were held constant. The number of hidden neurons was set to 14 since we found that to be an ideal number from the previous analysis, and 250 testing points were used for verification. Table 3 lists the average testing errors and correlation coefficients for the models that map a₁ and a₂ to b₁, and Table 6 lists the average testing errors and correlation coefficients for the models that map a₁ and a₂ to b₂. Once again, both mappings showed similar trends. The average testing errors decreased for an increasing number of training points. However, as more and more training points were added, diminishing returns on the testing errors were evident. Figure 17 plots the average testing errors as a function of the number of training points for both mappings.

Fig. 17 — Average testing errors as functions of the number of training points for ANN models trained to map a₁ and a₂ to b₁ and a₁ and a₂ to b₂. The models were developed using 14 hidden neurons and verified using 250 testing points.

Finally, we varied the number of testing points from 5 to 250. All other parameters were held constant. The number of hidden neurons was once again set to 14, and the same 250 training points were used for model development. Table 4 lists the average testing errors and correlation coefficients for the models that map a₁ and a₂ to b₁, and Table 7 lists the average testing errors and correlation coefficients for the models that map a₁ and a₂ to b₂. Both mappings showed that the average testing errors varied little with the number of testing points. Figure 18 plots the average testing errors as a function of the number of testing points for both mappings.

Fig. 18 — Average testing errors as functions of the number of testing points for ANN models trained to map a₁ and a₂ to b₁ and a₁ and a₂ to b₂. The models were developed using 14 hidden neurons and 250 training points.

4.3 Results and Comparison for Sec. 4

Based on the results of our sensitivity analysis, we decided to use 250 training points and 250 testing points to train and verify the two ANN models. We chose to use 14 hidden neurons for mapping values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₁ and 16 hidden neurons for mapping values from the first five harmonics of a₁ and a₂ to the first five harmonics of b₂. The testing error was 0.72 % for the b₁ model and 0.73 % and for the b₂ model, with respective correlation coefficients of 0.99997 and 0.99992.

After the ANN models were developed, the nonlinear large-signal $S$ -parameters, $S_{11 k 1}$ and $S_{21 k 1}$ (k = 1, 2, …, 5), were obtained by interpolating b₁_k and b₂_k from measured results for nonzero values of a₁₂, a₁₃, …, a₁₅ and a₂₁, a₂₂, …, a₂₅ to the desired values for a₁₂, a₁₃, …, a₁₅ and a₂₁, a₂₂, …, a₂₅ equal to zero. Figure 19 shows the interpolated value of b₁₁ ( $= S_{1111} \cdot a_{11}$ ) when a₁₂, a₁₃, …, a₁₅ and a₂₁, a₂₂, …, a₂₅ were set equal to zero, and Fig. 20 shows the interpolated value of b₂₁ ( $= S_{2111} \cdot a_{11}$ ) when a₁₂, a₁₃, …, a₁₅ and a₂₁, a₂₂, …, a₂₅ were set equal to zero.

Fig. 19 — The 250 measurements of b₁₁ used for training (circles). Values of $S_{1111} \cdot a_{11}$ were determined from the measurement-based ANN model (square) and the harmonic balance simulation using a compact model (triangle).

Fig. 20 — The 250 measurements of b₂₁ used for training (circles). Values of $S_{2111} \cdot a_{11}$ were determined from the measurement-based ANN model (square) and the harmonic balance simulation using a compact model (triangle).

We compared our results to a compact model provided by the manufacturer and simulated in commercial harmonic-balance software to get an independent check on our methodology. Our comparison was accomplished by providing the simulator with the identical biasing conditions on the diode and a stimulus of the same magnitude used in the measurements for a₁₁ and setting all other a’s to zero. Providing the simulated circuit with a₁₁ of the same magnitude as the measurement should give the same values of b₁_k and b₂_k as the interpolated values of b₁_k ( $= S_{11 k 1} \cdot a_{11}$ ) and b₂_k ( $= S_{21 k 1} \cdot a_{11}$ ) determined by the ANN models when a₁₂, a₁₃, …, a₁₅ and a₂₁, a₂₂, …, a₂₅ are set equal to zero. Figures 19 and 20 show that the simulated values b₁₁ and b₂₁ agree with those determined from the measurement-based ANN models.

Quantitatively, the differences between the ANN and equivalent-circuit models are shown in Table 8.

Table 8.

Differences between the measurement-based, ANN-modeled results and the compact model simulated in commercial harmonic-balance software

Quantity	Difference (%)	Difference (dBV)
$S_{1111}$	3.38	–44.5
$S_{1121}$	1.23	–53.3
$S_{1131}$	3.29	–44.8
$S_{1141}$	0.40	–63.1
$S_{1151}$	1.67	–50.6
$S_{2111}$	3.95	–43.2
$S_{2121}$	7.15	–38.0
$S_{2131}$	5.93	–39.6
$S_{2141}$	0.72	–57.9
$S_{2151}$	0.85	–56.5

Open in a new tab

4.4 Summary of Sec. 4

We described a method of extracting nonlinear large-signal $S$ -parameters, using an NVNA equipped with isolators and a second source. First, we showed how multiple measurements of a nonlinear circuit could be used to train artificial neural networks. Then, we extracted the desired $S$ -parameters by interpolating the ANN models for all a’s equal to zero other than a₁₁. We checked our approach by comparing our results to a compact model simulated in commercial harmonic-balance software, and showed that the two methods agree well.

We also performed a sensitivity analysis on the ANN networks, and discovered the following: (1) The average testing error decreases for an increasing number of training points. However, as more and more training points are added, diminishing returns on the testing errors are evident. (2) As the number of hidden neurons are increased, the average testing error decreases until around 14 hidden neurons at which point more hidden neurons have no benefit and can actually lead to increases in testing error. (3) The number of testing points does not drastically affect the testing error. In fact, no more than 25 testing points are needed for the models tested.

5. Overall Summary

In this paper, we introduced nonlinear large-signal scattering parameters representing a new type of frequency-domain mapping that relates incident and reflected signals. Unlike classical S-parameters, nonlinear large-signal $S$ -parameters take harmonic content into account and depend on the signal magnitudes. First, we presented a general form of nonlinear large-signal $S$ -parameters and showed that they reduce to classic S-parameters in the absence of nonlinearities. We also introduced nonlinear large-signal impedance ( $Z$ ) and admittance ( $D$ ) parameters, and presented equations that relate the different representations. Next, we considered two simplified cases of a one-port network and a two-port network, each with a single-tone excitation. For the one-port network, we showed that the equation relating $S$ and $Z$ reduces to the same well-known equation for the linear case, assuming no power is transferred in the form of frequency down-conversion. For the two-port case, we extracted input reflection coefficients and forward transmission coefficients, which can be useful for designing circuits such as amplifiers and frequency multipliers. In addition, we derived a quasi-linear approximation of the output reflection coefficient under normal operating conditions. These three two-port parameters allow a designer to “see” application-specific engineering figures of merit that are similar to what he or she is accustomed to in the linear world.

Next, we illustrated how nonlinear large-signal $S$ -parameters can be used as a tool in the design process of a single-diode 1 GHz frequency-doubler. Specifically, we used $S_{1111}$ to determine the input matching network, $S_{2222}$ to determine the output matching network, and $S_{11 k 1}, S_{21 k 1}$ (for k = 1 to 6), and $G_{2}$ to quantify the performance of the circuit at each stage. By the final stage of the design, we had created a doubler with an overall power gain of −9.56 dB, a value not far from the maximum possible predicted value of −9 dB.

For the case where a nonlinear model is not readily available, we described a method of extracting nonlinear large-signal $S$ -parameters, using an NVNA equipped with isolators and a second source. First, we showed how multiple measurements of a nonlinear circuit could be used to train artificial neural networks. Then, we extracted the desired $S$ -parameters by interpolating the ANN models for all a’s equal to zero other than a₁₁. We checked our approach by comparing our results to a compact model simulated in commercial harmonic-balance software, and showed that the two methods agree well. We also performed a sensitivity analysis on the ANN networks, and discovered the following: (1) The average testing error decreases for an increasing number of training points. However, as more and more training points are added, diminishing returns on the testing errors are evident. (2) As the number of hidden neurons are increased, the average testing error decreases until around 14 hidden neurons, at which point more hidden neurons have no benefit and can actually lead to increases in testing error. (3) The number of testing points does not drastically affect the testing error. In fact, no more than 25 testing points are needed for the models tested.

Acknowledgments

The authors thank Dominique Schreurs for her assistance with the measurements discussed in Sec. 4 and for her helpful suggestions regarding the preparation of this manuscript, and Alessandro Cidronali for his valuable interactions.

Biography

About the authors: Jeffrey A. Jargon has been with the Electromagnetics Division, NIST Electronics and Electrical Engineering Laboratory, Boulder, CO, since 1990. His current research interests include calibration techniques for nonlinear vector network analyzers and artificial neural network modeling of passive and active devices.

K.C. Gupta has been a Professor at the University of Colorado since 1983. Presently, he is also the Associate Director for the NSF I/UCR Center for Advanced Manufacturing and Packaging of Microwave, Optical and Digital Electronics (CAMPmode) at the University of Colorado; and a Guest Researcher with the RF Technology Group of National Institute of Standards and Technology at Boulder. Dr. Gupta’s current research interests are in the area of computer-aided design techniques (including ANN applications) for microwave and millimeter-wave integrated circuits, nonlinear characterization and modeling, RF MEMS, and reconfigurable antennas.

Donald C. DeGroot is currently the Project Leader with the NIST Nonlinear Device Characterization Project in the Electromagnetics Division. His present research activities include development of large-signal broadband measurement and calibration techniques for the development and validation of nonlinear circuits. Concurrently, Don is also Professor Adjoint of Electrical and Computer Engineering at the University of Colorado at Boulder. The National Institute of Standards and Technology is an agency of the Technology Administration, U.S. Department of Commerce.

6. Appendix A. Comparing Nonlinear Large-Signal $S$ -Parameters With Nonlinear Scattering Functions

Here, we compare the nonlinear large-signal $S$ -parameters, introduced in this paper, to another form of nonlinear mapping, known as nonlinear scattering functions, introduced by Verspecht [6–7].

For a two-port nonlinear device, excited by a single-tone signal, and assuming all harmonic signals are relatively small compared to the fundamental signals, Verspecht defines nonlinear scattering functions as

b_{k p} = F_{k p} + \sum_{\begin{array}{l} i = 1, 2 \\ j = 2, \dots, M \end{array}} G_{k p i j} Re (a_{i j}) + \sum_{\begin{array}{l} i = 1, 2 \\ j = 2, \dots, M \end{array}} H_{k p i j} Im (a_{i j}),

(59)

where a_ij and b_kp represent the wave variables proportional to the incoming and outgoing waves, respectively, and M refers to the number of harmonics being taken into account. F_kp, G_kpij, and H_kpij are functions of the fundamental components Re(a₁₁), Re(a₂₁), and Im(a₂₁). The imaginary component of a₁₁ is omitted, with the assumption that the wave variables are phase referenced such that the phase of a₁₁ is set to zero. F_kp, G_kpij, and H_kpij are assumed complex constants for a given bias and fundamental drive condition. Note that these three terms do not depend upon the higher harmonic signal levels. With the a_ij wave variables split into real and imaginary components, G_kpij and H_kpij serve to map a_ij circles centered at zero to b_kp ellipses with variable axes also centered at zero, as shown in Fig. 21. The F_kp terms translate the ellipses about the complex plane.

Fig. 21 — *G_kpij* and *H_kpij* serve to map *a_ij* circles centered at zero to *b_kp* ellipses with variable axes also centered at zero, neglecting *F_kp* for illustrative purposes.

For illustrative purposes, let us consider b₁₁, taking into account the first three harmonics. Doing this, Eq. (59) reduces to

b_{11} = F_{11} + \sum_{\begin{array}{l} i = 1, 2 \\ j = 2, 3 \end{array}} G_{11 i j} Re (a_{i j}) + \sum_{\begin{array}{l} i = 1, 2 \\ j = 2, 3 \end{array}} H_{11 i j} Im (a_{i j})

(60)

\begin{matrix} b_{11} = F_{11} + G_{1112} Re (a_{12}) + H_{1112} Im (a_{12}) \\ + G_{1113} Re (a_{13}) + H_{1113} Im (a_{13}) \\ + G_{1122} Re (a_{22}) + H_{1122} Im (a_{22}) \\ + G_{1123} Re (a_{23}) + H_{1123} Im (a_{23}) . \end{matrix}

(61)

If we now consider the nonlinear large-signal $S$ -parameter representation for b₁₁, once again assuming a two-port network and taking into account the first three harmonics, we have

b_{11} = \sum_{\begin{array}{l} j = 1, 2 \\ l = 2, 3 \end{array}} S_{1 j 1 l} a_{j l}

(62)

\begin{array}{l} b_{11} = S_{1111} a_{11} + S_{1112} a_{12} + S_{1113} a_{13} \\ + S_{1211} a_{21} + S_{1212} a_{22} + S_{1213} a_{23} . \end{array}

(63)

Here, $S_{i j k l}$ are functions of all of the harmonics, not just the fundamental terms. So for any change in any a_jl, a new set of $S_{i j k l}$ will need to be determined. Separating the real and imaginary components of the a’s, we can express eq. (63) as

\begin{array}{l} b_{11} = S_{1111} Re (a_{11}) + S_{1112} [Re (a_{12}) + j Im (a_{12})] \\ + S_{1113} [Re (a_{13}) + j Im (a_{13})] + S_{1211} [Re (a_{21}) + j Im (a_{21})] \\ + S_{1212} [Re (a_{22}) + j Im (a_{22})] + S_{1213} [Re (a_{23}) + j Im (a_{23})] . \end{array}

(64)

Once again, the imaginary component of a₁₁ is omitted, with the phase reference such that the phase of a₁₁ is set to zero.

We can now equate the nonlinear large-signal $S$ -parameters of Eq. (64) to the nonlinear scattering functions of Eq. (61), with the understanding that this is only generally valid for the special case when the nonlinear large-signal $S$ -parameters are constant for a given bias and fundamental drive level, like F_kp, G_kpij, and H_kpij are defined. Normally, however, the nonlinear large-signal $S$ -parameters depend upon the higher harmonics as well as on the bias and fundamental drive level. The implication of this special case will be discussed shortly, after Eqs. (61) and (64) are equated. Equating the corresponding real and imaginary components of the a wave variables in Eqs. (61) and (64) gives

F_{11} = S_{1111} Re (a_{11}) + S_{1211} a_{21} .

(65)

Additionally,

S_{1112} = G_{1112}; j S_{1112} = H_{1112},

(66)

S_{1113} = G_{1113}; j S_{1113} = H_{1113},

(67)

S_{1212} = G_{1122}; j S_{1212} = H_{1122},

(68)

and

S_{1213} = G_{1123}; j S_{1213} = H_{1123} .

(69)

Equations (66)–(69) imply

G_{1112} = - j H_{1112}; G_{1113} = - j H_{1113}; G_{1122} = - j H_{1122}; G_{1123} = - j H_{1123},

(70)

which means

Re (G_{k p i j}) = Im (H_{k p i j}); {Re (H}_{k p i j}) = - Im (G_{k p i j}) .

(71)

Equation (71) satisfies the conditions of the Cauchy-Riemann equations [15],

\frac{\partial [Re (b_{k p})]}{\partial [Re (a_{i j})]} = \frac{\partial [Im (b_{k p})]}{\partial [Im (a_{i j})]}; \frac{\partial [Re (b_{k p})]}{\partial [Im (a_{i j})]} = - \frac{\partial [Im (b_{k p})]}{\partial [Re (a_{i j})]},

(72)

which implies b_kp must be an analytic function of a_ij. A complex-valued function is said to be analytic on an open set W if it has a derivative at every point of W. This is generally true only when b_kp is a linear function of a_ij. Thus, equating the nonlinear large-signal $S$ -parameters with the nonlinear scattering functions is generally valid only in the small-signal, linear case.

As we mentioned earlier, Eqs. (65)–(70) are only generally valid in the special case when the nonlinear large-signal $S$ -parameters are constant for a given bias and fundamental drive level, like F_kp, G_kpij, and H_kpij are defined. Since this is not generally true, the formulations for nonlinear large-signal $S$ -parameters and nonlinear scattering functions are not equivalent.

We can draw a few important conclusions, however, after attempting to equate the two formulations. First, if G_kpij and H_kpij are allowed to be functions of higher harmonics, then only one of them, either G_kpij or H_kpij, or equivalently $S_{i j k l}$ , is required since Eq. (70) shows that they are not independent. Second, if the nonlinear large-signal $S$ -parameters are complex constants for a given bias and fundamental drive level and are not functions of the higher harmonics, the parameters have the limitation that they cannot map circles into ellipses, but rather can only map circles into circles, as shown in Figure 22. This is because $S_{i j k l}$ is a single, complex constant rather than a pair of independent complex constants such as G_kpij and H_kpij. Thus, if $S_{i j k l}$ is not dependent upon higher harmonics, it acts like a linear S-parameter.

Fig. 22 — If $S_{i j k l}$ is a complex constant for a given bias and fundamental drive level, it has the limitation that it can only map circles into circles.

We have shown above that the two formulations are not equivalent. Nonlinear large-signal $S$ -parameters are more general than the nonlinear scattering functions, which are useful in approximating a specific class of nonlinearity in a more compact form. Nonlinear large-signal $S$ -parameters have the advantage of being able to map circles into any arbitrary shape, rather than being limited to ellipses.

Contributor Information

Jeffrey A. Jargon, Email: jargon@boulder.nist.gov.

Donald C. DeGroot, Email: degroot@boulder.nist.gov.

7. References

1.Sipila M, Lehtinen K, Porra V. High-frequency periodic time-domain waveform measurement system. IEEE Trans Microwave Theory Tech. 1988;36:1397–1405. [Google Scholar]
2.Lott U. Measurement of magnitude and phase of harmonics generated in nonlinear microwave two-ports. IEEE Trans Microwave Theory Tech. 1989;37:1506–1511. [Google Scholar]
3.Kompa G, Van Raay F. Error-corrected large-signal waveform measurement system combining network analyzer and sampling oscilloscope capabilities. IEEE Trans Microwave Theory Tech. 1990;38:358–365. [Google Scholar]
4.Verspecht J, Debie P, Barel A, Martens L. Accurate on wafer measurement of phase and amplitude of the spectral components of incident and scattered voltage waves at the signal ports of a nonlinear microwave device. 1995 IEEE MTT-S Int Microwave Symp Dig. 1995 May;:1029–1032. [Google Scholar]
5.Verspecht J. Doctoral Dissertation. Vrije Universiteit Brussel; Belgium: 1995. Calibration of a measurement system for high-frequency nonlinear devices. [Google Scholar]
6.Verspecht J, Schreurs D, Barel A, Neuwelaers B. Black box modeling of hard nonlinear behavior in the frequency domain. IEEE MTT-S Int Microwave Symp Dig. 1996 Jun;:1735–1738. [Google Scholar]
7.Verspecht J, Van Esch P. Accurately characterizing hard nonlinear behavior of microwave components with the nonlinear network measurement system: introducing ‘nonlinear scattering functions’; Proceedings of the 5th International Workshop on Integrated Nonlinear Microwave and Millimeterwave Circuits; Duisburg, Germany. Oct. 1998; pp. 17–26. [Google Scholar]
8.Jargon JA, DeGroot DC, Gupta KC, Cidronali A. Calculating ratios of harmonically related, complex signals with application to nonlinear large-signal scattering parameters; 60th ARFTG Conference Digest; Washington, DC. Dec. 2002; pp. 113–122. [Google Scholar]
9.Faber MT, Chramiec J, Adamski ME. Microwave and millimeter-wave diode frequency multipliers. Artech House; Boston, London: 1995. [Google Scholar]
10.Maas SA. The rf and microwave circuit design cookbook. Artech House; Boston, London: 1998. [Google Scholar]
11.Jargon JA, Gupta KC, Cidronali A, DeGroot DC. Expanding definitions of gain by taking harmonic content into account. Int J RF Microwave CAE. 2003;5:357–369. [Google Scholar]
12.Jargon JA, Gupta KC, Schreurs D, DeGroot DC. Developing frequency-domain models for nonlinear circuits based on large-signal measurements; URSI XXVIIth General Assembly; Maastricht, the Netherlands. Aug. 2002; CD-ROM. [Google Scholar]
13.Zhang QJ, Gupta KC. Neural networks for RF and microwave design. Artech House; Boston, London: 2000. [Google Scholar]
14.Zhang QJ, his neural network research team . NeuroModeler, ver. 1.2. Department of Electronics, Carleton University; Ottawa, Canada: 1999. [Google Scholar]
15.Fisher SD. Complex variables. Brooks/Cole Publishing Company; Monterey: 1986. [Google Scholar]

[b1-j94jar] 1.Sipila M, Lehtinen K, Porra V. High-frequency periodic time-domain waveform measurement system. IEEE Trans Microwave Theory Tech. 1988;36:1397–1405. [Google Scholar]

[b2-j94jar] 2.Lott U. Measurement of magnitude and phase of harmonics generated in nonlinear microwave two-ports. IEEE Trans Microwave Theory Tech. 1989;37:1506–1511. [Google Scholar]

[b3-j94jar] 3.Kompa G, Van Raay F. Error-corrected large-signal waveform measurement system combining network analyzer and sampling oscilloscope capabilities. IEEE Trans Microwave Theory Tech. 1990;38:358–365. [Google Scholar]

[b4-j94jar] 4.Verspecht J, Debie P, Barel A, Martens L. Accurate on wafer measurement of phase and amplitude of the spectral components of incident and scattered voltage waves at the signal ports of a nonlinear microwave device. 1995 IEEE MTT-S Int Microwave Symp Dig. 1995 May;:1029–1032. [Google Scholar]

[b5-j94jar] 5.Verspecht J. Doctoral Dissertation. Vrije Universiteit Brussel; Belgium: 1995. Calibration of a measurement system for high-frequency nonlinear devices. [Google Scholar]

[b6-j94jar] 6.Verspecht J, Schreurs D, Barel A, Neuwelaers B. Black box modeling of hard nonlinear behavior in the frequency domain. IEEE MTT-S Int Microwave Symp Dig. 1996 Jun;:1735–1738. [Google Scholar]

[b7-j94jar] 7.Verspecht J, Van Esch P. Accurately characterizing hard nonlinear behavior of microwave components with the nonlinear network measurement system: introducing ‘nonlinear scattering functions’; Proceedings of the 5th International Workshop on Integrated Nonlinear Microwave and Millimeterwave Circuits; Duisburg, Germany. Oct. 1998; pp. 17–26. [Google Scholar]

[b8-j94jar] 8.Jargon JA, DeGroot DC, Gupta KC, Cidronali A. Calculating ratios of harmonically related, complex signals with application to nonlinear large-signal scattering parameters; 60th ARFTG Conference Digest; Washington, DC. Dec. 2002; pp. 113–122. [Google Scholar]

[b9-j94jar] 9.Faber MT, Chramiec J, Adamski ME. Microwave and millimeter-wave diode frequency multipliers. Artech House; Boston, London: 1995. [Google Scholar]

[b10-j94jar] 10.Maas SA. The rf and microwave circuit design cookbook. Artech House; Boston, London: 1998. [Google Scholar]

[b11-j94jar] 11.Jargon JA, Gupta KC, Cidronali A, DeGroot DC. Expanding definitions of gain by taking harmonic content into account. Int J RF Microwave CAE. 2003;5:357–369. [Google Scholar]

[b12-j94jar] 12.Jargon JA, Gupta KC, Schreurs D, DeGroot DC. Developing frequency-domain models for nonlinear circuits based on large-signal measurements; URSI XXVIIth General Assembly; Maastricht, the Netherlands. Aug. 2002; CD-ROM. [Google Scholar]

[b13-j94jar] 13.Zhang QJ, Gupta KC. Neural networks for RF and microwave design. Artech House; Boston, London: 2000. [Google Scholar]

[b14-j94jar] 14.Zhang QJ, his neural network research team . NeuroModeler, ver. 1.2. Department of Electronics, Carleton University; Ottawa, Canada: 1999. [Google Scholar]

[b15-j94jar] 15.Fisher SD. Complex variables. Brooks/Cole Publishing Company; Monterey: 1986. [Google Scholar]

PERMALINK

Frequency-Domain Models for Nonlinear Microwave Devices Based on Large-Signal Measurements

Jeffrey A Jargon

Donald C DeGroot

K C Gupta

Abstract

1. Introduction

2. Nonlinear Large-Signal Scattering Parameters

2.1 General Form

2.2 Nonlinear Large-Signal Impedance Parameters

2.3 Relating S and Z Matrices

2.4 Nonlinear Large-Signal Admittance Parameters

2.5 Relating S and D Matrices

2.6 One-Port Network With Single-Tone Excitation

2.7 Two-Port Network With Single-Tone Excitation

2.8 Summary of Sec. 2

3. Using Nonlinear Large-Signal S-Parameters to Design a Diode Frequency-Doubler Circuit With a Harmonic-Balance Simulator

Fig. 1.

Fig. 2.

3.1 Diode Only

3.2 Diode With 1 GHz and 2 GHz Filters

Table 1.

3.3 Diode With 1 GHz and 2 GHz Filters and Input Matching

3.4 Diode With 1 GHz and 2 GHz Filters, Plus Input and Output Matching

3.5 Diode With 1 GHz and 2 GHz Filters, Plus Optimized Input and Output Matching

3.6 Diode With (1, 2, 4, and 6) GHz Filters, Plus Optimized Input and Output Matching

Fig. 3.

Fig. 4.

3.7 Summary of Sec. 3

4. Determining Nonlinear Large-Signal S-Parameters from Artificial Neural Network Models Trained With Measurement Data

Fig. 5.

4.1 Methodology

Fig. 6.

Fig. 7.

Fig. 8.

Fig. 9.

Fig. 10.

Fig. 11.

Fig. 12.

Fig. 13.

Fig 14.

Fig. 15.

4.2 Sensitivity Analysis of ANN Models

Table 2.

Table 3.

Table 4.

Table 5.

Table 6.

Table 7.

Fig. 16.

Fig. 17.

Fig. 18.

4.3 Results and Comparison for Sec. 4

Fig. 19.

Fig. 20.

Table 8.

4.4 Summary of Sec. 4

5. Overall Summary

Acknowledgments

Biography

6. Appendix A. Comparing Nonlinear Large-Signal S-Parameters With Nonlinear Scattering Functions

Fig. 21.

Fig. 22.

Contributor Information

7. References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

2.3 Relating $S$ and $Z$ Matrices

2.5 Relating $S$ and $D$ Matrices

3. Using Nonlinear Large-Signal $S$ -Parameters to Design a Diode Frequency-Doubler Circuit With a Harmonic-Balance Simulator

4. Determining Nonlinear Large-Signal $S$ -Parameters from Artificial Neural Network Models Trained With Measurement Data

6. Appendix A. Comparing Nonlinear Large-Signal $S$ -Parameters With Nonlinear Scattering Functions