Secure Outsourced Matrix Computation and Application to Neural Networks

Xiaoqian Jiang; Miran Kim; Kristin Lauter; Yongsoo Song

doi:10.1145/3243734.3243837

. Author manuscript; available in PMC: 2019 Aug 11.

Published in final edited form as: Conf Comput Commun Secur. 2018 Oct;2018:1209–1222. doi: 10.1145/3243734.3243837

Secure Outsourced Matrix Computation and Application to Neural Networks

Xiaoqian Jiang ¹, Miran Kim ², Kristin Lauter ³, Yongsoo Song ⁴

PMCID: PMC6689419 NIHMSID: NIHMS1043787 PMID: 31404438

Abstract

Homomorphic Encryption (HE) is a powerful cryptographic primitive to address privacy and security issues in outsourcing computation on sensitive data to an untrusted computation environment. Comparing to secure Multi-Party Computation (MPC), HE has advantages in supporting non-interactive operations and saving on communication costs. However, it has not come up with an optimal solution for modern learning frameworks, partially due to a lack of efficient matrix computation mechanisms.

In this work, we present a practical solution to encrypt a matrix homomorphically and perform arithmetic operations on encrypted matrices. Our solution includes a novel matrix encoding method and an efficient evaluation strategy for basic matrix operations such as addition, multiplication, and transposition. We also explain how to encrypt more than one matrix in a single ciphertext, yielding better amortized performance.

Our solution is generic in the sense that it can be applied to most of the existing HE schemes. It also achieves reasonable performance for practical use; for example, our implementation takes 9.21 seconds to multiply two encrypted square matrices of order 64 and 2.56 seconds to transpose a square matrix of order 64.

Our secure matrix computation mechanism has a wide applicability to our new framework E2DM, which stands for encrypted data and encrypted model. To the best of our knowledge, this is the first work that supports secure evaluation of the prediction phase based on both encrypted data and encrypted model, whereas previous work only supported applying a plain model to encrypted data. As a benchmark, we report an experimental result to classify handwritten images using convolutional neural networks (CNN). Our implementation on the MNIST dataset takes 28.59 seconds to compute ten likelihoods of 64 input images simultaneously, yielding an amortized rate of 0.45 seconds per image.

Keywords: Homomorphic encryption, matrix computation, machine learning, neural networks

1. INTRODUCTION

Homomorphic Encryption (HE) is an encryption scheme that allows for operations on encrypted inputs so that the decrypted result matches the outcome for the corresponding operations on the plaintext. This property makes it very attractive for secure outsourcing tasks, including financial model evaluation and genetic testing, which can ensure the privacy and security of data communication, storage, and computation [3, 46]. In biomedicine, it is extremely attractive due to the privacy concerns about patients’ sensitive data [28, 47]. Recently deep neural network based models have been demonstrated to achieve great success in a number of health care applications [36], and a natural question is whether we can outsource such learned models to a third party and evaluate new samples in a secure manner?

There are several different scenarios depending on who owns the data and who provides the model. Assuming a few different roles including data owners (e.g. hospital, institution or individuals), cloud computing service providers (e.g. Amazon, Google, or Microsoft), and machine learning model providers (e.g. researchers and companies), we can imagine the following situations: (1) the data owner trains a model and makes it available on a computing service provider to be used to make predictions on encrypted inputs from other data owners; (2) model providers encrypt their trained classifier models and upload them to a cloud service provider to make predictions on encrypted inputs from various data owners; and (3) a cloud service provider trains a model on encrypted data from some data owners and uses the encrypted trained model to make predictions on new encrypted inputs. The first scenario has been previously studied in CryptoNets [24] and subsequent follow-up work [7, 10]. The second scenario was considered by Makri et al. [35] based on Multi-Party Computation (MPC) using polynomial kernel support vector machine classification. However, the second and third scenarios with an HE system have not been studied yet. In particular, classification tasks for these scenarios rely heavily on efficiency of secure matrix computation on encrypted inputs.

1.1. Our Contribution

In this paper, we introduce a generic method to perform arithmetic operations on encrypted matrices using an HE system. Our solution requires O(d) homomorphic operations to compute a product of two encrypted matrices of size d ×d, compared to O(d²) of the previous best method. We extend basic matrix arithmetic to some advanced operations: transposition and rectangular matrix multiplication. We also describe how to encrypt multiple matrices in a single ciphertext, yielding a better amortized performance per matrix.

We apply our matrix computation mechanism to a new framework E2DM, which takes encrypted data and encrypted machine learning model to make predictions securely. This is the first HE-based solution that can be applied to the prediction phase of the second and third scenarios described above. As a benchmark of this framework, we implemented an evaluation of convolutional neural networks (CNN) model on the MNIST dataset [33] to compute ten likelihoods of handwritten images.

1.2. Technical Details

After Gentry’s first construction of a fully HE scheme [22], there have been several attempts to improve efficiency and flexibility of HE systems. For example, the ciphertext packing technique allows multiple values to be encrypted in a single ciphertext, thereby performing parallel computation on encrypted vectors in a Single Instruction Multiple Data (SIMD) manner. In the current literature, most of practical HE schemes [8, 9, 14, 19] support their own packing methods to achieve better amortized complexity of homomorphic operations. Besides component-wise addition and multiplication on plaintext vectors, these schemes provide additional functionalities such as scalar multiplication and slot rotation. In particular, permutations on plaintext slots enable us to interact with values located in different plaintext slots.

A naive solution for secure multiplication between two matrices of size d ×d is to use d² distinct ciphertexts to represent each input matrix (one ciphertext per one matrix entry) and apply pure SIMD operations (addition and multiplication) on encrypted vectors. This method consumes one level for homomorphic multiplication, but it takes O(d³) multiplications. Another approach is to consider a matrix multiplication as a series of matrix-vector multiplications. Halevi and Shoup [25] introduced a matrix encoding method based on its diagonal decomposition, putting the matrix in diagonal order and mapping each of them to a single ciphertext. So it requires d ciphertexts to represent the matrix and the matrix-vector multiplication can be computed using O(d) rotations and multiplications. Therefore, the matrix multiplication takes O(d²) complexity and has a depth of a single multiplication.

We propose an efficient method to perform matrix operations by combining HE-friendly operations on packed ciphertexts such as SIMD arithmetics, scalar multiplication, and slot rotation. We first define a simple encoding map that identifies an arbitrary matrix of size d ×d with a vector of dimension n = d² having the same entries. Let ʘ denote the component-wise product between matrices. Then matrix multiplication can be expressed as $A \cdot B = \sum_{i = 0}^{d - 1} A_{i} ⊙ B_{i}$ for some matrices A_i (resp. B_i) obtained from A (resp. B) by taking specific permutations. Figure 1 describes this equality for the case of d = 3. We remark that the initial matrix A₀ (resp. B₀) can be computed with O(d) rotations, and that for any 1 ≤ i < d the permuted matrix A_i (resp. B_i) can be obtained by O(1) rotations from the initial matrix. Thus the total computational complexity is bounded by O(d) rotations and multiplications. We refer to Table 1 for comparison of our method with prior work in terms of the number of input ciphertexts for a single matrix, complexity, and the required depth for implementation. We denote homomorphic multiplication and constant multiplication by Mult and CMult, respectively.

Figure 1: — Our matrix multiplication algorithm with d = 3

Table 1:

Comparison of secure d-dimensional matrix multiplication algorithms

Methodology	Number of ciphertexts	Complexity	Required depth
Naive method	d²	O(d³)	1 Mult
Halevi-Shoup [25]	d	O(d²)	1 Mult
Ours	1	O(d)	1 Mult + 2 CMult

Open in a new tab

Our basic solution is based on the assumption that a ciphertext can encrypt d² plaintext slots, but it can be extended to support matrix computation of an arbitrary size. When a ciphertext has more than d² plaintext slots, for example, we can encrypt multiple matrices in a single ciphertext and carry out matrix operations in parallel. On the other hand, if a matrix is too large to be encoded into one ciphertext, one can partition it into several sub-matrices and encrypt them individually. An arithmetic operation over large matrices can be expressed using block-wise operations, and the computation on the sub-matrices can be securely done using our basic matrix algorithms. We will use this approach to evaluate an encrypted neural networks model on encrypted data.

Our implementation is publicly available at https://github.com/K-miran/HEMat. It is based on an HE scheme of Cheon et al. [14], which is optimized in computation over the real numbers. For example, it took 9.21 seconds to securely compute the product of two matrices of size 64 × 64 and 2.56 seconds to transpose a single matrix of size 64 × 64. For the evaluation of an encrypted CNN model, we adapted a similar network topology to CryptoNets: one convolution layer and two fully connected (FC) layers with square activation function. This model is obtained from the keras library [15] by training 60,000 images of the MNIST dataset, and used for the classification of handwriting images of size 28 × 28. It took 28.59 seconds to compute ten likelihoods of encrypted 64 input images using the encrypted CNN model, yielding an amortized rate of 0.45 seconds per image. This model achieves a prediction accuracy of 98.1% on the test set.

2. PRELIMINARIES

The binary logarithm will be simply denoted by log(·). We denote vectors in bold, e.g. a, and every vector in this paper is a row vector. For a d₁ × d matrix A₁ and a d₂ × d matrix A₂, (A₁; A₂) denotes the (d₁ + d₂) × d matrix obtained by concatenating two matrices in a vertical direction. If two matrices A₁ and A₂ have the same number of rows, (A₁|A₂) denotes a matrix formed by horizontal concatenation. We let λ denote the security parameter throughout the paper: all known valid attacks against the cryptographic scheme under scope should take Ω(2^λ) bit operations.

2.1. Homomorphic Encryption

HE is a cryptographic primitive that allows us to compute on encrypted data without decryption and generate an encrypted result which matches that of operations on plaintext [6, 9, 14, 19]. So it enables us to securely outsource computation to a public cloud. This technology has great potentials in many real-world applications such as statistical testing, machine learning, and neural networks [24, 30, 31, 40].

Let $M$ and $C$ denote the spaces of plaintexts and ciphertexts, respectively. An HE scheme Π = (KeyGen, Enc, Dec, Eval) is a quadruple of algorithms that proceeds as follows:

KeyGen(1^λ). Given the security parameter λ, this algorithm outputs a public key pk, a public evaluation key evk and a secret key sk.
Enc_pk(m). Using the public key pk, the encryption algorithm encrypts a message $m \in M$ into a ciphertext $ct \in C$ .
Dec_sk(ct). For the secret key sk and a ciphertext ct, the decryption algorithm returns a message $m \in M$ .
Eval_evk(f; ct₁, … , ct_k). Using the evaluation key evk, for a circuit $f : M^{k} \to M$ and a tuple of ciphertexts (ct₁, … , ct_k), the evaluation algorithm outputs a ciphertext $c t^{'} \in C$ .

An HE scheme Π is called correct if the following statements are satisfied with an overwhelming probability:

Dec_sk(ct) = m for any $m \in M$ and ct ← Enc_pk(m).
Dec_sk(ct′) = f (m₁, … , m_k) with an overwhelming probability if ct′ ← Eval_evk(f ; ct₁… , ct_k) for an arithmetic circuit $f : M^{k} \to M$ and for some ciphertexts ${ct}_{1}, \dots, {ct}_{k} \in C$ such that Dec_sk(ct_i) = m_i.

An HE system can securely evaluate an arithmetic circuit f consisting of addition and multiplication gates. Throughout this paper, we denote by Add(ct₁, ct₂) and Mult_evk(ct₁, ct₂) the homomorphic addition and multiplication between two ciphertexts ct₁ and ct₂, respectively. In addition, we let CMult_evk(ct; u) denote the multiplication of ct with a scalar $u \in M$ For simplicity, we will omit the subscript of the algorithms when it is clear from the context.

2.2. Ciphertext Packing Technique

Ciphertext packing technique allows us to encrypt multiple values into a single ciphertext and perform computation in a SIMD manner. After Smart and Vercauteren [45] first introduced a packing technique based on polynomial-CRT, it has been one of the most important features of HE systems. This method represents a native plaintext space $M$ as a set of n-dimensional vectors in $R^{n}$ over a ring $R$ using appropriate encoding/decoding methods (each factor is called a plaintext slot). One can encode and encrypt an element of $R^{n}$ into a ciphertext, and perform component-wise arithmetic operations over the plaintext slots at once. It enables us to reduce the expansion rate and parallelize the computation, thus achieving better performance in terms of amortized space and time complexity.

However, the ciphertext packing technique has a limitation that it is not easy to handle a circuit with some inputs in different plaintext slots. To overcome this problem, there have been proposed some methods to move data in the slots over encryption. For example, some HE schemes [14, 23] based on the ring learning with errors (RLWE) assumption exploit the structure of Galois group to implement the rotation operation on plaintext slots. That is, such HE schemes include the rotation algorithm, denoted by Rot (ct; ℓ), which transforms an encryption ct of $m = (m_{0}, \dots, m_{n - 1}) \in M = R^{n}$ into an encryption of ρ(m; ℓ) := (m_ℓ, … ,m_n−1, m₀, … ,m_ℓ−1) Note that ℓ can be either positive or negative, and a rotation by (−ℓ) is the same as a rotation by (n − ℓ).

2.3. Linear Transformations

Halevi and Shoup [25] introduced a method to evaluate an arbitrary linear transformation on encrypted vectors. In general, an arbitrary linear transformation $L : R^{n} \to R^{n}$ over plaintext vectors can be represented as L : m ↦ U · m for some matrix $U \in R^{n \times n}$ We can express the matrix-vector multiplication by combining rotation and constant multiplication operations.

Specifically, for 0 ≤ ℓ < n, we define the ℓ-th diagonal vector of U by $u_{l} = (U_{0}, l, U_{1, l + 1}, \dots, U_{n - l - 1, n - 1}, U_{n - l}_{, 0}, \dots, U_{n - 1, l - 1}) \in R^{n}$ . Then we have

U \cdot m = \sum_{0 \leq l < n} (u_{l} ⊙ ρ (m; l))

(1)

where ʘ denotes the component-wise multiplication between vectors. Given a matrix $U \in R^{n \times n}$ and an encryption ct of the vector m, Algorithm 1 describes how to compute a ciphertext of the desired vector U · m.

As shown in Algorithm 1, the computational cost of matrix-vector multiplication is about n additions, constant multiplications, and rotations. Note that rotation operation needs to perform a key-switching operation and thus is comparably expensive than the other two operations. So we can say that the complexity is asymptotically O(n) rotations. It can be reduced when the number of nonzero diagonal vectors of U is relatively small.

3. SECURE MATRIX MULTIPLICATION VIA HOMOMORPHIC ENCRYPTION

In this section, we propose a simple encoding method to convert a matrix into a plaintext vector in a SIMD environment. Based on this encoding method, we devise an efficient algorithm to carry out basic matrix operations over encryption.

3.1. Permutations for Matrix Multiplication

We propose an HE-friendly expression of the matrix multiplication operation. For a d × d square matrix A = (A_i,j)_{0≤i, j<d}, we first define useful permutations σ, τ, ϕ, and ψ on the set $R^{d \times d}$ . For simplicity, we identify $ℤ \cap [0, d)$ as a representative of $ℤ_{d}$ and write [i]_d to denote the reduction of an integer i modulo d into that interval. All the indexes will be considered as integers modulo d.

σ(A)_{i, j} = A_{i, i+j}.
τ(A)_{i, j} = A_{i+j, j}.
ϕ(A)_{i, j} = A_{i, j+1}.
ψ(A)_{i, j} = A_{i+1, j}.

Note that ϕ and ψ represent the column and row shifting functions, respectively. Then for two square matrices A and B of order d, we can express their matrix product AB as follows:

A \cdot B = \sum_{k = 0}^{d - 1} (ϕ^{k} \circ σ (A)) ⊙ (ψ^{k} \circ τ (B)),

(2)

where ʘ denotes the component-wise multiplication between matrices. The correctness is shown in the following equality by computing the matrix component of the index (i, j):

\sum_{k = 0}^{d - 1} {(ϕ^{k} \circ σ (A))}_{i, j} \cdot {(ψ^{k} \circ τ (B))}_{i, j} = \sum_{k = 0}^{d - 1} σ {(A)}_{i, j + k} \cdot τ {(B)}_{i + k, j} = \sum_{k = 0}^{d - 1} A_{i, i + j + k} \cdot B_{i + j + k, j} = \sum_{k = 0}^{d - 1} A_{i, k} \cdot B_{k, j} = {(A \cdot B)}_{i, j} .

Since Equation (2) consists of permutations on matrix entries and the Hadamard multiplication operations, we can efficiently evaluate it using an HE system with ciphertext packing method.

3.2. Matrix Encoding Method

We propose a row ordering encoding map to transform a vector of dimension n = d² into a matrix in $R^{d \times d}$ For a vector a =(a_k)_0≤k<n, we define the encoding map $ι : R^{n} \to R^{d \times d}$ by

l : a \mapsto A = {(a_{d \cdot i + j})}_{0 \leq i, j < d},

i.e., a is the concatenation of row vectors of A. It is clear that ι(·) is an isomorphism between additive groups, which implies that matrix addition can be securely computed using homomorphic addition in a SIMD manner. In addition, one can perform multiplication by scalars by adapting a constant multiplication of an HE scheme. Throughout this paper, we identify two spaces $R^{n}$ and $R^{d \times d}$ with respect to the ι(·), so a ciphertext will be called an encryption of A if it encrypts the plaintext vector a = ι⁻¹(A).

3.3. Matrix Multiplication on Packed Ciphertexts

An arbitrary permutation operation on $R^{d \times d}$ can be understood as a linear transformation $L : R^{n} \to R^{n}$ such that n = d². In general, its matrix representation $U \in {0, 1}^{n \times n} \subseteq R^{n \times n}$ has n number of nonzero diagonal vectors. So if we directly evaluate the permutations A ↦ ϕ^k ◦ σ(A) and B ↦ ψ^k ◦ τ(B) for 1 ≤ k < d, each of them requires O(d²) homomorphic operations and thus the total complexity is O(d³). We provide an efficient algorithm to perform the matrix multiplication on packed ciphertexts by combining Equation (2) and our matrix encoding map.

3.3.1. Tweaks of Permutations.

We focus on the following four permutations σ, τ, ϕ, and ψ described above. We let U^σ, U^τ, V, and W denote the matrix representations corresponding to these permutations, respectively. Firstly, the matrix representations U^σ and U^τ of σ and τ are expressed as follows:

U_{d \cdot i + j, l}^{σ} = {\begin{array}{l} 1 & if l = d \cdot i + {[i + j]}_{d}; \\ 0 & otherwise; \end{array}

U_{d \cdot i + j, l}^{τ} = {\begin{array}{l} 1 & if l = d \cdot {[i + j]}_{d} + j; \\ 0 & otherwise, \end{array}

for 0 ≤ i, j < d and 0 ≤ ℓ < d². Similarly, for 1 ≤ k < d, the matrix representations of ϕ^k and ψ^k can be computed as follows:

V_{d \cdot i + j, l}^{k} = {\begin{array}{l} 1 & if l = d \cdot i + {[j + k]}_{d}; \\ 0 & otherwise; \end{array}

W_{d \cdot i + j, l}^{k} = {\begin{array}{l} 1 & if l = d \cdot {[i + k]}_{d} + j; \\ 0 & otherwise, \end{array}

for 0 ≤ i, j < d and 0 ≤ ℓ < d²

As described in Equation (1), we employ the diagonal decomposition of the matrix representations for multiplications with encrypted vectors. Let us count the number of diagonal vectors to estimate the complexity. We use the same notation u_ℓ to write the ℓ-th diagonal vector of a matrix U. For simplicity, we identify $u_{d^{2} - l}$ with u_−ℓ. The matrix U^σ has exactly (2d − 1) number of nonzero diagonal vectors, denoted by $u_{k}^{σ}$ for $k \in ℤ \cap (- d, d)$ . The ℓ-th diagonal vector of U^τ is nonzero if and only if ℓ is divisible by the integer d, so U^τ has d nonzero diagonal vectors. For any 1 ≤ k < d, the matrix V^k has two nonzero diagonal vectors v_k and v_k-d. Similarly, the matrix W^k has the only nonzero diagonal vector w_d·k. Therefore, homomorphic evaluations of the permutations σ andτ require O(d) rotations while it takes O(1) rotations to compute ψ^k or ϕ^k for any 1 ≤ k < d.

3.3.2. Homomorphic Matrix Multiplication.

Suppose that we are given two ciphertexts ct.A and ct.B that encrypt matrices A and B of size d × d, respectively. In the following, we describe an efficient evaluation strategy for homomorphic matrix multiplication.

Step 1–1: This step performs the linear transformation U^σ on the input ciphertext ct.A. As mentioned above, the matrixU^σ is a sparse matrix with (2d − 1) number of nonzero diagonal vectors $u_{k}^{σ}$ for $k \in ℤ \cap (- d, d)$ so we can represent the linear transformation as

U^{σ} \cdot a = \sum_{- d < k < d} (u_{k}^{σ} ⊙ ρ (a; k))

(3)

where $a = l^{- 1} (A) \in R^{n}$ is the vector representation of A. If k ≥ 0, the k-th diagonal vector is computed by

u_{k}^{σ} [l] = {\begin{array}{l} 1 & if 0 \leq l - d \cdot k < (d - k); \\ 0 & otherwise, \end{array}

where $u_{k}^{σ} [l]$ denotes the ℓ-th component of $u_{k}^{σ}$ In the other cases k < 0, it is computed by

u_{k}^{σ} [l] = {\begin{array}{l} 1 & if - k \leq l - (d + k) \cdot d < d; \\ 0 & otherwise. \end{array}

Then Equation (3) can be securely computed as

\sum_{- d < k < d} CMult (Rot (ct . A; k); u_{k}^{σ}),

resulting the encryption of the plaintext vector U^σ a, denoted by ct.A⁽⁰⁾. Thus, the computational cost is about 2d additions, constant multiplications, and rotations.

Step 1–2: This step is to evaluate the linear transformation U^τ on the input ciphertext ct.B. As described above, the matrix U^τ has d nonzero diagonal vectors so we can express this matrix-vector multiplication as

U^{τ} \cdot b = \sum_{0 \leq k < d} (u_{d \cdot k}^{τ} ⊙ ρ (b; d \cdot k)),

(4)

where b = ι⁻¹(B) and $u_{d \cdot k}^{τ}$ is the (d · k)-th diagonal vector of the matrix U^τ. We note that for any 0 ≤ k < d, the vector $u_{d \cdot k}^{τ}$ contains one in the (k + d · i)-th component for 0 ≤ i < d and zeros in all the other entries. Then Equation (4) can be securely computed as

\sum_{0 \leq k < d} CMult (Rot (ct . B; d \cdot k); u_{d \cdot k}^{τ}),

resulting the encryption of the plaintext vector U^τ · b, denoted by ct.B⁽⁰⁾. The complexity of this procedure is roughly half of the Step 1–1: d additions, constant multiplications, and rotations.

Step 2: This step securely computes the column and row shifting operations of σ(A) and τ(B), respectively. For 1 ≤ k < d, the column shifting matrix V^k has two nonzero diagonal vectors v_k and v_k−d that are computed by

v_{k} [l] = {\begin{array}{l} 1 & if 0 \leq {[l]}_{d} < (d - k); \\ 0 & otherwise; \end{array}

v_{k - d} [l] = {\begin{array}{l} 1 & if (d - k) \leq {[l]}_{d} < d; \\ 0 & otherwise. \end{array}

Then we get an encryption ct.A^(k) of the matrix ϕ^k ◦ σ (A) by adding two ciphertexts CMult(Rot(ct.A⁽⁰⁾;k)v_k); and CMult(Rot(ct.A⁽⁰⁾; k−d); v_k−d) In the case of the row shifting permutation, the corresponding matrixW^k has exactly one nonzero diagonal vector w_d·k whose entries are all one. Thus we can obtain an encryption of the matrix ψ^k ◦ τ(B) by computing ct.B^(k) ← Rot(ct.B⁽⁰⁾;d · k) The computational cost of this procedure is about d additions, 2d constant multiplications, and 3d rotations.

Step 3: This step computes the Hadamard multiplication between the ciphertexts ct.A^(k) and ct.B^(k)for 0 ≤ k < d, and finally aggregates all the resulting ciphertexts. As a result, we get an encryption ct.AB of the matrix AB. The running time of this step is d homomorphic multiplications and additions.

In summary, we can perform the homomorphic matrix multiplication operation as described in Algorithm 2.

3.3.

3.3.3. Further Improvements.

This implementation of matrix multiplication takes about 5d additions, 5d constant multiplications, 6d rotations, and d multiplications. The complexity of Steps 1–1 and 1–2 can be reduced by applying the idea of baby-step/giant-step algorithm. Given an integer k ∈ (−d,d), we can write $k = \sqrt{d} \cdot i + j$ for some $- \sqrt{d} < i < \sqrt{d}$ and $0 \leq j < \sqrt{d}$ . It follows from [26,·27] that Equation (3) can be expressed as

U^{σ} \cdot a = \sum_{\begin{array}{l} - \sqrt{d} < i < \sqrt{d} \\ 0 \leq j < \sqrt{d} \end{array}} (u_{\sqrt{d} \cdot i + j}^{σ} ⊙ ρ (a; \sqrt{d} \cdot i + j)) = \sum_{- \sqrt{d} < i < \sqrt{d}} ρ (\sum_{0 \leq j < \sqrt{d}} a_{i, j}; \sqrt{d} \cdot i)

where $a_{i, j} = ρ (u_{\sqrt{d} \cdot i + j}^{σ}; - \sqrt{d} \cdot i) ⊙ ρ (a; j)$ . We first compute encryptions of baby-step rotations ρ(a; j) for $0 \leq j < \sqrt{d}$ _. We use them to compute the ciphertexts of a_{i, j}’s using only constant multiplications. After that, we perform $\sqrt{d}$ additions, $\sqrt{d}$ constant multiplications, and a single rotation for each i. In total, Step 1–1 can be homomorphically evaluated with 2d additions, 2d constant multiplications, and $3 \sqrt{d}$ rotations. Step 1–2 can be computed in a similar way using d additions, d constant multiplications, and $2 \sqrt{d}$ rotations.

On the other hand, we can further reduce the number of constant multiplications in Step 2 by leveraging two-input multiplexers. The sum of ρ(v_k; −k) and ρ(v_k−d;d − k) generates a plaintext vector that has 1’s in all the slots, which implies that

CMult (Rot (ct . A^{(0)}; k - d); v_{k - d}) = Rot (CMult (ct . A^{(0)}; ρ (v_{k - d}; d - k)); k - d) = Rot (ct . A^{(0)} - CMult (ct . A^{(0)}; ρ (v_{k}, - k)); k - d) .

For each 1 ≤ k < d, we compute CMult(ct.A⁽⁰⁾; ρ(v_k,−k)). Then, using the fact that CMult(Rot(ct.A⁽⁰⁾;k);v_k) = Rot(CMult(ct.A⁽⁰⁾;ρ(v_k, −k)); k), we obtain the desired ciphertext ct.A^(k) with addition and rotation operations.

Table 2 summarizes the complexity and the required depth of each step of Algorithm 2 with the proposed optimization techniques.

Table 2:

Complexity and required depth of Algorithm 2

Step	Add	CMult	Rot	Mult	Depth
1–1	2d	2d	$3 \sqrt{d}$	-	1 CMult
1–2	d	d	$2 \sqrt{d}$	-	1 CMult
2	2d	d	3d	-	1 CMult
3	d	-	-	d	1 Mult
Total	6d	4d	$3 d + 5 \sqrt{d}$	d	1 Mult + 2 CMult

Open in a new tab

4. ADVANCED HOMOMORPHIC MATRIX COMPUTATIONS

In this section, we introduce a method to transpose a matrix over an HE system. We also present a faster rectangular matrix multiplication by employing the ideas from Algorithm 2. We can further extend our algorithms to parallel matrix computation without additional cost.

4.1. Matrix Transposition on Packed Ciphertexts

Let U^t be the matrix representation of the transpose map A ↦ A^t on $R^{d \times d} ≅ R^{n}$ For 0 ≤ i, j < d, its entries are given by

U_{d \cdot i + j, k}^{t} = {\begin{array}{l} 1 & if k = d \cdot j + i; \\ 0 & otherwise. \end{array}

The k-th diagonal vector of U^t is nonzero if and only if k = (d − 1)·i for some $i \in ℤ \cap (- d, d)$ , so the matrix U^t is a sparse matrix with (2d − 1) nonzero diagonal vectors. We can represent this linear transformation as

U^{t} \cdot a = \sum_{- d < i < d} (t_{(d - 1) \cdot i} ⊙ ρ (a; (d - 1) \cdot i))

where t_(d−1)·i denotes the nonzero diagonal vector of U^t. The ℓ-th component of the vector t_(d−1)·i is computed by

t_{(d - 1) \cdot i} [l] = {\begin{array}{l} 1 & if l - i = (d + 1) \cdot j, 0 \leq j < d - i; \\ 0 & otherwise, \end{array}

if i ≥ 0, or

t_{(d - 1) \cdot i} [l] = {\begin{array}{l} 1 & if l + i = (d + 1) \cdot j, 0 \leq j < d + i; \\ 0 & otherwise, \end{array}

if i < 0. The total computational cost is about 2d rotations and the baby-step/giant-step approach can be used to reduce the complexity; the number of automorphism can be reduced down to $3 \sqrt{d}$ _.

4.2. Rectangular Matrix Multiplication

In this section, we design an efficient algorithm for rectangular matrix multiplication such as $R^{l \times d} \times R^{d \times d} \to R^{l \times d} or R^{d \times d} \times R^{d \times l} \to R^{d \times l}$ For convenience, let us consider the former case that A has a smaller number of rows than columns (i.e., ℓ < d). A naive solution is to generate a square matrix by padding zeros in the bottom of the matrix A and perform the homomorphic matrix multiplication algorithm in Section 3.3, resulting in running time of O(d) rotations and multiplications. However, we can further optimize the complexity by manipulating its matrix multiplication representation using a special property of permutations described in Section 3.1.

4.2.1. Refinements of Rectangular Matrix Multiplication.

Suppose that we are given an ℓ × d matrix A and a d × d matrix B such that ℓ divides d. Since σ and ϕ are defined as row-wise operations, the restrictions to the rectangular matrix A are well-defined permutations on A. By abuse of notation, we use the same symbols σ and ϕ to denote the restrictions. We also use C_{ℓ₁:ℓ₂} to denote the (ℓ₂ − ℓ₁) ×d submatrix of C formed by extracting from ℓ₁-th row to the (ℓ₂ − 1)-th row of C. Then their matrix product AB has shape ℓ × d and it can be expressed as follows:

A \cdot B = \sum_{0 \leq k < d} (ϕ^{k} \circ σ (A)) ⊙ ({(ψ^{k} \circ τ (B))}_{0 : l}) = \sum_{0 \leq i < l} \sum_{0 \leq j < d / l} (ϕ^{j \cdot l + i} \circ σ (A)) ⊙ ({(ψ^{j \cdot l + i} \circ τ (B))}_{0 : l}) .

Our key observation is the following lemma, which gives an idea of a faster rectangular matrix multiplication algorithm.

Lemma 4.1. Two permutations σ and ϕ are commutative. In general, we have ϕ^k ◦ σ = σ ◦ ϕ^k for k > 0. Similarly, we obtain ψ^k ◦ τ = τ ◦ ψ^k for k > 0.

Now let us define a d × d matrix Ā containing (d/ℓ)copies of A in a vertical direction (i.e., Ā = (A; …;A)). Lemma 4.1 implies that

{(ϕ^{i} \circ σ (\bar{A}))}_{j \cdot l : (j + 1) \cdot l} = ϕ^{i} \circ (σ {(\bar{A})}_{j \cdot l : (j + 1) \cdot l}) = ϕ^{i} \circ σ \circ ϕ^{j \cdot l} (A) = ϕ^{j \cdot l + i} \circ σ (A) .

Similarly, using the commutative property of τ and ψ, it follows

{(ψ^{i} \circ τ (B))}_{j \cdot l : (j + 1) \cdot l} = {(ψ^{j \cdot l + i} \circ τ (B))}_{0 : l} .

Therefore, the matrix product AB is written as follows:

A \cdot B = \sum_{0 \leq j < d / l} {(\sum_{0 \leq i < l} (ϕ^{i} \circ σ (\bar{A})) ⊙ (ψ^{i} \circ τ (B)))}_{j \cdot l : (j + 1) \cdot l}

4.2.2. Homomorphic Rectangular Matrix Multiplication.

Suppose that we are given two ciphertexts ct.Ā and ct.B that encrypt matrices Ā and B, respectively. We first apply the baby-step/giant-step algorithm to generate the encryptions of σ(Ā) and τ (B) as in Section 3.3.3. Next, we can securely compute $\sum_{i = 0}^{l - 1} (ϕ^{i} \circ σ (\bar{A})) ⊙ (ψ^{i} \circ τ (B))$ in a similar way to Algorithm 2, say the output is ct.ĀB Finally, we perform aggregation and rotation operations to get the final result: $\sum_{j = 0}^{d / l - 1} Rot (ct . \bar{A} B; j \cdot l \cdot d)$ . This step can be evaluated using a repeated doubling approach, yielding a running time of log (d/ℓ) additions and rotations. See Algorithm 3 for an explicit description of homomorphic rectangular matrix multiplication.

4.2.

Table 3 summarizes the total complexity of Algorithm 3. Even though we need additional computation for Step 4, we can reduce the complexities of Step 2 and 3 to O(ℓ) rotations and ℓ multiplications, respectively. We also note that the final output ct.AB encrypts a d ×d matrix containing (d/ℓ) copies of the desired matrix product AB in a vertical direction.

Table 3:

Complexity of Algorithm 3

Step	Add	CMult	Rot	Mult
1	3d	3d	$5 \sqrt{d}$	-
2	ℓ	2ℓ	3ℓ	-
3	ℓ	-	-	ℓ
4	log(d/ℓ)	-	log(d/ℓ)
Total	3d + 2ℓ + log(d/ℓ)	3d + 2ℓ	$3 l + 5 \sqrt{d} + log (d / l)$	ℓ

Open in a new tab

This resulting ciphertext has the same form as a rectangular input matrix of Algorithm 3, so it can be reusable for further matrix computation without additional cost.

4.3. Parallel Matrix Computation

Throughout Sections 3 and 4, we have identified the message space $M = R^{n}$ with the set of matrices $R^{d \times d}$ under the assumption that n = d². However, most of HE schemes [9, 14, 19] have a quite large number of plaintext slots (e.g. thousands) compared to the matrix dimension in some real-world applications, i.e., n ≫ d². If a ciphertext can encrypt only one matrix, most of plaintext slots would be wasted. This subsection introduces an idea that allows multiple matrices to be encrypted in a single ciphertext, thereby performing parallel matrix computation in a SIMD manner.

For simplicity, we assume that n is divisible by d² and let g = n/d². We modify the encoding map in Section 3.2 to make a 1-to-1 correspondence ι_g between $R^{n}$ and ${(R^{d \times d})}^{g}$ , which transforms an n-dimensional vector a g-tuple of square matrices of order d. Specifically, for an input vector a = (a_ℓ)_0≤ℓ<n, we define ι_g by

ι_{g} : a \mapsto {(A_{k} = (a_{g \cdot (d \cdot i + j) + k}))}_{0 \leq k < g} .

The components of a with indexes congruent to k modulo g are corresponding to the k-th matrix A_k.

We note that for an integer 0 ≤ ℓ < d², the rotation operation ρ(a;g · ℓ) represents the matrix-wise rotation by ℓ positions. It can be naturally extended to the other matrix-wise operations including scalar linear transformation and matrix multiplication. For example, we can encrypt g number of d × d matrices into a single ciphertext and perform the matrix multiplication operations between g pairs of matrices at once by applying our matrix multiplication algorithm on two ciphertexts. The total complexity remains the same as Algorithm 2, which results in a less amortized computational complexity of O(d/g) per matrix.

5. IMPLEMENTATION OF HOMOMORPHIC MATRIX OPERATIONS

In this section, we report the performance of our homomorphic matrix operations and analyze the performance of the implementation. For simplicity, we assume that d is a power-of-two integer. In general, we can pad zeros at the end of each row to set d as a power of two.

In our implementation, we employ a special encryption scheme suggested by Cheon et al. [14], which supports approximate computation over encrypted data, called HEAAN. A unique property of the HE scheme is the rescaling procedure to truncate a ciphertext into a smaller modulus, which leads to rounding of the plaintext. This plays an important role in controlling the magnitude of messages, and thereby achieving efficiency of approximate computation. For further details, we refer to [14] and [12].

All the experiments were performed on a Macbook Pro laptop with an Intel Core i7 running with 4 cores rated at 2.9 GHz. Our implementation is based on the HEAAN library [13] with C++ based Shoup’s NTL library [43].

5.1. Parameter Setting

We present how to select parameters of our underlying HE scheme to support homomorphic matrix operations described in Section 3 and 4. Our underlying HE scheme is based on the RLWE assumption over the cyclotomic ring $R = ℤ [X] / (X^{N} + 1)$ for a power-of-two integer N. Let us denote by [·]_q the reduction modulo q into the interval $(- q / 2, q / 2] \cap ℤ$ of the integer. We write $R_{q} = R / q R$ for the residue ring of $R$ modulo an integer q. The native plaintext space is represented as a set of (N/2)-dimensional complex vectors.

Suppose that all the elements are scaled by a factor of an integer p and then converted into the nearest integers for quantization. If we are multiplying a ciphertext by a plaintext vector, we assume that the constant vector is scaled by a factor of an integer p_c to maintain the precision. Thus, the rescaling procedure after homomorphic multiplication reduces a ciphertext modulus by p while the rescaling procedure after a constant multiplication reduces a modulus by p_c. For example, Algorithm 2 has depth of two constant multiplications for Step 1 and 2, and has additional depth of a single homomorphic multiplication for Step 3. This implies that an input ciphertext modulus is reduced by (2 logp_c + logp bits after the matrix multiplication algorithm. As a result, we obtain the following lower bound on the bit length of a fresh ciphertext modulus, denoted by logq:

log q = {\begin{array}{l} log q_{0} & for HE-MatAdd; \\ 2 log p_{c} + log p + log q_{0} & for HE-MatMult; \\ log p_{c} + log q_{0} & for HE-MatTrans, \end{array}

where q₀ is the output ciphertext modulus. The final ciphertext represents the desired vector but is scaled by a factor of p, which means that logq₀ should be larger than logp. In our implementation, we take logp = 24, logp_c = 15, and logq₀ = 32.

We use the discrete Gaussian distribution of standard deviation σ = 3.2 to sample error polynomials. The secret-key polynomials were sampled from the discrete ternary uniform distribution over {0, ±1}^N. The cyclotomic ring dimension is chosen as N = 2¹³ to achieve a 80-bit security level against the known attacks of the LWE problem based on the estimator of Albrecht et al. [2]. In short, we present three parameter sets and sizes of the corresponding fresh ciphertexts as follows:

(N, q, size) = {\begin{array}{l} (2^{13}, 2^{32}, 64 KB) & for HE-MatAdd; \\ (2^{13}, 2^{86}, 172 KB) & for HE-MatMult; \\ (2^{13}, 2^{47}, 94 KB) & for HE-MatTrans. \end{array}

5.2. Performance of Matrix Operations

Table 4 presents timing results for matrix addition, multiplication, and transposition for various matrix sizes from 4 to 64 where the throughput means the number of matrices being processed in parallel. We provide three distinct implementation variants: single-packed, sparsely-packed, and fully-packed. The single-packed implementation is that a ciphertext represents only a single matrix; two other implementations are to encode several matrices into sparsely or fully packed plaintext slots. We use the same parameters for all variants, and thus each ciphertext can hold up to N/2 = 2¹² plaintext values. For example, if we consider 4 × 4 matrices, we can process operations over 2¹²/(4 · 4 = 256 distinct matrices simultaneously. In the case dimension of 16, a ciphertext can represent up to 2¹²/(16 · 16) = 16 distinct matrices.

Table 4:

Benchmarks of homomorphic matrix operations

Dim	Throughput	Message size (KB)	Expansion rate	Encoding+ Encryption (s)	Decoding+ Decryption (s)	Amortized time per matrix
Dim	Throughput	Message size (KB)	Expansion rate	Encoding+ Encryption (s)	Decoding+ Decryption (s)	HE-MatAdd (ms)	HE-MatMult (s)	HE-MatTrans (s)
4	1	0.47	3670	0.034	0.009	0.622	0.779	0.363
	16	0.75	229	0.041	0.012	0.046	0.047	0.018
	256	12.0	14	0.095	0.081	0.025	0.003	0.001
16	1	0.75	229	0.033	0.013	0.622	2.501	0.847
	4	3.0	57	0.048	0.027	0.188	0.649	0.211
	16	12.0	14	0.097	0.078	0.043	0.162	0.049
64	1	12.0	14	0.108	0.076	0.622	9.208	2.557

Open in a new tab

5.2.1. Ciphertext Sizes.

As mentioned above, a ciphertext could hold 2¹² different plaintext slots, and thus we can encrypt one 64 × 64 matrix into a fully-packed ciphertext. We assumed that all the inputs had logp = 24 bits of precision, which means that an input matrix size is bounded by 64 × 64 × 24 bits or 12 KB. Since a single ciphertext is at most 172 KB for an evaluation of matrix multiplication, the encrypted version is 172/12 ≈ 14 times larger than the unencrypted version. In Table 4, the third column gives the size of input matrices and the fourth column gives an expansion ratio of the generated ciphertext to the input matrices.

5.2.2. Timing Results.

We conducted experiments over ten times and measured the average running times for all the operations. For the parameter setting in Section 5.1, the key generation takes about1.277 seconds. In Table 4, the fifth column gives timing for encoding input matrices and encrypting the resulting plaintext slots. Since matrix multiplication requires the largest fresh ciphertext modulus and takes more time than others, we just report the encryption timing results for the case. In the sixth column, we give timing for decrypting the output ciphertext and decoding to its matrix form. Note that encryption and decryption timings are similar each other; but encoding and decoding timings depend on the throughput. The last three columns give amortized time per matrix for homomorphic matrix computation. The entire execution time, called latency, is similar between the three variant implementations so the parallel matrix computation provides roughly a speedup as a factor of the throughput.

5.3. Performance of Rectangular Matrix Multiplication

We present the performance of Algorithm 3 described in Section 4.2. As a concrete example, we consider the rectangular matrix multiplication $ℝ^{16 \times 64} \times ℝ^{64 \times 64} \to ℝ^{16 \times 64}$ . As we described above, our optimized method has a better performance than a simple method exploiting Algorithm 2 for the multiplication between 64 × 64 matrices.

To be precise, the first step of Algorithms 2 or 3 generates two ciphertexts ct.A⁽⁰⁾ and ct.B⁽⁰⁾ by applying the linear transformations of U^σ and U^τ, thus both approaches have almost the same computational complexity. Next, in the second and third steps, two algorithms apply the same operations to the resulting ciphertexts but with different numbers: Algorithm 2 requires approximately four times more operations compared to Algorithm 3. As a result, Step 2 turns out to be the most time consuming part in Algorithm 2, whereas it is not the dominant procedure in Algorithm 3. Finally, Algorithm 3 requires some additional operations for Step 4, but we need only log(64/16) = 2 automorphisms.

Table 5 summarizes an experimental result based on the same parameter as in the above section. The total running times of two algorithms are 9.21 seconds and 4.29 seconds, respectively; therefore, Algorithm 3 achieves a speedup of 2X compared to Algorithm 2.

Table 5:

Performance comparison of homomorphic square and rectangular matrix multiplications

Algorithm	Step 1	Step 2	Step 3	Step 4	Total
HE-MatMult	2.36s	5.70s	1.16s	-	9.21s
HE-RMatMult	2.36s	1.53s	0.35s	0.05s	4.29s

Open in a new tab

6. E2DM: MAKING PREDICTION BASED ON ENCRYPTED DATA AND MODEL

In this section, we propose a novel framework E2DM to test encrypted convolutional neural networks model on encrypted data. We consider a new service paradigm where model providers offer encrypted trained classifier models to a public cloud and the cloud server provides on-line prediction service to data owners who uploaded their encrypted data. In this inference, the cloud should learn nothing about private data of the data owners, nor about the trained models of the model providers.

6.1. Neural Networks Architecture

The first viable example of CNN on image classification was AlexNet by Krizhevsky et al. [32] and it was dramatically improved by Simonyan et al. [44]. It consists of a stack of linear and non-linear layers. The linear layers can be convolution layers or FC layers. The non-linear layers can be max pooling (i.e., compute the maximal value of some components of the feeding layer), mean pooling (i.e., compute the average value of some components of the feeding layer), ReLu functions, or sigmoid functions.

Specifically, the convolution operator forms the fundamental basis of the convolutional layer. The convolution has kernels, or windows, of size k × k, a stride of (s, s), and a mapcount of h. Given an image $I \in ℝ^{w \times w}$ , and a kernel $K \in ℝ^{k \times k}$ , we compute the convolved image $Conv (I, K) \in ℝ^{d_{K} \times d_{K}}$ by

Conv {(I, K)}_{i^{'}, j^{'}} = \sum_{0 \leq i, j < k} K_{i, j} \cdot I_{s \cdot i^{'} + i, s \cdot j^{'} + j}

for 0 ≤ i′, j′ < d_K = ⌈(w − k)/s⌉ + 1. Here ⌈·⌉returns the least integer greater than or equal to the input. It can be extended to multiple kernels $K = {K^{(k)}}_{0 \leq k < h}$ as

Conv (I, K) = (Conv (I, K^{(0)}), \dots, Conv (I, K^{(h - 1)})) \in ℝ^{d_{K} \times d_{K} \times h} .

On the other hand, FC layer connects n_I nodes to n_O nodes, or equivalently, it can be specified by the matrix-vector multiplication of an n_O × n_I matrix. Note that the output of convolution layer has a form of tensor so it should be flatten before FC layer. Throughout this paper, we concatenate the rows of the tensor one by one and output a column vector, denoted by FL(·).

6.2. Homomorphic Evaluation of CNN

We present an efficient strategy to evaluate CNN prediction model on the MNIST dataset. Each image is a 28 × 28 pixel array, where the value of each pixel represents a level of gray. After an arbitrary number of hidden layers, each image is labeled with 10 possible digits. The training set has 60,000 images and the test set has 10,000 images. We assume that a neural network is trained with the plaintext dataset in the clear. We adapted a similar network topology to CryptoNets: one convolution layer and two FC layers with square activation function. Table 6 describes our neural networks to the MNIST dataset and summarizes the hyperparameters.

Table 6:

Description of our CNN to the MNIST dataset

Layer	Description
Convolution	Input image 28 × 28, kernel size 7 × 7, stride size of 3, number of output channels 4
1^st square	Squaring 256 input values
FC-1	Fully connecting with 256 inputs and 64 outputs: $ℝ^{64 \times 256} \times ℝ^{256 \times 1} \to ℝ^{64 \times 1}$
2^nd square	Squaring 64 input values
FC-2	Fully connecting with 64 inputs and 10 outputs: $ℝ^{10 \times 64} \times ℝ^{64 \times 1} \to ℝ^{10 \times 1}$

Open in a new tab

The final step of neural networks is usually to apply the softmax activation function for a purpose of probabilistic classification. We note that it is enough to obtain an index of maximum values of outputs in a prediction phase.

In the following, we explain how to securely test encrypted model on encrypted multiple data. In our implementation, we take N = 2¹³ as a cyclotomic ring dimension so each plaintext vector is allowed to have dimension less than 2¹² and one can predict 64 images simultaneously in a SIMD manner. We describe the parameter selection in more detail below.

6.2.1. Encryption of Images.

At the encryption phase, the data owner encrypts the data using the public key of an HE scheme. Suppose that the data owner has a two-dimensional image $I \in ℝ^{28 \times 28}$ . For 0 ≤ i′, j′ < d_K = 8, let us define an extracted image feature I[i′, j′] formed by taking the elements I_{3·i′+i,3·j′+j} for 0 ≤ i, j < 7. That is, a single image can be represented as the 64 image features of size 7×7. It can be extended to multiple images $I = {I^{(k)}}_{0 \leq k < 64}$ . For each 0 ≤ i, j < 7, the dataset is encoded into a Imatrix consisting of the (i, j)-th components of 64 features over 64 images and it is encrypted as follows:

ct. I_{i, j} = Enc [\begin{matrix} I^{(0)} {[0, 0]}_{i, j} & I^{(1)} {[0, 0]}_{i, j} & \dots & I^{(63)} {[0, 0]}_{i, j} \\ I^{(0)} {[0, 1]}_{i, j} & I^{(1)} {[0, 1]}_{i, j} & \dots & I^{(63)} {[0, 1]}_{i, j} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ I^{(0)} {[7, 7]}_{i, j} & I^{(1)} {[7, 7]}_{i, j} & \dots & I^{(63)} {[7, 7]}_{i, j} \end{matrix}] .

The resulting ciphertexts {ct.I_i,j}_0≤i,j<7 are sent the public cloud and stored in their encrypted form.

6.2.2. Encryption of Trained Model.

The model provider encrypts the trained prediction model values such as multiple convolution kernels’ values $K = {K^{(k)}}_{0 \leq k < 4}$ and weights (matrices) of FC layers. provider begins with a procedure for encrypting each of the convolution kernels separately. For 0 ≤ i, j < 7 and 0 ≤ k < 4, the (i, j)-th component of the kernel matrix K^(k) is copied into plaintext slots and the model provider encrypts the plaintext vector into a ciphertext, denoted by $ct . K_{i, j}^{(k)}$ .

Next, the first FC layer is specified by a 64 × 256 matrix and it can be divided into four square sub-matrices of size 64 × 64. For 0 ≤ k < 4, we write W_k to denote the k-th sub-matrix. Each matrix is encrypted into a single ciphertext using the matrix encoding method in Section 3.2, say the output ciphertext ct.W_k.

For the second FC layer, it can be expressed by a 10 × 64 matrix. The model provider pads zeros in the bottom to obtaina matrix V of size 16 × 64 and then generates a 64 × 64 matrix $\bar{V}$ containing four copies of V vertically, say the output ciphertext ct.V. Finally, the model provider transmits three distinct types of ciphertexts to the cloud: $ct . K_{i, j}^{(k)}$ , ct.W_k, and ct.V.

6.2.3. Homomorphic Evaluation of Neural Networks.

At the prediction phase, the public cloud takes ciphertexts of the images from the data owner and the neural network prediction model from the model provider. Since the data owner uses a SIMD technique to batch 64 different images, the first FC layer is specified as a matrix multiplication: $ℝ^{64 \times 256} \times ℝ^{256 \times 64} \to ℝ^{64 \times 64}$ . Similarly, the second FC layer is represented as a matrix multiplication: $ℝ^{10 \times 64} \times ℝ^{64 \times 64} \to ℝ^{10 \times 64}$ .

Homomorphic convolution layer.

The public cloud takes the ciphertexts ct.I_i,j and $ct . K_{i, j}^{(k)}$ for 0 ≤ i, j < 7 and 0 ≤ k < 4. We apply pure SIMD operations to efficiently compute dot-products between the kernel matrices and the extracted image features. For each 0 ≤ k < 4, the cloud performs the following computation on ciphertexts:

ct. C_{k} \leftarrow \sum_{0 \leq i, j < 7} Mult (ct. I_{i, j}, ct. K_{i, j}^{(k)}) .

By the definition of the convolution, the resulting ciphertext ct_k represents a square matrix C_k of order 64 such that

C_{k} = [\begin{matrix} ⋮ & ⋮ \\ FL (Conv (I^{(0)}, K^{(k)})) & \dots FL (Conv (I^{(63)}, K^{(k)})) \\ ⋮ & ⋮ \end{matrix}] .

That is, it is an encryption of the matrix C_k having the i-th column as the flatten convolved result between the the i-th image I⁽ⁱ⁾ and the k-th kernel K^(k).

The first square layer.

This step applies the square activation function to all the encrypted output images of the convolution in a SIMD manner. That is, for each 0 ≤ k < 4, the cloud computes as follows:

ct . S_{k}^{(1)} \leftarrow SQR (ct . C_{k})

where SQR(·) denotes the squaring operation of an HE scheme. Note that $ct . S_{k}^{(1)}$ is an encryption of the matrix C_k ʘ C_k.

The FC-1 layer.

This procedure requires a matrix multiplication between a 64 × 256 weight matrix W =(W₀|W₁|W₂|W₃)and a 256 × 64 input matrix C = (C₀ ʘ C₀; C₁ ʘ C₁; C₂ ʘ C₂; C₃ ʘ C₃). The matrix product W · C is formed by combining the blocks in the same way, that is,

W \cdot C = \sum_{0 \leq k < 4} (W_{k} \cdot (C_{k} ⊙ C_{k})) .

Thus the cloud performs the following computation:

ct. F \leftarrow \sum_{0 \leq k < 4} HE-MatMult(ct. W_{k}, ct. S_{k}^{(1)}) .

The second square layer.

This step applies the square activation function to all the output nodes of the first FC layer:

ct . S^{(2)} \leftarrow SQR (ct . F) .

The FC-2 layer.

This step performs the rectangular multiplication algorithm between the weight ciphertext ct.V and the output ciphertext ct.S⁽²⁾ of the second square layer:

ct.out \leftarrow HE-RMatMult(ct. V, ct. S^{(2)}) .

6.2.4. The Threat Model.

Suppose that one can ensure the IND-CPA security of an underlying HE scheme, which means that ciphertexts of any two messages are computationally indistinguishable. Since all the computations on the public cloud are performed over encryption, the cloud learns nothing from the encrypted data so we can ensure the confidentiality of the data against such a semi-honest server.

6.3. Performance Evaluation of E2DM

We evaluated our E2DM framework to classify encrypted handwritten images of the MNIST dataset. We used the library keras [15] with Tensorflow [1] to train the CNN model from 60,000 images of the dataset by applying the ADADELTA optimization algorithm [50].

6.3.1. Optimization Techniques.

Suppose that we are given an encryption ct.A of a d × d matrix A. Recall from Section 3 that we apply homomorphic liner transformations to generate the encryption ct.A^(ℓ) of a matrix ϕ^ℓ ◦ σ(A) for 0 ≤ ℓ < d. Sometimes one can pre-compute ϕ^ℓ ◦ σ(A) in the clear and generate the corresponding ciphertexts for free. Thus this approach gives us a space/time tradeoff: although it requires more space for d ciphertexts rather than a single ciphertext, it reduces the overhead of rotation operations from $(3 d + 5 \sqrt{d})$ to $(d + 2 \sqrt{d})$ , achieving a better performance. This method has another advantage, in that an input ciphertext modulus is reduced by (logp + logp_c) bits after matrix multiplication while (logp + 2 logp_c) in the original method. This is because the encryptions of ϕ^k ◦ σ(A) are given as fresh ciphertexts and it only requires additional depths to generate the encryptions of ψ^k ◦ τ(B).

We can apply this idea to the FC layers. For each 0 ≤ k < 4 and 0 ≤ ℓ < 64, the model provider generates a ciphertext $ct . W_{k}^{(l)}$ representing the matrix ϕ^ℓ ◦ σ(W_k) of the first FC layer. For the second FC layer, the provider generates an encryption ct.V ^(ℓ) of the matrix $ϕ^{l} \circ σ (\bar{V})$ for 0 ≤ ℓ < 16.

6.3.2. Parameters.

The convolution layer and the square activation layers have a depth of one homomorphic multiplication. As discussed before, the FC layers have depth of one homomorphic multiplication and one constant multiplication by applying the precomputation optimization technique. Therefore, the lower bound on the bit length of a fresh ciphertext modulus is 5 logp + 2 logp_c + logq₀. In our implementation, we assume that all the inputs had logp = 24 bits of precision and set the bit length of the output ciphertext modulus as logq₀ = logp + 8. In addition, we set logp_c = 15 for the bit precision of constant values. We could actually obtain the bit length of the largest ciphertext modulus around 182 and took the ring dimension N = 2¹³ to ensure 80 bits of security. This security was chosen to be consistent with other performance number reported from CryptoNets. Note that a fresh ciphertext has 0.355 MB under this parameter setting.

6.3.3. Ciphertext Sizes.

Each image is a 28 × 28 pixel array, where each pixel is in the range from 0 to 255. The data owner first chooses 64 images in the MNIST dataset, normalizes the data by dividing by the maximum value 255, and generates the ciphertexts {ct.I_i,j}_0≤i,j<7. The total size of ciphertexts is 0.355 × 49 ≈ 17.417 MB and a single ciphertext contains informations of 64 images, and therefore the total ciphertext size per image is 17.417/64 ≈ 0.272 MB or 278 KB. Since each image has approximately 28 × 28 × 24 bits, it is 121 times smaller than the encrypted one. Meanwhile, the model provider generates three distinct types of ciphertexts:

$ct . K_{i, j}^{(k)}$ for 0 ≤ i, j < 7 and 0 ≤ k < 4;
$ct . W_{k}^{(l)}$ for 0 ≤ k < 4 and 0 ≤ ℓ < 64;
ct.V^(ℓ) for 0 ≤ ℓ < 16.

The total size of ciphertexts is 0.355 × 468 ≈ 166.359 MB. After the homomorphic evaluation of E2DM, the cloud sends only a single ciphertext to an authority who is the legitimate owner of the secret key of the HE scheme. The ciphertext size is around 0.063 MB and the size per image is 0.063/64 MB ≈ 1 KB. Table 7 summarizes the numbers in the second and third columns.

Table 7:

Ciphertext sizes of E2DM

	Ciphertext size	Size per instance
Data owner → Cloud	17.417 MB	278 KB
Model provider → Cloud	166.359 MB	-
Cloud → Authority	0.063 MB	1 KB

Open in a new tab

6.3.4. Implementation Details.

The key generation takes about 1.38 seconds for the parameters setting in Section 6.3.2. The data owner takes about 1.56 seconds to encrypt 64 different number of images. Meanwhile, the model provider takes about 12.33 seconds to generate the encrypted prediction models. This procedure takes more time than the naive method but it is an one-time process before data outsourcing and so it is a negligible overhead.

In Table 8, we report timing results for the evaluation of E2DM. The third column gives timings for each step and the fourth column gives the relative time per image (if applicable). The dominant cost of evaluating the framework is that of performing the first FC layer. This step requires four matrix multiplication operations over encrypted 64×64 matrices so it expects to take about 9.21 ×4 ≈ 36.84 seconds from the result of Table 4. We further take advantage of the pre-computation method described in Section 6.3.1, and thereby it only took about 20.79 seconds to evaluate the layer (1.8 times faster). Similarly, we could apply this approach to the second FC layer, which leads to 1.97 seconds for the evaluation. In total, it took about 28.59 seconds to classify encrypted images from the encrypted training model, yielding an amortized rate of 0.45 seconds per image.

Table 8:

Performance results of E2DM for MNIST

	Stage	Latency (s)	Amortized time per image (ms)
Data owner	Encoding + Encryption	1.56	24.42
Model provider	Encoding + Encryption	12.33	-
Cloud	Convolution	5.68	88.75
	1^st square	0.10	1.51
	FC-1	20.79	324.85
	2^nd square	0.06	0.96
	FC-2	1.97	30.70
	Total	28.59	446.77
Authority	Decoding + Decryption	0.07	1.14

Open in a new tab

After the evaluation, the cloud returns only a single packed ciphertext that is transmitted the authority. Then the output can be decrypted with the secret key and the authority computes the argmax of 10 scores for each image to obtain the prediction. These procedures take around 0.07 seconds, yielding an amortized time of 1.14 milliseconds per image. In the end, the data owner gets the results from the authority.

This model achieves an accuracy of 98.1% on the test set. The accuracy is the same as the one obtained by the evaluation of the model in the clear, which implies that there is no precision loss from the approximate homomorphic encryption.

6.4. Comparison with Previous Work

Table 9 compares our benchmark result for the MNIST dataset with the state-of-the-art frameworks: CryptoNets [24], MiniONN [34], and GAZELLE [29]. The first column indicates the framework and the second column denotes the method used for preserving privacy. The last columns give running time and communication costs required for image classification.

Table 9:

MNIST Benchmark of privacy-preserving neural network frameworks

Framework	Methodology	Runtime (s)				Communication (MB)
Framework	Methodology	Offline	Online	Total	Amortized	Offline	Online	Total	Cost per instance
CryptoNets	HE	-	-	570	0.07	-	-	595.5	0.07
MiniONN	HE, MPC	0.88	0.40	1.28	1.28	3.6	44	47.6	47.6
GAZELLE	HE, MPC	0	0.03	0.03	0.03	0	0.5	0.5	0.5
E2DM	HE	-		28.59	0.45	-	-	17.48	0.27

Open in a new tab

6.4.1. HE-based Frameworks.

We used a similar network topology to CryptoNets (only different numbers of nodes in the hidden layers) but considered different scenario and underlying cryptographic primitive. CryptoNets took 570 seconds to perform a single prediction, yielding in an amortized rate of 0.07 seconds. In our case, data is represented in a matrix form and applied to the evaluation of neural networks using homomorphic matrix operations. As a result, E2DM achieves 20-fold reduction in latency and 34-fold reduction in message sizes. CryptoNets allows more SIMD parallelism, so it could give better amortized running time. However, this implies that CryptoNets requires a very large number of prediction to yield better amortized complexity, so its framework turns out to be less competitive in practice.

6.4.2. Mixed Protocol Frameworks.

Liu et al. [34] presented MiniONN framework of privacy-preserving neural networks by employing a ciphertext packing technique as a pre-processing tool. Recently, Juvekar et al. [29] presented GAZELLE that deploys automorphism structure of an underlying HE scheme to perform matrix-vector multiplication, thereby improving the performance significantly. It took 30 milliseconds to classify one image from the MNIST dataset and has an online bandwidth cost of 0.5 MB. Even though these mixed protocols achieve relatively fast run-time, they require interaction between protocol participants, resulting in high bandwidth usage.

7. RELATED WORKS

7.1. Secure Outsourced Matrix Computation

Matrix multiplication can be performed using a series of inner products. Wu and Haven [48] suggested the first secure inner product method in a SIMD environment. Their approach is to encrypt each row or column of a matrix into an encrypted vector and obtain component-wise product of two input vectors by performing a single homomorphic multiplication. However, it should aggregate all the elements over the plaintext slots in order to get the desired result and this procedure requires at least logd automorphisms. Since one can apply this solution to each row of A and each column of B, the total complexity of secure matrix multiplication is about d² multiplications and d² logd automorphisms.

Recently, several other approaches have been considered by applying the encoding methods of Lauter et al. [40] and Yasuda et al. [49] on an RLWE-based HE scheme. Duong et al. [18] proposed a method to encode a matrix as a constant polynomial in the native plaintext space. Then secure matrix multiplication requires only one homomorphic multiplication over packed ciphertexts. This method was later improved in [37]. However, this solution has a serious drawback for practical use: the resulting ciphertext has non-meaningful terms in its coefficients, so for more computation, it should be decrypted and re-encoded by removing the terms in the plaintext polynomial.

Most of related works focus on verifiable secure outsourcing of matrix computation [4, 11, 20, 38]. In their protocols, a client delegates a task to an untrusted server and the server returns the computation result with a proof of the correctness of the computation. There are general results [16, 20, 21] of verifiable secure computation outsourcing by applying a fully HE scheme with Yao’s Garbled circuit or pseudo-random functions. However, it is still far from practical to apply these theoretical approaches to real-world applications.

7.2. Privacy-preserving Neural Networks Predictions

Privacy-preserving deep learning prediction models were firstly considered by Gilad-Bachrach et al. [24]. The authors presented the private evaluation protocol CryptoNets for CNN. A number of subsequent works have improved it by normalizing weighted sums prior to applying the approximate activation function [10], or by employing a fully HE to apply an evaluation of an arbitrary deep neural networks [7].

There are other approaches for privacy-preserving deep learning prediction based on MPC [5, 41] or its combination with (additively) HE. The idea behind such hybrid protocols is to evaluate scalar products using HE and compute activation functions (e.g. threshold or sigmoid) using MPC technique. Mohassel and Zhang [39] applied the mixed-protocol framework of [17] to implement neural networks training and evaluation in a two-party computation setting. Liu et al. [34] presented MiniONN to transform an existing neural network to an oblivious neural network by applying a SIMD batching technique. Riazi et al. [42] designed Chameleon, which relies on a trusted third-party. Their frameworks were later improved in [29] by leveraging effective use of packed ciphertexts. Even though these hybrid protocols could improve efficiency, they result in high bandwidth and long network latency.

8. CONCLUSION

In this paper, we presented a practical solution for secure outsourced matrix computation. We did demonstrate its applicability by presenting a novel framework E2DM for secure evaluation of encrypted neural networks on encrypted data. Our benchmark shows that E2DM achieves lower messages sizes and latency than the solution of CryptoNets.

Our secure matrix computation primitive can be applied to various computing applications such as genetic testing and machine learning. In particular, we can investigate financial model evaluation based on our E2DM framework. Our another future work is to extend the matrix computation mechanism for more advanced operations.

CCS CONCEPTS.

Security and privacy → Cryptography;

ACKNOWLEDGMENTS

This work was supported in part by the Cancer Prevention Research Institute of Texas (CPRIT) under award number RR180012, UT STARs award, and the National Institute of Health (NIH) under award number U01TR002062, R01GM118574 and R01GM124111.

Contributor Information

Xiaoqian Jiang, University of Texas, Health Science Center.

Miran Kim, University of Texas, Health Science Center.

Kristin Lauter, Microsoft Research.

Yongsoo Song, University of California, San Diego.

REFERENCES

[1].Abadi Martín, Agarwal Ashish, Barham Paul, Brevdo Eugene, Chen Zhifeng, Citro Craig, Greg S Corrado Andy Davis, Dean Jeffrey, Devin Matthieu, et al. 2015. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. (2015). https://www.tensorflow.org.
[2].Albrecht Martin R, Player Rachel, and Scott Sam. 2015. On the concrete hardness of learning with errors. Journal of Mathematical Cryptology 9, 3 (2015), 169–203. [Google Scholar]
[3].Cloud Security Alliance. 2009. Security guidance for critical areas of focus in cloud computing. (2009). http://www.cloudsecurityalliance.org.
[4].Atallah Mikhail J and Frikken Keith B. 2010. Securely outsourcing linear algebra computations In Proceedings of the 5th ACM Symposium on Information, Computer and Communications Security. ACM, 48–59. [Google Scholar]
[5].Barni Mauro, Orlandi Claudio, and Piva Alessandro. 2006. A privacy-preserving protocol for neural-network-based computation In Proceedings of the 8th workshop on Multimedia and security. ACM, 146–151. [Google Scholar]
[6].Bos Joppe W, Lauter Kristin, Loftus Jake, and Naehrig Michael. 2013. Improved security for a ring-based fully homomorphic encryption scheme In Cryptography and Coding. Springer, 45–64. [Google Scholar]
[7].Bourse Florian, Minelli Michele, Minihold Matthias, and Paillier Pascal. 2017. Fast Homomorphic Evaluation of Deep Discretized Neural Networks. Cryptology ePrint Archive, Report 2017/1114. (2017). https://eprint.iacr.org/2017/1114.
[8].Brakerski Zvika. 2012. Fully Homomorphic Encryption without Modulus Switching from Classical GapSVP In Advances in Cryptology–CRYPTO 2012. Springer, 868–886. [Google Scholar]
[9].Brakerski Zvika, Gentry Craig, and Vaikuntanathan Vinod. 2012. (Leveled) fully homomorphic encryption without bootstrapping In Proc. of ITCS. ACM, 309–325. [Google Scholar]
[10].Chabanne Hervé, de Wargny Amaury, Milgram Jonathan, Morel Constance, and Prouff Emmanuel. 2017. Privacy-preserving classification on deep neural network. Cryptology ePrint Archive, Report 2017/035. (2017). https://eprint.iacr.org/2017/035.
[11].Chaum David and Pedersen Torben Pryds. 1992. Wallet databases with observers In Annual International Cryptology Conference. Springer, 89–105. [Google Scholar]
[12].Hee Cheon Jung, Han Kyoohyung, Kim Andrey, Kim Miran, and Song Yongsoo. 2018. Bootstrapping for Approximate Homomorphic Encryption In Advances in Cryptology–EUROCRYPT 2018. Springer, 360–384. [Google Scholar]
[13].Cheon Jung Hee, Kim Andrey, Kim Miran, and Song Yongsoo. 2016. Implementation of HEAAN. (2016). https://github.com/kimandrik/HEAAN.
[14].Cheon Jung Hee, Kim Andrey, Kim Miran, and Song Yongsoo. 2017. Homomorphic encryption for arithmetic of approximate numbers In Advances in Cryptology–ASIACRYPT 2017: 23rd International Conference on the Theory and Application of Cryptology and Information Security. Springer, 409–437. [Google Scholar]
[15].Chollet François et al. 2015. Keras. (2015). https://github.com/keras-team/keras.
[16].Chung Kai-Min, Kalai Yael Tauman, Liu Feng-Hao, and Raz Ran. 2011. Memory delegation In Annual Cryptology Conference. Springer, 151–168. [Google Scholar]
[17].Demmler Daniel, Schneider Thomas, and Zohner Michael. 2015. ABY-A Framework for Efficient Mixed-Protocol Secure Two-Party Computation. In NDSS. [Google Scholar]
[18].Duong Dung Hoang, Mishra Pradeep Kumar, and Yasuda Masaya. 2016. Efficient secure matrix multiplication over LWE-based homomorphic encryption. Tatra Mountains Mathematical Publications 67, 1 (2016), 69–83. [Google Scholar]
[19].Fan Junfeng and Vercauteren Frederik. 2012. Somewhat Practical Fully Homomorphic Encryption. Cryptology ePrint Archive, Report 2012/144. (2012). https://eprint.iacr.org/2012/144.
[20].Fiore Dario and Gennaro Rosario. 2012. Publicly verifiable delegation of large polynomials and matrix computations, with applications In Proceedings of the 2012 ACM conference on Computer and communications security. ACM, 501–512. [Google Scholar]
[21].Gennaro Rosario, Gentry Craig, and Parno Bryan. 2010. Non-interactive verifiable computing: Outsourcing computation to untrusted workers In Annual Cryptology Conference. Springer, 465–482. [Google Scholar]
[22].Gentry Craig et al. 2009. Fully homomorphic encryption using ideal lattices.. In STOC, Vol. 9 169–178. [Google Scholar]
[23].Gentry Craig, Halevi Shai, and Smart Nigel P. 2012. Homomorphic evaluation of the AES circuit In Advances in Cryptology–CRYPTO 2012 Springer, 850–867. [Google Scholar]
[24].Gilad-Bachrach Ran, Dowlin Nathan, Laine Kim, Lauter Kristin, Naehrig Michael, and Wernsing John. 2016. CryptoNets: Applying neural networks to encrypted data with high throughput and accuracy. In International Conference on Machine Learning. 201–210. [Google Scholar]
[25].Halevi Shai and Shoup Victor. 2014. Algorithms in HElib. In Advances in Cryptology–CRYPTO 2014. Springer, 554–571. [Google Scholar]
[26].Halevi Shai and Shoup Victor. 2015. Bootstrapping for HElib In Advances in Cryptology–EUROCRYPT 2015. Springer, 641–670. [Google Scholar]
[27].Halevi Shai and Shoup Victor. 2018. Faster Homomorphic Linear Transformations in HElib. Cryptology ePrint Archive, Report 2018/244. (2018). https://eprint.iacr.org/2018/244.
[28].Jiang Xiaoqian, Zhao Yongan, Wang Xiaofeng, Malin Bradley, Wang Shuang, Ohno-Machado Lucila, and Tang Haixu. 2014. A community assessment of privacy preserving techniques for human genomes. BMC Med. Inform. Decis. Mak 14 Suppl 1, Suppl 1 (Dec. 2014), S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
[29].Juvekar Chiraag, Vaikuntanathan Vinod, and Chandrakasan Anantha. 2018. GAZELLE: A Low Latency Framework for Secure Neural Network Inference In 27th USENIX Security Symposium (USENIX Security 18). USENIX Association, Baltimore, MD. [Google Scholar]
[30].Kim Miran and Lauter Kristin. 2015. Private genome analysis through homomorphic encryption. BMC medical informatics and decision making 15, Suppl 5 (2015), S3. [DOI] [PMC free article] [PubMed] [Google Scholar]
[31].Kim Miran, Song Yongsoo, Wang Shuang, Xia Yuhou, and Jiang Xiaoqian. 2018. Secure Logistic Regression based on Homomorphic Encryption: Design and Evaluation. JMIR medical informatics 6, 2 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
[32].Krizhevsky Alex, Sutskever Ilya, and Hinton Geoffrey E. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097–1105. [Google Scholar]
[33].LeCun Yann. 1998. The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/ (1998).
[34].Liu Jian, Juuti Mika, Lu Yao, and Asokan N. 2017. Oblivious neural network predictions via minionn transformations In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security. ACM, 619–631. [Google Scholar]
[35].Makri Eleftheria, Rotaru Dragos, Smart Nigel P, and Vercauteren Frederik. 2017. PICS: Private Image Classification with SVM. Cryptology ePrint Archive, Report 2017/1190. (2017). https://eprint.iacr.org/2017/1190.
[36].Miotto Riccardo, Wang Fei, Wang Shuang, Jiang Xiaoqian, and Dudley Joel T. 2017. Deep learning for healthcare: review, opportunities and challenges. Brief. Bioinform (May 2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
[37].Mishra Pradeep Kumar, Duong Dung Hoang, and Yasuda Masaya. 2017. Enhancement for Secure Multiple Matrix Multiplications over Ring-LWE Homomorphic Encryption In International Conference on Information Security Practice and Experience. Springer, 320–330. [Google Scholar]
[38].Mohassel Payman. 2011. Efficient and Secure Delegation of Linear Algebra. Cryptology ePrint Archive, Report 2011/605. (2011). https://eprint.iacr.org/2011/605.
[39].Mohassel Payman and Zhang Yupeng. 2017. SecureML: A system for scalable privacy-preserving machine learning In Security and Privacy (SP), 2017 IEEE Symposium on. IEEE, 19–38. [Google Scholar]
[40].Naehrig Michael, Lauter Kristin, and Vaikuntanathan Vinod. 2011. Can homomorphic encryption be practical? In Proceedings of the 3rd ACM workshop on Cloud computing security workshop. ACM, 113–124. [Google Scholar]
[41].Orlandi Claudio, Piva Alessandro, and Barni Mauro. 2007. Oblivious neural network computing via homomorphic encryption. EURASIP Journal on Information Security 2007, 1 (2007), 037343. [Google Scholar]
[42].Riazi M Sadegh, Weinert Christian, Tkachenko Oleksandr, Songhori Ebrahim M, Schneider Thomas, and Koushanfar Farinaz. 2018. Chameleon: A Hybrid Secure Computation Framework for Machine Learning Applications. arXiv preprint arXiv:1801.03239 (2018). [Google Scholar]
[43].Shoup Victor et al. 2001. NTL: A library for doing number theory. (2001).
[44].Simonyan Karen and Zisserman Andrew. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014). [Google Scholar]
[45].Smart Nigel P and Vercauteren Frederik. 2011. Fully homomorphic SIMD operations. Cryptology ePrint Archive, Report 2011/133. (2011). https://eprint.iacr.org/2011/133.
[46].Takabi Hassan, Joshi James BD, and Ahn Gail-Joon. 2010. Security and privacy challenges in cloud computing environments. IEEE Security & Privacy 8, 6 (2010), 24–31. [Google Scholar]
[47].Wang Shuang, Jiang Xiaoqian, Tang Haixu, Wang Xiaofeng, Bu Diyue, Carey Knox, Dyke Stephanie O M, Fox Dov, Jiang Chao, Lauter Kristin, and Others. 2017. A community effort to protect genomic data sharing, collaboration and outsourcing. npj Genomic Medicine 2, 1 (2017), 33. [DOI] [PMC free article] [PubMed] [Google Scholar]
[48].Wu David and Haven Jacob. 2012. Using homomorphic encryption for large scale statistical analysis. Technical Report; Technical Report: cs. stanford. edu/people/dwu4/papers/FHESI Report. pdf. [Google Scholar]
[49].Yasuda Masaya, Shimoyama Takeshi, Kogure Jun, Yokoyama Kazuhiro, and Koshiba Takeshi. 2015. New packing method in somewhat homomorphic encryption and its applications. Security and Communication Networks 8, 13 (2015), 2194–2213. [Google Scholar]
[50].Zeiler Matthew D. 2012. ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012).

[R1] [1].Abadi Martín, Agarwal Ashish, Barham Paul, Brevdo Eugene, Chen Zhifeng, Citro Craig, Greg S Corrado Andy Davis, Dean Jeffrey, Devin Matthieu, et al. 2015. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. (2015). https://www.tensorflow.org.

[R2] [2].Albrecht Martin R, Player Rachel, and Scott Sam. 2015. On the concrete hardness of learning with errors. Journal of Mathematical Cryptology 9, 3 (2015), 169–203. [Google Scholar]

[R3] [3].Cloud Security Alliance. 2009. Security guidance for critical areas of focus in cloud computing. (2009). http://www.cloudsecurityalliance.org.

[R4] [4].Atallah Mikhail J and Frikken Keith B. 2010. Securely outsourcing linear algebra computations In Proceedings of the 5th ACM Symposium on Information, Computer and Communications Security. ACM, 48–59. [Google Scholar]

[R5] [5].Barni Mauro, Orlandi Claudio, and Piva Alessandro. 2006. A privacy-preserving protocol for neural-network-based computation In Proceedings of the 8th workshop on Multimedia and security. ACM, 146–151. [Google Scholar]

[R6] [6].Bos Joppe W, Lauter Kristin, Loftus Jake, and Naehrig Michael. 2013. Improved security for a ring-based fully homomorphic encryption scheme In Cryptography and Coding. Springer, 45–64. [Google Scholar]

[R7] [7].Bourse Florian, Minelli Michele, Minihold Matthias, and Paillier Pascal. 2017. Fast Homomorphic Evaluation of Deep Discretized Neural Networks. Cryptology ePrint Archive, Report 2017/1114. (2017). https://eprint.iacr.org/2017/1114.

[R8] [8].Brakerski Zvika. 2012. Fully Homomorphic Encryption without Modulus Switching from Classical GapSVP In Advances in Cryptology–CRYPTO 2012. Springer, 868–886. [Google Scholar]

[R9] [9].Brakerski Zvika, Gentry Craig, and Vaikuntanathan Vinod. 2012. (Leveled) fully homomorphic encryption without bootstrapping In Proc. of ITCS. ACM, 309–325. [Google Scholar]

[R10] [10].Chabanne Hervé, de Wargny Amaury, Milgram Jonathan, Morel Constance, and Prouff Emmanuel. 2017. Privacy-preserving classification on deep neural network. Cryptology ePrint Archive, Report 2017/035. (2017). https://eprint.iacr.org/2017/035.

[R11] [11].Chaum David and Pedersen Torben Pryds. 1992. Wallet databases with observers In Annual International Cryptology Conference. Springer, 89–105. [Google Scholar]

[R12] [12].Hee Cheon Jung, Han Kyoohyung, Kim Andrey, Kim Miran, and Song Yongsoo. 2018. Bootstrapping for Approximate Homomorphic Encryption In Advances in Cryptology–EUROCRYPT 2018. Springer, 360–384. [Google Scholar]

[R13] [13].Cheon Jung Hee, Kim Andrey, Kim Miran, and Song Yongsoo. 2016. Implementation of HEAAN. (2016). https://github.com/kimandrik/HEAAN.

[R14] [14].Cheon Jung Hee, Kim Andrey, Kim Miran, and Song Yongsoo. 2017. Homomorphic encryption for arithmetic of approximate numbers In Advances in Cryptology–ASIACRYPT 2017: 23rd International Conference on the Theory and Application of Cryptology and Information Security. Springer, 409–437. [Google Scholar]

[R15] [15].Chollet François et al. 2015. Keras. (2015). https://github.com/keras-team/keras.

[R16] [16].Chung Kai-Min, Kalai Yael Tauman, Liu Feng-Hao, and Raz Ran. 2011. Memory delegation In Annual Cryptology Conference. Springer, 151–168. [Google Scholar]

[R17] [17].Demmler Daniel, Schneider Thomas, and Zohner Michael. 2015. ABY-A Framework for Efficient Mixed-Protocol Secure Two-Party Computation. In NDSS. [Google Scholar]

[R18] [18].Duong Dung Hoang, Mishra Pradeep Kumar, and Yasuda Masaya. 2016. Efficient secure matrix multiplication over LWE-based homomorphic encryption. Tatra Mountains Mathematical Publications 67, 1 (2016), 69–83. [Google Scholar]

[R19] [19].Fan Junfeng and Vercauteren Frederik. 2012. Somewhat Practical Fully Homomorphic Encryption. Cryptology ePrint Archive, Report 2012/144. (2012). https://eprint.iacr.org/2012/144.

[R20] [20].Fiore Dario and Gennaro Rosario. 2012. Publicly verifiable delegation of large polynomials and matrix computations, with applications In Proceedings of the 2012 ACM conference on Computer and communications security. ACM, 501–512. [Google Scholar]

[R21] [21].Gennaro Rosario, Gentry Craig, and Parno Bryan. 2010. Non-interactive verifiable computing: Outsourcing computation to untrusted workers In Annual Cryptology Conference. Springer, 465–482. [Google Scholar]

[R22] [22].Gentry Craig et al. 2009. Fully homomorphic encryption using ideal lattices.. In STOC, Vol. 9 169–178. [Google Scholar]

[R23] [23].Gentry Craig, Halevi Shai, and Smart Nigel P. 2012. Homomorphic evaluation of the AES circuit In Advances in Cryptology–CRYPTO 2012 Springer, 850–867. [Google Scholar]

[R24] [24].Gilad-Bachrach Ran, Dowlin Nathan, Laine Kim, Lauter Kristin, Naehrig Michael, and Wernsing John. 2016. CryptoNets: Applying neural networks to encrypted data with high throughput and accuracy. In International Conference on Machine Learning. 201–210. [Google Scholar]

[R25] [25].Halevi Shai and Shoup Victor. 2014. Algorithms in HElib. In Advances in Cryptology–CRYPTO 2014. Springer, 554–571. [Google Scholar]

[R26] [26].Halevi Shai and Shoup Victor. 2015. Bootstrapping for HElib In Advances in Cryptology–EUROCRYPT 2015. Springer, 641–670. [Google Scholar]

[R27] [27].Halevi Shai and Shoup Victor. 2018. Faster Homomorphic Linear Transformations in HElib. Cryptology ePrint Archive, Report 2018/244. (2018). https://eprint.iacr.org/2018/244.

[R28] [28].Jiang Xiaoqian, Zhao Yongan, Wang Xiaofeng, Malin Bradley, Wang Shuang, Ohno-Machado Lucila, and Tang Haixu. 2014. A community assessment of privacy preserving techniques for human genomes. BMC Med. Inform. Decis. Mak 14 Suppl 1, Suppl 1 (Dec. 2014), S1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] [29].Juvekar Chiraag, Vaikuntanathan Vinod, and Chandrakasan Anantha. 2018. GAZELLE: A Low Latency Framework for Secure Neural Network Inference In 27th USENIX Security Symposium (USENIX Security 18). USENIX Association, Baltimore, MD. [Google Scholar]

[R30] [30].Kim Miran and Lauter Kristin. 2015. Private genome analysis through homomorphic encryption. BMC medical informatics and decision making 15, Suppl 5 (2015), S3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] [31].Kim Miran, Song Yongsoo, Wang Shuang, Xia Yuhou, and Jiang Xiaoqian. 2018. Secure Logistic Regression based on Homomorphic Encryption: Design and Evaluation. JMIR medical informatics 6, 2 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] [32].Krizhevsky Alex, Sutskever Ilya, and Hinton Geoffrey E. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097–1105. [Google Scholar]

[R33] [33].LeCun Yann. 1998. The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/ (1998).

[R34] [34].Liu Jian, Juuti Mika, Lu Yao, and Asokan N. 2017. Oblivious neural network predictions via minionn transformations In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security. ACM, 619–631. [Google Scholar]

[R35] [35].Makri Eleftheria, Rotaru Dragos, Smart Nigel P, and Vercauteren Frederik. 2017. PICS: Private Image Classification with SVM. Cryptology ePrint Archive, Report 2017/1190. (2017). https://eprint.iacr.org/2017/1190.

[R36] [36].Miotto Riccardo, Wang Fei, Wang Shuang, Jiang Xiaoqian, and Dudley Joel T. 2017. Deep learning for healthcare: review, opportunities and challenges. Brief. Bioinform (May 2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] [37].Mishra Pradeep Kumar, Duong Dung Hoang, and Yasuda Masaya. 2017. Enhancement for Secure Multiple Matrix Multiplications over Ring-LWE Homomorphic Encryption In International Conference on Information Security Practice and Experience. Springer, 320–330. [Google Scholar]

[R38] [38].Mohassel Payman. 2011. Efficient and Secure Delegation of Linear Algebra. Cryptology ePrint Archive, Report 2011/605. (2011). https://eprint.iacr.org/2011/605.

[R39] [39].Mohassel Payman and Zhang Yupeng. 2017. SecureML: A system for scalable privacy-preserving machine learning In Security and Privacy (SP), 2017 IEEE Symposium on. IEEE, 19–38. [Google Scholar]

[R40] [40].Naehrig Michael, Lauter Kristin, and Vaikuntanathan Vinod. 2011. Can homomorphic encryption be practical? In Proceedings of the 3rd ACM workshop on Cloud computing security workshop. ACM, 113–124. [Google Scholar]

[R41] [41].Orlandi Claudio, Piva Alessandro, and Barni Mauro. 2007. Oblivious neural network computing via homomorphic encryption. EURASIP Journal on Information Security 2007, 1 (2007), 037343. [Google Scholar]

[R42] [42].Riazi M Sadegh, Weinert Christian, Tkachenko Oleksandr, Songhori Ebrahim M, Schneider Thomas, and Koushanfar Farinaz. 2018. Chameleon: A Hybrid Secure Computation Framework for Machine Learning Applications. arXiv preprint arXiv:1801.03239 (2018). [Google Scholar]

[R43] [43].Shoup Victor et al. 2001. NTL: A library for doing number theory. (2001).

[R44] [44].Simonyan Karen and Zisserman Andrew. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014). [Google Scholar]

[R45] [45].Smart Nigel P and Vercauteren Frederik. 2011. Fully homomorphic SIMD operations. Cryptology ePrint Archive, Report 2011/133. (2011). https://eprint.iacr.org/2011/133.

[R46] [46].Takabi Hassan, Joshi James BD, and Ahn Gail-Joon. 2010. Security and privacy challenges in cloud computing environments. IEEE Security & Privacy 8, 6 (2010), 24–31. [Google Scholar]

[R47] [47].Wang Shuang, Jiang Xiaoqian, Tang Haixu, Wang Xiaofeng, Bu Diyue, Carey Knox, Dyke Stephanie O M, Fox Dov, Jiang Chao, Lauter Kristin, and Others. 2017. A community effort to protect genomic data sharing, collaboration and outsourcing. npj Genomic Medicine 2, 1 (2017), 33. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] [48].Wu David and Haven Jacob. 2012. Using homomorphic encryption for large scale statistical analysis. Technical Report; Technical Report: cs. stanford. edu/people/dwu4/papers/FHESI Report. pdf. [Google Scholar]

[R49] [49].Yasuda Masaya, Shimoyama Takeshi, Kogure Jun, Yokoyama Kazuhiro, and Koshiba Takeshi. 2015. New packing method in somewhat homomorphic encryption and its applications. Security and Communication Networks 8, 13 (2015), 2194–2213. [Google Scholar]

[R50] [50].Zeiler Matthew D. 2012. ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012).

PERMALINK

Secure Outsourced Matrix Computation and Application to Neural Networks

Xiaoqian Jiang

Miran Kim

Kristin Lauter

Yongsoo Song

Abstract

1. INTRODUCTION

1.1. Our Contribution

1.2. Technical Details

Figure 1:

Table 1:

2. PRELIMINARIES

2.1. Homomorphic Encryption

2.2. Ciphertext Packing Technique

2.3. Linear Transformations

3. SECURE MATRIX MULTIPLICATION VIA HOMOMORPHIC ENCRYPTION

3.1. Permutations for Matrix Multiplication

3.2. Matrix Encoding Method

3.3. Matrix Multiplication on Packed Ciphertexts

3.3.1. Tweaks of Permutations.

3.3.2. Homomorphic Matrix Multiplication.

3.3.3. Further Improvements.

Table 2:

4. ADVANCED HOMOMORPHIC MATRIX COMPUTATIONS

4.1. Matrix Transposition on Packed Ciphertexts

4.2. Rectangular Matrix Multiplication

4.2.1. Refinements of Rectangular Matrix Multiplication.

4.2.2. Homomorphic Rectangular Matrix Multiplication.

Table 3:

4.3. Parallel Matrix Computation

5. IMPLEMENTATION OF HOMOMORPHIC MATRIX OPERATIONS

5.1. Parameter Setting

5.2. Performance of Matrix Operations

Table 4:

5.2.1. Ciphertext Sizes.

5.2.2. Timing Results.

5.3. Performance of Rectangular Matrix Multiplication

Table 5:

6. E2DM: MAKING PREDICTION BASED ON ENCRYPTED DATA AND MODEL

6.1. Neural Networks Architecture

6.2. Homomorphic Evaluation of CNN

Table 6:

6.2.1. Encryption of Images.

6.2.2. Encryption of Trained Model.

6.2.3. Homomorphic Evaluation of Neural Networks.

Homomorphic convolution layer.

The first square layer.

The FC-1 layer.

The second square layer.

The FC-2 layer.

6.2.4. The Threat Model.

6.3. Performance Evaluation of E2DM

6.3.1. Optimization Techniques.

6.3.2. Parameters.

6.3.3. Ciphertext Sizes.

Table 7:

6.3.4. Implementation Details.

Table 8:

6.4. Comparison with Previous Work

Table 9:

6.4.1. HE-based Frameworks.

6.4.2. Mixed Protocol Frameworks.

7. RELATED WORKS

7.1. Secure Outsourced Matrix Computation

7.2. Privacy-preserving Neural Networks Predictions

8. CONCLUSION

CCS CONCEPTS.

ACKNOWLEDGMENTS

Contributor Information

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases