Efficient Privacy-preserving Logistic Model With Malicious Security

Guanhong Miao; Samuel S Wu

doi:10.1109/tifs.2024.3402319

. Author manuscript; available in PMC: 2025 May 17.

Published in final edited form as: IEEE Trans Inf Forensics Secur. 2024 May 17;19:5751–5766. doi: 10.1109/tifs.2024.3402319

Efficient Privacy-preserving Logistic Model With Malicious Security

Guanhong Miao ¹, Samuel S Wu ¹

PMCID: PMC11236440 NIHMSID: NIHMS1998323 PMID: 38993695

Abstract

Conducting secure computations to protect against malicious adversaries is an emerging field of research. Current models designed for malicious security typically necessitate the involvement of two or more servers in an honest-majority setting. Among privacy-preserving data mining techniques, significant attention has been focused on the classification problem. Logistic regression emerges as a well-established classification model, renowned for its impressive performance. We introduce a novel matrix encryption method to build a maliciously secure logistic model. Our scheme involves only a single semi-honest server and is resilient to malicious data providers that may deviate arbitrarily from the scheme. The $d$ -transformation ensures that our scheme achieves indistinguishability (i.e., no adversary can determine, in polynomial time, which of the plaintexts corresponds to a given ciphertext in a chosen-plaintext attack). Malicious activities of data providers can be detected in the verification stage. A lossy compression method is implemented to minimize communication costs while preserving negligible degradation in accuracy. Experiments illustrate that our scheme is highly efficient to analyze large-scale datasets and achieves accuracy similar to non-private models. The proposed scheme outperforms other maliciously secure frameworks in terms of computation and communication costs.

Index Terms—: Privacy-preserving, logistic model, malicious adversary, indistinguishability

I. Introduction

Internet of Things (IoT) approaches our lives gradually with the wireless communication systems increasingly employed as technology driver for smart monitoring and applications. An IoT system can be depicted as smart devices that interact on a collaborative basis for a common goal. Smart cities are incorporating a wide range of advanced IoT infrastructures, resulting in a large amount data gathered from different devices deployed in many domains, such as health care, energy transmission, and transportation [1]. Smart things provide efficient tools for ubiquitous data collection or tracking, but also faces privacy threats.

In order to solve the challenges arising from IoT data processing and analysis, an increasing amount of innovations have been emerged recently. For instance, collaborative learning is a desirable and empowering paradigm for smart IoT systems. Collaborative learning enables multiple data providers to learn models utilizing all their data jointly [2], [3]. Typical collaborative systems are distributed computing systems such as secure multi-party computation (SMC) frameworks [2], [4]. SMC enables parties to jointly compute on private inputs without revealing anything but the result.

Collaborative learning has benefited the society including medical research [5]. Data containing healthcare informatics are usually collected in medical centers such as hospitals. Generally, the study center does not share data with other institutes considering the confidentiality of participants. To learn disease mechanisms especially rare diseases for which each center has limited cases, it is of importance to perform data analysis combining data from multiple institutes. Collaborative learning provides great promise to connect healthcare data sources. Since data sharing of individual levels is not permitted by law or regulation in many domains, various privacy-preserving techniques have been developed to perform collaborative learning.

Many privacy-preserving techniques assume semi-honest models, in which the server and clients follow the protocol specification. Because clients could be any arbitrary entity, it is less likely that all the clients (i.e., data providers) would be semi-honest. Recently, maliciously secure models [6], [7] have been proposed to achieve privacy in the presence of malicious adversaries that could deviate arbitrarily from the protocol specification. Based on the assumption of the number of servers that can be malicious in the protocol, maliciously secure frameworks operate in either an honest-majority setting [6]–[14] or a malicious-majority setting [15]–[18]. These frameworks typically rely on multiple servers (e.g., threeserver model [6], [7], [9], [10], four-server model [8], [12]–[14]), with the most common assumption being an honest-majority setting (i.e., a majority of servers are semi-honest). In contrast, the malicious-majority setting anticipates a scenario where a majority of servers may behave maliciously. This setting enhances security in environments where a significant portion of servers may be untrustworthy, providing a more realistic and robust solution in adversarial conditions. Since the efficiency of SMC protocols is highly dependent on the number of honest servers [13], maliciously secure frameworks with the malicious-majority setting is less efficient than those with the honest-majority setting.

In this paper, we propose a privacy-preserving logistic model scheme, assuming a dishonest majority in a maliciously secure setting. We assume that data are horizontally distributed among data providers (i.e., each data provider is a client that collects information of the same features for different samples). Our contributions are summarized as follows:

We propose a novel matrix encryption technique to build a maliciously secure logistic model. Unlike state-of-the-art frameworks that necessitate the involvement of two or more servers in an honest-majority setting, our scheme involves only a single semi-honest server and is resilient to malicious attacks conducted by data providers. Malicious behaviors conducted by data providers are detectable during the verification stage.
The proposed matrix encryption method combines Gaussian matrix encryption with $d$ -transformation and commutative matrix encryption. The implementation of $d$ -transformation ensures that random records within any energy range are indistinguishable. The commutative matrix encryption is applied to preserve data utility.
We utilize a lossy compression method to reduce communication costs while ensuring negligible degradation in accuracy. Compared with other maliciously secure frameworks, our scheme is more efficient to analyze large-scale datasets in terms of computation and communication costs.

II. Related work

Secure multi-party computation (SMC)

SMC frameworks with a small number of parties have shown to be particularly attractive recently. Among these frameworks, homomorphic encryption (HE) [22], [23] has been widely used to protect data privacy [10], [11], [18]-[20]. Recent advances on garbled circuit [24], [25] have led to a set of privacy-preserving protocols [15]-[17] for SMC tolerating an arbitrary number of malicious corruptions. Garbled circuits and HE techniques require large volumes of ciphertexts to be transferred or have high computation complexity. In terms of efficient constructions, various secure frameworks in an honest-majority setting [6]–[14] have drawn phenomenal attention. The details of these frameworks are summarized in Table I.

Table I:

Recent secure multi-party computation (SMC) frameworks

Framework	No. of parties/servers	Encryption method	Threat model	Collusion assumption
[15]	2	Garbled circuit	Malicious	Malicious-majority
[16]	≥2	Garbled circuit	Malicious	Malicious-majority
[17]	≥2	Garbled circuit	Malicious	Malicious-majority
[8]	4	Garbled circuit	Malicious	Honest-majority
[9]	3	Garbled circuit	Malicious	Honest-majority
[18]	–	Homomorphic encryption	Malicious	Malicious-majority
[19]	1	Homomorphic encryption	Semi-honest	Passive adversary
[10]	3	Mixed	Malicious	Honest-majority
[11]	2	Mixed	Malicious	Honest-majority
[20]	2	Mixed	Semi-honest	Passive adversary
[12]	3, 4	Joint message passing	Malicious	Honest-majority
[13]	3, 4	SPDZ	Malicious	Honest-majority
[21]	2	Secret sharing	Semi-honest	Passive adversary
[14]	4	Secret sharing	Malicious	Honest-majority
[6]	3	Secret sharing	Malicious	Honest-majority
[7]	3	Secret sharing	Malicious	Honest-majority
Our	1	Matrix encryption	Malicious	Malicious-majority

Open in a new tab

Malicious model: the entity deviates arbitrarily from the protocol specification. Semi-honest model: the entity follows the prescribed protocol but attempts to gain unauthorized information by covertly observing the communication or computations of other entities involved. Mixed encryption method: the framework applies both garbled circuit and homomorphic encryption.

Differential privacy

Differential privacy (DP) [26] has been widely incorporated into distributed deep learning [27]–[29] by adding noise to input data, loss functions, gradients, weights, or output classes. Moreover, DP has been applied to enable secure exchanges of intermediate data and obtain models resilient to adversarial inferences in the federated learning [30]–[32]. There are still some challenging issues to implement DP in practice since it requires high privacy budgets to train robust and accurate models and the level of privacy achieved in practice remains unclear [33].

Matrix encryption

Matrix encryption has been extensively utilized in the development of compressed sensing (CS)-based cryptosystems [34]–[37]. This approach is well-suited for ensuring the security of practical applications, such as the Internet of Things and multimedia. The Gaussian one-time sensing CS-based cryptosystem, which employs a random Gaussian matrix and renews the matrix at each encryption, has been proven to be asymptotically secure for the plaintext with constant energy [34]–[36]. It is challenging to practically implement these CS-based cryptosystems because the indistinguishability of Gaussian matrix encryption is highly sensitive to variations in the energy of plaintexts.

To summarize, a majority of maliciously secure models necessitate the involvement of two or more servers in an honest-majority setting. Moreover, existing secure models have relatively low efficiency for large-scale data analysis. Previous studies of Gaussian matrix encryption ensures indistinguishability among records with constant energy (i.e., Euclidean norm) and poses a practical challenge when implementing it with data having arbitrary energy ranges. This paper introduces a maliciously secure logistic model which ensures indistinguishability among random records within any energy range. Our model assumes the malicious-majority setting and is highly efficient in analyzing datasets of substantial size.

III. Preliminaries

A. Logistic model

Consider a set of data $D = \{(x_{1}, y_{1}), \dots, (x_{n}, y_{n})\}$ , where $x_{i} \in ℛ^{q}$ and $y_{i} \in {0,1}$ denotes the binary outcome such as case/control status of $x_{i} (i = 1, \dots, n)$ . Without loss of generality, a constant 1 is typically added as the first element to the record $x_{i} (i = 1, \dots, n)$ to account for the intercept. The logistic model [38], [39] has the form

log \frac{Pr (y_{i} = 1 ∣ x_{i})}{Pr (y_{i} = 0 ∣ x_{i})} = x_{i}^{T} β .

(1)

where $β = {(β_{1}, \dots, β_{q})}^{T}$ is a $q$ -dimensional coefficient vector and Pr(⋅) is the probability function. Model estimate of the logistic model is typically fitted through maximum likelihood, using the conditional likelihood. The log-likelihood is

ℓ (β) = \sum_{i = 1}^{n} \{y_{i} log [p (x_{i}; β)] + (1 - y_{i}) log (1 - p (x_{i}; β))\}

(2)

where $p (x_{i}; β) = Pr (y_{i} = 1 ∣ x_{i}; β) = \frac{exp (x_{i}^{T} β)}{1 + exp (x_{i}^{T} β)}$ .

For ridge regularized logistic model, we maximize the log-likelihood subject to a size constraint on $L_{2}$ -norm (i.e., Euclidean norm) of the coefficients. The ridge estimate is

β_{ridge} = \underset{β}{argmin} \{- ℓ (β) + \frac{λ}{2} ∥ β ∥_{2}^{2}\}

(3)

where $λ \geq 0$ is the ridge parameter.

Let $Y = {(y_{1}, \dots, y_{n})}^{T}$ denote the outcome, $X = {(x_{1}, \dots, x_{n})}^{T}$ denote the $n \times q$ feature matrix, and let $W$ be the $n \times n$ diagonal matrix of weights (Equation 4).

W ≜ (\begin{matrix} p (x_{1}; β) (1 - p (x_{1}; β)) \\ ⋱ \\ p (x_{n}; β) (1 - p (x_{n}; β)) \end{matrix})

(4)

We use Newton’s method to fit the logistic model. Given $β^{old}$ , a single Newton update is

β^{new} = β^{old} - {(\frac{\partial^{2} ℓ (β)}{\partial β \partial β^{T}})}^{- 1} \frac{\partial ℓ (β)}{\partial β}

(5)

where the derivatives are evaluated at $β^{old}$ . The equation can be expressed using matrix notations as follows.

β^{new} = {(X^{T} W^{old} X + Λ)}^{- 1} [X^{T} W^{old} X β^{old} + X^{T} (Y - p^{old})]

(6)

where $p^{old} = {(p (x_{1}; β^{old}), \dots, p (x_{n}; β^{old}))}^{T}$ and $p (x_{i}; β^{old}) (1 - p (x_{i}; β^{old}))$ is the $i$ -th diagonal element in the diagonal matrix $W^{old} . Λ$ is a matrix of zeros for non-regularized logistic model. For ridge regularized logistic model, $Λ$ is a diagonal matrix with the diagonal elements being ${0, λ, \dots, λ}$ .

B. Indistinguishability

Indistinguishability has been widely used as the security measure in recent cryptosystems. Using different notations (e.g., Definitions 3.9 and 3.10 in [40], Definition 2.1 in [41], Definition 2 in [42], Definition “PrvInd” in [43], Definition 2 in [44], Definition 1 in [45], Definition 1 in [35], Section III in [36]), all these indistinguishability definitions express the same security level: a cryptosystem has the indistinguishability if no adversary can determine in polynomial time which of the two plaintexts corresponds to the ciphertext, with probability significantly better than that of a random guess. In other words, within a cryptosystem with indistinguishability, an adversary cannot learn any partial information of the plaintext in polynomial time given a ciphertext. Comprehensive comparisons of indistinguishability with other security measures (e.g., differential privacy) are given in [41], [42]. In line with other cryptosystems utilizing matrix encryption methods [35], [36], [44], we provide the formal definition of indistinguishability, denoted as Definition 1, following Definition 1 in [35], Section III in [36], and Definition 2 in [44].

Definition 1. Let $p_{d}$ be the probability that an adversary can successfully discern which of the two plaintexts corresponds to the ciphertext using any algorithm that operates within polynomial time. Then a cryptosystem is indistinguishable if there is a negligible function $ϵ (q)$ such that for all plaintext length $q$ ,

p_{d} \leq \frac{1}{2} + ϵ (q) .

(7)

$ϵ (q)$ is negligible if there exists an integer $q_{c}$ for every positive constant $c$ such that $ϵ (q) < q^{- c}$ for all $q > q_{c}$ .

Let $d_{TV} (p_{1}, p_{2})$ be the total variation (TV) distance [46] between $p_{1} = Pr (y ∣ t_{1})$ and $p_{2} = Pr (y ∣ t_{2})$ where $p_{i}$ is the probability distribution of $y$ conditioned on $t_{i} (i = 1,2)$ . Based on [47], the probability to successfully distinguish the plaintexts is bounded by

p_{d} \leq \frac{1}{2} + \frac{d_{TV} (p_{1}, p_{2})}{2} .

(8)

where $d_{TV} (p_{1}, p_{2}) \in [0,1]$ . If $d_{TV} (p_{1}, p_{2}) = 0$ , the probability of success is at most equivalent to a random guess, leading to indistinguishability [40].

Computing $d_{TV} (p_{1}, p_{2})$ directly is difficult [48] and we employ the Hellinger distance [46] to bound the TV distance. Let $d_{H} (p_{1}, p_{2})$ be the Hellinger distance [46] and it can give both lower and upper bounds on the TV distance [49], i.e.,

d_{H}^{2} (p_{1}, p_{2}) \leq d_{TV} (p_{1}, p_{2}) \leq d_{H} (p_{1}, p_{2}) \sqrt{2 - d_{H}^{2} (p_{1}, p_{2})}

(9)

where $d_{H} (p_{1}, p_{2}) \in [0, 1]$ . Moreover, if $p_{1}$ and $p_{2}$ are multivariate Gaussian distributions (i.e., the ciphertext $y$ conditioned on $t_{h}$ follows Gaussian distribution with zero mean and covariance matrix $C_{h}, h \in {1,2}$ ), the Hellinger distance between $p_{1}$ and $p_{2}$ is given by [50] and [51]

d_{H} (p_{1}, p_{2}) = \sqrt{1 - \frac{{|C_{1}|}^{\frac{1}{4}} {|C_{2}|}^{\frac{1}{4}}}{{|C_{3}|}^{\frac{1}{2}}}}

(10)

where $C_{3}$ is defined as the average of $C_{1}$ and $C_{2}$ (i.e., $C_{3} ≜ \frac{C_{1} + C_{2}}{2}$ ). Formal definitions and properties of total variation and Hellinger distances are given in [46]–[48].

C. Adversarial attacks on matrix encryption methods

In previous privacy-preserving frameworks using matrix encryption techniques [37], [52]–[54], adversarial attack models are classified into four levels: ciphertext-only attack (COA), known-plaintext attack (KPA), chosen-plaintext attack (CPA), and chosen-ciphertext attack (CCA).

Ciphertext-only attack (level 1):

The adversary is assumed to have access to ciphertexts and no other information. Within the COA, the adversary attempts to retrieve sensitive information using the ciphertexts.

Known-plaintext attack (level 2):

The adversary has access to the ciphertexts and corresponding plaintexts. Within the KPA, the attacker attempts to recover sensitive information by analyzing ciphertexts and their corresponding plaintexts.

Chosen-plaintext attack (level 3):

Given any plaintext, the adversary can get its corresponding ciphertext within the CPA. The adversary attempts to recover the encryption key or algorithm by examining associations between plaintexts and ciphertexts.

Chosen-ciphertext attack (level 4):

Within the CCA, the adversary has the capability to obtain the decryption of any ciphertexts of its choice. The adversary attempts to determine the plaintext that was encrypted to give some other ciphertexts.

In our scheme, no ciphertext is decrypted and thus it is impossible for adversaries to obtain the decryption of any ciphertext. Therefore, adversaries could not perform CCA and we only consider the first three attacks.

D. Matrix encryption

Assume $x$ is a random row in the dataset $X$ containing $n$ rows and $q$ columns. To encrypt $x$ , a random Gaussian matrix, denoted as $B$ with dimensions $q \times q$ , is generated, where each element follows a Gaussian distribution $N (μ, σ^{2})$ , and $μ$ and $σ$ are parameters of the distribution. The encryption function for row $x$ and data $X$ can be summarized as $f_{B, r} (x) ≜ x B$ and $f_{B, r} (X) ≜ X B$ . Each column in the dataset $X$ can be encrypted similarly. Specifically, let $x_{c}$ be a random column in $X$ . A $n \times n$ random Gaussian matrix $A$ is generated for $x_{c}$ encryption. The encryption function for column $x_{c}$ and data $X$ can be summarized as $f_{A, l} (x_{c}) ≜ A x_{c}$ and $f_{A, l} (X) ≜ A X$ .

IV. System Overview

A. System model

We investigate collaborative learning in which data are collected and owned by different data providers, referred to as clients. The goal is to build an efficient logistic model using data from all the clients while ensuring privacy protection. We consider that data are horizontally distributed, i.e., clients have different sets of samples and the same set of features (Figure 1A.

Fig. 1: — A: an example showing the horizontal partitioning scenario with three data providers (referred to as clients); B: workflow of the proposed privacy-preserving logistic model.

The proposed scheme involves multiple clients and one server who is responsible for the secure computation. The privacy-preserving scheme contains four stages: encryption, modeling, decryption and verification (Figure 1B). Clients perform data encryption as the initial step. After the encryption process, the data are sent to the server for the secure computation. The server then sends encrypted model results back to clients. Subsequently, clients decrypt the model results. Finally, the server and clients initiate the verification stage to identify any malicious activity conducted by the clients.

B. Threat model

Maliciously secure frameworks with multiple servers have been particularly rich [6], [7], [12], [13]. These frameworks assume at least one server is semi-honest and does not collude with malicious adversaries. Following these frameworks, the sole server in our scheme is assumed to be semi-honest, while clients are allowed to be malicious. Precisely, we assume that the server faithfully executes the delegated computations but may be curious about the intermediate data and try to learn or infer any sensitive information. In contrast, clients are likely to act maliciously (i.e., arbitrarily deviate from the predefined scheme to cheat others). Clients may collude with each other while the server is not allowed to collude with malicious clients. The possible adversary behaviors of malicious clients and the semi-honest server are summarized as follows.

To perform the ciphertext-only attack (CPA), clients insert fake plaintexts into the privacy-preserving scheme and collude with each other to share both plaintexts and their corresponding ciphertexts. Detailed description of the CPA is given in Section VI.
Malicious clients do not follow the proposed encryption method to encrypt data (i.e., each client’s data should be encrypted sequentially by all the clients using commutative matrices). After getting sufficient data for CPA, malicious clients may choose to skip subsequent computations in order to reduce the computation cost.
Malicious clients do not follow the decryption procedure (i.e., the encrypted model result derived by the server should be decrypted sequentially by all the clients using commutative matrices which have been used for data encryption). Once malicious clients have gathered enough information for CPA, they may choose to skip the computations and send fake data to other clients in the decryption procedure.
The semi-honest server attempts to retrieve sensitive information from received ciphertexts.

C. Design goals

Our design goals contain four aspects. Privacy: The private data have to remain confidentiality at any time. Learning verifiability: There should be a verification stage to check whether all the clients behave honestly. Correctness: The scheme is able to derive correct model result if all the clients and the server behave honestly. Efficiency: The scheme is computationally efficient and achieves high accuracy.

V. Proposed scheme

A. Data encryption, modeling and decryption

Suppose there are $K$ clients and client $i$ owns the data $X_{i}$ (i.e., plaintext) with $n_{i}$ samples and $q$ features $(i = 1,2, \dots, K)$ . These $K$ clients collect the same $q$ features for different samples. The analytical model will be built using the aggregated data, i.e., $X = (\begin{matrix} X_{1} \\ ⋮ \\ X_{K} \end{matrix})$ . To ensure data confidentiality, we introduce a novel privacy-preserving logistic model that employs random matrices for encryption. Specifically, client $i$ encrypts $X_{i}$ using random encryption matrices $A_{i}$ and $B_{i}$ . Aggregating these encrypted datasets, we get $X^{enc} = (\begin{matrix} A_{1} X_{1} B_{1} \\ ⋮ \\ A_{K} X_{K} B_{K} \end{matrix})$ . Since $A_{i}$ and $B_{i} (i = 1, \dots, K)$ are generated randomly by client $i$ , the aggregated dataset does not preserve data utility. In order to maintain the utility of the data, we require that 1) $B_{i} (i = 1, \dots, K)$ is designed to be commutative (i.e., $B_{i} B_{j} = B_{j} B_{i}$ for $i \neq j$ , Appendix A), 2) $X_{i}$ is subsequently encrypted by client $j$ using $B_{j}$ $(j \neq i)$ , and 3) the encryption matrix $A_{i}$ is decrypted by the server prior to the secure computation. The commutative nature of the random encryption matrix $B_{i}$ guarantees that the resulting encrypted dataset remains independent of the order in which clients perform encryption. Clients send the encrypted data (i.e., ciphertexts) to the server after the encryption. The server then decrypts $A_{i}$ and obtains the aggregated data $(\begin{matrix} X_{1} B \\ ⋮ \\ X_{K} B \end{matrix}) = X B$ where $B = \prod_{i}^{K} B_{i} = B_{1} B_{2} \dots B_{K}$ , as $B_{i}$ are $B_{j}$ designed to be commutative (i.e., $B_{i} B_{j} = B_{j} B_{i}$ for $i \neq j$ ). Table II summarizes symbols in the proposed scheme.

Table II:

Notations

Notation	Description
$K$	Number of clients (data providers)
$X_{i}, Y_{i}$	Data (plaintext) collected by client $i (i = 1, \dots, K)$
$Z_{i}$	Transformed outcome $(Z_{i} = Y_{i}^{T} X_{i})$
$n_{i}$	Number of samples in $X_{i}$ and $Y_{i}$
$q$	Number of features in $X_{i}$
$X, Y$	Aggregated data (plaintext)
$n$	Number of samples in $X$ and $Y$
$x_{i}$	The $i$ -th record (row) in $X$
$x_{c, i}$	The $i$ -th column in $X$
$W$	A diagonal matrix (Equation 4)
$A_{i}$	Random Gaussian matrix generated by client $i$
$A_{i}^{- 1}$	Inverse of matrix $A_{i}$
$B_{0}^{2}$	Random Gaussian matrix shared among $K$ clients
$b_{i j}$	Random coefficient generated by client $i (j = 1, \dots, q)$
$B_{i}$	Commutative matrix generated by client $i (B_{i} = \sum_{j = 1}^{q} b_{i j} B_{0}^{j})$
$B$	Encryption matrix $(B = \prod^{Λ} B_{i})$
$B^{- 1}$	Inverse of matrix $B^{i}$
$X^{enc}$	Encrypted $X$ (ciphertext)
$Z^{enc}$	Encrypted outcome (ciphertext)
$X^{T}$	Transpose of $X$
$d$	A constant to ensure indistinguishability (Algorithm 1)
$β$	Model estimate (a $q$ -dimensional vector) of non-secure model
$β^{enc}$	Model estimate (a $q$ -dimensional vector) derived by the server
$∥x_{i}∥$	Euclidean norm of $x_{i}$
$d_{H}$	Hellinger distance
$d_{TV}$	Total variation (TV) distance
$d_{TV,low}$	Lower bound of the TV distance
$d_{TV, up}$	Upper bound of the TV distance
$p_{d}$	Success probability in the indistinguishability experiment
$ϵ (q)$	A negligible function
$Y_{s}$	Pseudo outcome $(Y_{s} = X^{T} X 1)$
$Y_{s}^{enc}$	Encrypted pseudo outcome
$f_{A_{i}, l} (X)$	Encryption function $(f_{A_{i}, l} (X) = A_{i} X)$
$f_{B_{0}, r} (X)$	Encryption function $(f_{B_{0}, r} (X) = X B_{0})$
$f_{r} (X)$	Encryption function $(f_{r} (X) = X (\sum_{j = 1}^{q} b_{j} B_{0}^{(j - 1)}))$

Open in a new tab

$1 = (1,1, \dots, 1)^{T}$ .

Pre-processing

Before data encryption, a pre-processing procedure is conducted by each client. Specifically, client $i$ generates a pseudo record with all the values being 1 (i.e., $(1,1, \dots, 1)$ ) and adds it to $X_{i}$ as the first row. The added row is used for malicious behavior detection. To encrypt the outcome information (i.e., $Y_{i}$ collected by client $i$ ), client $i$ computes $Z_{i} = Y_{i}^{T} X_{i}$ and concatenates it into $X_{i}$ as the last row. Additionally, client $i$ calculates the Euclidean norm of each column in $X_{i}$ . Let $c_{j}$ be the Euclidean norm of the $j$ -th column $(j = 1, \dots, q)$ and $c_{m} = max_{j} c_{j}$ . Client $i$ generates a vector, $c_{v}$ , with the $j$ -th element of $c_{v}$ being $\sqrt{c_{m}^{2} - c_{j}^{2}} . c_{v}$ is added to $X_{i}$ as the last row. This procedure guarantees that the Euclidean norm of each column equals $c_{m}$ and is essential to ensure the indistinguishability of our encryption approach.

Without loss of generality, we include an intercept to the logistic model. Specifically, a vector of ones is added to $X_{i} (i = 1, \dots, K)$ as the first column. To achieve indistinguishability, each client multiplies the elements in the first column by a constant $d$ , where $d$ is selected by Algorithm 1 (i.e., $d$ -transformation). The $d$ -transformation is performed before data encryption.

The proposed encryption procedures can be categorized into two layers: internal and external. The data are first encrypted by its owner internally and then subsequently encrypted by other clients, referred to as the external encryption.

Internal encryption

Client $i$ first generates a random Gaussian matrix $A_{i}$ to encrypt $X_{i}$ . To improve the computation efficiency of the encryption for clients with large sample size, we partition the encryption matrix $A_{i}$ into a diagonal matrix. The detailed description is given in Appendix B. Client $i$ shares $A_{i}$ with the server and encrypts the data as $A_{i} X_{i}$ . Client $i$ further encrypts $A_{i} X_{i}$ using a specifically designed matrix $B_{i}$ . To generate the specific $B_{i}$ , a random $q \times q$ Gaussian matrix $B_{0}$ is generated and shared among the $K$ clients. Client $i$ subsequently generates a random coefficient vector $(b_{i 1}, \dots, b_{i q})$ and a client-specific matrix $B_{i} = \sum_{j = 1}^{q} b_{i j} B_{0}^{j} (i = 1, 2, \dots, K)$ . This ensures that $B_{i}$ (generated by client $i$ ) and $B_{j}$ (generated by client $j) (i \neq j)$ are commutative, i.e., $B_{i} B_{j} = B_{j} B_{i}$ (Appendix A). Client $i$ computes $X_{i}^{enc} = A_{i} X_{i} B_{i}$ and sends $X_{i}^{enc}$ to other clients for external encryption.

External encryption

Upon receiving $X_{i}^{enc} = A_{i} X_{i} B_{i}$ from client $i$ , client $i + 1$ further encrypts it using $B_{i + 1}$ (i.e., $X_{i}^{enc} = A_{i} X_{i} B_{i} B_{i + 1}$ ) and sends the updated ciphertext $X_{i}^{enc}$ to client $i + 2$ . Client $i + 2$ then encrypts the ciphertext using $B_{i + 2}$ and sends the ciphertext to client $i + 3$ . After all the $K - 1$ clients complete the external encryption, the ciphertext is in the form of $X_{i}^{enc} = A_{i} X_{i} B$ where $B = \prod_{i}^{k} B_{i} = B_{1} B_{2} \dots B_{K} . X_{i}^{enc}$ is then sent to the server.

To build the ridge regression model, client $i$ computes and sends $B_{i}^{T} B_{i}$ to client $i + 1$ . Client $i + 1$ calculates $B_{j}^{T} B_{i}^{T} B_{i} B_{j}$ and sends it to client $i + 2$ . Each of the $K$ clients conducts the encryption sequentially. Once all clients complete encryption, $\prod_{j = 1}^{K} B_{i}^{T} B_{i}$ is sent to the server for ridge model computation.

Algorithm 2 describes the detailed encryption procedures and Figure 2 gives an example of the encryption procedures with three clients. Table III summarizes the internal and external encryption procedures. The primary goal of the internal encryption layer is to protect against malicious adversaries. To preserve data utility, the data are further encrypted by the other clients using commutative matrices in the external encryption layer.

Fig. 2: — An example showing the proposed encryption procedures including data transformation details with three clients. $X_{i}$ denotes the data collected by client $i (i = 1, \dots, K = 3) . Y_{i}^{T} X_{i}$ is integrated into $X_{i}$ prior to the data encryption. Data are encrypted by all the clients sequentially. The arrows connect the origin and endpoint of the data transmission.

Table III:

Encryption details for the plaintext $X_{i}$ (owned by client $i$ )

Encryption layer	Client	Rationale	Affected by malicious adversaries?	Encryption matrix
Internal	$i$	Withstand malicious adversaries	No	$A_{i}, B_{i}$
External	$1,2, \dots, i - 1, i + 1, \dots, K$	Preserve data utility	Yes	$\prod_{j \neq i} B_{j}$

Open in a new tab

Modeling

Upon receiving $X^{enc} = (\begin{matrix} A_{1} X_{1} B \\ ⋮ \\ A_{K} X_{K} B \end{matrix}) (i = 1, \dots, K)$ (and $\prod_{j = 1}^{K} B_{i}^{T} B_{i}$ for ridge regression), the server decrypts $A_{i}$ to get $(\begin{matrix} X_{1} B \\ ⋮ \\ X_{K} B \end{matrix}) = X B$ . For the subsequent analysis, the server defines $X B$ as $X^{enc}$ (i.e., $X^{enc} ≜ X B$ ) and further eliminates pseudo records in $X^{enc}$ . The detailed procedures are described in Algorithm 3. The server retrieves the encrypted outcome from $X B$ and denotes it as $Z_{i}^{enc}$ . According to the encryption procedure, $Z_{i}^{enc} = Y_{i}^{T} X_{i} B$ . Then the server derives a $q$ -dimensional vector $Z^{enc} ≜ \sum_{i = 1}^{K} Z_{i}^{enc}$ . Given the encrypted data, the Newton update becomes

{(β^{new})}^{enc} = {[{(X^{enc})}^{T} W^{enc} X^{enc} + Λ B^{T} B]}^{- 1} \times [{(X^{enc})}^{T} W^{enc} X^{enc} {(β^{old})}^{enc} + {(Z^{enc})}^{T} - {(X^{enc})}^{T} p^{enc}]

(11)

where $p^{enc} = {({[p (x_{1}; β^{old})]}^{enc}, \dots, {[p (x_{n}; β^{old})]}^{enc})}^{T}$ ,

{[p (x_{i}; β^{old})]}^{enc} = p (x_{i}^{enc}; {(β^{old})}^{enc}) = \frac{exp [{(x_{i}^{enc})}^{T} {(β^{old})}^{enc}]}{1 + exp [{(x_{i}^{enc})}^{T} {(β^{old})}^{enc}]}

(12)

for $i = 1, \dots, n, W^{enc}$ is a $n \times n$ diagonal matrix with the $i$ -th diagonal element being ${[p (x_{i}; β^{old})]}^{enc} \{1 - {[p (x_{i}; β^{old})]}^{enc}\}$ , Λ is a matrix of zeros for the non-regularized logistic model while Λ is a diagonal matrix with the diagonal elements being ${0, λ, \dots, λ}$ for ridge regression. The server computes model estimates using Equation 11 until converged (e.g., the minimum of the absolute difference between ( ${β^{new})}^{enc}$ and ${(β^{old})}^{enc}$ is smaller than 10⁻⁶).

Theorem 1. The privacy-preserving logistic model converges as long as the non-secure logistic model converges.

Proof. Let ${[β^{(0)}]}^{enc}$ denote the initial point in Newton’s method within the privacy-preserving logistic model. According to Theorem 9 (Appendix C), it is equivalent to setting $B ({[β^{(0)}]}^{enc})$ as the initial point within the non-secure model. Given the initial point $B ({[β^{(0)}]}^{enc})$ , suppose the non-secure model converges after $s$ iterations with the model estimate being $β$ . Based on Theorem 9 (Appendix C), our privacy-preserving model also converges after $s$ iterations with the model estimate being $β^{enc} = B^{- 1} β$ . □

Decryption

The server sends the converged model estimate $β^{enc}$ to the clients. As shown in Theorem 9 (Appendix C), we have $β^{enc} = \prod_{i = 1}^{K} {(B_{i})}^{- 1} β$ where $β$ is the estimate of the non-secure model. To get the true model estimate $β$ , client $i (i = 1, \dots, K$ ) uses the encryption matrix $B_{i}$ to decrypt $β^{enc}$ . Figure 3 shows the detailed decryption procedure. $β = (\prod_{i = 1}^{K} B_{i}) β^{enc}$ is the result once all clients complete decryption.

Fig. 3: — The decryption procedure. Client $i$ decrypts $β^{enc}$ sequentially $(i = 1, \dots, K)$ . The data above the arrow are those transferred among clients.

B. Multiclass classification

Our scheme can be modified to solve the multiclass classification problem. Suppose the outcome contains a total of $e_{y}$ classes. Client $i$ defines $e_{y}$ sub-outcomes $Y_{i (1)}, Y_{i (2)}, \dots$ , $Y_{i (e_{y})}$ using the indicator function $(1_{j} (Y_{i}))$ as follows.

Y_{i (j)} = 1_{j} (Y_{i}) = \{\begin{array}{l} 1 & Y_{i} = j, \\ 0 & Y_{i} \neq j, \end{array} for j = 1, \dots, e_{y} .

(13)

Following the above described scheme, client $i$ calculates $Z_{i (j)} = Y_{i (j)}^{T} X_{i} (j = 1, \dots, e_{y})$ . Before data encryption, client $i$ adds $Z_{i (1)}, \dots, Z_{i (e_{y})}$ to the feature matrix $X_{i}$ as the last $e_{y}$ rows. Define the sub-outcome for the $j$ -th classes as $Z_{(j)} = \sum_{i = 1}^{K} Y_{i (j)}^{T} X_{i}$ for $j = 1, \dots, e_{y}$ . After data encryption, the server performs the secure logistic model computation (Algorithm 3) for each of the $e_{y}$ sub-outcomes using each pair of feature matrix and outcome $\{X^{enc}, Z_{(j)}^{enc}\}$ for $j = 1, \dots, e_{y}$ .

C. Lossy compression with SZ

To reduce the communication cost, we employ the lossy compression technique with SZ [55] for data compression. Lossy compression with $SZ$ is an error-bounded lossy compression scheme [55]–[57]. In our scheme, the data are compressed before being transferred between clients and the server.

D. Verification: malicious behavior detection

To identify if any client has conducted malicious behavior, a designated pseudo outcome $Y_{s}$ is subjected to the same procedures as the original outcome $Y$ . Assuming all clients adhere to our scheme for both $Y$ and $Y_{s}$ , predetermined outputs are anticipated after the verification stage.

Specifically, client $i$ defines a constant $τ_{i} ≜ Y_{i}^{T} X_{i} 1$ where $1 = (1,1, \dots, 1)^{T}$ and shares $τ_{i}$ with the server. For the verification, client $i$ generates a pseudo outcome $Y_{s i} ≜ X_{i}^{T} X_{i} 1 . Y_{s}$ is the sum of $Y_{s i}$ , denoted as $Y_{s} ≜ \sum_{i = 1}^{K} Y_{s i}$ . $Y_{s}$ can be expressed as $Y_{s} = X^{T} X 1$ . Client $i$ further encrypts $Y_{s i}$ using $B_{i}$ (i.e., $Y_{s i}^{enc} = B_{i}^{T} Y_{s i}$ ) and sends the ciphertext $Y_{s i}^{enc}$ to the other clients. $Y_{s i}^{enc}$ is subsequently encrypted by client $j$ using $B_{j} (j = 1, \dots, i - 1, i + 1, \dots, K)$ . After all the clients have completed the encryption process, $Y_{s i}^{enc}$ is shared with the server. Upon receiving $Y_{s i}^{enc}$ , the server verifies if $Z_{i}^{enc} {[{(X_{i}^{enc})}^{T} X_{i}^{enc}]}^{- 1} Y_{s i}^{enc} = τ_{i}$ where $Z_{i}^{enc} = Y_{i}^{T} X_{i} B$ . If any client exhibits malicious behavior during the encryption process, the equation will not hold (Theorem 2). Since ${(X^{T} X)}^{- 1} X^{T} X 1 = 1$ , we design the following process to verify if the clients follow the proposed decryption procedure. The server first calculates the sum of encrypted pseudo outcomes (i.e., $Y_{s}^{enc} ≜ \sum_{i = 1}^{K} Y_{s i}^{enc}$ ) and the estimate $β_{s}^{enc} ≜ {[{(X^{enc})}^{T} X^{enc}]}^{- 1} Y_{s}^{enc} . β_{s}^{enc}$ can be simplified as follows.

β_{s}^{enc} = {[{(X^{enc})}^{T} X^{enc}]}^{- 1} Y_{s}^{enc} = {[{(X^{enc})}^{T} X^{enc}]}^{- 1} [\sum_{i = 1}^{K} Y_{s i}^{enc}] = {[{(X^{enc})}^{T} X^{enc}]}^{- 1} (\sum_{i = 1}^{K} B^{T} X_{i}^{T} X_{i} 1) = {[(X B)^{T} X B]}^{- 1} B^{T} X^{T} X 1 = B^{- 1} 1 .

(14)

The server shares $β_{s}^{enc}$ with the client who performs the verification (e.g., client 1). To confirm that no malicious behavior was conducted within the decryption process, client 1 combines $β_{s}^{enc}$ with $β^{enc}$ . Specifically, client 1 generates two random constants, $α_{1}$ and $α_{2}$ , and defines a new estimate as $(\tilde{β})^{enc} ≜ α_{1} β^{enc} + α_{2} β_{s}^{enc}$ . $(\tilde{β})^{enc}$ is decrypted by all the clients following the procedure in Figure 3. Let $\tilde{β}$ be the decrypted estimate. Upon obtaining $\tilde{β}$ , client 1 calculates $β_{s} ≜ \frac{1}{α_{2}} [\tilde{β} - α_{1} B β^{enc}]$ where $B β^{enc}$ is the decrypted model estimate. $β_{s}$ is expected to be a vector of ones if all the clients correctly decrypt both $(\tilde{β})^{enc}$ and $β^{enc}$ following the proposed decryption procedure (Theorem 3). Algorithm 4 summarizes the verification process.

In order to preserve data utility, all the clients need to follow three encryption criteria. We utilize a case study involving two clients to better illustrate these criteria. The encryption procedures are shown in Figure 4. Initially, it is necessary for clients to employ uniform encryption matrices to encrypt datasets owned by other clients. This condition implies that $B_{3} = B_{2}$ and $B_{4} = B_{1}$ in the example. Additionally, these encryption matrices must be commutative with each other, which implies that the multiplication of $B_{1}$ and $B_{2}$ equals the multiplication of $B_{2}$ and $B_{1}$ , denoted as $B_{1} B_{2} = B_{2} B_{1}$ . Thirdly, clients use uniform inputs across the entire encryption process. More precisely, each client does not alter the data owned by other clients during data encryption. To violate the third criterion, client 2 may selectively encrypt specific rows within the dataset $A_{1} X_{1} B_{1}$ or substitute $A_{1} X_{1} B_{1}$ with fake data prior to transmitting it to the server.

Fig. 4: — An example of the proposed encryption procedures with two clients. $X_{i}$ denotes the data collected by client $i (i = 1,2) . Y_{i}^{T} X_{i}$ is integrated into $X_{i}$ prior to the data encryption. The internal encryption is performed by each client, encrypting its own data internally before transmitting it to other clients. The external encryption is further conducted by the other clients. Finally, the server decrypts $A_{i}$ from the data it has received and retrieves the encrypted outcome information.

Let ${({\tilde{X}}_{1})}^{enc}$ and ${({\tilde{X}}_{2})}^{enc}$ denote the ciphertexts transmitted to the server from client 2 and client 1, respectively. The server then decrypts $A_{i}$ and extracts the encrypted outcome ${({\tilde{Z}}_{i})}^{enc} (i = 1,2)$ . If all the clients adhere to these three criteria, the server should get the ciphertexts in Equations 15 and 16 (C-1, C-2, and C-3 refer to criteria 1 through 3).

X^{enc} = [\begin{matrix} {({\tilde{X}}_{1})}^{enc} \\ {({\tilde{X}}_{2})}^{enc} \end{matrix}] \overset{C - 3}{=} [\begin{matrix} A_{1} X_{1} B_{1} B_{3} \\ A_{2} X_{2} B_{2} B_{4} \end{matrix}] \overset{C - 1}{=} [\begin{matrix} A_{1} X_{1} B_{1} B_{2} \\ A_{2} X_{2} B_{2} B_{1} \end{matrix}] \overset{C - 2}{=} [\begin{matrix} A_{1} X_{1} B_{1} B_{2} \\ A_{2} X_{2} B_{1} B_{2} \end{matrix}] = [\begin{matrix} A_{1} X_{1} \\ A_{2} X_{2} \end{matrix}] B_{1} B_{2} \overset{Decryption}{\Rightarrow} [\begin{matrix} X_{1} \\ X_{2} \end{matrix}] B_{1} B_{2} .

(15)

Z^{enc} = [\begin{matrix} {({\tilde{Z}}_{1})}^{enc} \\ {({\tilde{Z}}_{2})}^{enc} \end{matrix}] \overset{C - 3}{=} [\begin{matrix} Y_{1}^{T} X_{1} B_{1} B_{3} \\ Y_{2}^{T} X_{2} B_{2} B_{4} \end{matrix}] \overset{C - 1}{=} [\begin{matrix} Y_{1}^{T} X_{1} B_{1} B_{2} \\ Y_{2}^{T} X_{2} B_{2} B_{1} \end{matrix}] \overset{C - 2}{=} [\begin{matrix} Y_{1}^{T} X_{1} B_{1} B_{2} \\ Y_{2}^{T} X_{2} B_{1} B_{2} \end{matrix}] = [\begin{matrix} Y_{1}^{T} X_{1} \\ Y_{2}^{T} X_{2} \end{matrix}] B_{1} B_{2} .

(16)

To preserve data utility, 1) $X_{i}$ and $X_{j} (i \neq j)$ need to be encrypted by the same encryption matrix $\prod_{K = 1}^{K} B_{k}$ , and 2) $X_{i}$ and $Y_{i}^{T} X_{i} (i = 1, \dots, K)$ need to be encrypted by the same encryption matrix $\prod_{K = 1}^{K} B_{k}$ . The malicious behavior of any client during data encryption results in a breach of one or both of these two requirements, thereby affecting the utility of the data.

Theorem 2. The proposed verification algorithm can identify the malicious behavior during the encryption process.

Proof. Firstly, the server verifies if $X_{i}$ and $X_{j} (i \neq j)$ are encrypted by the same encryption matrix by checking whether the first rows in $X_{i}^{enc}$ and $X_{j}^{enc}$ are identical. To meet this requirement, all clients must adhere to the three encryption criteria indicated in Equations 15 and 16. Secondly, the server checks whether $Z_{i}^{enc} {[{(X_{i}^{enc})}^{T} X_{i}^{enc}]}^{- 1} Y_{s i}^{enc}$ equals $τ_{i}$ to ascertain if $X_{i}$ and $Y_{i}^{T} X_{i}$ have been encrypted using the same encryption matrix. Upon receiving $Z_{i}^{enc} = Y_{i}^{T} X_{i} B$ and $X_{i}^{enc} = X_{i} B$ , the server verifies if $B = B$ by calculating

Z_{i}^{enc} {[{(X_{i}^{enc})}^{T} X_{i}^{enc}]}^{- 1} Y_{s i}^{enc} = Y_{i}^{T} X_{i} B {({[X_{i} B]}^{T} X_{i} B)}^{- 1} B^{T} Y_{s i} = Y_{i}^{T} X_{i} B B^{- 1} {(X_{i}^{T} X_{i})}^{- 1} {(B^{T})}^{- 1} B^{T} X_{i}^{T} X_{i} 1 \overset{B = B}{=} Y_{i}^{T} X_{i} 1 = τ_{i}

(17)

for $i = 1, \dots, K$ . The server affirms the absence of any malicious activities by validating the fulfillment of the two conditions mentioned above. □

In the decryption procedures, client $i$ should use the encryption matrix $B_{i}$ to decrypt $β^{enc}$ (output in Algorithm 3) (decryption criterion).

Theorem 3. The proposed verification algorithm can identify if the client violates the decryption criterion.

Proof. In our scheme, both $β^{enc}$ and $(\tilde{β})^{enc}$ need to be decrypted following the decryption procedures. Let $B β^{enc}$ and $B (\tilde{β})^{enc}$ be the decrypted data of $β^{enc}$ and $(\tilde{β})^{enc}$ , respectively. Since

(\tilde{β})^{enc} ≜ α_{1} β^{enc} + α_{2} β_{s}^{enc},

(18)

we have

\tilde{β} ≜ B (\tilde{β})^{enc} = α_{1} B β^{enc} + α_{2} B β_{s}^{enc} .

(19)

Suppose client $i (i = 1, \dots, K)$ follows the proposed decryption procedures to decrypt both $β^{enc}$ and $(\tilde{β})^{enc}, B$ and $B$ should be identical (i.e., $B = B = B$ where $B = \prod_{i = 1}^{K} B_{i}$ ). So

β_{s} ≜ \frac{1}{α_{2}} [\tilde{β} - α_{1} B β^{enc}] = \frac{1}{α_{2}} [α_{1} B β^{enc} + α_{2} B β_{s}^{enc} - α_{1} B β^{enc}] \overset{B = B = B}{=} β_{s}^{enc} .

(20)

According to Equation 14,

B β_{s}^{enc} = B {[{(X^{enc})}^{T} X^{enc}]}^{- 1} Y_{s}^{enc} = 1 .

(21)

Based on Equations 20 and 21, $β_{s}$ is expected to be a vector of ones if no malicious activities are involved in the decryption procedures. Therefore our verification stage can identify whether the client breaks the decryption criterion. □

VI. Security analysis

The encryption matrices $A_{i}$ and $B_{i}$ may be recovered if ciphertexts of different plaintexts are distinguishable. Once the encryption matrix is recovered, the client can recover other clients’ data. Potential attacks to achieve such goals include CPA, KPA, COA and CCA [34]–[36], [40] as described in Section III-C. It is impossible to perform CCA for our scheme because adversaries cannot obtain the decryption of any ciphertext. Since CPA is more threatening than KPA and COA, a secure scheme is resilient to KPA and COA if it protects against CPA.

CPA [37], [52]–[54] is reasonable in our scheme. Consider a robust threat model in which all clients, except one, can be compromised in a collusion attack (Figure 5). Suppose client 1 is the only honest client. In the external encryption layer, client $i (i \neq 1)$ generates fake data $X_{i}$ and sends it to client 1 for encryption. Client 1 uses $B_{1}$ to encrypt $X_{i}$ and returns the ciphertext to the other clients for further encryption. During the process, colluded clients share received ciphertexts from agency 1 and are able to match each plaintext $X_{i}$ with its ciphertext $X_{i} B_{1}$ . In a collusion attack, colluded clients cooperate as a group to share plaintexts and ciphertexts with each other. So the colluded group is able to insert arbitrary plaintexts and get the corresponding ciphertexts for the purpose of CPA. The colluded group will try to first recover the encryption matrix $B_{1}$ and then retrieve the plaintext $X_{1}$ owned by client 1.

Fig. 5: — An example of the strong threat model with all the clients except one can be compromised. Suppose client 1 is the only honest client and $B_{1}$ denotes the commutative matrix for data encryption. $X_{i}$ denotes the plaintext from client $i (i = 2, \dots, K)$ .

To be resilient to CPA, the encrypted data in our privacy-preserving model should have indistinguishability for any random plaintexts. In this section, we demonstrate that the ciphertexts of two arbitrary plaintexts are indistinguishable in our privacy-preserving scheme.

Define the encryption functions $f_{M, l} (X) ≜ M X$ and $f_{M, r} (X) ≜ X M$ where $M$ is a random Gaussian matrix, i.e, each element of $M$ follows Gaussian distribution. Since $B_{i} = B_{0} (\sum_{j = 1}^{q} b_{j} B_{0}^{(j - 1)})$ , the internal encryption function $f (X_{i}) = A_{i} X_{i} B_{i}$ can be split into 3 sub-functions. Specifically, $f (X_{i}) = A_{i} X_{i} B_{i} = A_{i} X_{i} B_{0} (\sum_{j = 1}^{q} b_{j} B_{0}^{(j - 1)}) = f_{r} (f_{B_{0}, r} (f_{A_{i}, l} (X_{i})))$ where $f_{A_{i}, l} (X) = A_{i} X, f_{B_{0}, r} (X) = X B_{0}$ , and $f_{r} (X) = X (\sum_{j = 1}^{q} b_{j} B_{0}^{(j - 1)})$ .

The clients have access to the ciphertext $A_{i} X_{i} B$ . In contrast, the server receives the encryption matrices $A_{i}$ from client $i (i = 1, \dots, K)$ and derives the ciphertext $X_{i} B = A_{i}^{- 1} A_{i} X_{i} B . A_{i}$ is only used in the internal encryption layer and the function $f_{A_{i}, l} (X) = A_{i} X$ is employed to ensure that clients cannot conduct effective CPA. We first prove that records in $A_{i} X_{i} B$ are indistinguishable to the clients (Section VI-A) and then demonstrate that records in $X_{i} B$ are indistinguishable to the server (Section VI-B).

A. Indistinguishability of $A_{i} X_{i} B$ : security against clients

Theorem 4. Given the Gaussian matrix encryption function $f_{M, l} (X) = M X$ , where $M$ is a random Gaussian matrix, the worst-case lower and upper bounds on $d_{T V} (p_{1}, p_{2})$ are

d_{T V, low} = 1 - {(\frac{2 ∥x_{c, 1}∥ ∥x_{c, 2}∥}{{∥x_{c, 1}∥}^{2} + {∥x_{c, 2}∥}^{2}})}^{q / 2},

(22)

d_{T V, u p} = \sqrt{1 - {(\frac{2 ∥x_{c, 1} ∣∥ ∥x_{c, 2}∥}{{∥x_{c, 1}∥}^{2} + {∥x_{c, 2}∥}^{2}})}^{q}} .

(23)

where $p_{1} = P (z ∣ x_{c, 1}), p_{2} = P (z ∣ x_{c, 2})$ and $x_{c, h} (h = 1,2)$ are two arbitrary columns in $X$ .

Proof. Based on the proof of [[35], Lemma 1], the covariance matrix of $z_{h} = M x_{c, h}$ conditioned on the plaintext $x_{c, h}$ is $C_{h} = {∥x_{c, h}∥}^{2} I$ where $∥x_{c, h}∥$ denotes the Euclidean norm of $x_{c, h}$ and $I$ is the identity matrix. Therefore, $C_{3} ≜ \frac{C_{1} + C_{2}}{2} = (\frac{{∥x_{c, 1}∥}^{2} + {∥x_{c, 2}∥}^{2}}{2}) I$ . Because $C_{1}, C_{2}$ and $C_{3}$ are diagonal matrices, we can get their determinants as

|C_{h}| = {({∥x_{c, h}∥}^{2})}^{q}, h \in {1,2}

(24)

and

|C_{3}| = {(\frac{{∥x_{c, 1}∥}^{2} + {∥x_{c, 2}∥}^{2}}{2})}^{q} .

(25)

d_{H} (p_{1}, p_{2}) = \sqrt{1 - \frac{{|C_{1}|}^{\frac{1}{4}} {|C_{2}|}^{\frac{1}{4}}}{{|C_{3}|}^{\frac{1}{2}}}} = \sqrt{{1 - \frac{2 ∥x_{c, 1}∥ ∥x_{c, 2}∥}{{∥x_{c, 1}∥}^{2} + {∥x_{c, 2}∥}^{2}})}^{q / 2}} .

(26)

According to the inequality relation between the Hellinger distance and the TV distance (Equation 9), we can get the lower and upper bounds of the TV distance as follows.

d_{TV, low} = d_{H}^{2} (p_{1}, p_{2}) = 1 - {(\frac{2 ∥x_{c, 1}∥ ∥x_{c, 2}∥}{{∥x_{c, 1}∥}^{2} + {∥x_{c, 2}∥}^{2}})}^{q / 2}, d_{TV, up} = d_{H} (p_{1}, p_{2}) \sqrt{2 - d_{H}^{2} (p_{1}, p_{2})} = \sqrt{1 - {(\frac{2 ∥x_{c, 1}∥ ∥x_{c, 2}∥}{{∥x_{c, 1}∥}^{2} + {∥x_{c, 2}∥}^{2}})}^{q}} .

(27)

□

Theorem 5. Given the Gaussian matrix encryption function $f_{M, r} (X) = X M$ , where $M$ is a random Gaussian matrix, the worst-case lower and upper bounds on $d_{T V} (p_{1}, p_{2})$ are

d_{T V, l o w} = 1 - {(\frac{2 ∥x_{1}∥ ∥∣ x_{2}∥}{{∥x_{1}∥}^{2} + {∥x_{2}∥}^{2}})}^{q / 2},

(28)

d_{T V, u p} = \sqrt{1 - {(\frac{2 ∥x_{1}∥ ∥∣ x_{2}∥}{{∥x_{1}∥}^{2} + {∥x_{2}∥}^{2}})}^{q}}

(29)

where $p_{1} = P (z ∣ x_{1}), p_{2} = P (z ∣ x_{2})$ and $x_{h} (h = 1,2)$ are two arbitrary rows in $X$ .

Proof. The proof is identical to that provided in Theorem 4. □

Corollary 1. The success probability of an adversary in the indistinguishability experiment is bounded by

p_{d} \leq \frac{1}{2} + \frac{1}{2} \sqrt{1 - {(\frac{2 ∥x_{1}∥ ∥x_{2}∥}{{∥x_{1}∥}^{2} + {∥x_{2}∥}^{2}})}^{q} .}

(30)

If each plaintext has constant Euclidean norm (i.e., $∥x_{1}∥ = ∥x_{2}∥$ for two random records $x_{1}$ and $x_{2}$ ), the cryptosystem has indistinguishability since $p_{d} \leq 0.5$ .

Corollary 1 ensures that no adversary can learn any partial information about the plaintext from a given ciphertext, as long as each plaintext has constant Euclidean norm. Because the Euclidean norms of all the columns in the plaintext $(X_{i})$ are designed to be the same (Section V-A), any two arbitrary columns in $A_{i} X_{i}$ are indistinguishable.

Theorem 6. The TV distance does not increase by encryption functions $f_{B_{0}, r}$ and $f_{r}$ . In other words, $d_{T V, u p} ({\tilde{p}}_{1}, {\tilde{p}}_{2}) \leq d_{T V, u p} (p_{1}, p_{2})$ where ${\tilde{p}}_{h}$ and $p_{h}$ denote the probability distribution of a random row in $A_{i} X_{h} B_{i}$ and $A_{i} X_{h}$ , respectively $(h \in {1,2})$ .

Proof. Hellinger distance can be expressed as a function of Rényi divergence [58], i.e.,

d_{H} (p_{1}, p_{2}) = \sqrt{2 (1 - e^{- \frac{1}{2} D_{\frac{1}{2}} (p_{1} ∣ p_{2})})} .

(31)

where $D_{\frac{1}{2}} (p_{1} ∣ p_{2})$ denotes the Rényi divergence of $p_{1}$ from $p_{2}$ . Based on the data processing inequality [[58], Theorem 1], $D_{\frac{1}{2}} ({\tilde{p}}_{1} ∣ {\tilde{p}}_{2}) \leq D_{\frac{1}{2}} (p_{1} ∣ p_{2})$ . So $d_{H} ({\tilde{p}}_{1}, {\tilde{p}}_{2}) \leq d_{H} (p_{1}, p_{2})$ . Because $0 \leq d_{H} (p_{1}, p_{2}) \leq 1$ and function $u (\sqrt{2 - u^{2}})$ is monotonically increasing on $0 \leq u \leq 1$ ,

d_{TV} ({\tilde{p}}_{1}, {\tilde{p}}_{2}) \leq d_{H} ({\tilde{p}}_{1}, {\tilde{p}}_{2}) \sqrt{2 - d_{H}^{2} ({\tilde{p}}_{1}, {\tilde{p}}_{2})} \leq d_{H} (p_{1}, p_{2}) \sqrt{2 - d_{H}^{2} (p_{1}, p_{2}) .}

(32)

In other words, $d_{TV, up} ({\tilde{p}}_{1}, {\tilde{p}}_{2}) \leq d_{TV, up} (p_{1}, p_{2})$ . □

Corollary 2. Clients are unable to learn any partial information of $X_{i}$ in polynomial time within our privacy-preserving scheme, thereby ensuring that our scheme is resilient to the CPA conducted by the colluded clients.

Proof. According to Theorems 4, 6 and Corollary 1, the internal encryption function $f (X_{i}) = A_{i} X_{i} B_{i}$ is indistinguishable. Theorem 6 indicates that the TV distance does not increase by the external encryption layer and thus $A_{i} X_{i} B$ is indistinguishable where $B = \prod_{i}^{K} B_{i} .$ Therefore, clients cannot perform effective CPA to learn the sensitive information of the other clients. □

B. Indistinguishability of $X_{i} B$ : security against the server

We further illustrate that any two arbitrary rows in $X_{i} B$ are indistinguishable. In the initial model training process (Algorithm 3), the server decrypts $A_{i}$ from $A_{i} X_{i} B$ and thus has access to $X_{i} B$ . With the indistinguishability of the encryption function $f (X_{i}) = X_{i} B$ , the server cannot learn any partial information about $X_{i}$ in polynomial time (Corollary 3).

Theorem 7. Given $γ = \frac{∥x_{1}∥}{∥x_{2}∥} (x_{1}$ and $x_{2}$ are two arbitrary rows in $X_{i}$ ) and a negligible function $ϵ (q), x_{1} B$ and $x_{2} B$ are indistinguishable if $γ$ satisfies $γ + \frac{1}{γ} \leq 2 {[1 - 4 ϵ^{2} (q)]}^{- 1 / q}$ where $q$ is the number of features.

Proof. The encryption function $f (X_{i}) = X_{i} B = X_{i} B_{0} (\sum_{j = 1}^{q} b_{j} B_{0}^{(j - 1)})$ can be split into 2 sub-functions, i.e., $f_{B_{0}, r} (X) = X B_{0}$ and $f_{r} (X) = X (\sum_{j = 1}^{q} b_{j} B_{0}^{(j - 1)})$ . According to Theorem 5 and the data processing inequality [[58], Theorem 1], the upper bound of the TV distance between $P (x_{1} B ∣ x_{1})$ and $P (x_{2} B ∣ x_{2})$ is $d_{TV,up} = \sqrt{1 - {(\frac{2 γ}{γ^{2} + 1})}^{q}}$ . Indistinguishability requires that $p_{d} \leq \frac{1}{2} + ϵ (q)$ . To achieve this, we require that $d_{TV, up} \leq 2 ϵ (q)$ . So ${[1 - 4 ϵ^{2} (q)]}^{1 / q} \leq \frac{2 γ}{γ^{2} + 1}$ for a given $q$ . This leads to $γ + \frac{1}{γ} \leq 2 {[1 - 4 ϵ^{2} (q)]}^{- 1 / q}$ . □

We propose a data transformation method, i.e., “ $d$ -transformation”, to ensure the indistinguishability for arbitrary records, irrespective of whether they possess a consistent Euclidean norm or not. Assume $∥x_{1}∥ < ∥x_{2}∥, \frac{γ}{γ^{2} + 1}$ is monotonically increasing on $0 < γ < 1$ . The decrease in $γ$ leads to the increase in $d_{TV,up} = \sqrt{1 - {(\frac{2 γ}{γ^{2} + 1})}^{q}}$ . To maintain $d_{TV, up}$ within an acceptable range, the Euclidean norms of any two arbitrary records need to be close to each other (i.e., $γ \to 1$ ). To achieve this, we perform the $d$ -transformation for each record. More precisely, we use $d$ (i.e., a vector of constant $d$ ) instead of a vector of ones as the intercept in $X_{i}$ . The first element of $x_{1}$ and $x_{2}$ becomes $d$ instead of 1. As presented in Figure 6A, a large $d$ ensures that $d_{TV, up}$ is close to 0 for a fixed $q$ . For fixed $d, d_{TV, up}$ increases as $q$ rises (Figure 6B).

Fig. 6: — Fig. A: $d_{TV, up}$ given $q = 100$ and different values of $d$ . Fig. B: $d_{TV,up}$ given $d = 2,000$ and different values of $q . γ = \frac{∥x_{1}∥}{∥x_{2}∥}$ and $0 < γ < 1$ .

Theorem 8. The $d$ -transformation ensures indistinguishability for any $q$ and $γ$ .

Proof. As described in Theorem 7, our scheme achieves indistinguishability depending on $γ$ . Since $γ + \frac{1}{γ}$ is monotonically decreasing on $0 < γ < 1$ , there exists a minimum threshold $γ_{-}$ such that $γ + \frac{1}{γ} \leq 2 {[1 - 4 ϵ^{2} (q)]}^{- 1 / q}$ for any $γ > γ_{-}$ . Let $γ_{original} = \frac{∥x_{1}∥}{∥x_{2}∥}$ where $x_{1}$ and $x_{2}$ are two random records in the plaintext. Define $γ_{new} = \frac{∥{\tilde{x}}_{1}∥}{∥{\tilde{x}}_{2}∥}$ where ${\tilde{x}}_{1}$ and ${\tilde{x}}_{2}$ are the $d$ -transformed $x_{1}$ and $x_{2}$ , respectively. So we have $γ_{new} ≜ \frac{\sqrt{γ_{original}^{2} + d^{2}}}{\sqrt{1 + d^{2}}}$ . For $0 < γ < 1, γ_{new}$ increases as $d$ goes up. So there exists a constant $d$ such that $γ_{new} \to 1$ , implying that $γ_{new} + \frac{1}{γ_{new}} \leq 2 {[1 - 4 ϵ^{2} (q)]}^{- 1 / q}$ given any $q$ and $γ$ . To conclude, the $d$ -transformation guarantees indistinguishability for any $q$ and $γ$ . □

Considering that $ϵ (p)$ (e.g., $ϵ (p) ≜ 2^{- q}$ ) can be large given small $q$ , we set an upper bound for $ϵ (q)$ (e.g., $ϵ (p) = min \{10^{- 5}, 2^{- q}\}$ ). As shown in Figure 6B, $d_{TV, up}$ increases when $q$ goes up. Given $ϵ (p) = min \{10^{- 5}, 2^{- q}\}, d = 2,000$ is sufficient to ensure indistinguishability when $q \leq 20$ (Figure 6B). For a large $q$ , clients follow Algorithm 1 to select a constant $d$ such that the encryption function in our scheme has indistinguishability.

Corollary 3. With the $d$ -transformation (Algorithm 1), the encryption function $f (X_{i}) = X_{i} B$ ensures the indistinguishability among any arbitrary records in $X_{i}$ . This demonstrates that the server cannot learn any partial information about $X_{i}$ in polynomial time.

Proof. Given any negligible function $ϵ (q)$ and data $X_{i}$ , Algorithm 1 selects a constant $d$ such that $γ + \frac{1}{γ} \leq 2 {[1 - 4 ϵ^{2} (q)]}^{- 1 / q}$ where $γ = \sqrt{min_{j, k} \frac{{‖x_{j}‖}^{2} + d^{2}}{{‖x_{k}‖}^{2} + d^{2}}}$ ( $x_{j}$ and $x_{k}$ are two arbitrary records in $X_{i}$ ). With the $d$ -transformation, any two arbitrary records in $X_{i}$ are indistinguishable (Theorem 8). □

The $d$ -transformation (i.e., multiplication of the intercept with a constant $d$ ) does not alter logistic model estimates, except for the intercept estimate. After multiplying the intercept by $d$ , the dataset becomes $X_{new} = X I_{D}$ , where $I_{D}$ represents a diagonal matrix with its diagonal elements being $d, 1,1, \dots, 1$ . Based on Equation 6, the model estimate for $X_{new}$ becomes $β_{new} = I_{D}^{- 1} β$ where $I_{D}^{- 1}$ refers to a diagonal matrix with the diagonal elements being ${1 / d, 1,1, \dots, 1}$ . So only the intercept estimate is altered by the multiplication of constant $d$ , while the estimates for the $q$ features remain unchanged.

VII. Performance evaluation

We perform experiments using the MNIST dataset from the UCI repository [59]. The MNIST dataset consists of hand-written digit images, each comprising a 28 by 28 pixel grid. Each image is associated with an integer label ranging from 0 to 9. The dataset consists of 60,000 images for training and 10,000 images for testing. In our privacy-preserving learning, we assume samples in each dataset are evenly distributed among $K$ clients, with each subset encompassing all the features. All the experiments are performed in Matlab on the University of Florida HiPerGator 3.0 (i.e., high computing performance) with 1 CPU and 40 Gb RAMs.

Our privacy-preserving logistic model is applied to differentiate label 9 from the remaining labels (0 to 8). To evaluate model performance on the large-scale dataset, we apply the bootstrap method [60] to create three datasets with sample sizes of $n = 100,000,$ $n = 500,000$ , and $n = 1,000,000$ , respectively. To minimize communication cost, we employ SZ for lossy compression, ensuring a relative error threshold of less than 0.01 (i.e., the difference between raw data and compressed data $< 0.01 *$ (maximum value-minimum value)). As shown in Table IV, our scheme with SZ compression after 20 iterations achieves an accuracy level equivalent to that of the non-secure model. The non-secure model is constructed using aggregated data from all clients without incorporating privacy protection considerations. The utilization of SZ compression leads to a notable decrease in communication costs. In scenarios where the number of clients involved in the privacy-preserving learning framework is less than 10, our scheme incorporating SZ compression incurs lower communication costs compared to the non-secure model (Figure 7A). Figure 7B shows that our scheme has high computation efficiency in analyzing large-scale datasets.

Table IV:

Model accuracy of the proposed privacy-preserving logistic model (relative error bound < 0.01 in the SZ compression)

Dataset	Non-secure model	Our scheme w/o SZ	Our scheme with SZ
Dataset	Non-secure model	Our scheme w/o SZ	$Iter = 5$	$Iter = 10$	$Iter = 20$	$Iter = 50$
$n = 60, 000$	97.27%	97.27%	96.73%	97.22%	97.27%	97.27%
$n = 100, 000$	97.0%	97.0%	96.46%	96.97%	97.0%	97.0%
$n = 500, 000$	97.25%	97.25%	96.69%	97.18%	97.24%	97.25%
$n = 1, 000, 000$	97.25%	97.25%	96.70%	97.20%	97.23%	97.25%

Open in a new tab

Non-secure model: the model built on the aggregated data from all the clients without considering privacy protection. w/o SZ: without SZ compression. Iter: iteration times. $n$ : No. of samples in the aggregated data.

Fig. 7: — A: communication cost of the proposed scheme. w/o SZ: without SZ compression. $n$ : No. of samples in the aggregated data. $K$ : No. of the clients. B: computation time of 20 iterations in the proposed scheme. w/o SZ: without SZ compression. $n$ : No. of samples in the aggregated data. $K$ : No. of the clients.

We further compare the performance of our scheme with four state-of-the-art frameworks that provide malicious security. First, we evaluate the performance of our scheme for the binary classification problem and compare with two maliciously secure frameworks, SWIFT [12] and Fantastic4 [13]. Specifically, we construct the secure logistic model to distinguish between the digits 4 and 9 in the MNIST dataset (binary classification). A total of 11,791 samples are included in the training set, while the testing set comprises 1,991 samples. Moreover, we apply our scheme to solve the multiclass classification problem (Section V-B) and compare our model with state-of-the-art privacy-preserving neural networks, SecureNN [6] and Falcon [7]. The multiclass classification utilizes 60,000 images from the MNIST dataset for training and the remaining 10,000 images for testing.

Table V summarizes results of our scheme and other privacy-preserving frameworks. The computation and communication cost of our scheme goes up with an increasing number of clients. For the comparison, we consider three scenarios, with the number of clients being 10, 20, or 50. Compared with SWIFT [12], our scheme is computationally faster for clients up to 50 and has competitive communication cost for clients up to 20. In contrast, our scheme has improved communication efficiency but higher computation cost compared with Fantastic4 [13]. Our model has higher model accuracy compared with these two maliciously secure frameworks. Compared with their respective frameworks with 3 servers, the 4-server frameworks in [12], [13] have better performance in both computation and communication aspects. Given that the secure frameworks in [12], [13] rely on the honest-majority setting (where a malicious adversary can corrupt at most one server), the inclusion of an extra server imposes a more stringent security prerequisite for the successful execution of the secure frameworks. In contrast, our scheme conducts a secure logistic model that is resilient to malicious clients, utilizing only a single semi-honest server in the process. For multiclass classification, our scheme has better computation and communication performance when compared to maliciously secure neural networks under the semi-honest assumption [6], [7]. The accuracy of our secure scheme is also comparable to these two neural network models with malicious security.

Table V:

Comparison between our model and other maliciously secure frameworks using MNIST dataset

Data	Framework	Comp.	Comm.	Accuracy
MNIST (4 vs. 9)	SWIFT [12] (3PC)	12 mins	96.5 Mb	–
	SWIFT [12] (4PC)	8.6 mins	44 Mb	–
	Fantastic4 [13] (3PC)	8.5 s	2.8 Gb	96.5%
	Fantastic4 [13] (4PC)	3 s	167 Mb	96.5%
	Our (w/o SZ, $K = 10$ )	17.5 s	204 Mb	98.9%
	Our (SZ, $K = 10$ )	17.7 s	29.2 Mb	98.9%
	Our (SZ, $K = 20$ )	29.6 s	58.4 Mb	98.9%
	Our (SZ, $K = 50$ )	65 s	146 Mb	98.9%
MNIST	SecureNN [6] (3PC)	1.03 hrs	110 Gb	93.4%
	Falcon [7] (3PC)	33.6 mins	88 Gb	97.4%
	Our (w/o SZ, $K = 10$ )	17.6 mins	1.1 Gb	98.0%
	Our (SZ, $K = 10$ )	17.8 mins	164 Mb	98.0%
	Our (SZ, $K = 20$ )	18.5 mins	328 Mb	98.0%
	Our (SZ, $K = 50$ )	20.7 mins	821 Mb	98.0%

Open in a new tab

The performance of binary classification is compared with that of SWIFT [12] and Fantastic4 [13], while the performance of multiclass classification is compared with that of SecureNN [6] and Falcon [7].

The performance statistics of SWIFT [12] are sourced from [13], whereas the statistics for the other three SMC frameworks are obtained from their respective publications.

Comp.: computation cost; Comm.: communication cost.

3PC: 3-party computation (i.e., 3 servers); 4PC: 4-party computation (i.e., 4 servers).

$K$ : No. of clients participated in our privacy-preserving scheme.

w/o SZ: without SZ compression.

VIII. Conclusion

In this paper, we introduce a maliciously secure logistic model for horizontally distributed data, utilizing a novel matrix encryption technique. Unlike state-of-the-art secure frameworks that require the participation of two or more servers in an honest-majority setting, our scheme utilizes only a single semi-honest server. Our scheme ensures that any two arbitrary records are indistinguishable through the $d$ -transformation. A verification stage can detect any deviations from the proposed scheme among malicious data providers. Lossy compression is employed to minimize the communication cost while ensuring negligible degradation in accuracy. Compared with other maliciously secure models, our scheme has higher computational and communication efficiency. One prospective avenue for future research involves expanding our secure scheme to other nonlinear models, such as support vector machine and neural network.

Acknowledgments

This work was supported by the National Institutes of Health [R01 LM014027, U24 AA029959-01]. The authors would like to thank anonymous reviewers for many helpful comments.

Appendix A. Commutative matrix

Matrix $B_{1}$ and $B_{2}$ are commutative if $B_{1} B_{2} = B_{2} B_{1}$ . To ensure negligible degradation in accuracy, the proposed privacy-preserving scheme generates commutative matrices to encrypt the plaintexts. The commutative encryption matrix is constructed based on matrix polynomial (i.e., a polynomial with matrices as variables) [61]. For instance, assume there are 2 clients in the collaborative learning. Each client first generates a common encryption key $B_{0}$ (a random nonsingular matrix). Client 1 then generates a vector of random coefficients $(b_{11}, b_{12}, \dots, b_{1 q})$ and an encryption matrix $B_{1} = \sum_{j = 1}^{q} b_{1 j} B_{0}^{j} = b_{11} B_{0} + b_{12} B_{0} B_{0} + b_{13} B_{0} B_{0} B_{0} + \dots + b_{1 q} B_{0}^{q}$ . Similarly, client 2 generates a vector of random coefficients $(b_{21}, b_{22}, \dots, b_{2 q})$ and an encryption matrix $B_{2} = \sum_{j = 1}^{q} b_{2 j} B_{0}^{j} = b_{21} B_{0} + b_{22} B_{0} B_{0} + b_{23} B_{0} B_{0} B_{0} + \dots + b_{2 q} B_{0}^{q} . B_{1}$ and $B_{2}$ are commutative (i.e., $B_{1} B_{2} = B_{2} B_{1}$ ) because $B_{1}$ and $B_{2}$ are both matrix polynomials of the common matrix $B_{0}$ .

Appendix B. Pre-processing and internal encryption for data with large sample size

To enhance the computational efficiency of encryption for clients with large sample sizes, we partition the encryption matrix $A_{i}$ into a diagonal matrix. Specifically, client $i$ generates $A_{i}$ as

A_{i} = (\begin{matrix} A_{0 i} \\ A_{0 i} \\ ⋱ \\ A_{0 i} \end{matrix})

(33)

where $A_{0 i}$ is a $100 \times 100$ random Gaussian matrix. $X_{i}$ is also partitioned into sub-matrices. Following the pre-processing procedure, a pseudo record is added to each sub-matrix to ensure indistinguishability. Specifically, $X_{i}$ (assuming that a vector of 1 is already included as the first row) is partitioned into sub-matrices $(X_{i l}, l = 1,2, \dots)$ , with each sub-matrix containing 99 samples and all the features. Before sub-matrices being encrypted by $A_{0}$ , client $i$ computes the Euclidean norm of each column in $X_{i l}$ . Let $c_{l j}$ be the Euclidean norm of the $j$ -th column $(j = 1, \dots, q)$ in $X_{i l}$ and $c_{l m} = {max}_{j} c_{l j}$ . Client $i$ generates a vector, $c_{l v}$ , with the $j$ -th element of $c_{l v}$ being $\sqrt{c_{l m}^{2} - c_{l j}^{2}}$ . $c_{l v}$ is added to $X_{i l}$ as the last row. Subsequently, the $l$ -th sub-matrix consists of 100 samples, and the Euclidean norm of every column equals $c_{l m}$ . As the total number of samples in $X_{i}$ may not be divisible by 99, client $i$ generates a random set of pseudo records to be vertically integrated into the original matrix such that each sub-matrix contains 99 samples. For example, consider the dataset $X_{i}$ containing 1,070 samples. Client $i$ generates 19 pseudo records, which are then vertically concatenated with $X_{i}$ . Following this concatenation, $X_{i}$ can be split into 11 submatrices, i.e., $X_{i l} (l = 1, \dots, 11)$ . Client $i$ first concatenates $X_{i l}$ with the pseudo record $c_{l v}$ and then encrypts each $X_{i l}$ with $A_{0 i}$ . After data encryption, client $i$ sends $A_{i}$ and row indices of the pseudo records to the server.

Appendix C. Logistic model estimate using encrypted and original data

Theorem 9. Data matrices and model estimates of the secure and non-secure computation have the following properties.

$p (x_{i}^{e n c}; β^{e n c}) = p (x_{i}; β)$ ;
$W^{e n c} = W$ ;
$\begin{matrix} β^{e n c} = B^{- 1} β \end{matrix}$ where $B = \prod_{i = 1}^{K} B_{i}$ .

Proof. Let

P_{1} ≜ (\begin{matrix} p (x_{1}; β) \\ p (x_{2}; β) \\ ⋱ \\ p (x_{n}; β) \end{matrix})

(34)

and

P_{2} ≜ (\begin{matrix} 1 - p (x_{1}; β) \\ 1 - p (x_{2}; β) \\ ⋱ \\ 1 - p (x_{n}; β) \end{matrix}) .

(35)

$P_{1}$ and $P_{2}$ are diagonal matrices. Let

P_{1}^{enc} ≜ P_{1} (X^{enc}, β^{enc}) and P_{2}^{enc} ≜ P_{2} (X^{enc}, β^{enc})

(36)

be the corresponding matrices in the privacy-preserving model.

In iteration $m = 0$ (initial setup), set the starting point as ${[β^{(0)}]}^{enc}$ for the privacy-preserving model and $β^{(0)} ≜ B {[β^{(0)}]}^{enc}$ for the non-secure model. Because $x_{i}^{enc} = B^{T} x_{i}$ , we have

p (x_{i}^{enc}; {[β^{(0)}]}^{enc}) = Pr (y_{i} = 1 ∣ x_{i}^{enc}; {[β^{(0)}]}^{enc}) = \frac{exp ({(x_{i}^{enc})}^{T} {[β^{(0)}]}^{enc})}{1 + exp ({(x_{i}^{enc})}^{T} {[β^{(0)}]}^{enc})} = \frac{exp (x_{i}^{T} B {[β^{(0)}]}^{enc})}{1 + exp (x_{i}^{T} B {[β^{(0)}]}^{enc})} = p (x_{i}; B {[β^{(0)}]}^{enc}) = p (x_{i}; β^{(0)}) .

(37)

According to Equations 34, 35 and 37, we have $P_{1}^{enc} = P_{1}$ and $P_{2}^{enc} = P_{2} . W$ (Equation 4) can be expressed as $W = P_{1} P_{2}$ . Thus we have

W^{enc} = {(P_{1} P_{2})}^{enc} = P_{1}^{enc} P_{2}^{enc} = P_{1} P_{2} = W .

(38)

Therefore properties $I - I I I$ hold in the initial step.

Next we prove that properties $I - I I I$ hold in the $(m + 1)$ -th iteration assuming that properties $I - I I I$ hold in the $m$ -th iteration. Let notations with ${superscript}^{(m)}$ or ${subscript}_{(m)}$ denote parameters derived during the $m$ -th iteration. Given $β^{(m)}, p (x_{i}; β)$ is updated as

p {(x_{i}; β^{(m)})}_{(m + 1)} = \frac{exp (x_{i}^{T} β^{(m)})}{1 + exp (x_{i}^{T} β^{(m)})} .

(39)

Assuming that properties $I - I I I$ hold in the $m$ -th iteration, we have ${[β^{(m)}]}^{enc} = B^{- 1} β^{(m)}$ . Since $x_{i}^{enc} = B^{T} x_{i}$ , we have

p (x_{i}^{enc}, {[β^{(m)}]}_{(m + 1)}^{enc}) = \frac{exp ({(x_{i}^{enc})}^{T} {[β^{(m)}]}^{enc})}{1 + exp ({(x_{i}^{enc})}^{T} {[β^{(m)}]}^{enc})} = \frac{exp (x_{i}^{T} B B^{- 1} β^{(m)})}{1 + exp (x_{i}^{T} B B^{- 1} β^{(m)})} = \frac{exp (x_{i}^{T} β^{(m)})}{1 + exp (x_{i}^{T} β^{(m)})} = p {(x_{i}; β^{(m)})}_{(m + 1)} .

(40)

Similar to the proof for iteration $m = 0$ , we have Enc $(W) = W$ . So properties $I - I I$ hold in the $(m + 1)$ -th iteration. Moreover, based on Equation 11, we have

{[β^{(m + 1)}]}^{enc} = {[{(X^{enc})}^{T} {[W^{(m)}]}^{enc} X^{enc} + Λ B^{T} B]}^{- 1} \times \{{(X^{enc})}^{T} {[W^{(m)}]}^{enc} X^{enc} {[β^{(m)}]}^{enc} + {(Z^{enc})}^{T} - {(X^{enc})}^{T} {[p^{(m)}]}^{enc}\} = {(B^{T} X^{T} W^{(m)} X B + Λ B^{T} B)}^{- 1} \times [B^{T} X^{T} W^{(m)} X B B^{- 1} β^{(m)} + B^{T} Z^{T} - B^{T} X^{T} p^{(m)}] = B^{- 1} β^{(m + 1)} .

(41)

To conclude, properties $I - I I I$ hold for all iterations. □

References

[1].Zhang Y, Yu R, Nekovee M, Liu Y, Xie S, and Gjessing S, “Cognitive machine-to-machine communications: visions and potentials for the smart grid,” IEEE Network, vol. 26, no. 3, pp. 6–13, 2012. [Google Scholar]
[2].Zhao C, Zhao S, Zhao M, Chen Z, Gao C-Z, Li H, and an Tan Y, “Secure multi-party computation: Theory, practice and applications,” Information Sciences, vol. 476, pp. 357–372, 2019. [Google Scholar]
[3].Li Q, Wen Z, Wu Z, Hu S, Wang N, Li Y, Liu X, and He B, “A survey on federated learning systems: Vision, hype and reality for data privacy and protection,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 4, pp. 3347–3366, 2023. [Google Scholar]
[4].Lindell Y, “Secure multiparty computation,” Commun. ACM, vol. 64 no. 1, pp. 86–96, 2020. [Google Scholar]
[5].Thapa C and Camtepe S, “Precision health data: Requirements, challenges and existing techniques for data security and privacy,” Computers in Biology and Medicine, vol. 129, p. 104130, 2021. [DOI] [PubMed] [Google Scholar]
[6].Wagh S, Gupta D, and Chandran N, “SecureNN: 3-party secure computation for neural network training.” Proc. Priv. Enhancing Technol, vol. 2019, no. 3, pp. 26–49, 2019 [Google Scholar]
[7].Wagh S, Tople S, Benhamouda F, Kushilevitz E, Mittal P, and Rabin T, “Falcon: Honest-majority maliciously secure framework for private deep learning,” Proceedings on Privacy Enhancing Technologies, vol. 2021, pp. 188–208, January 2021. [Google Scholar]
[8].Gordon SD, Ranellucci S, and Wang X, “Secure computation with low communication from cross-checking,” in Advances in Cryptology - ASIACRYPT 2018: 24th International Conference on the Theory and Application of Cryptology and Information Security, Brisbane, QLD, Australia, December 2–6, 2018, Proceedings, Part III. Springer-Verlag, 2018, pp. 59–85. [Google Scholar]
[9].Patra A and Suresh A, “BLAZE: blazing fast privacy-preserving machine learning,” in 27th Annual Network and Distributed System Security Symposium, NDSS 2020, San Diego, California, USA, February 23–26, 2020. The Internet Society, 2020. [Google Scholar]
[10].Mohassel P and Rindal P, “ABY3: A mixed protocol framework for machine learning,” in Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security. New York, NY, USA: Association for Computing Machinery, 2018, p. 35–52. [Google Scholar]
[11].Lehmkuhl R, Mishra P, Srinivasan A, and Popa RA, “Muse Secure inference resilient to malicious clients,” in 30th USENIX Security Symposium (USENIX Security 21). USENIX Association, Aug. 2021, pp. 2201–2218. [Google Scholar]
[12].Koti N, Pancholi M, Patra A, and Suresh A, “SWIFT: Super-fast and robust privacy-preserving machine learning,” in 30th USENIX Security Symposium (USENIX Security 21). USENIX Association, 2021, pp. 2651–2668. [Google Scholar]
[13].Dalskov A, Escudero D, and Keller M, “Fantastic four: Honest-majority four-party secure computation with malicious security,” in 30th USENIX Security Symposium (USENIX Security 21). USENIX Association, 2021, pp. 2183–2200. [Google Scholar]
[14].Byali M, Chaudhari H, Patra A, and Suresh A, “FLASH: Fast and robust framework for privacy-preserving machine learning,” Proceedings on Privacy Enhancing Technologies, vol. 2020, pp. 459–480, April 2020. [Google Scholar]
[15].Wang X, Ranellucci S, and Katz J, “Authenticated garbling and efficient maliciously secure two-party computation,” in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘17. Association for Computing Machinery, 2017, pp. 21–37. [Google Scholar]
[16].Wang X, Ranellucci S, and Katz J, “Global-scale secure multiparty computation,” in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘17. Association for Computing Machinery, 2017, pp. 39–56. [Google Scholar]
[17].Zhu R, Cassel D, Sabry A, and Huang Y, “NANOPI: Extreme-scale actively-secure multi-party computation,” in Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘18. Association for Computing Machinery, 2018, pp. 862–879. [Google Scholar]
[18].Zheng W, Popa RA, Gonzalez JE, and Stoica I, “Helen: Maliciously secure coopetitive learning for linear models,” in 2019 IEEE Symposium on Security and Privacy (SP), 2019, pp. 724–738. [Google Scholar]
[19].Fan Y, Bai J, Lei X, Zhang Y, Zhang B, Li K-C, and Tan G, “Privacy preserving based logistic regression on big data,” Journal of Network and Computer Applications, vol. 171, p. 102769, 2020. [Google Scholar]
[20].Mohassel P and Zhang Y, “Secureml: A system for scalable privacy-preserving machine learning,” in 2017 IEEE Symposium on Security and Privacy (SP). Los Alamitos, CA, USA: IEEE Computer Society, 2017, pp. 19–38. [Google Scholar]
[21].Patra A, Schneider T, Suresh A, and Yalame H, “ABY2.0: Improved mixed-protocol secure two-party computation,” in 30th USENIX Security Symposium (USENIX Security 21), 2021, pp. 2165–2182. [Google Scholar]
[22].Rivest RL, Adleman L, Dertouzos ML et al. , “On data banks and privacy homomorphisms,” Foundations of secure computation, vol. 4, no. 11, pp. 169–180, 1978. [Google Scholar]
[23].Acar A, Aksu H, Uluagac AS, and Conti M, “A survey on homomorphic encryption schemes: Theory and implementation,” ACM Comput. Surv, vol. 51, no. 4, pp. 1–35, 2018 [Google Scholar]
[24].Yao AC-C, “How to generate and exchange secrets,” in 27th annual symposium on foundations of computer science (Sfcs 1986). IEEE; 1986, pp. 162–167. [Google Scholar]
[25].Goldreich O, Micali S, and Wigderson A, “How to play ANY mental game,” in Proceedings of the Nineteenth Annual ACM Symposium on Theory of Computing. Association for Computing Machinery, 1987, pp. 218–229. [Google Scholar]
[26].Dwork C, “Differential privacy,” in Proceedings of the 33rd International Conference on Automata, Languages and Programming - Volume Part II. Springer-Verlag, 2006, pp. 1–12. [Google Scholar]
[27].Zhu T, Ye D, Wang W, Zhou W, and Yu PS, “More than privacy: Applying differential privacy in key areas of artificial intelligence,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 6, pp. 2824–2843, 2022. [Google Scholar]
[28].Zhao L, Wang Q, Zou Q, Zhang Y, and Chen Y, “Privacy-preserving collaborative deep learning with unreliable participants,” IEEE Transactions on Information Forensics and Security, vol. 15, pp. 1486–1500, 2020. [Google Scholar]
[29].Phan N, Vu MN, Liu Y, Jin R, Dou D, Wu X, and Thai MT, “Heterogeneous gaussian mechanism: preserving differential privacy in deep learning with provable robustness,” in Proceedings of the 28th International Joint Conference on Artificial Intelligence. AAAI Press, 2019, pp. 4753–4759. [Google Scholar]
[30].Li W, Milletarì F, Xu D, Rieke N, Hancox J, Zhu W, Baust M, Cheng Y, Ourselin S, Cardoso MJ et al. , “Privacy-preserving federated brain tumour segmentation,” in Machine Learning in Medical Imaging: 10th International Workshop, MLMI 2019, Held in Conjunction with MICCAI 2019, Shenzhen, China, October 13, 2019, Proceedings 10. Springer, 2019, pp. 133–141. [Google Scholar]
[31].Wei K, Li J, Ding M, Ma C, Yang HH, Farokhi F, Jin S, Quek TQS, and Poor HV, “Federated learning with differential privacy: Algorithms and performance analysis,” IEEE Transactions on Information Forensics and Security, vol. 15, pp. 3454–3469, 2020. [Google Scholar]
[32].Truex S, Baracaldo N, Anwar A, Steinke T, Ludwig H, Zhang R, and Zhou Y, “A hybrid approach to privacy-preserving federated learning,” in Proceedings of the 12th ACM Workshop on Artificial Intelligence and Security, ser. AISec’19. New York, NY, USA: Association for Computing Machinery, 2019, p. 1–11. [Google Scholar]
[33].Jayaraman B and Evans D, “Evaluating differentially private machine learning in practice,” in 28th USENIX Security Symposium (USENIX Security 19). Santa Clara, CA: USENIX Association, 2019, pp. 1895–1912. [Google Scholar]
[34].Bianchi T, Bioglio V, and Magli E, “Analysis of one-time random projections for privacy preserving compressed sensing,” IEEE Transactions on Information Forensics and Security, vol. 11, no. 2, pp. 313–327, 2016. [Google Scholar]
[35].Yu NY, “Indistinguishability and energy sensitivity of gaussian and bernoulli compressed encryption,” IEEE Transactions on Information Forensics and Security, vol. 13, no. 7, pp. 1722–1735, 2018. [Google Scholar]
[36].Cho W and Yu NY, “Secure and efficient compressed sensing-based encryption with sparse matrices,” IEEE Transactions on Information Forensics and Security, vol. 15, pp. 1999–2011, 2020 [Google Scholar]
[37].Kuldeep G and Zhang Q, “Design prototype and security analysis of a lightweight joint compression and encryption scheme for resource-constrained iot devices,” IEEE Internet of Things Journal, vol. 9, no. 1, pp. 165–181, 2022 [Google Scholar]
[38].Hastie T, Tibshirani R, Friedman JH, and Friedman JH, The elements of statistical learning: data mining, inference, and prediction. Springer, 2009, vol. 2. [Google Scholar]
[39].Maalouf M, “Logistic regression in data analysis: An overview,” Int. J. Data Anal. Tech. Strateg, vol. 3, no. 3, p. 281–299, 2011. [Google Scholar]
[40].Katz J and Lindell Y, Introduction to Modern Cryptography, Second Edition, 2nd ed. Chapman & Hall/CRC, 2014. [Google Scholar]
[41].He X, Machanavajjhala A, Flynn C, and Srivastava D, “Composing differential privacy and secure computation: A case study on scaling private record linkage,” in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘17. Association for Computing Machinery, 2017, p. 1389–1406 [Google Scholar]
[42].Wang W, Ying L, and Zhang J, “On the relation between identifiability, differential privacy, and mutual-information privacy,” IEEE Transactions on Information Theory, vol. 62, no. 9, pp. 5018–5029, 2016. [Google Scholar]
[43].Bellare M, Hoang VT, and Rogaway P, “Foundations of garbled circuits,” in Proceedings of the 2012 ACM Conference on Computer and Communications Security. New York, NY, USA: Association for Computing Machinery, 2012, pp. 784–796 [Google Scholar]
[44].Liu C, Hu X, Chen X, Wei J, and Liu W, “SDIM: A subtly designed invertible matrix for enhanced privacy-preserving outsourcing matrix multiplication and related tasks,” IEEE Transactions on Dependable and Secure Computing, pp. 1–18, 2023 [Google Scholar]
[45].Canetti R, “Universally composable security,” J. ACM, vol. 67, no. 5, 2020. [Google Scholar]
[46].Gibbs AL and Su FE, “On choosing and bounding probability metrics,” International Statistical Review / Revue Internationale de Statistique, vol. 70, no. 3, pp. 419–435, 2002. [Google Scholar]
[47].Cam LL, Asymptotic Methods in Statistical Decision Theory. Springer; New York, NY, 1986 [Google Scholar]
[48].DasGupta A, Asymptotic Theory of Statistics and Probability. Springer; New York, NY, 2008 [Google Scholar]
[49].Guntuboyina A, Saha S, and Schiebinger G, “Sharp inequalities for f-divergences,” IEEE Transactions on Information Theory, vol. 60, no. 1, pp. 104–121, 2014 [Google Scholar]
[50].Kailath T, “The divergence and bhattacharyya distance measures in signal selection,” IEEE Transactions on Communication Technology, vol. 15, no. 1, pp. 52–60, 1967. [Google Scholar]
[51].Abou-Moustafa KT and Ferrie FP, “A note on metric properties for some divergence measures: The gaussian case,” in Proceedings of the Asian Conference on Machine Learning, ser. Proceedings of Machine Learning Research, vol. 25. Singapore Management University, Singapore: PMLR, 2012, pp. 1–15 [Google Scholar]
[52].Sun X, Tian C, Hu C, Tian W, Zhang H, and Yu J, “Privacy-preserving and verifiable SRC-based face recognition with cloud/edge server assistance,” Computers & Security, vol. 118, p. 102740, 2022 [Google Scholar]
[53].Liu C, Hu X, Zhang Q, Wei J, and Liu W, “An efficient biometric identification in cloud computing with enhanced privacy security,” IEEE Access, vol. 7, pp. 105363–105375,2019 [Google Scholar]
[54].Jasmine RM and Jasper J, “A privacy preserving based multi-biometric system for secure identification in cloud environment,” Neural Processing Letters, vol. 54, no. 1, pp. 303–325, 2022. [Google Scholar]
[55].Di S and Cappello F, “Fast error-bounded lossy hpc data compression with sz,” in 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2016, pp. 730–739. [Google Scholar]
[56].Cappello F, Di S, Li S, Liang X, Gok AM, Tao D, Yoon CH, Wu X-C, Alexeev Y, and Chong FT, “Use cases of lossy compression for floating-point data in scientific data sets,” The International Journal of High Performance Computing Applications, vol. 33, no. 6, pp. 1201–1220,2019. [Google Scholar]
[57].Zhao K, Di S, Lian X, Li S, Tao D, Bessac J, Chen Z, and Cappello F, “SDRBench: Scientific data reduction benchmark for lossy compressors,” in 2020 IEEE International Conference on Big Data (Big Data), 2020, pp. 2716–2724. [Google Scholar]
[58].van Erven T and Harremos P, “Rényi divergence and kullback-leibler divergence,” IEEE Transactions on Information Theory, vol. 60, no. 7, pp. 3797–3820, 2014 [Google Scholar]
[59].Dua D, Graff C et al. , “Uci machine learning repository,” 2017. [Online]. Available: http://archive.ics.uci.edu/ml
[60].Kulesa A, Krzywinski M, Blainey P, and Altman N, “Sampling distributions and the bootstrap,” Nature Methods, vol. 12, pp. 477–478 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
[61].Hou S, Uehara T, Yiu S, Hui LC, and Chow K, “Privacy preserving confidential forensic investigation for shared or remote servers,” in 2011 Seventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2011, pp. 378–383 [Google Scholar]

[R1] [1].Zhang Y, Yu R, Nekovee M, Liu Y, Xie S, and Gjessing S, “Cognitive machine-to-machine communications: visions and potentials for the smart grid,” IEEE Network, vol. 26, no. 3, pp. 6–13, 2012. [Google Scholar]

[R2] [2].Zhao C, Zhao S, Zhao M, Chen Z, Gao C-Z, Li H, and an Tan Y, “Secure multi-party computation: Theory, practice and applications,” Information Sciences, vol. 476, pp. 357–372, 2019. [Google Scholar]

[R3] [3].Li Q, Wen Z, Wu Z, Hu S, Wang N, Li Y, Liu X, and He B, “A survey on federated learning systems: Vision, hype and reality for data privacy and protection,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 4, pp. 3347–3366, 2023. [Google Scholar]

[R4] [4].Lindell Y, “Secure multiparty computation,” Commun. ACM, vol. 64 no. 1, pp. 86–96, 2020. [Google Scholar]

[R5] [5].Thapa C and Camtepe S, “Precision health data: Requirements, challenges and existing techniques for data security and privacy,” Computers in Biology and Medicine, vol. 129, p. 104130, 2021. [DOI] [PubMed] [Google Scholar]

[R6] [6].Wagh S, Gupta D, and Chandran N, “SecureNN: 3-party secure computation for neural network training.” Proc. Priv. Enhancing Technol, vol. 2019, no. 3, pp. 26–49, 2019 [Google Scholar]

[R7] [7].Wagh S, Tople S, Benhamouda F, Kushilevitz E, Mittal P, and Rabin T, “Falcon: Honest-majority maliciously secure framework for private deep learning,” Proceedings on Privacy Enhancing Technologies, vol. 2021, pp. 188–208, January 2021. [Google Scholar]

[R8] [8].Gordon SD, Ranellucci S, and Wang X, “Secure computation with low communication from cross-checking,” in Advances in Cryptology - ASIACRYPT 2018: 24th International Conference on the Theory and Application of Cryptology and Information Security, Brisbane, QLD, Australia, December 2–6, 2018, Proceedings, Part III. Springer-Verlag, 2018, pp. 59–85. [Google Scholar]

[R9] [9].Patra A and Suresh A, “BLAZE: blazing fast privacy-preserving machine learning,” in 27th Annual Network and Distributed System Security Symposium, NDSS 2020, San Diego, California, USA, February 23–26, 2020. The Internet Society, 2020. [Google Scholar]

[R10] [10].Mohassel P and Rindal P, “ABY3: A mixed protocol framework for machine learning,” in Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security. New York, NY, USA: Association for Computing Machinery, 2018, p. 35–52. [Google Scholar]

[R11] [11].Lehmkuhl R, Mishra P, Srinivasan A, and Popa RA, “Muse Secure inference resilient to malicious clients,” in 30th USENIX Security Symposium (USENIX Security 21). USENIX Association, Aug. 2021, pp. 2201–2218. [Google Scholar]

[R12] [12].Koti N, Pancholi M, Patra A, and Suresh A, “SWIFT: Super-fast and robust privacy-preserving machine learning,” in 30th USENIX Security Symposium (USENIX Security 21). USENIX Association, 2021, pp. 2651–2668. [Google Scholar]

[R13] [13].Dalskov A, Escudero D, and Keller M, “Fantastic four: Honest-majority four-party secure computation with malicious security,” in 30th USENIX Security Symposium (USENIX Security 21). USENIX Association, 2021, pp. 2183–2200. [Google Scholar]

[R14] [14].Byali M, Chaudhari H, Patra A, and Suresh A, “FLASH: Fast and robust framework for privacy-preserving machine learning,” Proceedings on Privacy Enhancing Technologies, vol. 2020, pp. 459–480, April 2020. [Google Scholar]

[R15] [15].Wang X, Ranellucci S, and Katz J, “Authenticated garbling and efficient maliciously secure two-party computation,” in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘17. Association for Computing Machinery, 2017, pp. 21–37. [Google Scholar]

[R16] [16].Wang X, Ranellucci S, and Katz J, “Global-scale secure multiparty computation,” in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘17. Association for Computing Machinery, 2017, pp. 39–56. [Google Scholar]

[R17] [17].Zhu R, Cassel D, Sabry A, and Huang Y, “NANOPI: Extreme-scale actively-secure multi-party computation,” in Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘18. Association for Computing Machinery, 2018, pp. 862–879. [Google Scholar]

[R18] [18].Zheng W, Popa RA, Gonzalez JE, and Stoica I, “Helen: Maliciously secure coopetitive learning for linear models,” in 2019 IEEE Symposium on Security and Privacy (SP), 2019, pp. 724–738. [Google Scholar]

[R19] [19].Fan Y, Bai J, Lei X, Zhang Y, Zhang B, Li K-C, and Tan G, “Privacy preserving based logistic regression on big data,” Journal of Network and Computer Applications, vol. 171, p. 102769, 2020. [Google Scholar]

[R20] [20].Mohassel P and Zhang Y, “Secureml: A system for scalable privacy-preserving machine learning,” in 2017 IEEE Symposium on Security and Privacy (SP). Los Alamitos, CA, USA: IEEE Computer Society, 2017, pp. 19–38. [Google Scholar]

[R21] [21].Patra A, Schneider T, Suresh A, and Yalame H, “ABY2.0: Improved mixed-protocol secure two-party computation,” in 30th USENIX Security Symposium (USENIX Security 21), 2021, pp. 2165–2182. [Google Scholar]

[R22] [22].Rivest RL, Adleman L, Dertouzos ML et al. , “On data banks and privacy homomorphisms,” Foundations of secure computation, vol. 4, no. 11, pp. 169–180, 1978. [Google Scholar]

[R23] [23].Acar A, Aksu H, Uluagac AS, and Conti M, “A survey on homomorphic encryption schemes: Theory and implementation,” ACM Comput. Surv, vol. 51, no. 4, pp. 1–35, 2018 [Google Scholar]

[R24] [24].Yao AC-C, “How to generate and exchange secrets,” in 27th annual symposium on foundations of computer science (Sfcs 1986). IEEE; 1986, pp. 162–167. [Google Scholar]

[R25] [25].Goldreich O, Micali S, and Wigderson A, “How to play ANY mental game,” in Proceedings of the Nineteenth Annual ACM Symposium on Theory of Computing. Association for Computing Machinery, 1987, pp. 218–229. [Google Scholar]

[R26] [26].Dwork C, “Differential privacy,” in Proceedings of the 33rd International Conference on Automata, Languages and Programming - Volume Part II. Springer-Verlag, 2006, pp. 1–12. [Google Scholar]

[R27] [27].Zhu T, Ye D, Wang W, Zhou W, and Yu PS, “More than privacy: Applying differential privacy in key areas of artificial intelligence,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 6, pp. 2824–2843, 2022. [Google Scholar]

[R28] [28].Zhao L, Wang Q, Zou Q, Zhang Y, and Chen Y, “Privacy-preserving collaborative deep learning with unreliable participants,” IEEE Transactions on Information Forensics and Security, vol. 15, pp. 1486–1500, 2020. [Google Scholar]

[R29] [29].Phan N, Vu MN, Liu Y, Jin R, Dou D, Wu X, and Thai MT, “Heterogeneous gaussian mechanism: preserving differential privacy in deep learning with provable robustness,” in Proceedings of the 28th International Joint Conference on Artificial Intelligence. AAAI Press, 2019, pp. 4753–4759. [Google Scholar]

[R30] [30].Li W, Milletarì F, Xu D, Rieke N, Hancox J, Zhu W, Baust M, Cheng Y, Ourselin S, Cardoso MJ et al. , “Privacy-preserving federated brain tumour segmentation,” in Machine Learning in Medical Imaging: 10th International Workshop, MLMI 2019, Held in Conjunction with MICCAI 2019, Shenzhen, China, October 13, 2019, Proceedings 10. Springer, 2019, pp. 133–141. [Google Scholar]

[R31] [31].Wei K, Li J, Ding M, Ma C, Yang HH, Farokhi F, Jin S, Quek TQS, and Poor HV, “Federated learning with differential privacy: Algorithms and performance analysis,” IEEE Transactions on Information Forensics and Security, vol. 15, pp. 3454–3469, 2020. [Google Scholar]

[R32] [32].Truex S, Baracaldo N, Anwar A, Steinke T, Ludwig H, Zhang R, and Zhou Y, “A hybrid approach to privacy-preserving federated learning,” in Proceedings of the 12th ACM Workshop on Artificial Intelligence and Security, ser. AISec’19. New York, NY, USA: Association for Computing Machinery, 2019, p. 1–11. [Google Scholar]

[R33] [33].Jayaraman B and Evans D, “Evaluating differentially private machine learning in practice,” in 28th USENIX Security Symposium (USENIX Security 19). Santa Clara, CA: USENIX Association, 2019, pp. 1895–1912. [Google Scholar]

[R34] [34].Bianchi T, Bioglio V, and Magli E, “Analysis of one-time random projections for privacy preserving compressed sensing,” IEEE Transactions on Information Forensics and Security, vol. 11, no. 2, pp. 313–327, 2016. [Google Scholar]

[R35] [35].Yu NY, “Indistinguishability and energy sensitivity of gaussian and bernoulli compressed encryption,” IEEE Transactions on Information Forensics and Security, vol. 13, no. 7, pp. 1722–1735, 2018. [Google Scholar]

[R36] [36].Cho W and Yu NY, “Secure and efficient compressed sensing-based encryption with sparse matrices,” IEEE Transactions on Information Forensics and Security, vol. 15, pp. 1999–2011, 2020 [Google Scholar]

[R37] [37].Kuldeep G and Zhang Q, “Design prototype and security analysis of a lightweight joint compression and encryption scheme for resource-constrained iot devices,” IEEE Internet of Things Journal, vol. 9, no. 1, pp. 165–181, 2022 [Google Scholar]

[R38] [38].Hastie T, Tibshirani R, Friedman JH, and Friedman JH, The elements of statistical learning: data mining, inference, and prediction. Springer, 2009, vol. 2. [Google Scholar]

[R39] [39].Maalouf M, “Logistic regression in data analysis: An overview,” Int. J. Data Anal. Tech. Strateg, vol. 3, no. 3, p. 281–299, 2011. [Google Scholar]

[R40] [40].Katz J and Lindell Y, Introduction to Modern Cryptography, Second Edition, 2nd ed. Chapman & Hall/CRC, 2014. [Google Scholar]

[R41] [41].He X, Machanavajjhala A, Flynn C, and Srivastava D, “Composing differential privacy and secure computation: A case study on scaling private record linkage,” in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, ser. CCS ‘17. Association for Computing Machinery, 2017, p. 1389–1406 [Google Scholar]

[R42] [42].Wang W, Ying L, and Zhang J, “On the relation between identifiability, differential privacy, and mutual-information privacy,” IEEE Transactions on Information Theory, vol. 62, no. 9, pp. 5018–5029, 2016. [Google Scholar]

[R43] [43].Bellare M, Hoang VT, and Rogaway P, “Foundations of garbled circuits,” in Proceedings of the 2012 ACM Conference on Computer and Communications Security. New York, NY, USA: Association for Computing Machinery, 2012, pp. 784–796 [Google Scholar]

[R44] [44].Liu C, Hu X, Chen X, Wei J, and Liu W, “SDIM: A subtly designed invertible matrix for enhanced privacy-preserving outsourcing matrix multiplication and related tasks,” IEEE Transactions on Dependable and Secure Computing, pp. 1–18, 2023 [Google Scholar]

[R45] [45].Canetti R, “Universally composable security,” J. ACM, vol. 67, no. 5, 2020. [Google Scholar]

[R46] [46].Gibbs AL and Su FE, “On choosing and bounding probability metrics,” International Statistical Review / Revue Internationale de Statistique, vol. 70, no. 3, pp. 419–435, 2002. [Google Scholar]

[R47] [47].Cam LL, Asymptotic Methods in Statistical Decision Theory. Springer; New York, NY, 1986 [Google Scholar]

[R48] [48].DasGupta A, Asymptotic Theory of Statistics and Probability. Springer; New York, NY, 2008 [Google Scholar]

[R49] [49].Guntuboyina A, Saha S, and Schiebinger G, “Sharp inequalities for f-divergences,” IEEE Transactions on Information Theory, vol. 60, no. 1, pp. 104–121, 2014 [Google Scholar]

[R50] [50].Kailath T, “The divergence and bhattacharyya distance measures in signal selection,” IEEE Transactions on Communication Technology, vol. 15, no. 1, pp. 52–60, 1967. [Google Scholar]

[R51] [51].Abou-Moustafa KT and Ferrie FP, “A note on metric properties for some divergence measures: The gaussian case,” in Proceedings of the Asian Conference on Machine Learning, ser. Proceedings of Machine Learning Research, vol. 25. Singapore Management University, Singapore: PMLR, 2012, pp. 1–15 [Google Scholar]

[R52] [52].Sun X, Tian C, Hu C, Tian W, Zhang H, and Yu J, “Privacy-preserving and verifiable SRC-based face recognition with cloud/edge server assistance,” Computers & Security, vol. 118, p. 102740, 2022 [Google Scholar]

[R53] [53].Liu C, Hu X, Zhang Q, Wei J, and Liu W, “An efficient biometric identification in cloud computing with enhanced privacy security,” IEEE Access, vol. 7, pp. 105363–105375,2019 [Google Scholar]

[R54] [54].Jasmine RM and Jasper J, “A privacy preserving based multi-biometric system for secure identification in cloud environment,” Neural Processing Letters, vol. 54, no. 1, pp. 303–325, 2022. [Google Scholar]

[R55] [55].Di S and Cappello F, “Fast error-bounded lossy hpc data compression with sz,” in 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2016, pp. 730–739. [Google Scholar]

[R56] [56].Cappello F, Di S, Li S, Liang X, Gok AM, Tao D, Yoon CH, Wu X-C, Alexeev Y, and Chong FT, “Use cases of lossy compression for floating-point data in scientific data sets,” The International Journal of High Performance Computing Applications, vol. 33, no. 6, pp. 1201–1220,2019. [Google Scholar]

[R57] [57].Zhao K, Di S, Lian X, Li S, Tao D, Bessac J, Chen Z, and Cappello F, “SDRBench: Scientific data reduction benchmark for lossy compressors,” in 2020 IEEE International Conference on Big Data (Big Data), 2020, pp. 2716–2724. [Google Scholar]

[R58] [58].van Erven T and Harremos P, “Rényi divergence and kullback-leibler divergence,” IEEE Transactions on Information Theory, vol. 60, no. 7, pp. 3797–3820, 2014 [Google Scholar]

[R59] [59].Dua D, Graff C et al. , “Uci machine learning repository,” 2017. [Online]. Available: http://archive.ics.uci.edu/ml

[R60] [60].Kulesa A, Krzywinski M, Blainey P, and Altman N, “Sampling distributions and the bootstrap,” Nature Methods, vol. 12, pp. 477–478 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R61] [61].Hou S, Uehara T, Yiu S, Hui LC, and Chow K, “Privacy preserving confidential forensic investigation for shared or remote servers,” in 2011 Seventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2011, pp. 378–383 [Google Scholar]

PERMALINK

Efficient Privacy-preserving Logistic Model With Malicious Security

Guanhong Miao

Samuel S Wu

Abstract

I. Introduction

II. Related work

Secure multi-party computation (SMC)

Table I:

Differential privacy

Matrix encryption

III. Preliminaries

A. Logistic model

B. Indistinguishability

C. Adversarial attacks on matrix encryption methods

Ciphertext-only attack (level 1):

Known-plaintext attack (level 2):

Chosen-plaintext attack (level 3):

Chosen-ciphertext attack (level 4):

D. Matrix encryption

IV. System Overview

A. System model

Fig. 1:

B. Threat model

C. Design goals

V. Proposed scheme

A. Data encryption, modeling and decryption

Table II:

Pre-processing

Internal encryption

External encryption

Fig. 2:

Table III:

Modeling

Decryption

Fig. 3:

B. Multiclass classification

C. Lossy compression with SZ

D. Verification: malicious behavior detection

Fig. 4:

VI. Security analysis

Fig. 5:

A. Indistinguishability of AiXiB: security against clients

B. Indistinguishability of XiB : security against the server

Fig. 6:

VII. Performance evaluation

Table IV:

Fig. 7:

Table V:

VIII. Conclusion

Acknowledgments

Appendix A. Commutative matrix

Appendix B. Pre-processing and internal encryption for data with large sample size

Appendix C. Logistic model estimate using encrypted and original data

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

A. Indistinguishability of $A_{i} X_{i} B$ : security against clients

B. Indistinguishability of $X_{i} B$ : security against the server