. Author manuscript; available in PMC: 2026 Feb 19.

Published in final edited form as: Proceedings (IEEE Int Conf Bioinformatics Biomed). 2026 Jan 19;2025:5454–5462. doi: 10.1109/bibm66473.2025.11356407

Algorithm 1:

The sampling algorithm.

Input: A longitudinal dataset $D$ consisting of visit-level records for $N$ participants. For each participant $i$ , a visit set is denoted as $V_{i} = \{v_{i 1}, v_{i 2}, \dots, v_{i T_{i}}\}$ , where $T_{i}$ indicates a number of total visits, including: participant ID and baseline demographics at $T = 1$ , and cognitive impairment status, comorbidities and NPS indicators over time
Output: Set of bootstrap samples $𝓑 = \{B_{1}, B_{2}, \dots, B_{100}\}$ with each dataset $B_{b} \in ℝ^{N \times d}$
1	Initialize $ℬ \leftarrow \emptyset$
2	for $b = 1$ to 100 do
	Initialize $B_{b} \leftarrow \emptyset$
	for $i = 1$ to $N$ do
	Let $V_{i} = {\{v_{i j}\}}_{j = 1}^{T_{i}}$ , sorted by time
	Partition $V_{i} = V^{n} (normal cognition) \cup V^{c} (cognitively impaired)$
	Let status $s_{i T_{i}} \in {normal cognition, cognitively impaired}$
	Extract $x_{i}^{0} \leftarrow$ baseline demographics from $v_{i 1}$
	if $s_{i T_{i}} = cognitively impaired$ then
	Random sample $S_{i}^{'} \subset V^{n} \cup V^{c}, \|S_{i}^{'}\| = 3, s . t .$
	$S_{i}^{'} = \{\begin{array}{l} \{v_{i t_{1}}^{n}, v_{i t_{2}}^{c}, v_{i t_{3}}^{c}\} \\ \{v_{i t_{1}}^{n}, v_{i t_{2}}^{n}, v_{i t_{3}}^{c}\} \end{array}$
	Random sample $S_{i}^{c} \subset V_{i}^{c}, \|S_{i}^{c}\| = 3$
	else
	Random sample $S_{i}^{'} \subset V_{i}^{n}, \|S_{i}^{'}\| = 3$
	end if
	Let $x_{i}^{'}$ disease and NPS statuses from $S_{i}^{'}$
	Let $x_{i} \in ℝ^{d} \leftarrow concatenate x_{i}^{0} and x_{i}^{'}$
	Append $x_{i}$ to $B_{b}$
	end for
	Append $B_{b}$ to $ℬ$
	end for
3	Return $ℬ$