. Author manuscript; available in PMC: 2019 Sep 28.

Published in final edited form as: Proc AAAI Conf Artif Intell. 2019 Jan-Feb;33:4763–4771. doi: 10.1609/aaai.v33i01.33014763

Table 2:

Glossary of variables and symbols used in this paper.

Symbol	Used for
X	Data point, X ∈ $X$
n	Number of data points
Y_s	Label for one of the t classification tasks, Y_s ∈ {1,...,k_s}
t	Number of tasks
Y	Vector of task labels Y = [Y₁, Y₂,...,Y_t]^T
r	Cardinality of the output space, r = \| $Y$ \|
G _task	Task structure graph
$Y$	Output space of allowable task labels defined by G_task, Y ∈ $Y$
$D$	Distribution from which we assume (X, Y) data points are sampled i.i.d.
s_i	Weak supervision source, a function mapping X to a label vector
$λ_{i}$	Label vector $λ_{i}$ ∈ $Y$ output by the ith source for X
m	Number of sources
λ	m × t matrix of labels output by the m sources for X
$Y$ ₀	Source output space, which is $Y$ augmented to include elements set to zero
τ_i	Coverage set of $λ_{i}$ - the tasks s_i gives non-zero labels to; for convenience, τ₀ = {1,…,t}
$Y$ _{τ_i}	The output space for $λ_{i}$ given coverage set τ_i
$Y_{τ_{i}}^{min}$	The output space $Y_{τ_{i}}$ with all but the first value, for defining a minimal set of statistics
G _source	Source dependency graph, G_source = (V, E), V = {Y, $λ_{1}$ ,..., $λ_{m}$ }
$C$	Cliqueset (maximal and non-maximal) of G_source
$\tilde{C}, S$	The maximal cliques (nodes) and separator sets of the junction tree over G_source
Ψ(C, y_C)	The indicator variable for the variables in clique C ∈ $C$ taking on values y_C, (y_C)_i ∈ $Y$ _{τ_i}
μ	The parameters of our label model we aim to estimate; $μ = E [ψ]$
O	The set of observable cliques, i.e. those corresponding to cliques without Y
Σ	Generalized covariance matrix of O ⊆ $S$ , Σ ≡ Cov [Ψ(O ⊆ $S$ )]
K	The inverse generalized covariance matrix K = Σ⁻¹
d_O, d _$S$	The dimensions of O and $S$ respectively
G _aug	The augmented source dependencies graph G_aug = (Ψ, E_aug)
Ω	The edge set of the inverse graph of G_aug
P	Diagonal matrix of class prior probabilities, P(Y)
P_μ (Y, λ)	The label model parameterized by μ
$\tilde{Y}$	The probabilistic training label, i.e. P_μ(Y\|λ)
f_w (X)	The end model trained using (X, $\tilde{Y}$ )