|
|
| Algorithm 1: Outline of algorithm for determining outliers via Eigenvalue method |
|
|
| Initialize: Determine initial cluster membership for all observations; |
| Return: Number of clusters: G and |
| Cluster variance estimates: for g = 1,…, G; |
| for Observations i = 1,…, N do |
| For observation i assigned to cluster g, hold other N − 1 cluster memberships constant; |
| (A) Determine variance structure when removing observation i; |
| With N-1 observations, use the M-step (from the EM algorithm) to calculate parameter estimates for all G groups under all specified variance structures Select best variance structure through maximization of the BIC; |
| Return: Cluster variance estimate: ; |
| if Selected variance structure differs from original variance structure, |
| then re-estimate cluster variance for g = 1,…, G with all N observations under the best variance structure selected for N − 1 observations ; |
| (B) Calculate the minimum eigenvalue
of R−i, where |
| Define: the # of observations in cluster g as Ng, trimming value and tj for j = 1,…, 5 to create ∼ 4 equally spaced intervals from 1,…, T; |
| for g = 1,…, G do |
| if Ng > T then |
| Compute thresholds with trimmed cluster-specific eigenvalues |
| for j = 1,…, 5, do Mt,g = Mean( ) − 5 SD( ) |
| for j = 2,…, 5, do = Mean |
| If |
| Then and |
| Else Cg = M1,g |
| else Cluster g denoted as “outlier cluster”, |
| Return: Observations with Eg ≤ Cg, for g = 1,…, G |
|
|