Modified tree-based selection in hierarchical mixed-effect models with trees: A simulation study and real-data application

Asrirawan; Khairil Anwar Notodiputro; Budi Susetyo; Sachnaz Desta Oktarina

doi:10.1016/j.mex.2025.103312

. 2025 Apr 12;14:103312. doi: 10.1016/j.mex.2025.103312

Modified tree-based selection in hierarchical mixed-effect models with trees: A simulation study and real-data application

Asrirawan ^a,^b, Khairil Anwar Notodiputro ^b,^⁎, Budi Susetyo ^b, Sachnaz Desta Oktarina ^b

PMCID: PMC12032322 PMID: 40292188

Abstract

Hierarchical mixed-effects models with three trees—3Trees models—are a new advanced statistical learning approach in mixed-effect modeling. These methods utilize the classification and regression trees (CART) algorithm to select the best tree through a backfitting algorithm. However, this algorithm relies on a greedy approach, making the trees prone to overfitting, biased in split selection, and often far from the optimal solution, ultimately affecting model performance. Two novel methods are proposed—3Trees-EvTree and 3Trees-CTree—to address these limitations. The proposed methods are compared with the available methods through several simulation exercises in different settings and real datasets. The simulation study confirms that the 3Trees-EvTree method performs well compared to the previous method in terms of parameter estimation and prediction accuracy under clusMSE and clusPMSE. Meanwhile, the 3Trees-CTree model performs well in low-correlation scenarios and the semilinear function. In addition, the proposed methods also reveal that the results of actual application confirm their superiority over other competing methods. Some highlights of the proposed method are:

•
3Trees-EvTree and 3Trees-CTree model to improve prediction accuracy and to reduce bias of 3Trees model are presented
•
MSE, ClusMSE, PMSE, ClusPMSE, and bias criteria are used to evaluate model performance
•
Applied to estimate and predict household expenditure per capita dataset

Keywords: Regression tree, Evolutionary learning, Conditional inference, Hierarchical data, Machine learning

Method name: 3Trees-EvTree and 3Trees-CTree

Graphical abstract

Specifications table

Subject area:	Mathematics and Statistics
More specific subject area:	Statistical learning, machine learning, mixed-effects model
Name of your method:	3Trees-EvTree and 3Trees-CTree
Name and reference of original method:	A. Gottard, G. Vannucci, L. Grilli, C. Rampichini, Mixed-effect models with trees, Adv. Data Anal. Classif. 17 (2) (2023) 431–461, doi: 10.1007/s11634-022-00509–3.
Resource availability:	household expenditure per capita dataset of West Java and its predictor variables (Individual level and village level) distributed among 27 regencies and cities a can be accessed on the BPS official website (https://silastik.bps.go.id)

Scenario	Type of Model	$β_{0}$	$β_{1}$	$β_{3}$	$β_{4}$	$γ_{1}$	$γ_{3}$	$μ_{1}$	$μ_{2}$	$μ_{3}$	ICC	$σ_{u}^{2}$	$σ_{ε}^{2}$	Correlation	$ρ$
M1	Linear	5	2	3	1	3	1	0	0	0	high	3	1	high	0.8
	Linear	5	2	3	1	3	1	0	0	0	small	0.3	0.5	high	0.8
	Linear	5	2	3	1	3	1	0	0	0	high	3	1	small	0.2
	Linear	5	2	3	1	3	1	0	0	0	small	0.3	0.5	small	0.2
M2	Semilinear+ Interaction	5	2	3	1	2	1	3	2	−2	high	3	1	high	0.8
	Semilinear+ Interaction	5	2	3	1	2	1	3	2	−2	small	0.3	0.5	high	0.8
	Semilinear+ Interaction	5	2	3	1	2	1	3	2	−2	high	3	1	small	0.2
	Semilinear+ Interaction	5	2	3	1	2	1	3	2	−2	small	0.3	0.5	small	0.2
M3	Nonlinear	5	2	2	2	3	2	2	1	0	high	3	1	high	0.8
	Nonlinear	5	2	2	2	3	2	2	1	0	small	0.3	0.5	high	0.8
	Nonlinear	5	2	2	2	3	2	2	1	0	high	3	1	small	0.2
	Nonlinear	5	2	2	2	3	2	2	1	0	small	0.3	0.5	small	0.2

$ρ$ =0.8, ICC=0.75					$ρ$ =0.8, ICC=0.375
	MSE	ClusMSE	PMSE	ClusPMSE		MSE	ClusMSE	PMSE	ClusPMSE
Scenario M1					Scenario M1
3Trees-CART (cp=0.0001)	3.569 (0.608)	0.914 (0.029)	3.674 (0.592)	1.086 (0.040)	3Trees-CART (cp=0.0001)	6.747 (0.479)	4.604 (0.136)	7.393 (0.529)	5.451 (0.195)
3Trees-CART (cp=0)	3.150 (0.469)	0.913 (0.03)	3.264 (0.460)	1.087 (0.041)	3Trees-CART (cp=0)	6.725 (0.462)	4.604 (0.137)	7.355 (0.515)	5.451 (0.192)
3Trees-CTree	3.235 (0.420)	0.946 (0.029)	3.305 (0.405)	1.050 (0.039)	3Trees-CTree	7.059 (0.453)	4.767 (0.137)	7.340 (0.484)	5.272 (0.187)
3Trees-EvTree	3.639 (0.487)	0.946 (0.029)	3.673 (0.477)	1.051 (0.040)	3Trees-EvTree	7.478 (0.508)	4.780 (0.139)	7.652 (0.537)	5.250 (0.183)
LMMs	3.808 (0.492)	0.954 (0.029)	3.862 (0.487)	1.040 (0.039)	LMMs	7.696 (0.523)	4.818 (0.140)	7.780 (0.531)	5.202 (0.184)
3Trees-TSCTree	3.860 (0.502)	0.954 (0.029)	3.883 (0.489)	1.040 (0.039)	3Trees-TSCTree	7.765 (0.551)	4.820 (0.140)	7.810 (0.549)	5.202 (0.184)
Scenario M2					Scenario M2
3Trees-CART (cp=0.0001)	3.774 (0.846)	1.153 (0.373)	3.865 (0.839)	1.338 (0.430)	3Trees-CART (cp=0.0001)	7.384 (0.729)	4.965 (0.452)	7.938 (0.759)	5.791 (0.492)
3Trees-CART (cp=0)	3.683 (0.813)	1.189 (0.381)	3.781 (0.834)	1.379 (0.444)	3Trees-CART (cp=0)	7.353 (0.742)	4.967 (0.451)	7.917 (0.784)	5.792 (0.489)
3Trees-CTree	5.107 (0.874)	1.878 (0.104)	5.207 (0.891)	2.076 (0.112)	3Trees-CTree	9.048 (0.908)	5.670 (0.204)	9.365 (0.874)	6.288 (0.217)
3Trees-EvTree	3.689 (0.532)	0.944 (0.032)	3.707 (0.524)	1.064 (0.042)	3Trees-EvTree	7.638 (0.503)	4.760 (0.156)	7.830 (0.556)	5.270 (0.180)
LMMs	7.443 (0.779)	3.296 (0.178)	7.485 (0.759)	3.575 (0.172)	LMMs	11.479 (0.788)	7.143 (0.294)	11.619 (0.775)	7.756 (0.314)
3Trees-TSCTree	7.510 (0.780)	3.296 (0.178)	7.505 (0.750)	3.575 (0.172)	3Trees-TSCTree	11.561 (0.790)	7.145 (0.294)	11.655 (0.773)	7.756 (0.314)
Scenario M3					Scenario M3
3Trees-CART (cp=0.0001)	4.808 (0.599)	2.269 (0.263)	5.169 (0.579)	2.805 (0.299)	3Trees-CART (cp=0.0001)	8.362 (0.590)	5.894 (0.296)	9.201 (0.727)	7.115 (0.440)
3Trees-CART (cp=0)	4.760 (0.593)	2.271 (0.265)	5.122 (0.587)	2.806 (0.300)	3Trees-CART (cp=0)	8.348 (0.579)	5.890 (0.297)	9.200 (0.724)	7.119 (0.443)
3Trees-CTree	6.539 (0.716)	4.130 (0.589)	7.036 (0.715)	4.767 (0.621)	3Trees-CTree	10.448 (0.74)	7.960 (0.543)	11.262 (0.909)	9.193 (0.727)
3Trees-EvTree	4.768 (0.548)	1.876 (0.194)	5.239 (0.531)	2.505 (0.227)	3Trees-EvTree	8.820 (0.602)	5.818 (0.305)	9.620 (0.711)	7.099 (0.382)
LMMs	12.985 (1.057)	9.800 (0.878)	13.152 (0.974)	10.64 (0.812)	LMMs	16.819 (1.069)	13.528 (0.873)	17.28 (1.220)	14.858 (0.985)
3Trees-TSCTree	13.041 (1.068)	9.807 (0.879)	13.163 (0.968)	10.639 (0.811)	3Trees-TSCTree	16.882 (1.074)	13.539 (0.874)	17.29 (1.219)	14.856 (0.987)

Parameter
	$β_{0}$	Bias	$β_{1}$	Bias	$β_{2}$	Bias	$β_{3}$	Bias	$β_{4}$	Bias	$γ_{1}$	Bias	$γ_{2}$	Bias	$γ_{3}$	Bias
Scenario M1
True Value	5.000	–	2.000	–	0.000	–	3.000	–	1.000	–	3.000	–	0.000	–	1.000	–
3Trees-CART (cp=0.0001)	2.728 (0.748)	2.272	2.003 (0.054)	−0.003	−0.008 (0.054)	0.008	3.002 (0.053)	−0.002	0.990 (0.048)	0.010	3.014 (0.379)	−0.014	−0.047 (0.373)	0.047	1.033 (0.423)	−0.033
3Trees-CART (cp=0)	1.123 (1.343)	3.877	1.997 (0.054)	0.003	−0.006 (0.054)	0.006	3.006 (0.054)	−0.006	0.992 (0.048)	0.008	2.943 (0.425)	0.057	−0.038 (0.419)	0.038	1.057 (0.419)	−0.057
3Trees-CTree	0.844 (1.478)	4.156	1.961 (0.063)	0.039	0.001 (0.045)	−0.001	2.917 (0.084)	0.083	0.990(0.049)	0.010	1.164 (0.656)	1.836	−0.067 (0.328)	0.067	0.592 (0.543)	0.408
3Trees-EvTree	4.932 (0.706)	0.068	2.002 (0.047)	−0.002	0.001 (0.047)	−0.001	2.995 (0.048)	0.005	0.992 (0.048)	0.008	2.984 (0.354)	0.016	−0.029 (0.348)	0.029	1.041 (0.402)	−0.041
LMMs	4.987 (0.483)	0.013	2.001 (0.045)	−0.001	0.001 (0.045)	−0.001	2.996 (0.045)	0.004	0.991 (0.049)	0.009	3.019 (0.363)	−0.019	−0.022 (0.358)	0.022	1.045 (0.432)	−0.045
3Trees-TSCTree	4.986 (0.293)	0.014	2.001 (0.045)	−0.001	0.001 (0.045)	−0.001	2.996 (0.045)	0.004	0.991 (0.049)	0.009	3.014 (0.349)	−0.014	−0.02 (0.345)	0.020	1.048 (0.416)	−0.048
Scenario M2
True Value	5.000	–	2.000	–	0.000	–	3.000	–	1.000	–	2.000	–	0.000	–	1.000	–
3Trees-CART (cp=0.0001)	0.981 (1.303)	4.019	2.619 (0.139)	−0.619	0.025 (0.052)	−0.025	3.015 (0.051)	−0.015	1.002 (0.053)	−0.002	2.086 (0.446)	−0.086	−0.656 (0.55)	0.656	0.945 (0.466)	0.055
3Trees-CART (cp=0)	0.409 (1.439)	4.591	2.625 (0.143)	−0.625	0.023 (0.053)	−0.023	3.016 (0.052)	−0.016	1.003 (0.054)	−0.003	2.069 (0.447)	−0.069	−0.79 (0.576)	0.790	0.988 (0.484)	0.012
3Trees-CTree	3.393 (0.862)	1.607	1.978 (0.084)	0.022	0.005 (0.064)	−0.005	2.817 (0.119)	0.183	0.996 (0.068)	0.004	1.384 (0.479)	0.616	−0.934 (0.424)	0.934	0.650 (0.665)	0.350
3Trees-EvTree	5.375 (0.635)	−0.375	2.206 (0.081)	−0.206	0.009 (0.045)	−0.009	3.003 (0.045)	−0.003	1.003 (0.048)	−0.003	2.059 (0.363)	−0.059	−0.134 (0.456)	0.134	0.950 (0.407)	0.050
LMMs	6.005 (0.582)	−1.005	3.585 (0.084)	−1.585	0.022 (0.084)	−0.022	2.989 (0.084)	0.011	1.007 (0.090)	−0.007	2.061 (0.436)	−0.061	−1.635 (0.436)	1.635	0.943 (0.518)	0.057
3Trees-TSCTree	6.021 (0.356)	−1.021	3.585 (0.084)	−1.585	0.022 (0.084)	−0.022	2.989 (0.084)	0.011	1.007 (0.090)	−0.007	2.062 (0.419)	−0.062	−1.639 (0.421)	1.639	0.949 (0.500)	0.051
Scenario M3
True Value	5.000	–	2.000	–	0.000	–	2.000	–	2.000	–	3.000	–	0.000	–	2.000	–
3Trees-CART (cp=0.0001)	2.832 (1.329)	2.168	2.032 (0.142)	−0.032	−0.001 (0.071)	0.001	1.997 (0.070)	0.003	2.011 (0.076)	−0.011	3.058 (0.446)	−0.058	−0.582 (0.442)	0.582	2.058 (0.504)	−0.058
3Trees-CART (cp=0)	2.668 (1.425)	2.332	2.031 (0.142)	−0.031	−0.004 (0.071)	0.004	1.996 (0.070)	0.004	2.011 (0.076)	−0.011	3.065 (0.450)	−0.065	−0.572 (0.441)	0.572	2.062 (0.507)	−0.062
3Trees-CTree	3.782 (1.288)	1.218	−0.222 (0.109)	2.222	0.008 (0.094)	−0.008	1.301 (0.124)	0.699	1.897 (0.154)	0.103	1.568 (0.594)	1.432	−0.626 (0.343)	0.626	0.249 (1.105)	1.751
3Trees-EvTree	20.035 (0.864)	−15.035	2.007 (0.112)	−0.007	0.006 (0.087)	−0.006	1.996 (0.064)	0.004	2.012 (0.068)	−0.012	3.031 (0.375)	−0.031	−0.667 (0.418)	0.667	2.041 (0.417)	−0.041
LMMs	6.787 (0.520)	−1.787	2.015 (0.144)	−0.015	0.006 (0.145)	−0.006	2.005 (0.144)	−0.005	2.031 (0.155)	−0.031	3.037 (0.388)	−0.037	−0.727 (0.39)	0.727	2.065 (0.458)	−0.065
3Trees-TSCTree	6.908 (0.321)	−1.908	2.014 (0.144)	−0.014	0.006 (0.144)	−0.006	2.005 (0.144)	−0.005	2.031 (0.155)	−0.031	3.042 (0.374)	−0.042	−0.732 (0.376)	0.732	2.066 (0.440)	−0.066

Variables	Label	Type of variable
Household level
Gender	gender	factor (1=male, 2=female)
Age	age	continues
Level of education	education	factor(1: not graduated, 2: primary school, 3: junior high school, 4: senior high school, 5: Diploma, 6: Bachelor's degree and graduate studies)
Primary employment status	employment_status	factor (1: Self-employed, 2: Self-employed with temporary or unpaid workers, 3: Self-employed with regular or paid employees, 4: Worker/Employee/Staff, 5: Freelancer, 6: Unpaid family worker, 7:Unemployed)
Household size	member	continues
Regional status	regional_status	factor (1: city, 2:village)
Social security BPJS status	social_security_status	factor (1: yes, 2: no)
Housing tenure status	housing_tenure	factor (1: Personal ownership, 2: contract/rent, 3:Free of charge, 4:official residence)
Lightning sources	lightning_source	factor (1:PLN electricity with a meter, 2:PLN electricity without a meter, 3:Non-PLN power source,4: Non-electricity)
Regular government assistence status	regular_assistence	factor (1: yes, 2: no)
Village level
Household income sources	income_source	factor (1: Agriculture and Natural Resources, 2: Energy and Utilities, 3:Manufacturing and Construction, 4: Trade and Repair, 5: Transportation and Accommodation, 6:Financial Services and Real Estate, 7: Government, Education, and Health, 8: Other Services)
Type of transportation Infrastructure to/from Agricultural Production Centers	road_type	factor (1: Asphalt/Concrete, 2:Gravelled (gravel, stone, etc.), 3:Land, 4: Water, 5: others)
Drinking Water Sources (group)	water_source_gr	factor (1: purified drinking water, 2: Water from the Supply and Distribution System, 3: Water from an Unprotected and Natural Source)
Number of educational facilities	num_education	continues
Number of health facilities	num_health	continues

Model	MSE	clusMSE	PMSE	clusPMSE
3Trees-EvTree	1.489	1.224	1.658	1.533
3Trees-CART	1.482	1.226	1.654	1.534
3Trees-CTree	1.505	1.247	1.687	1.566
LMMs	1.566	1.282	1.750	1.611

Variable	Parameter	Standard Error	pvalue
Household level
Intercept	2.263	0.156	0.001*
Regional_status (village)	0.059	0.036	0.111
Gender (female)	0.053	0.035	0122
Age	0.002	0.001	0.038*
Education (primary school)	0.091	0.071	0.201
Education (junior high school)	0.224	0.076	0.003*
Education (senior high school)	1.024	0.109	0.001*
Education (diploma, undergraduate, graduate)	2.307	0.113	0.001*
Employment_status (self-employed with temporary or unpaid workers)	−0.046	0.041	0.269
Employment_status (self-employed with regular or paid employees)	0.609	0.059	0.001*
Employment_status (worker/employee/staff)	0.027	0.029	0.352
Employment_status (freelancer)	−0.127	0.038	0.001*
Employment_status (unpaid family worker)	−0.098	0.116	0.394
Employment_status (unemployed)	−0.143	0.039	0.001*
Social_security_status (no)	0.218	0.023	0.001*
Member	−0.187	0.011	0.001*
Housing_tenure (contract/rent)	−0.480	0.042	0.001*
Housing_tenure (free of charge)	−0.328	0.036	0.001*
Housing_tenure (official residence)	0.208	0.228	0.361
Lighting_source (PLN electricity without a meter)	−0.086	0.038	0.024*
Lighting_source (Non-PLN power source)	−0.158	0.212	0.456
Lighting_source (Non-electricity)	−0.451	0.360	0.211
regular_assistence (no)	0.155	0.047	0.001*
Village Level
Num_education	0.019	0.004	0.001*
Num_health	0.004	0.003	0.211
Road_type (gravelled)	−0.036	0.043	0.400
Road_type (land)	−0.002	0.056	0.968
Road_type (water)	0.163	0.242	0.503
Road_type (others)	−0.078	0.185	0.674
Income_source (energy and utilities)	−0.237	0.449	0.598
Income_source (manufacturing and construction)	0.143	0.244	0.556
Income_source (trade and repair)	0.286	0.245	0.241
Income_source (transportation and accommodation)	0.411	0.277	0.138
Income_source (financial services and real estate)	0.571	0.274	0.037*
Income_source (government, education, and health)	1.004	0.276	0.001*
Income_source (other services)	0.327	0.249	0.189
Water_source (water from the supply and distribution system)	−0.083	0.029	0.005*
Water_source (water from an unprotected and natural source)	−0.113	0.047	0.017*
Tree component
First tree, region R1.5	−0.988	0.112	0.001*
First tree, region R1.7	−0.952	0.055	0.001*
First tree, region R1.8	−0.646	0.061	0.001*
First tree, region R1.11	−0.239	0.064	0.002*
First tree, region R1.12	−0.549	0.078	0.001*
Second tree, region R2.3	−0.103	0.035	0.003*
Third tree, region R3.4	−0.683	0.099	0.001*
Third tree, region R3.5	−0.919	0.089	0.001*
Random component
$σ_{u}$	0.408
$σ_{ε}$	1.147

$ρ$ =0.2, ICC=0.75					$ρ$ =0.2, ICC=0.375
	MSE	ClusMSE	PMSE	ClusPMSE		MSE	ClusMSE	PMSE	ClusPMSE
Scenario M1					Scenario M1
3Trees-CART (cp=0.0001)	3.537 (0.548)	0.913 (0.034)	3.633 (0.540)	1.089 (0.040)	3Trees-CART (cp=0.0001)	6.811 (0.481)	4.597 (0.154)	7.363 (0.480)	5.421 (0.190)
3Trees-CART (cp=0)	3.141 (0.442)	0.912 (0.034)	3.247 (0.433)	1.088 (0.039)	3Trees-CART (cp=0)	6.778 (0.459)	4.596 (0.153)	7.347 (0.476)	5.424 (0.194)
3Trees-CTree	3.164 (0.410)	0.947 (0.034)	3.211 (0.406)	1.052 (0.039)	3Trees-CTree	7.060 (0.444)	4.748 (0.161)	7.328 (0.440)	5.262 (0.173)
3Trees-EvTree	3.598 (0.462)	0.947 (0.035)	3.618 (0.459)	1.051 (0.037)	3Trees-EvTree	7.526 (0.484)	4.760 (0.162)	7.675 (0.496)	5.238 (0.173)
LMMs	3.755 (0.488)	0.954 (0.034)	3.779 (0.498)	1.042 (0.037)	LMMs	7.747 (0.491)	4.797 (0.164)	7.835 (0.514)	5.198 (0.170)
3Trees-TSCTree	3.812 (0.49)	0.954 (0.034)	3.812 (0.495)	1.042 (0.037)	3Trees-TSCTree	7.797 (0.489)	4.799 (0.164)	7.851 (0.509)	5.197 (0.170)
Scenario M2					Scenario M2
3Trees-CART (cp=0.0001)	4.326 (1.037)	1.413 (0.432)	4.466 (1.018)	1.642 (0.492)	3Trees-CART (cp=0.0001)	7.930 (0.936)	5.285 (0.426)	8.542 (0.944)	6.221 (0.523)
3Trees-CART (cp=0)	4.216 (1.066)	1.392 (0.432)	4.362 (1.060)	1.619 (0.493)	3Trees-CART (cp=0)	7.913 (0.946)	5.285 (0.421)	8.536 (0.960)	6.228 (0.517)
3Trees-CTree	3.780 (0.730)	1.350 (0.389)	3.890 (0.743)	1.507 (0.415)	3Trees-CTree	7.768 (0.645)	5.182 (0.405)	8.072 (0.694)	5.786 (0.450)
3Trees-EvTree	3.758 (0.561)	0.947 (0.034)	3.825 (0.540)	1.071 (0.045)	3Trees-EvTree	7.732 (0.591)	4.780 (0.163)	7.867 (0.613)	5.305 (0.180)
LMMs	7.594 (0.769)	3.303 (0.168)	7.661 (0.813)	3.589 (0.214)	LMMs	11.575 (0.832)	7.152 (0.287)	11.721 (0.803)	7.779 (0.326)
3Trees-TSCTree	7.676 (0.791)	3.304 (0.168)	7.701 (0.825)	3.589 (0.214)	3Trees-TSCTree	11.653 (0.856)	7.154 (0.288)	11.75 (0.813)	7.779 (0.326)
Scenario M3					Scenario M3
3Trees-CART (cp=0.0001)	4.799 (0.645)	2.324 (0.254)	5.145 (0.678)	2.820 (0.312)	3Trees-CART (cp=0.0001)	8.464 (0.621)	6.002 (0.325)	9.322 (0.626)	7.188 (0.364)
3Trees-CART (cp=0)	4.786 (0.638)	2.324 (0.254)	5.130 (0.674)	2.819 (0.313)	3Trees-CART (cp=0)	8.466 (0.634)	6.002 (0.325)	9.340 (0.622)	7.192 (0.363)
3Trees-CTree	7.436 (0.866)	5.088 (0.763)	7.801 (0.887)	5.676 (0.735)	3Trees-CTree	11.156 (0.891)	8.736 (0.673)	12.166 (0.918)	10.098 (0.726)
3Trees-EvTree	4.769 (0.604)	1.925 (0.174)	5.198 (0.624)	2.539 (0.208)	3Trees-EvTree	8.792 (0.623)	5.872 (0.282)	9.683 (0.614)	7.150 (0.352)
LMMs	13.042 (1.003)	9.922 (0.788)	12.958 (1.01)	10.537 (0.759)	LMMs	16.859 (0.906)	13.642 (0.68)	17.261 (1.173)	14.855 (0.99)
3Trees-TSCTree	13.114 (1.016)	9.929 (0.789)	12.982 (1.019)	10.536 (0.760)	3Trees-TSCTree	16.924 (0.911)	13.654 (0.681)	17.284 (1.158)	14.854 (0.99)

$ρ$ =0.8, ICC=0.75					$ρ$ =0.8, ICC=0.375
	MSE	ClusMSE	PMSE	ClusPMSE		MSE	ClusMSE	PMSE	ClusPMSE
Scenario M1					Scenario M1
3Trees-CART (cp=0.0001)	3.873 (0.71)	0.790 (0.050)	4.138 (0.734)	1.283 (0.080)	3Trees-CART (cp=0.0001)	7.130 (0.657)	4.013 (0.239)	8.560 (0.799)	6.236 (0.368)
3Trees-CART (cp=0)	3.846 (0.663)	0.790 (0.051)	4.100 (0.699)	1.283 (0.081)	3Trees-CART (cp=0)	7.137 (0.641)	4.009 (0.243)	8.571 (0.802)	6.236 (0.382)
3Trees-CTree	4.020 (0.593)	0.857 (0.054)	4.152 (0.624)	1.186 (0.073)	3Trees-CTree	7.800 (0.686)	4.318 (0.248)	8.545 (0.811)	5.857 (0.336)
3Trees-EvTree	4.415 (0.707)	0.857 (0.054)	4.474 (0.697)	1.189 (0.071)	3Trees-EvTree	8.214 (0.723)	4.339 (0.252)	8.694 (0.82)	5.790 (0.308)
LMMs	4.577 (0.683)	0.873 (0.054)	4.672 (0.701)	1.160 (0.068)	LMMs	8.528 (0.719)	4.412 (0.251)	8.835 (0.859)	5.649 (0.296)
3Trees-TSCTree	4.666 (0.693)	0.873 (0.054)	4.682 (0.697)	1.161 (0.068)	3Trees-TSCTree	8.631 (0.734)	4.422 (0.253)	8.848 (0.859)	5.651 (0.297)
Scenario M2					Scenario M2
3Trees-CART (cp=0.0001)	8.574 (1.002)	4.275 (0.27)	10.102 (1.113)	6.727 (0.391)	3Trees-CART (cp=0.0001)	8.404 (1.055)	4.242 (0.312)	9.814 (1.072)	6.681 (0.421)
3Trees-CART (cp=0)	5.372 (1.145)	0.995 (0.136)	5.654 (1.183)	1.592 (0.225)	3Trees-CART (cp=0)	8.426 (1.063)	4.247 (0.308)	9.844 (1.086)	6.690 (0.410)
3Trees-CTree	6.542 (1.114)	1.184 (0.094)	6.790 (1.214)	1.645 (0.150)	3Trees-CTree	10.331 (1.186)	4.654 (0.299)	10.89 (1.063)	6.349 (0.348)
3Trees-EvTree	4.820 (0.764)	0.860 (0.073)	4.971 (0.839)	1.252 (0.106)	3Trees-EvTree	10.223 (1.261)	4.623 (0.29)	10.609 (1.186)	6.283 (0.342)
LMMs	8.397 (1.002)	2.430 (0.232)	8.587 (1.004)	3.241 (0.313)	LMMs	12.287 (1.115)	6.052 (0.421)	12.452 (1.027)	7.742 (0.508)
3Trees-TSCTree	8.654 (1.041)	2.433 (0.232)	8.719 (1.042)	3.241 (0.313)	3Trees-TSCTree	12.531 (1.146)	6.066 (0.424)	12.542 (1.032)	7.742 (0.507)
Scenario M3					Scenario M3
3Trees-CART (cp=0.0001)	5.435 (0.722)	1.937 (0.235)	6.179 (0.853)	3.528 (0.594)	3Trees-CART (cp=0.0001)	8.769 (0.779)	5.268 (0.442)	10.498 (0.943)	8.336 (0.755)
3Trees-CART (cp=0)	5.416 (0.703)	1.935 (0.237)	6.165 (0.840)	3.523 (0.589)	3Trees-CART (cp=0)	8.770 (0.784)	5.266 (0.441)	10.487 (0.939)	8.334 (0.746)
3Trees-CTree	7.282 (1.006)	3.464 (0.472)	8.259 (1.138)	5.777 (1.028)	3Trees-CTree	11.072 (0.927)	7.044 (0.622)	12.554 (1.309)	10.388 (1.275)
3Trees-EvTree	5.720 (0.839)	1.762 (0.241)	6.450 (0.814)	3.420 (0.560)	3Trees-EvTree	10.228 (0.976)	5.819 (0.504)	11.293 (1.038)	8.643 (0.791)
LMMs	13.521 (1.493)	7.427 (0.847)	14.006 (1.610)	12.373 (1.603)	LMMs	17.936 (1.606)	11.276 (0.966)	18.372 (1.827)	16.895 (1.827)
3Trees-TSCTree	13.695 (1.517)	7.45 (0.853)	14.039 (1.643)	12.357 (1.602)	3Trees-TSCTree	18.102 (1.617)	11.321 (0.974)	18.395 (1.836)	16.872 (1.835)

$ρ$ =0.2, ICC=0.75					$ρ$ =0.2, ICC=0.375
	MSE	ClusMSE	PMSE	ClusPMSE		MSE	ClusMSE	PMSE	ClusPMSE
Scenario M1					Scenario M1
3Trees-CART (cp=0.0001)	3.903 (0.631)	0.787 (0.036)	4.127 (0.592)	1.290 (0.079)	3Trees-CART (cp=0.0001)	7.283 (0.681)	4.062 (0.264)	8.673 (0.737)	6.301 (0.370)
3Trees-CART (cp=0)	3.855 (0.590)	0.787 (0.036)	4.086 (0.577)	1.290 (0.079)	3Trees-CART (cp=0)	7.280 (0.680)	4.060 (0.264)	8.674 (0.743)	6.301 (0.377)
3Trees-CTree	4.053 (0.587)	0.852 (0.039)	4.153 (0.576)	1.199 (0.067)	3Trees-CTree	7.771 (0.643)	4.383 (0.265)	8.516 (0.666)	5.903 (0.342)
3Trees-EvTree	4.447 (0.623)	0.856 (0.038)	4.483 (0.63)	1.194 (0.067)	3Trees-EvTree	8.304 (0.699)	4.418 (0.267)	8.708 (0.720)	5.819 (0.328)
LMMs	4.641 (0.670)	0.872 (0.039)	4.719 (0.659)	1.172 (0.065)	LMMs	8.626 (0.739)	4.492 (0.265)	8.818 (0.740)	5.683 (0.310)
3Trees-TSCTree	4.743 (0.685)	0.872 (0.039)	4.729 (0.665)	1.173 (0.065)	3Trees-TSCTree	8.724 (0.749)	4.503 (0.266)	8.832 (0.746)	5.682 (0.309)
Scenario M2					Scenario M2
3Trees-CART (cp=0.0001)	5.924 (1.153)	1.050 (0.104)	6.133 (1.061)	1.695 (0.171)	3Trees-CART (cp=0.0001)	8.574 (1.002)	4.275 (0.270)	10.102 (1.113)	6.727 (0.391)
3Trees-CART (cp=0)	5.903 (1.129)	1.050 (0.105)	6.124 (1.050)	1.696 (0.169)	3Trees-CART (cp=0)	8.558 (0.977)	4.277 (0.269)	10.087 (1.125)	6.726 (0.388)
3Trees-CTree	5.096 (0.848)	1.098 (0.162)	5.320 (0.857)	1.551 (0.231)	3Trees-CTree	8.676 (0.998)	4.613 (0.298)	9.533 (1.103)	6.232 (0.397)
3Trees-EvTree	4.780 (0.773)	0.850 (0.068)	4.902 (0.752)	1.258 (0.114)	3Trees-EvTree	10.204 (1.181)	4.658 (0.277)	10.724 (1.211)	6.265 (0.348)
LMMs	8.455 (0.928)	2.459 (0.221)	8.589 (0.98)	3.285 (0.321)	LMMs	12.235 (1.137)	6.064 (0.366)	12.499 (1.089)	7.69 (0.507)
3Trees-TSCTree	8.738 (0.944)	2.462 (0.221)	8.708 (0.986)	3.285 (0.322)	3Trees-TSCTree	12.49 (1.174)	6.079 (0.368)	12.623 (1.086)	7.691 (0.506)
Scenario M3					Scenario M3
3Trees-CART (cp=0.0001)	5.612 (0.698)	2.109 (0.331)	6.384 (0.772)	3.678 (0.533)	3Trees-CART (cp=0.0001)	8.927 (0.969)	5.338 (0.482)	10.595 (0.913)	8.458 (0.641)
3Trees-CART (cp=0)	5.605 (0.701)	2.109 (0.330)	6.388 (0.780)	3.684 (0.533)	3Trees-CART (cp=0)	8.919 (0.969)	5.336 (0.483)	10.605 (0.916)	8.468 (0.644)
3Trees-CTree	8.112 (0.999)	4.238 (0.612)	9.218 (1.148)	7.029 (1.176)	3Trees-CTree	11.868 (1.405)	7.711 (0.831)	13.44 (1.265)	11.467 (1.159)
3Trees-EvTree	5.846 (0.735)	1.875 (0.272)	6.616 (0.753)	3.500 (0.481)	3Trees-EvTree	10.268 (0.966)	5.809 (0.511)	11.313 (1.083)	8.617 (0.744)
LMMs	13.742 (1.445)	7.603 (0.858)	14.085 (1.331)	12.356 (1.528)	LMMs	17.731 (1.656)	11.079 (1.026)	18.194 (1.619)	16.739 (1.413)
3Trees-TSCTree	13.907 (1.464)	7.627 (0.862)	14.103 (1.377)	12.343 (1.523)	3Trees-TSCTree	17.897 (1.663)	11.123 (1.033)	18.233 (1.621)	16.716 (1.409)

Parameter
	$β_{0}$	Bias	$β_{1}$	Bias	$β_{2}$	Bias	$β_{3}$	Bias	$β_{4}$	Bias	$γ_{1}$	Bias	$γ_{2}$	Bias	$γ_{3}$	Bias
Scenario M1
3Trees-CART (cp=0.0001)	−0.233 (1.335)	5.233	2.007 (0.078)	−0.007	0.008 (0.081)	−0.008	2.994 (0.079)	0.006	1.009 (0.108)	−0.009	3.014 (0.308)	−0.014	0.032 (0.304)	−0.032	0.974 (0.41)	0.026
3Trees-CART (cp=0)	−0.394 (1.455)	5.394	2.002 (0.08)	−0.002	0.007 (0.082)	−0.007	2.991 (0.078)	0.009	1.011 (0.108)	−0.011	3.007 (0.311)	−0.007	0.015 (0.313)	−0.015	0.953 (0.415)	0.047
3Trees-CTree	0.310 (1.583)	4.690	1.858 (0.095)	0.142	0 (0.056)	0.000	2.72 (0.152)	0.280	1.006 (0.109)	−0.006	1.062 (0.620)	1.938	−0.012 (0.211)	0.012	0.544 (0.526)	0.456
3Trees-EvTree	4.938 (0.807)	0.062	2.003 (0.061)	−0.003	0.009 (0.065)	−0.009	3.001 (0.061)	−0.001	1.011 (0.108)	−0.011	3.025 (0.236)	−0.025	0.002 (0.232)	−0.002	0.992 (0.413)	0.008
LMMs	4.998 (0.505)	0.002	1.998 (0.056)	0.002	0 (0.057)	0.000	3.001 (0.056)	−0.001	1.010 (0.109)	−0.010	3.014 (0.226)	−0.014	0.022 (0.231)	−0.022	0.980 (0.444)	0.020
3Trees-TSCTree	5.030 (0.309)	−0.030	1.998 (0.056)	0.002	0 (0.057)	0.000	3.001 (0.056)	−0.001	1.010 (0.109)	−0.010	3.015 (0.217)	−0.015	0.022 (0.222)	−0.022	0.984 (0.427)	0.016
Scenario M2
3Trees-CART (cp=0.0001)	−0.266 (1.242)	5.266	3.120 (0.260)	−1.120	0 (0.071)	0.000	3.006 (0.069)	−0.006	0.998 (0.116)	0.002	1.899 (0.292)	0.101	−1.024 (0.525)	1.024	1.029 (0.487)	−0.029
3Trees-CART (cp=0)	−0.670 (1.400)	5.670	3.117 (0.260)	−1.117	0.002 (0.071)	−0.002	3.006 (0.069)	−0.006	1.000 (0.115)	0.000	1.901 (0.292)	0.099	−1.020 (0.535)	1.020	1.003 (0.488)	−0.003
3Trees-CTree	3.031 (1.038)	1.969	1.992 (0.097)	0.008	0.003 (0.059)	−0.003	2.674 (0.149)	0.326	0.991 (0.113)	0.009	1.045 (0.406)	0.955	−0.092 (0.344)	0.092	0.697 (0.459)	0.303
3Trees-EvTree	5.113 (0.725)	−0.113	2.213 (0.122)	0.213	0.002 (0.056)	−0.002	3.004 (0.056)	−0.004	0.999 (0.109)	0.001	1.944 (0.236)	0.056	−0.202 (0.382)	0.202	0.975 (0.425)	0.025
LMMs	6.057 (0.610)	−1.057	3.585 (0.069)	−1.585	0.008 (0.069)	−0.008	3.005 (0.069)	−0.005	0.991 (0.133)	0.009	1.962 (0.274)	0.038	−1.668 (0.278)	1.668	0.991 (0.537)	0.009
3Trees-TSCTree	5.992 (0.372)	−0.992	3.585 (0.069)	−1.585	0.008 (0.069)	−0.008	3.005 (0.069)	−0.005	0.991 (0.133)	0.009	1.959 (0.264)	0.041	−1.668 (0.268)	1.668	1.005 (0.517)	−0.005
Scenario M3
3Trees-CART (cp=0.0001)	2.699 (1.254)	2.301	2.061 (0.234)	−0.061	0.010 (0.065)	−0.010	1.994 (0.064)	0.006	1.987 (0.123)	0.013	2.996 (0.334)	0.004	−0.581 (0.305)	0.581	1.980 (0.498)	0.020
3Trees-CART (cp=0)	2.551 (1.382)	2.449	2.036 (0.236)	−0.036	0.008 (0.065)	−0.008	1.994 (0.064)	0.006	1.985 (0.124)	0.015	3.030 (0.332)	−0.030	−0.589 (0.307)	0.589	2.037 (0.518)	−0.037
3Trees-CTree	3.382 (1.490)	1.618	0.221 (0.099)	1.779	−0.002 (0.076)	0.002	1.653 (0.161)	0.347	1.875 (0.195)	0.125	1.414 (0.579)	1.586	−0.459 (0.221)	0.459	0.814 (0.752)	1.186
3Trees-EvTree	19.932 (1.102)	−14.930	2.045 (0.165)	−0.045	−0.004 (0.075)	0.004	1.996 (0.062)	0.004	1.993 (0.121)	0.007	3.003 (0.246)	−0.003	−0.640 (0.323)	0.640	1.968 (0.423)	0.032
LMMs	7.054 (0.526)	−2.054	2.011 (0.095)	−0.011	0.010 (0.095)	−0.010	1.992 (0.095)	0.008	1.977 (0.183)	0.023	3.014 (0.239)	−0.014	−0.646 (0.237)	0.646	1.955 (0.464)	0.045
3Trees-TSCTree	7.039 (0.329)	−2.039	2.011 (0.095)	−0.011	0.010 (0.095)	−0.010	1.992 (0.094)	0.008	1.977 (0.183)	0.023	3.014 (0.231)	−0.014	−0.645 (0.229)	0.645	1.958 (0.448)	0.042

PERMALINK

Modified tree-based selection in hierarchical mixed-effect models with trees: A simulation study and real-data application

Asrirawan

Khairil Anwar Notodiputro

Budi Susetyo

Sachnaz Desta Oktarina

Abstract

Graphical abstract

Background

Method details

Hierarchical mixed-effects model with trees (3Trees)

Algorithm 1.

CART algorithm

Algorithm 2.

CTree algorithm

Algorithm 3.

EvTree algorithm

Fig. 1.

Algorithm 4.

Method validation

Simulation study design

Table 1.

Results of simulation studies

Table 2.

Table 3.

Fig. 2.

Table 4.

Fig. 3.

Fig. 4.

Fig. 5.

Fig. 6.

Fig. 7.

Fig. 10.

Case study: Expenditure per Capita

Table 5.

Table 6.

Table 7.

Fig. 8.

Fig. 9.

Conclusions

Limitations

Ethics statements

CRediT author statement

Supplementary material and/or additional information [OPTIONAL]

Declaration of competing interest

Acknowledgments

Footnotes

Appendix A

Table 8.

Table 9.

Appendix B

Table 10.

Table 11.

Table 12.

Appendix C

Data availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases