PSHREG: A SAS macro for proportional and nonproportional subdistribution hazards regression

Maria Kohl; Max Plischke; Karen Leffondré; Georg Heinze

doi:10.1016/j.cmpb.2014.11.009

. 2015 Feb;118(2):218–233. doi: 10.1016/j.cmpb.2014.11.009

PSHREG: A SAS macro for proportional and nonproportional subdistribution hazards regression

Maria Kohl ^a,^b, Max Plischke ^c, Karen Leffondré ^d, Georg Heinze ^a,^⁎

PMCID: PMC4673318 PMID: 25572709

Highlights

•
The %pshreg SAS macro fits Fine-Gray models for competing risks.
•
The macro first modifies a given data set and then uses PROC PHREG for analysis.
•
Many useful features of PROC PHREG can now be applied to a Fine-Gray model.
•
Time-dependent effects can be accommodated by time-by-covariate interactions.
•
For small data sets, the Firth correction is available.

Keywords: Regression analysis, Survival analysis, Competing risks, SAS software, Subdistribution hazard ratio, Cumulative incidence

Abstract

We present a new SAS macro %pshreg that can be used to fit a proportional subdistribution hazards model for survival data subject to competing risks. Our macro first modifies the input data set appropriately and then applies SAS's standard Cox regression procedure, PROC PHREG, using weights and counting-process style of specifying survival times to the modified data set. The modified data set can also be used to estimate cumulative incidence curves for the event of interest. The application of PROC PHREG has several advantages, e.g., it directly enables the user to apply the Firth correction, which has been proposed as a solution to the problem of undefined (infinite) maximum likelihood estimates in Cox regression, frequently encountered in small sample analyses.

Deviation from proportional subdistribution hazards can be detected by both inspecting Schoenfeld-type residuals and testing correlation of these residuals with time, or by including interactions of covariates with functions of time. We illustrate application of these extended methods for competing risk regression using our macro, which is freely available at: http://cemsiis.meduniwien.ac.at/en/kb/science-research/software/statistical-software/pshreg, by means of analysis of a real chronic kidney disease study. We discuss differences in features and capabilities of %pshreg and the recent (January 2014) SAS PROC PHREG implementation of proportional subdistribution hazards modelling.

1. Introduction

Competing risks arise in the analysis of time-to-event data, if for some subjects the event of interest is precluded by a different type of event occurring before. Competing risks may be encountered, e.g., if interest focuses on a specific non-terminal event type such as dialysis onset. In such a situation, death before dialysis onset constitutes a competing risk.

It has frequently been pointed out that in presence of competing risks, the standard product-limit method for estimating the survival function for the event of interest yields biased results; cf., e.g., Bakoyannis and Touloumi [1]. The main assumption of this method is that any subject whose survival time is censored will experience the event of interest if followed up long enough. This does not hold if competing risks are present, as the occurrence of the event of interest is precluded or its probability of occurrence is modified by an antecedent competing event. As a remedy, the cumulative distribution function, generally denoted cumulative incidence function (CIF), proposed by Kalbfleisch and Prentice [2] can be used. While the naive product-limit estimate of CIF, treating the competing risk as an independent censoring mechanism, will reach 1 with an infinite follow-up time, the proper CIF estimate never reaches 1 as a consequence of presence of a certain proportion of subjects who will experience the competing event.

Two different ways of modelling competing risks data have been proposed. The first one analyses the cause-specific hazard of each event type separately, by applying Cox regression targeting each event type in turn, and censoring all other event types. By a complete analysis of all event types by such cause-specific Cox models, estimated cumulative incidence curves for an event of interest can be estimated using specialized software [3]. By contrast, the proportional subdistribution hazards (PSH) model proposed by Fine and Gray [4] directly aims at modelling differences in the cumulative incidence of an event of interest. Its estimation is based on modified risk sets, where subjects experiencing the competing event are retained even after their competing event. In case of censoring (which is the rule rather than the exception), a modification of this simple principle was proposed such that the weight of those subjects artificially retained in the risk sets is gradually reduced according to the conditional probability of being under follow-up had the competing event not occurred.

The two different approaches to competing risks modelling are both appropriate but have different aims: while modelling the cause-specific hazards targets aetiologic research questions by modelling the effect of covariates on event rates among subjects at risk, the PSH model is useful for medical decision making and prognosis, as it models the absolute risk of an event. Differences between these two approaches are discussed, e.g., in Wolbers et al. [5] and Andersen et al. [6]. Here we focus on the latter approach.

Programs for fitting a Fine-Gray regression model are available in R (e.g., package cmprsk by Gray [7]) and STATA (stcrreg command by StataCorp. [8]). Waltoft [9] describes a SAS macro for cumulative incidence curve estimation via Poisson regression. Zhang and Zhang [10] provide a SAS macro for the special case of computing adjusted cumulative incidence curves for two treatment groups. However, a publicly available and fairly general implementation of the Fine-Gray regression model in SAS has yet been missing. Our SAS macro %pshreg was written to fill this gap. The macro modifies an input data set by separating follow-up periods of patients with competing events into several disjoint sub-periods with declining weights, following the suggestions in Fine and Gray [4] and Geskus [11]. This allows using SAS's PROC PHREG to compute the proportional subdistribution hazards model. Thus, all options offered by PROC PHREG for verifying and relaxing the assumption of proportional subdistribution hazards can be used, including the computation and display of unscaled and scaled Schoenfeld-type residuals or extending the model by time-dependent effects of covariates.

In very small data sets with few events, monotone pseudo-likelihood may cause parameter estimates to diverge to ±∞. This phenomenon typically occurs if events are observed in only one of two levels of a binary covariate. The application of the Firth-correction [12], which is readily implemented in PROC PHREG, may be useful in such circumstances [13], [14].

In the remainder of this manuscript we first briefly review the estimation of proportional subdistribution hazards models with time-fixed and time-dependent effects, and illustrate Firth's bias correction method. Later we explain the most important macro options in detail and explain the structure of the macro code. Subsequently, a worked example illustrates different aspects of application of the macro. The manuscript closes by discussing what has been achieved and the differences in features and capabilities of %pshreg and the recent SAS implementation of PSH modelling in PROC PHREG (SAS/STAT Version 13.1, released in January 2014).

2. Methodological background

2.1. An example for competing risks

As a motivating example, we consider a study on chronic kidney disease (CKD) recently performed at the Medical University of Vienna including 273 patients, who were diagnosed with CKD between stages I and IV at their first visit of the University's outpatient department [15]. In this study, researchers were interested in modelling the time to dialysis onset, i.e., the time to progression to end stage renal disease. The continuous variables age per decade (age), log₂ of urine osmolarity (logUosm), log₂ of creatinine clearance (logCCR), log₂ of proteinuria (logProt) and the binary variables beta-blocker (bblock), diuretic therapies (diur) and type of underlying renal disease (pkd), all measured at the day of diagnosis, were considered as important in prognosis of the cumulative incidence of dialysis onset. The median follow-up time of the study was 92 months. The event of interest, dialysis onset, could be observed in 105 patients (38.46%). CKD patients are at higher risk of mortality than the normal population, and thus, in our study, death before dialysis onset constitutes a competing event (n = 35, 12.82%).

2.2. Notation

Generally, we assume observations on m covariates in n subjects, and we let x_il, t_i and ɛ_i denote subject i's (i = 1, …, n) values of covariate l (l = 1, …, m), observed survival time and follow-up status, respectively. For the latter, we consider ɛ_i = 0, ɛ_i = 1, and ɛ_i = 2 as denoting a censored time t_i, an event of interest occurred at t_i, and a competing event occurred at t_i, respectively. Furthermore, we assume that there are r distinct times at which events of interest have occurred, and that t₍₁₎, …, t_(r) denote the corresponding ordered event times.

2.3. Non-parametric cumulative incidence estimation

Without loss of generality, we assume that there is one event type of interest, dialysis onset in our example, and only one competing event, death. These two types of events are in the sequel denoted by event type 1 and event type 2. In the competing risks literature, the event types are also denoted as causes of an event. Let λ_k(t) and Λ_k(t) denote the cause-specific hazard functions and cause-specific cumulative hazard functions, respectively, for event type (cause) k, k = 1, 2. The cause-specific cumulative incidence function F₁(t), describing the cumulative probability of subjects experiencing event type 1 up to time t, is given by

F_{1} (t) = \int_{0}^{t} S (s) λ_{1} (s) d s

Note that S(t) is the survival function of time to the first of the two event types, given by S(t) = e^{−Λ₁(t)−Λ₂(t)}. F_k(t) has also been denoted as the ‘subdistribution’, reflecting the fact that it does not reach 1 in presence of a competing risk. In the absence of competing risks, the cumulative hazard and cumulative incidence (one minus survivor) function are connected by the relationship F(t) = 1 − e^−Λ(t). This unique correspondence is lost with competing risks, because the cumulative incidence for the event of interest depends on the cause-specific hazard of the competing event [6]. Consequently, the Kaplan-Meier estimator of the cause specific cumulative incidence function, obtained by simply censoring all observations with a competing event, is biased since 1 − S_k(t) ≥ F_k(t) (k = 1, 2) [1]. Instead, the cumulative incidence function F₁(t) at the ordered event times t_(j), j = 1, …, r, should be estimated by the non-parametric plug-in estimator [1]

{\hat{F}}_{1} (t_{(j)}) = \sum_{i = 1}^{j} {\hat{λ}}_{1} (t_{j}) \hat{S} (t_{(j - 1)}) = \sum_{i = 1}^{j} \frac{d_{1 (j)}}{n_{(j)}} \hat{S} (t_{(j - 1)})

where d_1(j) is the number of events of type 1 observed at t_(j), n_(j) is the number of patients at risk for both events just before t_(j), $\hat{S} (t)$ is the Kaplan-Meier estimator of the survival function of time to the first event at t and $\hat{S} (t_{(0)}) = 1$ .

2.4. Proportional subdistribution hazard regression

The proportional subdistribution hazard model was proposed by Fine and Gray [4] in order to estimate the effect of covariates on the cumulative incidence of the event of interest.

2.4.1. The Fine-Gray model

We consider T as the (partly unobservable) random variable describing the time at which the first event of any type occurs in an individual, and ɛ ∈ {1, 2} as the event type related to that time. The subdistribution hazard of event type 1, h₁(t, X), is defined as

h_{1} (t, X) = \lim_{Δ t \to 0} \frac{1}{Δ t} \Pr {t \leq T \leq t + Δ t \land ε = 1 | T \geq t \lor (T \leq t \land ε = 2), X}

(1)

with X denoting a row vector of covariates, and ' ∨ ' and ' ∧ ' denoting the logical OR and AND, respectively. Following Fine and Gray [4], the subdistribution hazard in Eq. (1) can be modelled as a function of a parameter vector β through

h_{1} (t, X) = h_{1,0} (t) e^{X β}

where h_1,0 is an unspecified baseline subdistribution hazard function.

The partial likelihood of the proportional subdistribution hazards model was defined by Fine and Gray as

L (β) = \prod_{j = 1}^{r} \frac{\exp (x_{(j)} β)}{\sum_{i \in ℛ (t_{(j)})} w_{i} (t_{(j)}) \exp (x_{i} β)}

where x_(j) is the covariate row vector of the subject experiencing an event of type 1 at t_(j). For simplicity, no ties in event times are assumed here, but both the Breslow and Efron tie corrections can incorporate weights and can thus be used in this model. The risk set ℛ(t) is defined as

ℛ (t) = {i; t_{i} \geq t \lor (t_{i} \leq t \land ε_{i} = 2)}

and includes the set of individuals who, at time t, are at risk of event type 1 as well as those who have had a competing event before t. The subject- and time-specific weights $w_{i} (t)$ are needed as soon as censoring occurs. Subjects who are at risk for an event of type 1 at time t, and who have not failed from an event of type 2 before t participate fully in ℛ(t) with $w_{i} (t) = 1$ , whereas $w_{i} (t) \leq 1$ for subjects with competing events at t_i < t. Formally,

w_{i} (t) = \{\begin{array}{l} 1 & if t_{i} \geq t \\ \frac{\hat{G} (t)}{\hat{G} (t_{i})} & if ε_{i} = 2 \land t_{i} < t \end{array})

(2)

Here, $\hat{G} (t)$ denotes an estimator of the survival function of the censoring distribution at t, i.e., the cumulative probability of still being followed-up at t. $\hat{G} (t)$ can be estimated by the product-limit method with reverse meaning of the censoring indicator:

\hat{G} (t) = \prod_{t_{i} < t} 1 - \frac{\sum_{j = 1}^{n} I (t_{j} = t_{i} \land ε_{i} = 0)}{\sum_{j = 1}^{n} I (t_{j} \geq t_{i})}

(3)

In case that there is no random censoring, i.e., every subject was followed up until a specified administrative censoring date (end of follow-up), the Fine-Gray model simplifies considerably. Then, it can be estimated by a Cox regression model in which times to competing events are replaced by times to administrative censoring and censored [1]. This simplification assumes that it would have been possible for subjects who experienced an event to follow them up until the administrative censoring date.

2.4.2. Fitting a Fine-Gray model with standard software

The proportional subdistribution hazards model can be estimated using any standard software for Cox regression that allows representation of times in counting process style, i.e., in (start, stop] syntax, and weighting [11]. In the following, we describe how to model the time to event type 1; in this case, unmodified data can be used on the subjects who either experience event type 1 or who are censored, but data of subjects who experience event type 2 has to be modified.

In particular, the event history of each subject i is represented in counting process style as one or several disjoint time intervals. For individuals who did not fail from the event of type 2 before failing from the event of type 1, these episodes, described by (start time, stop time, status indicator) are just $(0, t_{i}, ε_{i}^{*})$ , where the modified censoring indicator $ε_{i}^{*}$ is 1 for event type 1, and 0 for a censored time. However, for subjects experiencing event type 2 before failing from the event of type 1 additional time intervals are created after their time to event of type 2. Assuming that the event times t_i of such subjects are such that t_(j) ≤ t_i < t_(j+1), the first time interval is given by (0, t_i, 0), the second one by (t_i, t_(j+1), 0) and further time intervals by (t_(j+1), t_(j+2), 0), (t_(j+2), t_(j+3), 0), …, (t_(r−1), t_(r), 0). These time intervals are assigned the decreasing weights 1, $w_{i} (t_{(j + 1)}), w_{i} (t_{(j + 2)}), \dots, w_{i} (t_{(r)})$ , respectively (Table 1). With this modification and weighting of the original data, the parameters of a PSH model can now be estimated using SAS's PROC PHREG.

Table 1.

Counting process representation and weighting of original survival information for a Fine-Gray model describing time to event type 1.

Subject	Observed survival time	Observed status	For Fine-Gray model
			Counting process representation	Weight
A	t_A	censored	(0, t_A, 0)	1
B	t_B	event 1	(0, t_B, 1)	1
C	t_C^*	event 2	(0, t_C, 0)	1
			(t_C, t_(j+1), 0)	$\frac{\hat{G} (t_{(j + 1)})}{\hat{G} (t_{C})}$
			(t_(j+1), t_(j+2), 0)	$\frac{\hat{G} (t_{(j + 2)})}{\hat{G} (t_{C})}$
			etc.	etc

Parameter	Definition
data	specifies the input SAS data set. The default value is _LAST_.
cens	specifies a variable containing the censoring indicator corresponding to each observation in time. There is no default value. The censoring indicator variable is expected to assume the value 1 for a time to the event of interest, the value 2 for a time to the competing event, and 0 for censored times.
time	specifies a variable containing the times to the first event, or time to censoring. There is no default value.
admin	specifies a variable containing administrative censoring times. Use this option if there is purely administrative censoring and no random censoring. If this option is used, the Fine-Gray model can be estimated using a Cox model by replacing competing event times by administrative censoring times, and censoring competing events [1].
varlist	specifies a list of independent variables, separated by blanks. There is no default value.
class	specifies a subset of the variables of varlist, which should be interpreted as factor variables with multiple levels instead of continuous covariates.
out	specifies the output SAS data set including all covariables, the start and stop times and the weights of the observations. The default name is dat_crr.
firth=1	request the Firth penalization (default=0).
action=estimate	requests the estimation of the Fine-Gray proportional subdistribution hazards model using as covariates all variables specified in varlist (default). If action=code the PSH model is not estimated but the code needed to obtain the PSH analysis using PROC PHREG is printed in the Log window.
by	specifies a variable which can be used to define subsets of a larger data set; useful for efficient processing of multiple data sets of the same structure. This option can also be used for computing weights separately in different strata. The input data set is automatically sorted by this variable.
options	specifies options which are passed to PROC PHREG's MODEL statement. As examples, consider - options=%str(rl=pl), which requests profile likelihood confidence limits for subdistribution hazards ratios, - options=%str(selection=backward slstay=0.05), requesting backward variable selection at a 5% significance level, or - options=%str(ties=efron), employing Efron's tie correction instead of the default Breslow correction. The SAS specific %str(arg) construct causes the argument arg to be used as string without evaluation. Its use is necessary to prevent SAS from interpreting arg before passing it to the MODEL statement of PROC PHREG.
id	specifies a patient identifier variable (usually not needed if each distinct patient is represented by exactly one observation in the input data set).
cuminc=1	to plot unadjusted cumulative incidence curves estimated from the empirical subdistribution hazard, stratified by the levels of the first variable specified in varlist. The default value is 0 (no cumulative incidence curve estimation).
call	specifies an output SAS data set which collects all values of macro options for later reference.
clean=1	if working data set should be cleaned (default), i.e., keeping only relevant variables mentioned in the macro call.
missing=drop	to drop missing values in the modified data set (default). To keep observations with missing values in any variable of the varlist set missing=keep.
delwork=1	to delete all working data sets on exit (default).
tiedcens=after	specifies whether censored times that are tied with event times should be interpreted as occurring slightly after the event times (the default) or slightly before the event.

Prognostic factor	SAS variable name	Median (1st, 3rd quartiles) or N (%)
Age (in decades)	age	5.6 (4.2, 6.7)
Osmolarity (mOsmol/L)	logUosm^*	510 (414, 622)
Proteinuria (g/L)	logProt^*	0.87 (0.23, 2.35)
Creatinine clearance (ml/min)	logCCR^*	47.5 (30.3, 79.0)
PKD	pkd	18 (6.59%)
Beta-blocker	bblock	119 (47.0%)
Diuretics	diur	120 (47.4%)

	The PSHREG macro: summary of macro options

Macro option	Assigned value	Remark
data	CKDstudy	Input data set
time	Time	Time variable
cens	status	Censoring variable
failcode	1	Code for event of interest
cencode	0	Code for censored observation
tiedcens	after	How censored times tied with event times should be treated
admin		Administrative censoring time variable
varlist	age	List of covariables
	logUosm
	logProt
	logCCR
	pkd
	bblocker
	diur
class		List of class variables
options		Options to be passed to PROC PHREG
firth	0	Standard ML estimation, no Firth correction
id		Subject identifier
by		BY processing variable
cuminc	0	Requests cumulative incidence curves
action	code	No output produced (see log file)
weights	0	Standard model, no weighting of risk sets
clean	1	Unnecessary variables removed
call	_pshregopt	Data set with this call's macro options
out	dat_crr	Output data set for standard Fine-Gray model
missing	drop	Delete observations with missing covariate
values
statustab	1	Summary of status variable requested
delwork	1	Temporary data sets deleted on exit
------------	-----------	------------------------------------------
macro version	2014.09
build	201409301506

The PSHREG macro: Summary of missing values in covariates and outcome variables

Remark	Covariates	Outcome (Time, status)	Total
Observations deleted in input data set because of missing values:	21	0	21

The PSHREG macro: Summary of status variable

Obs	_status	COUNT	PERCENT
1	Censored	119	47.2222
2	Events of interest	100	39.6825
3	Competing events	33	13.0952

The PHREG Procedure

Analysis of Maximum Likelihood Estimates

Parameter	DF	Parameter Estimate	Standard Error	StdErr Ratio	Chi-Square	Pr > ChiSq

age	1	−0.13910	0.08002	1.171	3.0219	0.0821
logUosm	1	0.71233	0.33328	1.110	4.5682	0.0326
logProt	1	0.61339	0.07247	0.955	71.6483	<.0001
logCCR	1	−1.91238	0.23115	1.033	68.4487	<.0001
PKD	1	1.23414	0.34888	1.010	12.5133	0.0004
BBlocker	1	0.42904	0.23475	1.089	3.3403	0.0676
diur	1	0.48149	0.23248	1.048	4.2895	0.0383

PROC PHREG DATA=dat_crr covs(aggregate);
	MODEL (_start_,_stop_)*_censcrr_(0)=age logUosm logProt logCCR pkd bblock diur/rl;
	WEIGHT _weight_;
	BASELINE out=cuminccurves covariates=datameans survival=_surv_;
RUN;

PROC PHREG DATA=dat_crr covs(aggregate) outest=estimates;
	MODEL (_start_,_stop_)*_censcrr_(0)=age logUosm logProt logCCR pkd bblocker diur;
	OUTPUT out=schoenfeld_data wtressch=WSR_age WSR_logUosm WSR_logProt
	WSR_logCCR WSR_pkd WSR_bblocker WSR_diur;
	ID _id_;
	WEIGHT _weight_;
	BY _by_;
RUN;

DATA schoenfeld_data;
	MERGE schoenfeld_data (keep=WSR_logCCR _stop_ _by_) estimates;
	BY _by_;
	rescaled_WSR_logCCR=WSR_logCCR+logCCR;
	LABEL rescaled_WSR_logCCR=“beta(t) of log2 of creatinine clearance” _stop_=“Time in years”;
RUN;

ODS GRAPHICS on;
ODS SELECT fitplot;
PROC LOESS DATA=schoenfeld_data
	PLOTS=residuals(smooth);
	MODEL rescaled_WSR_logCCR=_stop_/clm;
RUN;
ODS GRAPHICS off;

The PHREG Procedure

Analysis of Maximum Likelihood Estimates

Parameter	Hazard Ratio	95% Hazard Ratio Confidence Limits

age	0.870	0.744	1.018
logUosm	2.039	1.061	3.918
logProt	1.847	1.602	2.129
logCCR	0.148	0.094	0.232
PKD	3.435	1.734	6.807
BBlocker	1.536	0.969	2.433
diur	1.618	1.026	2.553

The CORR Procedure

Pearson Correlation Coefficients
Prob>\|r\| under H0: Rho=0
Number of Observations

	_stop_	logstop
WSR_age	−0.08184	−0.04724
Standardized Schoenfeld Residual age	0.4182	0.6407
	100	100
WSR_logUosm	−0.11007	−0.15490
Standardized Schoenfeld Residual logUosm	0.2756	0.1238
	100	100
WSR_logProt	−0.12749	−0.04825
Standardized Schoenfeld Residual logProt	0.2062	0.6336
	100	100
WSR_logCCR	0.30715	0.34600
Standardized Schoenfeld Residual logCCR	0.0019	0.0004
	100	100
WSR_pkd	0.08729	0.09667
Standardized Schoenfeld Residual PKD	0.3878	0.3387
	100	100
WSR_bblocker	0.08781	0.07293
Standardized Schoenfeld Residual BBlocker	0.3850	0.4709
	100	100
WSR_diur	−0.07860	−0.08767
Standardized Schoenfeld Residual diur	0.4370	0.3857
	100	100

The PHREG Procedure

Analysis of Maximum Likelihood Estimates

Parameter	DF	Parameter Estimate	Standard Error	StdErr Ratio	Chi-Square	Pr>ChiSq

age	1	−0.12462	0.07846	1.128	2.5228	0.1122
logUosm	1	0.65759	0.31375	1.055	4.3927	0.0361
logProt	1	0.60890	0.07306	0.955	69.4532	<.0001
logCCR	1	−2.64152	0.26778	0.812	97.3099	<.0001
PKD	1	1.14126	0.33683	0.968	11.4801	0.0007
BBlocker	1	0.40915	0.22757	1.049	3.2326	0.0722
diur	1	0.47390	0.22469	1.005	4.4484	0.0349
logstop1*logCCR	1	0.83933	0.19486	0.781	18.5532	<.0001

	Analysis of Maximum Likelihood Estimates

Parameter	Hazard Ratio	95% Hazard Ratio Confidence Limits		Label

age	0.883	0.757	1.030	age
logUosm	1.930	1.044	3.570
logProt	1.838	1.593	2.121
logCCR	.	.	.
PKD	3.131	1.618	6.058
BBlocker	1.506	0.964	2.352	BBlocker
diur	1.606	1.034	2.495
logstop1*logCCR.	.	.	.	logstop1 * logCCR

PROC CORR DATA=schoenfeld_data;
	VAR _stop_ logstop;
	WITH WSR_age WSR_logUosm WSR_logProt WSR_logCCR WSR_pkd WSR_bblocker
	WSR_diur;
RUN;

PROC PHREG DATA=dat_crr covs(aggregate);
	MODEL (_start_,_stop_)_censcrr_(0)=age logUosm logProt logCCR pkd bblocker diur logCCRlogstop1/rl;
	logstop1=log(_stop_+1/12);
	ID _id_;
	WEIGHT _weight_;
	HAZARDRATIO logCCR/at(logstop1=-2.49 -0.54 0.08 1.13 1.63);
RUN;

	Hazard Ratios for logCCR

Description	Point Estimate	95% Wald Robust Confidence Limits

logCCR Unit=1 At logstop1=−2.49	0.009	0.002	0.033
logCCR Unit=1 At logstop1=−0.54	0.045	0.023	0.088
logCCR Unit=1 At logstop1=0.08	0.076	0.046	0.127
logCCR Unit=1 At logstop1=1.13	0.184	0.118	0.286
logCCR Unit=1 At logstop1=1.63	0.280	0.165	0.474

Prognostic factor	SAS variable name	Median (1st, 3rd quartiles) or N (%)
Age, binary (>60 years)	dage60	17 (8.5%)
Osmolarity, binary (>500 mOsmol/L)	dUosm^*	158 (79%)
Proteinuria, binary (>3 g/L)	dProt^*	26 (13%)
Creatinine clearance (ml/min)	logCCR^*	91.9 (80.9, 132.86)

PROC PHREG DATA=small_crr;
	MODEL (_start_,_stop_)*_censcrr_(0)=dage60 dUosm dProt logCCR/rl=pl firth;
	ID _id_;
	WEIGHT _weight_;
	TITLE “Firth-corrected PSH model”;
RUN;

Firth-corrected PSH model

The PHREG Procedure

Analysis of Maximum Likelihood Estimates

Parameter	DF	Parameter Estimate	Standard Error	Chi-Square	Pr>ChiSq

dage60	1	−2.05496	1.47804	1.9330	0.1644
dUosm	1	0.18740	0.47225	0.1575	0.6915
dProt	1	0.94149	0.49547	3.6108	0.0574
logCCR	1	−2.49539	0.66178	14.2183	0.0002

PROC PHREG DATA=small_crr COVS(AGGREGATE);
	MODEL (_start_,_stop_)*_censcrr_(0)=dage60 dUosm dProt logCCR/rl;
	ID _id_;
	WEIGHT _weight_;
	TITLE “Uncorrected PSH model”;
RUN;

Firth-corrected analysis
Variable	Parameter estimate	Lower	Upper	Lower	Upper
		95% Wald CL		95% PPL CL
dage60	−2.06	−5.00	0.84	−6.90	−0.03
dUosm	0.19	−0.74	1.11	−0.68	1.17
dProt	0.94	−0.03	1.91	−0.08	1.86
logCCR	−2.50	−3.80	−1.20	−3.90	−1.30

PERMALINK

PSHREG: A SAS macro for proportional and nonproportional subdistribution hazards regression

Maria Kohl

Max Plischke

Karen Leffondré

Georg Heinze

Highlights

Abstract

1. Introduction

2. Methodological background

2.1. An example for competing risks

2.2. Notation

2.3. Non-parametric cumulative incidence estimation

2.4. Proportional subdistribution hazard regression

2.4.1. The Fine-Gray model

2.4.2. Fitting a Fine-Gray model with standard software

Table 1.

2.4.3. Prediction of cumulative incidence

2.4.4. Methods of inference in proportional subdistribution hazards models

2.4.5. Monotone likelihood and Firth's bias correction method

2.5. Nonproportional subdistribution hazards

2.5.1. Schoenfeld-type residuals

2.5.2. Time-varying coefficients

3. Working with the macro

3.1. Syntax

Table 2.

3.2. Output of the macro

3.2.1. Immediate model estimation

3.2.2. SAS code generation

3.3. Computational details

4. A worked example

4.1. Example with default settings

Table 3.

Fig. 1.

Fig. 2.

Fig. 3.

Fig. 4.

4.2. An example with time-dependent effects

Fig. 5.

4.3. An example with the Firth correction

Table 4.

Table 5.

5. Concluding remarks

Conflict of interest

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases