Skip to main content
Journal of Clinical Laboratory Analysis logoLink to Journal of Clinical Laboratory Analysis
letter
. 2021 May 27;35(7):e23849. doi: 10.1002/jcla.23849

To handle the inflation of odds ratios in a retrospective study with a profile penalized log‐likelihood approach

I‐Shiang Tzeng 1,2,
PMCID: PMC8274998  PMID: 34043251

Dear Editor,

We read with great interest the paper authored by Chen et al. 1 entitled “Integrin α7 is overexpressed and correlates with higher pathological grade increased T stage advanced TNM stage as well as worse survival in clear cell renal cell carcinoma patients: A retrospective study,” published in January 2020 in the Journal of Clinical Laboratory Analysis. The authors aimed to explore the association of integrin α7 with clinicopathological characteristics and overall survival (OS) in patients with clear cell renal cell carcinoma (ccRCC) based on a retrospective study. They found that high tumor integrin α7 expression was an independent predictor of poor OS (< 0.001) in a multivariate Cox proportional hazards regression model. Although the present study makes a valuable contribution to this area of research some methodological points need further clarification.

The authors compared the OS between tumor integrin α7 high expression and low expression groups and found that the crude hazard ratio (HR) of OS increased 4.453 times in patients with high integrin α7 expression levels compared to those with low integrin α7 expression levels (ccRCC patients with high expression [n = 69] vs. ccRCC patients with low integrin α7 expression [n = 110]; HR = 4.453; 95% confidence interval [CI]: 2.633–7.532). Furthermore another significant factor that affects the OS of patients with ccRCC is the node (N) stage. The univariate Cox regression results showed that the crude HR of OS increased 8.197 times in patients with N1 stage compared to those with N0 stage (ccRCC patients with N1 stage [n = 12] vs. ccRCC patients with N0 stage [n = 167]; HR = 8.197; 95% confidence interval (CI): 4.297–15.636). Several authors have argued that large effect estimates may result from special conditions in a data set and this is known as “monotone likelihood”. 2 , 3 Thus researchers cannot be certain about the true value of HR if the CI of the HR is too wide. The CI along with the large effect estimate suggests reconstruction of the interval estimation based on profile penalized log likelihood (PPL). 3

To demonstrate this an analysis of a breast cancer data 4 set with the coxphf package 5 provided an implementation of the Firth's logistic regression. Of the 100 patients 74 were censored with a median follow‐up time of 78 months (range 11–172 months). The survival time of 100 patients and four potential risk factors were investigated. The four potential risk factors included tumor stage (T) nodal status (N) histological grading (G) and cathepsin D immunoreactivity (CD). We summarized the analysis results in Table 1 which shows that the HR and 95% CI of histological grading were shrinking after modification based on PPL. Figure 1 also shows the histological grading 95% CI which was estimated based on a PPL likelihood that was narrower than the 95% CI without modification.

TABLE 1.

Summary of analysis of breast cancer data under Cox's proportional hazards regression model before and after modifying based on profile penalized log likelihood

Characteristic HR 95% CI p‐Value Modified HR 95% CI p‐Value
Tumor stage 3.593 1.343–9.616 0.0109* 3.402 1.363–9.472 0.0082*
Nodal status 2.576 1.120–5.926 0.0260* 2.507 1.120–5.833 0.0253*
Historical grading 100100000 0.000–Inf 0.9966 11.296 1.466–1451.946 0.0136*
Cathepsin D immunoreactivity 1.492 0.626–3.558 0.3670 1.488 0.627–3.512 0.3645

Abbreviations: CI, confidence interval; HR, hazard ratio; Inf, infinite.

*

<0.05.

FIGURE 1.

FIGURE 1

Plot of the profile penalized log‐likelihood function for variable historical grading

As shown in Table 1 in the study conducted by Chen et al. only 12 cases of patients with ccRCC were of N1 stage which means that their data were subject to sparse events and may increase the probability of the occurrence of monotone likelihood. We estimated that there will be fewer cases of patients with ccRCC with N1 stage due to patient survival (Chen et al. do not provide details). Different methods have been introduced to address the occurrence of monotone likelihood,but penalization through data augmentation is an efficient method that was published in 2001. 2 After a multivariate analysis we found that the integrin α7 high expression factor (HR = 3.353; 95% CI: 1.900–5.918) preserved and the N1 stage factor did not in Cox's proportional hazards regression model (with forward stepwise method). We can conclude that the problem is not limited to univariate models if the N1 stage factor is preserved in the final multivariate model. 2 The sample size number of events and magnitudes of association with the outcome degree of balance and number of binary covariates affect the prevalence of monotone likelihood when considering multivariate regression. The monotone likelihood phenomenon was observed in Cox's proportional hazards regression model. Hence we suggest that Chen et al. add to their text that a monotone likelihood limitation exists in their clinical data.

DECLARATION OF FINANCIAL/OTHER RELATIONSHIPS

The authors of this letter have no relevant financial or other relationships to disclose.

FUNDING INFORMATION

This research was funded by Taipei Tzu Chi Hospital; grant number TCRD‐TPE‐109‐39 (1/2) and TCRD‐TPE‐109‐39 (2/2).

DATA AVAILABILITY STATEMENT

The data used to support the findings of this study are included within the article.

REFERENCES

  • 1. Chen Y, Wang Y, Cai Z, Fan X, Zhang Y. Integrin α7 is overexpressed and correlates with higher pathological grade, increased T stage, advanced TNM stage as well as worse survival in clear cell renal cell carcinoma patients: a retrospective study. J Clin Lab Anal. 2020;34(1):e23034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Heinze G, Schemper M. A solution to the problem of monotone likelihood in Cox regression. Biometrics. 2001;57(1):114‐119. [DOI] [PubMed] [Google Scholar]
  • 3. Heinze G, Ploner M. SAS and SPLUS programs to perform Cox regression without convergence problems. Comput Methods Programs Biomed. 2002;67(3):217‐223. [DOI] [PubMed] [Google Scholar]
  • 4. Lösch A, Tempfer C, Kohlberger P, et al. Prognostic value of cathepsin D expression and association with histomorphological subtypes in breast cancer. Br J Cancer. 1998;78(2):205‐209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Ploner M, Heinze G. Coxphf: cox regression with Firth’s penalized likelihood. R package version. 2015;1:11. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data used to support the findings of this study are included within the article.


Articles from Journal of Clinical Laboratory Analysis are provided here courtesy of Wiley

RESOURCES