Main Content

Cox proportional hazards regression is a semiparametric method
for adjusting survival rate estimates to quantify the effect of predictor
variables. The method represents the effects of explanatory variables
as a multiplier of a common baseline hazard function, *h*_{0}(*t*).
The hazard function is the nonparametric part of the Cox proportional
hazards regression function, whereas the impact of the predictor variables
is a loglinear regression. For a baseline relative to 0, this model
corresponds to

$$h\left({X}_{i},t\right)={h}_{0}(t)\mathrm{exp}\left[{\displaystyle \sum _{j=1}^{p}{x}_{ij}{b}_{j}}\right],$$

where $${X}_{i}=({x}_{i1},{x}_{i2},\cdots ,{x}_{ip})$$ is
the predictor variable for the *i*th subject, *h*(*X*_{i},*t*)
is the hazard rate at time *t* for *X*_{i},
and *h*_{0}(*t*)
is the baseline hazard rate function.

The Cox proportional hazards model relates the hazard rate for
individuals or items at the value *X*_{i},
to the hazard rate for individuals or items at the baseline value.
It produces an estimate for the hazard ratio:

$$HR({X}_{i})=\frac{h\left({X}_{i},t\right)}{{h}_{0}\left(t\right)}=\mathrm{exp}\left[{\displaystyle \sum _{j=1}^{p}{x}_{ij}{b}_{j}}\right].$$

The model is based
on the assumption that the baseline hazard function depends on time, *t*,
but the predictor variables do not. This assumption is also called
the proportional hazards assumption, which states that the hazard
ratio does not change over time for any individual.

The hazard ratio represents the relative risk of instant failure
for individuals or items having the predictive variable value *X*_{i} compared
to the ones having the baseline values. For example, if the predictive
variable is smoking status, where nonsmoking is the baseline category,
the hazard ratio shows the relative instant failure rate of smokers
compared to the baseline category, that is, nonsmokers. For a baseline
relative to *X*^{*} and the
predictor variable value *X*_{i},
the hazard ratio is

$$HR({X}_{i})=\frac{h\left({X}_{i},t\right)}{h\left({X}^{*},t\right)}=\mathrm{exp}\left[{\displaystyle \sum _{j=1}^{p}\left({x}_{ij}-{x}_{j}{}^{*}\right){b}_{j}}\right].$$

For example, if the baseline is
the mean values of the predictor variables (`mean(X)`

),
then the hazard ratio becomes

$$HR({X}_{i})=\frac{h\left({X}_{i},t\right)}{h\left(\overline{X},t\right)}=\mathrm{exp}\left[{\displaystyle \sum _{j=1}^{p}\left({x}_{ij}-{\overline{x}}_{j}\right){b}_{j}}\right].$$

Hazard rates are related to survival rates, such that the survival
rate at time *t* for an individual with the explanatory
variable value *X*_{i} is

$${S}_{{X}_{i}}\left(t\right)={S}_{0}{\left(t\right)}^{HR({X}_{i})},$$

where *S*_{0}(*t*)
is the survivor function with the baseline hazard rate function *h*_{0}(*t*),
and *HR*(*X*_{i})
is the hazard ratio of the predictor variable value *X*_{i} relative
to the baseline value.

When you have variables that do not satisfy the proportional hazards (PH) assumption, you can consider using two extensions of Cox proportional hazards model: the stratified Cox model and the Cox model with time-dependent variables.

If the variables that do not satisfy the PH assumption are categorizable, use the stratified Cox model:

$${h}_{s}\left({X}_{i},t\right)={h}_{0s}(t)\mathrm{exp}\left[{\displaystyle \sum _{j=1}^{p}{x}_{ij}{b}_{j}}\right],$$

where the subscript *s* indicates
the *s*th stratum. The stratified Cox model has a
different baseline hazard rate function for each stratum but shares
coefficients. Therefore, it has the same hazard ratio across all strata
if the predictor variable values are the same. You can include stratification
variables in `coxphfit`

by using the name-value
pair `'Strata'`

.

If the variables that do not satisfy the PH assumption are time-dependent variables, use the Cox model with time-dependent variables:

$$h\left({X}_{i},t\right)={h}_{0}(t)\mathrm{exp}\left[{\displaystyle \sum _{j=1}^{{p}_{1}}{x}_{ij}{b}_{j}}+{\displaystyle \sum _{k=1}^{{p}_{2}}{x}_{ik}(t){c}_{k}}\right],$$

where *x*_{ij} is
an element of a time-independent predictor and *x*_{ik}(*t*)
is an element of a time-dependent predictor. For an example of how
to include time-dependent variables in `coxphfit`

,
see Cox Proportional Hazards Model with Time-Dependent Covariates.

A point estimate of the effect of each explanatory variable,
that is, the estimated hazard ratio for the effect of each explanatory
variable is exp(*b*), given all other variables are
held constant, where *b* is the coefficient estimate
for that variable. The coefficient estimates are found by maximizing
the partial likelihood function of the model. The partial likelihood
function for the proportional hazards regression model is based on
the observed order of events. It is the product of partial likelihoods
of failures estimated for each failure time. If there are *n* failures
at *n* distinct failure times, $${t}_{1}<{t}_{2}<\cdots <{t}_{n}$$,
then the partial likelihood is

$$L=\frac{HR\left({X}_{1}\right)}{{\displaystyle {\sum}_{j=1}^{n}HR\left({X}_{j}\right)}}\times \frac{HR\left({X}_{2}\right)}{{\displaystyle {\sum}_{j=2}^{n}HR\left({X}_{j}\right)}}\times \cdot \cdot \cdot \times \frac{HR\left({X}_{n}\right)}{HR\left({X}_{n}\right)}={\displaystyle \prod _{i=1}^{n}\frac{HR\left({X}_{i}\right)}{{\displaystyle {\sum}_{j=i}^{n}HR\left({X}_{j}\right)}}}.$$

You can rewrite the
partial likelihood by using a risk set *R*_{i}:

$$L={\displaystyle \prod _{i=1}^{n}\frac{HR({X}_{i})}{{\displaystyle \sum _{j\in {R}_{i}}HR({X}_{j})}}},$$

where *R*_{i} represents
the index set of subjects who are under study but do not experience
the event until the *i*th failure time.

You can use a likelihood ratio test to assess the significance
of adding a term or terms in a model. Consider the two models where
the first model has *p* predictive variables and
the second model has *p* + *r* predictive
variables. Then, comparing the two models, –2*(*L*_{1}/*L*_{2})
has a chi-square distribution with *r* degrees of
freedom (the number of terms being tested).

When you have tied events, `coxphfit`

approximates
the partial likelihood of the model by either Breslow’s method
(default) or Efron’s method, instead of computing the exact
partial likelihood. Computing the exact partial likelihood requires
a large amount of computation, which involves an entire permutation
of the risk sets for the tied event times.

The simplest approximation method is Breslow’s method. This method uses the same denominator for each tied set.

$$L={\displaystyle \prod _{i=1}^{d}{\displaystyle \prod _{j\in {D}_{i}}\frac{HR({X}_{j})}{{\displaystyle \sum _{k\in {R}_{i}}HR({X}_{k})}}}},$$

where *d* is
the number of distinct event times, and *D*_{i} is
the index set of all subjects whose event time is equal to the *i*th
event time.

Efron’s method is more accurate than Breslow’s method, yet simple. This method adjusts the denominator of the tied events as follows:

$$L={\displaystyle \prod _{i=1}^{d}{\displaystyle \prod _{j\in {D}_{i}}\frac{HR({X}_{j})}{{\displaystyle \sum _{k\in {R}_{i}}HR({X}_{k})-\frac{j-1}{{d}_{i}}{\displaystyle \sum _{k\in {D}_{i}}HR({X}_{k})}}}}},$$

where *d*_{i} is
the number of indexes in *D*_{i}.

For an example, assume that the first two events are tied, that
is, *t*_{1} = *t*_{2} and $${t}_{2}<{t}_{3}<\cdots <{t}_{n}$$.
In Breslow’s method, the denominators of the first two terms
are the same:

$$L=\frac{HR\left({X}_{1}\right)}{{\displaystyle {\sum}_{j=1}^{n}HR\left({X}_{j}\right)}}\times \frac{HR\left({X}_{2}\right)}{{\displaystyle {\sum}_{j=1}^{n}HR\left({X}_{j}\right)}}\times \frac{HR\left({X}_{3}\right)}{{\displaystyle {\sum}_{j=3}^{n}HR\left({X}_{j}\right)}}\times \frac{HR\left({X}_{4}\right)}{{\displaystyle {\sum}_{j=4}^{n}HR\left({X}_{j}\right)}}\times \cdot \cdot \cdot \times \frac{HR\left({X}_{n}\right)}{HR\left({X}_{n}\right)}.$$

Efron’s method adjusts the denominator of the second term:

$$L=\frac{HR\left({X}_{1}\right)}{{\displaystyle {\sum}_{j=1}^{n}HR\left({X}_{j}\right)}}\times \frac{HR\left({X}_{2}\right)}{0.5HR\left({X}_{1}\right)+0.5HR\left({X}_{2}\right)+{\displaystyle {\sum}_{j=3}^{n}HR\left({X}_{j}\right)}}\times \frac{HR\left({X}_{3}\right)}{{\displaystyle {\sum}_{j=3}^{n}HR\left({X}_{j}\right)}}\times \frac{HR\left({X}_{4}\right)}{{\displaystyle {\sum}_{j=4}^{n}HR\left({X}_{j}\right)}}\times \cdot \cdot \cdot \times \frac{HR\left({X}_{n},{t}_{n}\right)}{HR\left({X}_{n},{t}_{n}\right)}.$$

You can specify an approximation method by using the name-value
pair `'Ties'`

in `coxphfit`

.

The Cox proportional hazards model can incorporate with the
frequency or weights of observations. Let *w*_{i} be
the weight of the *i*th observation. Then, the partial
likelihoods of the Cox model with weights become as follows:

Partial likelihood with weights

$$L={\displaystyle \prod _{i=1}^{n}\frac{H{R}_{w}({X}_{i})}{{\displaystyle \sum _{j\in {R}_{i}}{w}_{j}HR({X}_{j})}}},$$

where

$$H{R}_{w}\left({X}_{i}\right)=\mathrm{exp}\left[{\displaystyle \sum _{j=1}^{p}{w}_{j}{x}_{ij}{b}_{j}}\right].$$

Partial likelihood with weights and Breslow’s method

$$L={\displaystyle \prod _{i=1}^{d}{\displaystyle \prod _{j\in {D}_{i}}\frac{H{R}_{w}({X}_{j})}{{\left[{\displaystyle \sum _{k\in {R}_{i}}{w}_{k}HR({X}_{k})}\right]}^{\frac{1}{{d}_{i}}{\displaystyle \sum _{j\in {D}_{i}}{w}_{j}}}}}}$$

Partial likelihood with weights and Efron’s method

$$L={\displaystyle \prod _{i=1}^{d}{\displaystyle \prod _{j\in {D}_{i}}\frac{H{R}_{w}({X}_{j})}{{\left[{\displaystyle \sum _{k\in {R}_{i}}{w}_{k}HR({X}_{k})-\frac{j-1}{{d}_{i}}{\displaystyle \sum _{k\in {D}_{i}}{w}_{k}HR({X}_{k})}}\right]}^{\frac{1}{{d}_{i}}{\displaystyle \sum _{j\in {D}_{i}}{w}_{j}}}}}}$$

You can specify the frequency or weights of observations
by using the name-value pair `'Frequency'`

in `coxphfit`

.

[1] Cox, D. R., and D. Oakes. *Analysis
of Survival Data*. London: Chapman & Hall, 1984.

[2] Lawless, J. F. *Statistical
Models and Methods for Lifetime Data*. Hoboken, NJ: Wiley-Interscience,
2002.

[3] Kleinbaum, D. G., and M. Klein. *Survival Analysis*.
Statistics for Biology and Health. 2nd edition. Springer, 2005.

[4] Klein, J. P., and M. L. Moeschberger. *Survival
Analysis*. Statistics for Biology and Health. 2nd edition.
Springer, 2003.

- Hazard and Survivor Functions for Different Groups
- Survivor Functions for Two Groups
- Cox Proportional Hazards Model for Censored Data
- Cox Proportional Hazards Model with Time-Dependent Covariates