active library

dfr.dist

Dynamic failure rate distributions for survival analysis and reliability engineering in R

Started 2023 HTML

Resources & Distribution

Package Registries

dfr.dist

R-CMD-check

Dynamic Failure Rate Distributions for Survival Analysis

The dfr.dist package provides a flexible framework for specifying survival distributions through their hazard (failure rate) functions. Instead of choosing from a fixed catalog of distributions, you directly specify the hazard function, giving complete control over time-varying failure patterns.

Why dfr.dist?

Featuredfr.distsurvivalflexsurv
Custom hazard functionsYesNoLimited
Built-in distributionsExp, Weibull, Gompertz, Log-logisticWeibull, ExpMany
Automatic differentiationYes (via femtograd)NoNo
Exact Hessian computationYesNoNo
Likelihood model interfaceFullBasicPartial
Right-censoring supportYesYesYes

Features

  • Flexible hazard specification: Define any hazard function h(t, par, …)
  • Built-in distributions: Exponential, Weibull, Gompertz, Log-logistic with optimized implementations
  • Complete distribution interface: hazard, survival, CDF, PDF, quantiles, sampling
  • Likelihood model support: Log-likelihood, score, Hessian for MLE
  • Automatic differentiation: Exact gradients and Hessians via femtograd integration
  • Model diagnostics: Residuals (Cox-Snell, Martingale) and Q-Q plots
  • Censoring support: Handle exact and right-censored survival data
  • Ecosystem integration: Works with algebraic.dist, likelihood.model, algebraic.mle

Installation

Install from GitHub:

# install.packages("devtools")
devtools::install_github("queelius/dfr_dist")

Quick Start

library(dfr.dist)

Built-in Distributions

Use the convenient constructors for classic survival distributions:

# Exponential: constant hazard (memoryless)
exp_dist <- dfr_exponential(lambda = 0.5)

# Weibull: power-law hazard (wear-out or infant mortality)
weib_dist <- dfr_weibull(shape = 2, scale = 3)

# Gompertz: exponentially increasing hazard (aging)
gomp_dist <- dfr_gompertz(a = 0.01, b = 0.1)

# Log-logistic: non-monotonic hazard (increases then decreases)
ll_dist <- dfr_loglogistic(alpha = 10, beta = 2)

All distribution functions are automatically available:

S <- surv(exp_dist)
S(2)  # Survival probability at t=2
#> [1] 0.3678794

h <- hazard(weib_dist)
h(1)  # Hazard at t=1
#> [1] 0.2222222

Maximum Likelihood Estimation

# Simulate failure times
set.seed(42)
times <- rexp(50, rate = 1)
df <- data.frame(t = times, delta = 1)

# Fit via MLE
solver <- fit(dfr_exponential())
result <- solver(df, par = c(0.5), method = "BFGS")
coef(result)  # Estimated rate
#> [1] 0.8808457

Custom Hazard Functions

Model complex failure patterns like bathtub curves:

# h(t) = a*exp(-b*t) + c + d*t^k
# Infant mortality + useful life + wear-out
bathtub <- dfr_dist(
  rate = function(t, par, ...) {
    par[1] * exp(-par[2] * t) + par[3] + par[4] * t^par[5]
  },
  par = c(a = 1, b = 2, c = 0.02, d = 0.001, k = 2)
)

h <- hazard(bathtub)
curve(sapply(x, h), 0, 15, xlab = "Time", ylab = "Hazard rate",
      main = "Bathtub hazard curve")

Model Diagnostics

Check model fit with residual analysis:

# Fit exponential to data
fitted_exp <- dfr_exponential(lambda = coef(result))

# Cox-Snell residuals Q-Q plot
qqplot_residuals(fitted_exp, df)

Mathematical Background

For a lifetime TT, the hazard function is:

h(t)=f(t)S(t)h(t) = \frac{f(t)}{S(t)}

From the hazard, all other quantities follow:

FunctionFormulaMethod
Cumulative hazardH(t)=0th(u)duH(t) = \int_0^t h(u) ducum_haz()
SurvivalS(t)=eH(t)S(t) = e^{-H(t)}surv()
CDFF(t)=1S(t)F(t) = 1 - S(t)cdf()
PDFf(t)=h(t)S(t)f(t) = h(t) \cdot S(t)density()

Likelihood for Survival Data

For exact observations: logL=logh(t)H(t)\log L = \log h(t) - H(t)

For right-censored: logL=H(t)\log L = -H(t)

# Mixed data with censoring
df <- data.frame(
  t = c(1, 2, 3, 4, 5),
  delta = c(1, 1, 0, 1, 0)  # 1 = exact, 0 = censored
)

ll <- loglik(dfr_exponential())
ll(df, par = c(0.5))
#> [1] -9.579442

Documentation

Getting Started:

Creating Custom Distributions:

Applications:

Discussion