Wei.series.md.c1.c2.c3

Below you will find pages that utilize the taxonomy term “Wei.series.md.c1.c2.c3”

mdrelax: When Masking Conditions Don't Hold

December 3, 2025

mdrelax extends my work on series system reliability by handling cases where the standard masking assumptions break down.

Background: The C1-C2-C3 Framework

My master’s thesis developed maximum likelihood techniques for series systems with masked failure data. The standard framework assumes three conditions:

C1: The failed component is always in the candidate set
C2: Non-informative masking (uniform probability within candidate set)
C3: Masking mechanism is independent of system parameters

When these hold, the masking probabilities factor out and you can ignore them for parameter estimation. The expo-masked-fim paper derives closed-form Fisher Information for the exponential case, and maskedcauses implements the general framework.

The Problem

In practice, C2 and C3 are often violated.

Informative masking (C2 violation): Diagnostic tests may be better at identifying certain failure modes than others. A component that fails catastrophically is easier to identify than one that degrades subtly.

Parameter-dependent masking (C3 violation): The masking mechanism might depend on component reliabilities. Components with shorter lifetimes fail more often, so technicians get more practice diagnosing them.

If you pretend C2 and C3 hold when they don’t, your parameter estimates are biased. Sometimes badly.

What mdrelax Does

The package implements likelihood-based inference with relaxed conditions:

library(mdrelax)

# Generate masked data with Bernoulli candidate sets
md <- md_bernoulli_cand_C1_C2_C3(data, p = 0.3)

# Sample candidate sets
md <- md_cand_sampler(md)

# MLE for exponential series system
fit <- md_mle_exp_series_C1_C2_C3(md)

# Fisher information matrix
fim <- md_fim_exp_series_C1_C2_C3(md, params(fit))

Key Features

Flexible masking models: Bernoulli, rank-based, KL-divergence constrained
Identifiability analysis: Tools to check when parameters can actually be estimated
Fisher information: Efficiency analysis under relaxed conditions
Simulation utilities: Monte Carlo studies for method validation

Relationship to Other Work

This package sits at the end of a progression toward generality:

Project	Focus
expo-masked-fim	Closed-form FIM for exponential case
maskedcauses	General R framework for masked data likelihood
reliability-estimation-in-series-systems	Master’s thesis implementation
wei.series.md.c1.c2.c3	Weibull series systems under C1-C2-C3
mdrelax	Relaxed conditions (C2, C3 violations)

The progression:

Exponential + C1-C2-C3: Closed-form solutions
Weibull + C1-C2-C3: Numerical MLE
Weibull + relaxed conditions: mdrelax

Each step trades analytical tractability for realism.

When to Use It

Use mdrelax when you suspect:

Diagnostic accuracy varies by component type
Masking patterns correlate with component reliabilities
Standard C1-C2-C3 assumptions are too restrictive for your data

The trade-off is real: relaxed models have more parameters and may need larger samples for reliable estimation. But biased estimates from wrong assumptions aren’t free either.

Model Selection for Weibull Series Systems: When Simpler Models Suffice

December 3, 2025

When can you safely use a simpler model for a series system? I ran extensive simulation studies with likelihood ratio tests to get a quantitative answer.

The Problem

In series system reliability, you estimate component parameters from masked failure data. For Weibull components, that means estimating $2m$ parameters: shape $k_j$ and scale $\lambda_j$ for each of $m$ components.

But what if the components have similar failure characteristics? A reduced model with homogeneous shape parameters uses only $m+1$ parameters (one common $k$ plus $m$ scales). This roughly halves the parameter count and has a nice property: the system itself becomes Weibull-distributed.

The question is when this simplification is justified.

Key Findings

Robustness of the Reduced Model

For well-designed series systems (components with similar failure characteristics), the result is striking:

The reduced homogeneous-shape model cannot be rejected even with sample sizes approaching 30,000, far larger than anything typically available in practice.

With realistic sample sizes (50 to 500), the likelihood ratio test shows no evidence against the reduced model when components truly have similar shapes. This is strong justification for using the simpler model.

Sharp Boundaries

The paper pins down exactly how much heterogeneity it takes to trigger rejection:

Shape Deviation	Sample Size	LRT Decision
0.25	30,000	Fail to reject
0.50	1,000+	Reject
1.0	100+	Strong reject
3.0	50+	Very strong reject

Even modest deviations in a single component’s shape parameter provide evidence against the reduced model. The boundaries are clean.

Practical Guidance

Use the reduced model when:

Components come from similar manufacturing processes
Historical data suggests similar wear-out patterns
Sample sizes are moderate ($n < 500$)
You need a quick reliability assessment

Use the full model when:

Components have fundamentally different failure modes (infant mortality vs wear-out)
Large samples are available ($n > 1000$)
Precise component-level inference is critical
Preliminary studies suggest model inadequacy

This paper fits into a broader program on masked failure data:

Paper/Package	Focus
Master’s Thesis	Weibull MLE with masked data
expo-masked-fim	Closed-form FIM for exponential case
maskedcauses	R framework for masked data likelihood
mdrelax	Relaxed masking conditions
This paper	Model selection via LRT

The pieces address different aspects of the same problem:

Reliability Estimation in Series Systems: Maximum Likelihood Techniques for Right-Censored and Masked Failure Data

June 15, 2024

This is my master’s thesis in mathematics. The problem: you have a series system (fails when any component fails), you can observe system-level failure times, but you often can’t tell which component actually caused the failure. The failure cause is “masked.” On top of that, some systems are still running at the end of the study, so their lifetimes are right-censored. You want to estimate the reliability of individual components from this incomplete data.

The challenge

Estimating component reliability is hard when:

You only observe system-level failure data
The exact component cause of failure is ambiguous (masked)
System lifetimes are right-censored
Sample sizes are small

A series system fails when any component fails, so disentangling which components are weakest from system-level observations is a non-trivial inference problem.

Likelihood model for masked data

I developed a likelihood model that handles two types of incompleteness.

Right-censoring: the system is observed until time $\tau$, but may not have failed yet:

\[ S_i = \min\lbrace \tau_i, T_i\rbrace \]

\[ \delta_i = \mathbb{1}_{T_i < \tau_i} \]

Component cause masking: when the system fails, you observe a candidate set $\mathcal{C}_i$ containing the failed component, but can’t pinpoint the exact cause.

Under three conditions (which hold in many industrial settings), the likelihood contribution simplifies to:

\[ L_i(\theta) \propto \left[\prod_{j=1}^m R_j(s_i; \theta_j)\right] \times \left[\sum_{j \in \mathcal{C}_i} h_j(s_i; \theta_j)\right]^{\delta_i} \]

where $R_j$ is the reliability function and $h_j$ is the hazard function of component $j$. The three conditions are: the candidate set always contains the true failed component, masking probability is uniform across components in the candidate set, and masking probabilities don’t depend on the system parameters $\theta$.

Weibull series systems

I focused on components with Weibull lifetimes: $T_{ij} \sim \text{Weibull}(k_j, \lambda_j)$. The shape parameter $k_j$ tells you the failure behavior: $k < 1$ is infant mortality, $k = 1$ is random failures (exponential), $k > 1$ is wear-out.

System reliability when all components are Weibull:

\[ R_{T_i}(t; \theta) = \exp\left\lbrace -\sum_{j=1}^m \left(\frac{t}{\lambda_j}\right)^{k_j}\right\rbrace \]

The hazard function is additive:

\[ h_{T_i}(t; \theta) = \sum_{j=1}^m \frac{k_j}{\lambda_j}\left(\frac{t}{\lambda_j}\right)^{k_j-1} \]

Simulation studies

I ran extensive simulations varying three factors:

Right-censoring impact (q = 60% to 100%): Scale parameters showed positive bias with censoring. Shape parameters were more sensitive than scale parameters. The most reliable component was most affected by censoring. Convergence rate exceeded 95% for q >= 0.7.

Reliability Analysis and the Problem of Censored Data

August 14, 2019

One of the most interesting statistical problems I have encountered is reliability analysis with censored data: situations where you know something didn’t fail, but not when it will fail.

The Censoring Problem

Imagine testing light bulbs. You run them for 1000 hours. Some fail during the test. Others are still working when you stop.

For the survivors, you know:

They lasted at least 1000 hours
You do not know their actual lifetime

This is right censoring. The true value lies somewhere to the right of your observation. You have a lower bound, not a measurement.

Why This Matters

Censored data is everywhere:

Medical studies (patients still alive at study end)
Engineering tests (components that have not failed)
Customer retention (users still active)

The naive responses are both wrong. Ignoring censored observations wastes information. Treating them as failures introduces bias. You need a framework that uses the partial information you actually have.

Maximum Likelihood to the Rescue

The solution is maximum likelihood estimation with likelihood contributions that account for censoring:

Failure observations contribute the probability density $f(t)$. You observed the exact failure time, so you know the probability of failing at that time.
Censored observations contribute the survival probability $S(t)$. You know the unit survived to time $t$, so its contribution is the probability of surviving at least that long.

The likelihood for the whole sample is:

$$L = \prod_{i: \text{failed}} f(t_i) \prod_{j: \text{censored}} S(t_j)$$

This lets you extract information from both failed and surviving units. The censored observations pull the estimated reliability upward; the failures pull it downward. Maximum likelihood balances them.

Series Systems Complexity

It gets more interesting with series systems, systems that fail when any component fails. If you observe system failure but do not know which component caused it, you have masked failure data.

This is the problem I am most interested in: extracting component-level reliability from system-level failures when the cause is ambiguous. The masking adds a latent variable, and the likelihood becomes a mixture. You can handle it with EM algorithms or direct optimization, but the combinatorics grow quickly with system size.

This work is laying groundwork for what will become a major focus of my mathematical statistics degree.

mdrelax: When Masking Conditions Don't Hold

Background: The C1-C2-C3 Framework

The Problem

What mdrelax Does

Key Features

Relationship to Other Work

When to Use It

Model Selection for Weibull Series Systems: When Simpler Models Suffice

The Problem

Key Findings

Robustness of the Reduced Model

Sharp Boundaries

Practical Guidance

Connection to Related Work

Reliability Estimation in Series Systems: Maximum Likelihood Techniques for Right-Censored and Masked Failure Data

The challenge

Likelihood model for masked data

Weibull series systems

Simulation studies

Reliability Analysis and the Problem of Censored Data

The Censoring Problem

Why This Matters

Maximum Likelihood to the Rescue

Series Systems Complexity