Rethinking Evidence in Rare Disease Research: A Case Study Using Propensity Score Methods


March 31, 2026

Rare diseases pose unique challenges for researchers and clinicians. Due to small patient populations, conducting randomized controlled trials (RCTs) is often impractical or ethically difficult. As a result, observational data becomes a key source of evidence.

In the landscape of rare disease, data is both our most precious resource and our greatest challenge. For conditions like Infantile-Onset Pompe Disease (IOPD), the journey from the first life-saving Enzyme Replacement Therapy (ERT) to the next generation of optimized treatments is rarely a path free of challenges. It is a path marked by small patient populations, high clinical variability, and the heavy weight of every data point.

The difficulty in rare disease research often lies in the “how”: How do we prove a new therapy is truly superior when baseline functional levels vary so wildly? How do we ensure that a single data entry error doesn’t mask a breakthrough or suggest a false decline?

In this blog, we explore how propensity score methods can be used to estimate treatment effectiveness in a rare disease setting through a real world–inspired case study.

In this case study, we pull back the curtain on the analytical rigor required to compare motor function trajectories in IOPD. From Propensity Score Matching to “red-flag” data auditing, we explore how sophisticated analysis turns fragmented data into a clear roadmap for the future of neuromuscular treatment.

 

Case study: Advancing motor function outcomes in IOPD

The evolution from first-generation drug to next-generation drug

Infantile-Onset Pompe Disease (IOPD) is a rare, progressive neuromuscular disorder. While the first generation of ERT revolutionized survival, the quest for superior motor function remains the “North Star” for researchers. This study compares longitudinal motor outcomes between the First-Generation Drug and Next-Generation Drug cohorts using the Gross Motor Function Measure (GMFM-88).

 

The challenge: Comparing across clinical trials

Comparing results from different studies requires more than just looking at averages; it requires accounting for the inherent variability in how patients present at baseline. To test the hypothesis that the Next-Generation Drug offers a superior motor trajectory, we implemented a rigorous three-tier analytical approach.

 

A three-tier analytical approach

1. The power of precise matching

To ensure an “apples-to-apples” comparison, we restricted the analysis to patient pairs matched by both age and baseline functional level.

  • The criteria: Matches were strictly filtered to those within a +/- 13-point window of the GMFM-88 raw score (rather than a percentage).
  • The goal: By tightening these parameters, we eliminated “baseline noise,” allowing the true pharmacological impact of the treatment to surface in the longitudinal graphs.

 

2. Data integrity: Investigating the “jumps and drops”

In rare disease registries, a single data point can skew an entire trajectory. Our team conducted a “deep dive” into five specific patient profiles that exhibited extreme volatility — marked by sharp drops or vertical jumps in scores.

Expert insight: A drop to zero isn’t always a clinical decline; often, it’s a data entry artifact where a missing value was defaulted to ‘0.’ By identifying and correcting these anomalies, we ensure the motor trajectory reflects biology, not a spreadsheet error.

 

3. Sophisticated balancing: Propensity Score Matching (PSM)

Propensity score methods help simulate a randomized experiment by balancing observed characteristics between treated and untreated groups.

To further validate our findings, we moved beyond simple matching to Propensity Score Matching. This statistical technique allows us to predict a patient’s likelihood of being in a specific treatment group based on their baseline characteristics, effectively “balancing” the two groups.

 

Key covariates included:

  • Baseline status: Age and GMFM-88 total raw score.
  • Clinical history: Age at diagnosis and age at start of ERT.
  • Biological markers: CRIM status (Cross-Reactive Immunologic Material) and LVMI (Left Ventricular Mass Index) z-scores.
  • Treatment variables: Specific enzyme dosage levels.

 

Why this matters for the rare disease community

This case study demonstrates that in the world of rare diseases, how we analyze data is as important as the data itself. By correcting for entry errors and using high-fidelity matching, we can more clearly see if the next-generation drug truly provides the “superior trajectory” hypothesized.

 

Precision analytics as a catalyst for care

By applying high-fidelity matching and propensity score modelling, we move beyond “average” results to understand the true potential of new interventions. Furthermore, our dedication to data integrity — manually investigating anomalies and “red-arrow” outliers — ensures that our conclusions are built on a foundation of clinical reality rather than administrative error.

Ultimately, this study reinforces that in the fight against rare diseases, data is our most powerful ally. When we refine our lens through rigorous matching and clean data, the path toward better motor function and brighter futures for IOPD patients becomes clearer than ever.

Contact Us!
Subscribe to our newsletter

Jenny James

Principal Statistical Programmer

Jenny James is Principal Statistical Programmer at Cytel. Jenny has around 12 years of experience in the industry. Over the last 5 years, she has been extensively involved in rare diseases and rare blood disorders, specifically leading and driving post-hoc analysis requirements to support regulatory submissions and publication strategies for the client. She is currently leading the immunology team and working on the integration and analysis of pooled data.

Read full employee bio

Prajakta Chavan

Manager, FSP Operations

Prajakta Chavan is Manager, FSP Operations at Cytel. Prajakta has 14 years of experience in the industry, with last 8 years focused on medical affairs. She has substantial expertise in statistical analysis, complemented by strong proficiency in SAS programming, enabling effective analysis and reporting of clinical studies in compliance with sponsor and regulatory requirements. Currently, she manages a team and leading ISS and ISE data creation activities. She holds a Master of Science in Statistics from Pune University, India.

Read full employee bio

Claim your free 30-minute strategy session

Book a free, no-obligation strategy session with a Cytel expert to get advice on how to improve your drug’s probability of success and plot a clearer route to market.

glow-ring
glow-ring-second