Solutions
About Us
Insights
Careers
Contact us
Contact Us
Customer Support
Customer Support

Real-world persistence with sacubitril/valsartan in patients with heart failure in France: A claims database study. Hanon, O., Cohen, A., Jourdain, P., Leray, E., de Pouvourville, G., Goguillot, M., Hugon, G., Al-Mawla, R., Duret, S., Logeart, D., & Sacubitril/Valsartan Real-World Study Group (2025). Archives of cardiovascular diseases, S1875-2136(25)00844-7.

Clinical burden of pneumococcal disease among adults in France: A retrospective cohort study. Bailey, M. D., Farge, G., Mohanty, S., Breau-Brunel, M., Roy, G., de Pouvourville, G., de Wazieres, B., Janssen, C., Tauty, S., Bugnard, F., Goguillot, M., Bénard, S., & Johnson, K. D. (2025). Human vaccines & immunotherapeutics, 21(1), 2515760.

Assessing the economic impact and healthcare resource utilization of inpatient pneumococcal disease among adults: a French national claims database study. Bailey, M. D., Farge, G., Breau Brunel, M., Mohanty, S., Roy, G., de Pouvourville, G., de Wazieres, B., Janssen, C., Tauty, S., Bugnard, F., Goguillot, M., Bénard, S., & Johnson, K. D. (2025). Journal of medical economics, 28(1), 251–259.

It Is Time to Re-Think the Limit of Quantitation. Fossler, M. J., & Smith, B. (2025). Clinical pharmacology in drug development, 14(12), 900–902

Graph Based, Adaptive, Multiarm, Multiple Endpoint, Two-Stage Designs. Mehta, C., Mukhopadhyay, A., & Posch, M. (2025). Statistics in medicine, 44(28-30), e70237

Intersectoral action to transform health equity for women and girls globally. Sarri, G., Soriano Gabarró, M. S., Cheng, R. F., & Jhutti-Johal, J. (2025). Communications medicine, 6(1), 40.

2025 in Perspective: Reflections From Our Newest Colleagues

Every year brings new faces, fresh ideas, and inspiring stories to Cytel. In 2025, these colleagues joined us from across the globe, each bringing unique experiences and ambitions. As the year closes, we asked them to share what stood out, what they’ve learned, and how they see their work shaping something bigger. Their reflections tell a story of connection, growth, and purpose.

 

Joining Cytel: Memorable moments and settling in

For Kasum de Souza Mateus (Senior Biostatistician, FSP) the most memorable part of joining Cytel was simple yet meaningful: “Being able to meet colleagues and mentors in person.” That feeling of connection resonated with many new Cytelians, from Adish Jindal (Senior Recruiter), who described the joy of reconnecting with familiar faces, to Luke Hilliard (Event Manager), who fondly recalls a team meeting: “I really enjoyed the trip to Bruges. It was such a pleasure meeting everyone in person. We came away with some fantastic ideas that we’ve since put into action for our events.”

Others found their defining moments and success in challenges that brought people together. Kanchan Kulkarni (Manager, Accounting) stepped into her role during a major system transition: “One of my most memorable experiences has been leading the Global GL Accounting function across EMEA, APAC, and NA regions during our Oracle ERP transition. It wasn’t just about systems and numbers — it was about connecting people, aligning processes, and building something stronger together.” And for Scott Rogers (CFO), the most powerful moment came during a Town Hall: “I was very moved by the presentation where we heard directly from a patient and understood how our work helped him realize the benefits he was seeing.”

For Macarena Pazos Maidana (Senior Market & Business Development Manager) success came early: “During my third week, I successfully secured a key renewal with a major pharmaceutical client for the East Horizon™ platform. This achievement not only boosted my confidence but also reinforced my belief in the value our solutions bring to the industry.” And Hannes Engberg Raeder (Principal Biostatistician, FSP) found pride in collaboration: “I’m proud of having been able to support one of our partnerships through process improvements that helped strengthen collaboration and overall efficiency.”

 

Leaning on advice

Of course, starting something new means leaning on advice from colleagues or mentors, and some words of wisdom stuck. Nicole Sheridan (Manager, Talent Management) shared the famous mantra that shaped her approach: “’Do or do not, there is no try.’ It’s simple, but it completely changed how I think about my work and even life outside of work. I realized it’s not about being perfect but it’s about showing up, committing, and seeing things through. That mindset has really helped me take initiative, stay resilient, and turn ideas into results.

Damian Kowalski (Principal Statistical Programmer, FSP) emphasized collaboration: “Don’t be afraid to ask questions. Collaboration is our strength.” And Sydney Jenkins (Senior Employee Relations & Engagement Partner) shared a perspective that guides her work: “Trust your logic. That perspective reminds me to approach challenges with a clear, rational mindset, even under pressure!”

 

Growth and ambition

This year was not only about settling into their role for our new Cytelians, however. It also marked a year of growth and achievements. Adish honed his global recruitment expertise: “One skill I’m particularly proud of developing in 2025 is my ability to manage global recruitment processes more effectively.” Monica Chaudhari (Associate Director, Biostatistics, FSP) shared a technical milestone: “My first study that I got assigned to was already closed. To help myself support the team through database lock, review of final outputs and drafting of the CSR, I created a swimmers plot summarizing all important endpoints on each subject’s trajectory that helped identify major deviations.”

Valeria Duque Mora (Project Coordinator, Resource Management) reflected on teamwork: “My current team has made a real difference in my daily work. They are the foundation of our success, always supporting each other and sharing new information with kindness and collaboration throughout every process.” For Dominika Wisniewska (Senior Statistical Programmer, FSP), the impact was deeply personal: “I am grateful that Cytel gave me the opportunity to work directly for our client where I work on research within rare diseases and neurology diseases. I am particularly interested in neuro because of personal reasons, and I am happy to participate in maybe discovering new treatments.” And Sankhyajit Sengupta (Senior Statistical Programmer, FSP) embraced learning: “In this very short period of time (three months), I’ve had the opportunity to gain exposure to R programming in live studies and also completed required trainings on R, an important step as the industry is moving in this direction.”

Looking ahead, our new colleagues are already thinking about how to make an even bigger impact in 2026. Kanchan hopes to drive automation and efficiency, Luke dreams of organizing a standalone event, and Ye Miao (Associate Director, Biostatistics, FSP) plans to deepen expertise in R programming to contribute more effectively to data analysis and reporting tasks in his FSP role. Sydney aims to strengthen policy awareness and consistency across the organization, while Macarena is focused on enhancing client retention and satisfaction. Each goal reflects a commitment to making an even bigger impact in year two.

 

Connecting to the bigger picture

Every role at Cytel connects to our mission of improving patient lives. Adish summed it up well: “As a Global Senior Recruiter, I help bring in the talent that powers our mission. Every great hire strengthens our culture, drives innovation, and helps the company achieve its goals globally.” Wyatt Gotbetter (Senior Vice President, Global Head Evidence, Value and Access) described the EVA team’s role: “I like to describe the work of EVA as the essential ‘last mile’ in our client’s drug development journey — after decades of scientific discovery, animal and human trials, and regulatory approvals, we play a vital role in helping ensure patients get access to needed therapies.” And Damian reminded us of the impact behind the data: “Every dataset we program and validate helps ensure reliable insights for clinical trials. It’s amazing to know that our work plays a role in bringing life-saving therapies to patients worldwide.”

 

The voices of our newest colleagues remind us that Cytel is more than a workplace. It’s a community driven by purpose, collaboration, and innovation. Here’s to their continued success and to another year of making a difference together.

Beyond the Database: How Clinical Data Management Transforms Patient Care

When we think about clinical data management (CDM), it is often easy to picture databases, spreadsheets, and documents for days. However, being able to step into a clinic setting and witness how data-driven decisions shape patient care reveals the true impact of CDM.

Here, I share real-world examples of the impact of clinical data management on patients and what lies ahead for the field as technology advances.

 

From data to decisions: The impact of clinical data management in the clinical setting

Every piece of data collected during a clinical trial, be it lab results, procedure information, patient reported outcomes, or even adverse events, tells a story. During trials, these individual stories converge to guide treatment plans, ensure safety, and improve outcomes. Accuracy and speed are absolutely critical when it comes to data entry and processing as it allows clinicians to make informed decisions without delay, reducing risks for patients. Without this precision, even groundbreaking therapies can stumble due to incomplete or unreliable information.

 

Real-world examples of CDM impact

Spotting issues early

In an oncology trial, centralized monitoring picked up unusual liver enzyme levels across several patients. Because of that insight, clinicians were able to tweak treatment plans right away, preventing serious side effects and keeping patients safe.

 

Identifying dosing mistakes

During a diabetes study, data checks uncovered inconsistencies in insulin doses. Fixing those errors ensured patients got the right amount of medication, reducing the risk of hypoglycemia and keeping the study on track.

 

Keeping patients engaged

Real-time data review revealed a trend of missed visits in a cardiovascular trial. Sharing this with site teams led to proactive outreach, helping patients stay on schedule and reducing dropout rates.

 

Bridging science and care

Clinical data managers play a behind-the-scenes role, but their work directly influences what happens in the exam room. For example:

 

Keeping data consistent

Consistency ensures that trial results are reliable and can be applied to real-world care, not just on paper.

 

Building trust in the numbers

Data Integrity means clinicians can rely on the information when adjusting dosages or monitoring side effects. No second-guessing, just confidence.

 

Protecting patients and speeding up progress

Regulatory compliance isn’t just about ticking boxes — it keeps patients safe and helps move promising therapies from research to approval faster.

 

Better communication

Real-time data sharing helps patients stay informed about their progress, reducing uncertainty.

 

Fewer repeat visits

Catching errors early means patients avoid unnecessary trips back to the clinic, saving time and stress.

 

The human element — My perspective

As a Principal Clinical Data Manager, I’ve had the privilege of seeing this impact firsthand. One moment that stands out was during a rare disease trial where every day mattered for patients waiting for treatment. By streamlining data cleaning and resolving queries quickly, we helped lock the database ahead of schedule. Knowing that this effort contributed to patients receiving life-changing therapy sooner was incredibly rewarding.

It’s in these moments that the connection between data and human lives becomes crystal clear. Behind every query, every validation check, there’s a patient hoping for better health and that’s what drives our work. CDM is not just about compliance; it’s about compassion through precision.

 

Looking ahead

As technology advances, the integration of real-time data and AI-driven insights will make clinical data management even more impactful. The clinic will become a hub where data flows seamlessly, supporting personalized medicine and improving patient experiences. Predictive analytics could help identify risks before they occur, and automation will free up time for deeper analysis. The future of CDM isn’t just about managing data, it’s about transforming care.

In short, clinical data management isn’t just a technical process, it’s a human story where every detail matters.

 

Interested in learning more?

The Medical AI Superintelligence Test and NOHARM: A New Framework for Assessing Clinical Safety in AI Systems

Artificial intelligence has become an increasingly common tool in medical decision-making. Physicians consult large language models (LLMs) for diagnostic reasoning, documentation, and summarization; patients use them to interpret symptoms; and health systems continue to integrate them into clinical workflows. Yet a basic question remains insufficiently answered: How safe are these systems when their outputs influence real medical decisions?

A recent initiative under Arise AI, centered around the NOHARM benchmark, offers one of the most rigorous evaluations of clinical safety to date. Its findings, and the broader accountability framework behind it, have implications not only for direct patient care but also for clinical development, medical writing, pharmacovigilance, and regulatory documentation. Importantly, the study highlights patterns of AI failure that closely mirror risks encountered when using LLMs for complex scientific and regulatory work.

 

A benchmark designed around real patient harm

NOHARM evaluates LLMs using one hundred real physician-to-specialist consultation cases across ten specialties. Instead of relying on synthetic questions or knowledge tests, the benchmark measures whether AI-generated recommendations could expose patients to harm. More than 4,000 plausible medical actions were annotated by specialists for clinical appropriateness and potential harm, allowing the framework to assess both errors of commission (unsafe recommendations) and omission (failing to recommend necessary actions).

The benchmark sits within the broader MAST (Medical AI Superintelligence Test), initiative led by Harvard and Stanford, hosted on bench.arise-ai.org, which aims to provide ongoing public evaluation of LLMs used in healthcare settings. By publishing comparative and transparent performance metrics — including safety, completeness, precision, and harm rates — MAST serves as a standardized accountability structure for medical AI systems.

 

Key findings from the study

The results provide a nuanced view of current medical AI capabilities:

  • Harm remains a measurable risk. Some LLMs produced severely harmful recommendations in more than 20% of cases.
  • Omissions are the dominant failure mode. Over three-quarters of severe errors involved missing essential actions rather than giving incorrect ones.
  • Model “strength” does not predict safety. Size, recency, and performance on general AI benchmarks had limited correlation with clinical safety.
  • Top models can outperform physicians. In a subset of cases, the best LLMs demonstrated higher safety and completeness than generalist clinicians.
  • Hybrid systems improve outcomes. Multi-agent configurations — where one model critiques or revises another — showed materially lower harm rates.

Collectively, these findings emphasize that clinical safety must be evaluated directly; it cannot be inferred from general intelligence or linguistic fluency.

 

Relevance beyond clinical care: Implications for clinical development

Although NOHARM focuses on medical recommendations, its insights apply directly to workflows in clinical development, where LLMs are increasingly used for drafting protocols, summarizing analyses, generating safety narratives, and producing Clinical Study Reports (CSRs). The risk profile is different — regulators, rather than patients, are the primary audience — but the core failure mode identified in NOHARM is the same: AI systems frequently omit essential information while producing text that appears complete.

These omissions can lead to incomplete evidence packages, insufficient traceability, inconsistencies with statistical outputs, and regulatory challenges. The study therefore reinforces the need for structured validation processes when using LLMs in high-stakes regulatory environments.

 

The CSR example: Completeness as a safety criterion

A clinical study report requires comprehensive reporting: methodology, protocol deviations, statistical analyses, safety findings, and linked tables, figures, and listings. While LLMs can streamline drafting and improve clarity, they do not reliably identify which elements are required for regulatory compliance. As NOHARM demonstrates, even highly capable models often omit critical actions or fail to include context necessary for safety.

This parallels the risk in clinical documentation: a well-written but incomplete CSR is not simply inconvenient — it can delay submission timelines, trigger regulatory questions, or obscure important safety signals. Ensuring completeness therefore becomes a core safety requirement.

 

The necessity of human-in-the-loop systems

One of the clearest insights from the NOHARM study is that hybrid systems outperform both standalone AI models and standalone human reviewers. Multi-agent architectures reduce harmful outputs, and expert human oversight further ensures contextual accuracy, completeness, and regulatory fidelity. In clinical development, this means that LLMs should support — but not replace — experienced medical writers, clinical scientists, statisticians, and safety physicians.

A well-designed workflow leverages AI for efficiency while relying on human expertise for judgment, quality control, and risk mitigation. This aligns with the MAST vision of AI systems operating under ongoing, benchmarked evaluation rather than unmonitored deployment.

 

A path forward: Benchmark-aligned, hybrid AI for regulated medicine

The NOHARM study and the broader Arise AI benchmarking platform represent a shift toward transparent, safety-focused evaluation of medical AI. They show that:

  • Safety and completeness require explicit measurement.
  • Omission is a primary source of AI risk in both clinical and regulatory contexts.
  • Multi-agent and human-in-the-loop systems materially reduce harm.
  • Public, standardized benchmarking supports accountability and informed adoption.

For organizations exploring or deploying AI in clinical development, the message is straightforward: LLMs can accelerate work and improve consistency, but only when embedded within systems designed to detect and mitigate the very risks NOHARM identifies. With rigorous evaluation, hybrid architectures, and expert oversight, AI can be integrated into medical and regulatory workflows in a way that advances both efficiency and safety.

 

Interested in learning more?

Consult the preprint by David Wu, et al., “First, do NOHARM: Towards clinically safe large language models” and access the interactive NOHARM leaderboard to see model performance.

FDA Guidance on Assessing Overall Survival in Oncology Trials: A DMC Perspective

Overall survival (OS) is the ultimate endpoint — it is easily measured and of the utmost clinical relevance. However, it also takes the longest time to develop and may not be as sensitive to treatment effect as other endpoints. In many oncology studies, for example, alternative endpoints are used, such as progression-free survival (PFS). The hope is that these alternative endpoints are indeed clinically relevant and allow for quicker results with fewer numbers of events and/or subjects needed than a study powered to detect a difference in OS. However, there has been growing concern over the use of endpoints such as PFS.

As a result, the FDA has pondered guidance on how to prioritize endpoints: whether OS or an alternative endpoint should be primary, or both should be co-primary. The FDA has also pondered whether accelerated approval might be permitted using a quicker-developing alternative while waiting for the OS data to mature and (hopefully) substantiate the efficacious results from the co-primary endpoint.

To address this, the FDA has produced the draft guidance “Approaches to Assessment of Overall Survival in Oncology Clinical Trials.”

Here, we will review this document as it pertains to Data Monitoring Committees (DMCs), and provide our own thoughts based on our 30 years of work and thousands of studies.

 

FDA draft guidance: “Approaches to Assessment of Overall Survival in Oncology Clinical Trials”

A key section of the FDA guidance related to DMC activity is the following:

This text reinforces what Cytel has long advocated — more interim analyses, both for futility (and possibly harm) and benefit — and reinforces the independence of the DMC and the group directly facilitating the DMC’s work. Approaches to constructing and implementing interim analyses for OS can be flexible — for example, assessing futility at 50% information fraction of OS, and then assessing both futility and benefit at 75% information fraction.

In addition to the section above, there are other references within the draft guidance for how a DMC or other parties would use and interpret OS data. Some particular references are below:

 

Using and interpreting overall survival data

It is important the DMC understands the operating characteristics of these interim analyses. The analyses might be at a high level in the DMC charter, or more specifically described in the protocol, or study SAP or interim analysis plan (IAP). Translating statistical methodology informally into actual counts is helpful to ensure audiences, including the DMC, understand what the “tipping points” might be. For example, a stopping rule for futility at 50 deaths (with the stopping rule perhaps defined by O’Brien-Fleming or by Bayesian analysis) would helpfully be presented to the DMC that translates to clearly understandable events as a reality check — perhaps the study design and stopping rules might lead to the conclusion that 28 deaths on active vs. 22 deaths on placebo (using reasonable assumptions for timing of events and other aspects of the data) would be the tipping point to trigger futility. This could be informally presented to the DMC at their organizational meeting to ensure the DMC understands the high-level approach and how that might play out in real numbers.

It is also important for the group creating the analyses to communicate to the DMC the uncertainty of early results so that the DMC does not overinterpret. This is especially true in the (theoretic) presence of non-proportional hazards.

 

Overall survival data: A key safety parameter

OS is a critical variable to help the DMC assess the study even outside of formal interim analyses — both as the DMC interprets safety (more deaths on the active arm) and efficacy (fewer deaths on the active arm). Together, this provides a more complete way to assess risk-benefit in the context of other safety concerns. While some sponsors prefer a “safety only” DMC and are hesitant to provide endpoint data, it is typically well understood that, as a key safety parameter, OS data is

Therefore, we typically present a death table to the DMC, and likely two: one summary of deaths for the safety population and one for the randomized population. This is particularly relevant for open-label studies where there might be an imbalance in subjects not treated and therefore not included in the safety population. Note that, traditionally, not all deaths are included in the adverse event (AE) dataset. In many studies, AEs occurring after treatment has finished and/or due to underlying disease might not be captured in the AE dataset.

Cytel also may tabulate investigator-assessed cause of death. (We do note to DMCs the hazards of interpreting investigator-reported causality though. That is part of the reason OS is appealing, as it is immune to the potential biases of ascertaining relationship.) Two treatment arms might have the same number of deaths, but the interpretation of the DMC could be different if there is an excess of deaths due to disease progression on one arm, relative to an excess of deaths due to serious adverse events (SAEs)/toxicity on another arm. It is traditional to also show timing in a table (perhaps deaths <30 days after last dose vs. deaths ≥ 30 days after last dose). The DMC may drill down into the fatal SAEs/toxicity to look for patterns and perhaps make recommendations to mitigate that risk. We have seen examples of excess fatal interstitial lung disease (ILD), fatal COVID-19, and fatal infections on active arms compared to control arms. DMCs in these situations have thought hard about mitigation strategies, as they would for any concerning imbalance in a major safety domain.

A Kaplan-Meier figure of time to death (possibly without any inferential statistics) is also typically presented to the DMC — typically based on the randomized population. It can be important for the DMC to understand the timing of the deaths to help answer questions such as whether most deaths occur early in the study or if there is a differential pattern over time (i.e., crossing curves implying non-proportional hazards) between the treatment arms. The definitions used for programming the Kaplan-Meier figure might not match the exact definitions at end of study if used in a non-inferential way — the censoring rules might be flexible based on the data available (e.g., the choice of whether to censor subjects still alive and in study follow-up at their last known contact, or whether to censor at the data cut-off date). In some situations, especially where OS is the primary endpoint, the sponsor might be hesitant to even show a non-inferential Kaplan-Meier figure of time to death to the DMC outside of formal analyses (if those exist). Nonetheless, the DMC could argue these are needed and, at minimum, are available upon request by the group facilitating the DMC’s discussion.

For formal interim analyses of OS that show inferential statistics (e.g., p-values, hazard ratio, confidence intervals), there would be appropriate effort in advance to ensure that the precise censoring algorithm and inferential statistics specified in in the study SAP are put in place for the DMC review.

 

Oversight on study integrity and interpretability

The DMC has oversight on study integrity and interpretability in conjunction with the sponsor, but it is important to have the independent thoughts of the DMC. In this domain, the DMC may well voice concern if an excessive rate of subjects is being lost to long-term survival surveillance. That would be particularly important if an imbalance in arms is developing for subjects lost to follow-up for long-term survival (as could happen in open-label studies). The DMC might not explicitly state that observation externally, but an imbalanced rate of lost to follow-up would be particularly concerning to the interpretation of OS at the end of the study.

And the DMC may voice concern if OS is the primary endpoint or it is critical to have a certain number of deaths for secondary analysis, but the overall rate of deaths is appreciably lower than expected, which seems likely to extend the duration of the study by years – bringing into question whether the study will be relevant and funded for the additional years needed. The DMC may request current projections (with computed confidence interval) for when in calendar time the expected number of deaths will occur. This can be complicated if the minimum number of deaths is required in a subgroup, and particularly so if the minimum numbers of deaths is in a blinded subgroup (a blinded biomarker population, or in a subgroup of treatments for studies with more than two treatment groups).

The DMC should understand that OS data early in a study is statistically unreliable. And there might even be some expectation for excess deaths early on (if late benefit is expected, in the presence of early toxicity). There might even be the expectation of no OS benefit — there might still be overall value to the new treatment even if there is no difference in OS — if the treatment is less toxic or more easily administered or cheaper.

 

Increasing the confidence of subgroup analysis results

The DMC may rightfully be concerned upon seeing an excess of deaths on the active arm. There are options for the DMC, as for any potential safety concern. One of the first steps traditionally undertaken is to investigate which subgroup of patients is most at risk for the excess death — but with the full understanding of the hazards of subgroup analysis. It is easy for an unwary reviewer to overinterpret the results of subgroup analyses. The DMC should consider factors that increase the confidence of subgroup results when looking for consistency of signal or trying to identify a subgroup that could specifically have some risk mitigation plan implemented:

  • Biologic rationale
  • Larger sample size
  • Consistent findings in other trials
  • Included as a stratification factor

 

Ad hoc interim analyses for OS

If OS is the primary endpoint, and no statistical futility in place but the DMC sees worrisome or neutral trends in OS, the DMC might consider ad hoc analysis of OS for futility. This is usually in the context of excess Grade 3+ AEs or SAEs as well as this OS result. The DMC might see if a lower limit of a confidence interval of hazard ratio (suitably adjusted for the interim nature by using information fraction and perhaps O’Brien-Fleming boundary) excludes 0.9 or lower. Or perhaps the DMC will request the supporting CRO to compute conditional power. We have seen DMC recommend termination or major change based on the totality of the data — largely influenced by neutral or negative OS results. Obviously, it is preferred to simply have pre-planned looks for OS futility.

The DMC might also see amazing benefit of OS, but without any formal interim analysis planned to assess benefit. The DMC could decide that the scientific question has been answered and that it is unethical to continue and therefore alert the sponsor. We have seen this occur and after intense focused discussion between the DMC and sponsor and regulatory agencies the decision was made to move forward towards regulatory approval, which accelerated the regulatory process by years. Clearly, it is preferred to simply have pre-planned looks for OS benefit.

A recent situation involved a relatively high toxicity observed, but the intervention was expected to provide a long-term OS benefit. The DMC decided to recommend halting enrollment in the population with the lowest baseline disease severity. These subjects were expected to have the same rate of toxicity as all groups, but with minimal expected absolute OS benefit.

 

Difficult recommendations

DMCs must make difficult recommendations as PFS and OS data emerge, if PFS is a primary or co-primary endpoint. The DMC should ask for and be provided with (perhaps non-inferential) summaries of both PFS and OS. DMC recommendations are challenging if results are discrepant. Most common would be an interim analysis of PFS for benefit that crosses a boundary for benefit, but OS results are extremely immature or neutral or perhaps even in the negative direction. The DMC hopefully has the flexibility to explain the context of the situation to a senior liaison at the sponsor for discussion if the study should continue and obtain additional valuable OS data to help answer questions about OS more precisely. This is particularly problematic if there is a discrepancy between investigator-provided PFS and blinded independent review committee (BICR) PFS, or a delay or unavailability of BICR data to the DMC.

Controversies exist when the co-primary endpoint achieves full statistical information before OS does and deciding what the DMC’s role is at the point, and after that point in time. In some situations, the DMC’s obligations for oversight of the study conclude once PFS has completed. The sponsor (perhaps the full study team, or a subset of the study team, or a separate team) becomes unblinded to final PFS (and likely interim OS), and oversight for safety of ongoing patients during continued OS surveillance is undertaken by the sponsor. The assumption is that unblinding and knowledge of PFS results will not bias the ongoing OS collection. However, if results of PFS are impressive and that knowledge becomes public, that might impact behavior that affects future collection of OS — especially in an open-label study.

Therefore, many DMCs have argued that the DMC should still be involved in oversight of OS even after final analysis of PFS — perhaps in conjunction with making sure that the final PFS results are handled very securely and led by DMC and not sponsor personnel. The value of this approach is that the continued OS is not impacted by patients, sites, or general sponsor team knowledge of the PFS results or interim OS results. The DMC can take the lead on communicating final PFS results and perhaps interim OS results to a small group within the sponsor or perhaps (with agreement from all parties) directly to regulatory agencies.

 

Final takeaways

We believe, and the FDA agrees, that DMC access to overall survival results can be critical to the DMC’s remit. Hopefully, DMC members will insist on receiving the outputs that meet their needs, which could include either non-inferential or inferential analyses of overall survival; CROs will be able to create the outputs needed in an accurate and timely way; and sponsors will trust DMCs to act responsibly with the outputs provided to them.

 

Interested in learning more?

Download our white paper, “DMCs for Oncology Studies”: