Solutions
About Us
Insights
Careers

Accelerating Database Lock Timelines Without Sacrificing Data Quality

Database Lock (DBL) is a critical milestone in the clinical trial lifecycle. A final step of clinical data management, database lock indicates the completion of data collection, cleaning, and validation, readying the data for statistical analysis. This milestone typically occurs 4–6 weeks after Last Patient Last Visit (LPLV). However, if a more challenging timeline (like 1-3 weeks) for DBL is required — due perhaps to expedited regulatory submissions and pressing business or scientific requirements — it creates a high-pressure scenario for all stakeholders.

As gatekeepers of data integrity and quality in clinical trials, the Clinical Data Manager (CDM) plays an important role in ensuring DBL is achieved on time without sacrificing data quality.

Here, I share best practices for achieving accelerated database lock timelines.

 

Successful database lock depends on early planning

To ensure the database lock is successful, meticulous planning and key stakeholder involvement are vital from the start. Key stakeholders may include Clinical Data Managers (CDM), Clinical Research Associates (CRA), Medical Monitors, Site Staff (Investigators, Coordinators), and Biostatisticians.

 

Stakeholder involvement

Different domains have different perspectives when looking at data, although we share the same goal. Since the biostatisticians are the ones who process and analyze the data, it is important to involve them early on so that our perspectives are aligned.

 

Stakeholder expectations

It’s important to align expectations, responsibilities, and timelines with all key stakeholders early in the planning process, ensuring all parties are on the same page. This will help to identify potential risks, evaluate the likelihood and impact of risks to determine their severity, and allow for contingency planning.

 

Accelerate database lock with continuous data cleaning

Adopting a strategy of continuous data cleaning throughout the trial significantly accelerates DBL. This involves performing regular, structured data review and correction of accumulated trial data.

 

Locking data in groups

Locking data periodically throughout the trial reduces the volume of data that needs to be finalized at the end. Grouping data for locking, verification, and cleaning must be completed before locking can take place.

 

Timelines for each group of data locking

Collaborate with stakeholders on how data could be grouped together for locking and agree on realistic timelines for each group, having specific needs of each stakeholder in mind as some tasks are dependent on one another. These timelines could include last data entry, last query sent, last query resolved, investigator sign-off, and lock date.

For example, a grouping strategy includes:

  1. Looking at the participant recruitment plan
  2. Identifying the number of participants expected to complete the last visit or specific visit in a certain number of months
  3. Grouping these data together to form a batch
  4. Defining timelines for data cleaning activities before performing a lock on these data

When pre-defining timelines, it is important to take into consideration source data verification (SDV) intervals, and the feasible aspects to minimize data unlocking after the lock. For example, if the locking group contains a substantial volume of data, then the timeline for each activity typically needs to be longer. Aiming for a smaller volume of data when nearer to LPLV is essential to facilitate shorter data cleaning turnaround time.

 

Identify issues early with clinical data manager oversight

The CDM should closely monitor data and perform trend analyses to detect common data entry discrepancies, lagging query resolutions, unexpectedly high open queries, or pending SDVs, and alert stakeholders to address the issues promptly. This significantly helps to identify issues early and optimize data quality, which minimize costly delays. The CDM should also monitor the unlocking rates of previously locked data. If the unlocking rate is high, consider revising the data locking plan with more realistic timelines.

 

Delay in Investigator sign-off

Investigator sign-off of Electronic Case Report Forms (eCRFs) is a foundational regulatory requirement serving as documented evidence for the accuracy, completeness, and integrity of the data submitted. It is frequently delayed due to a combination of high investigator workloads, technical complexities, or cumbersome processes. Early discussion on this critical milestone and including the timeline in the Data Locking Plan contribute significantly to expedited DBL.

 

Avoid bottlenecks caused by data outside electronic data capture

External data that are not part of electronic data capture (EDC) often become the bottleneck in DBL due to complexity and time-intensive processes. Early proactive discussions with vendors regarding timelines for data delivery are critical to avoid jeopardizing an accelerated DBL timeline.

 

Final takeaways

Adopting continuous data cleaning approach is essential for organizations aiming to shorten the timeline between LPLV and DBL. With strong attention to planning, timelines, and ongoing stakeholders’ engagement, DBL can be achieved on an accelerated timeline.

Why “More Data” Isn’t Helping You Run Better Trials

Clinical Operations teams are being asked to let go of traditional approaches and do more than ever before:

Deliver more complex trials, faster — with fewer resources — and higher confidence in outcomes.

And how has the industry responded?

With a proliferation of data access, tools, and dashboards.  But does a dashboard really help navigate complexity with speed and well-managed risk?  No.

Let’s discuss the methods and tools that help turn this complexity into clarity.

 

The problem isn’t just complexity — It’s information overload

Clinical trials have changed dramatically:

  • 7x increase in data points
  • 4x increase in data sources
  • Increasing reliance on external data, RWE, and predictive modeling

Yet often you’re still expected to manage across multiple systems, in spreadsheets and trackers: CTMS, EDC, RBQM dashboards, query reports, enrollment trackers, deviation logs, and monitoring reports.

None of these disparate sources of information tell the whole story, and every critical study execution decision you make is plagued with data gaps, inconsistencies or discrepancies, and latency issues.

How then can we consolidate and automate our use of the data to make timely decisions that we trust?  There are certainly technology stacks that large organizations license and deploy.  But what happens when you can’t afford them?  You partner with a data management and biometrics specialty provider who understands what you are up against, what is needed to successfully deliver a study, who understands the data and what is required, and who offers critical solutions to help heads of clinical operations gain control at a price that they can afford.

Tools that actually make a difference offer:

  • Actionable insights, not static reports
  • Continuous visibility, not retrospective analysis
  • Aligned teams, not handoffs

 

Central statistical monitoring: Detecting emerging risks early

Early intervention is key to managing trial risks and ensuring reliable results. As clinical trials grow in complexity, data quality and patient safety can no longer be ensured within system reports. And with evolving regulatory expectations, trial budget pressures, and the need for earlier, more objective insights into emerging risks, central statistical monitoring (CSM) has become a critical component of modern trial oversight.

Tools, such as Cytel’s Cytelytics, can leverage statistics to identify trends, detect risks, and optimize source data verification efforts.

Regulatory agencies now treat audit trail data with the same level of scrutiny as clinical data, and expect proactive, ongoing reviews. Relying on outdated or manual approaches is a risk you can’t afford.

Additionally, regulatory agencies emphasize the need for proactive and ongoing audit trail reviews, treating audit trail data with the same level of scrutiny as clinical data. Manual approaches are time sinks and can introduce unnecessary risk. Tools like Cytel’s Audit Detective enhance compliance and data integrity by identifying inconsistencies, unauthorized access, and unusual activity patterns in audit trails.

 

Better data visualization: Driving decisions, not just reporting

Traditional reports tell you what happened. Modern visualization:

  • Links operational metrics to clinical outcomes
  • Allows drill-down from summary to patient level
  • Highlights where intervention changes the outcome

Tools like Cytel’s ClinCytesDV provides interactive graphs, tables, and listings, layering data together to tell a richer story.

 

Data management: Operating environments that drive speed and quality

Data ingestion, cleaning, reconciliation, and reporting should not operate in lock step. A modern approach:

  • Automates data ingestion across sources (EDC, RWD, wearables)
  • Standardizes data structures (CDISC, OMOP, FHIR)
  • Enables real-time cleaning and review

The result is better data processing, reduced site burden, faster lock — and less firefighting. This is the difference between oversight and control.

 

Final takeaways

The answer isn’t more dashboards, systems, or data, but rather the methods and tools that result in fewer reconciliations across systems, earlier visibility into risks, faster decisions with higher confidence, and ultimately, that allow you to spend less time managing the process — and more time managing the study.

Clinical Data Management’s Next Evolution: From Data Stewardship to Data Intelligence

Clinical Data Management (CDM) is undergoing a fundamental transformation. What was once primarily a function focused on data collection, validation, and cleaning is now emerging as a strategic, technology-driven discipline at the heart of modern clinical research.

Today’s trials generate unprecedented volumes of complex data. A recent Tufts Center for the Study of Drug Development survey found a 7x increase in data points and 4x increase in data sources. Here at Cytel we have seen studies with over 20 data sources. Beyond traditional electronic data capture (EDC), clinical studies increasingly incorporate electronic health records (EHRs), wearable devices, mobile applications, genomics, imaging, and real-world evidence (RWE). While these data sources create enormous potential for deeper insight, they also introduce new challenges that conventional CDM approaches were never designed to handle.

To unlock the value of this expanding data universe, clinical organizations must rethink not only their tools, but also their talent, workflows, and mindset.

 

The rise of new roles in clinical data management

This evolution has created demand for new, specialized roles that bridge clinical knowledge, data science, and technology:

 

Clinical Data Scientist (CDS)

Clinical Data Scientists focus on extracting insight from complex medical data. They apply advanced analytics, visualization, and domain expertise to uncover trends, assess data quality risks, and support clinical and operational decision-making.

 

Clinical Data Engineer (CDE)

Clinical Data Engineers design and maintain the data infrastructure that makes modern analytics possible. They build robust, compliant data pipelines, integrate diverse data sources, and ensure data is reliable, traceable, and analysis-ready across the clinical trial ecosystem.

 

Together, these roles move CDM beyond data stewardship toward true data enablement.

 

The expanding complexity of clinical data

Modern clinical trials are no longer linear or siloed. Data flows continuously from multiple sources, often in near real time, and in formats that vary widely in structure, granularity, and reliability. Managing this complexity requires more than rule-based checks and manual reviews. Organizations need scalable data architecture, advanced analytics, and intelligent monitoring approaches that can adapt as data volume, velocity, and variety increase. This shift marks a move away from reactive data cleaning toward proactive data intelligence.

 

Why data visualization matters more than ever

As data points multiply, traditional listings and static reports quickly become unmanageable. Data visualization is no longer a “nice to have,” it is essential. Advanced visual analytics enable clinical teams to identify patterns, compare data across sites, and detect emerging issues early, before they compromise data quality or timelines. By transforming complex datasets into intuitive visual insights, teams can move faster, ask better questions, and focus attention where it matters most.

 

Figure 1: Early Detection of Data Quality Risks through Data Visualization Use Case

Systemic audit trail analysis and regulatory expectations

Regulatory expectations are also evolving alongside data complexity. The 2023 EMA guidance places increased emphasis on audit trail review, signaling a shift from point-in-time checks to systemic analysis. Manual audit trail reviews are no longer sufficient at scale. Instead, sponsors and CROs must adopt analytical approaches that continuously monitor audit trail activity while identifying unusual patterns. This will support site fraud detections, risk-based quality management, and inspection readiness. Analytics-driven audit trail review not only improves compliance, but it also strengthens overall data integrity and operational oversight. In short, the audit trail data needs to be treated similarly to clinical data. In 2025, Cytel was made aware of multiple sponsors being asked to provide evidence of a systematic review of the audit trail data by regulatory authorities.

 

Figure 2: Systemic Audit Trail Analysis Use Case

From comprehensive reviews to trend and outlier detection

In a world of big data, reviewing everything is neither practical nor effective. The future of data cleaning lies in intelligent prioritization. By leveraging statistical methods and trend analysis, CDMs can shift from exhaustive data review to targeted investigation focusing on outliers, inconsistencies, and meaningful deviations. This will reduce manual effort while improving data quality outcomes, aligning with risk-based monitoring principles, and enabling faster, more confident decision-making throughout the trial lifecycle. This is accomplished by statistically analyzing the data variability similar to how statistics are used to evaluate for safety and efficacy and assigning risk levels to the various checks that are performed. An overall risk level is also created and based on the analysis targeted data checks are performed.

 

Figure 3: Risk-Based Data Cleaning Use Case

Building insight-ready clinical data ecosystems

The future of clinical data management is not defined by a single tool or technology, but by an ecosystem; one that combines modern platforms, advanced analytics, and specialized talent.

Organizations that invest in insight-ready data architectures and deploy the right expertise will be better positioned to improve data quality, accelerate timelines, and generate deeper insights from increasingly complex datasets. As clinical research continues to evolve, CDM’s role is expanding from managing data to unlocking its full strategic value.

 

Interested in learning more?

William Baker and Jenn Sustin will be hosting the webinar “Enabling the Shift to Clinical Data Science and Engineering for Modern Trials” on February 18 at 10 am ET:

Beyond the Database: How Clinical Data Management Transforms Patient Care

When we think about clinical data management (CDM), it is often easy to picture databases, spreadsheets, and documents for days. However, being able to step into a clinic setting and witness how data-driven decisions shape patient care reveals the true impact of CDM.

Here, I share real-world examples of the impact of clinical data management on patients and what lies ahead for the field as technology advances.

 

From data to decisions: The impact of clinical data management in the clinical setting

Every piece of data collected during a clinical trial, be it lab results, procedure information, patient reported outcomes, or even adverse events, tells a story. During trials, these individual stories converge to guide treatment plans, ensure safety, and improve outcomes. Accuracy and speed are absolutely critical when it comes to data entry and processing as it allows clinicians to make informed decisions without delay, reducing risks for patients. Without this precision, even groundbreaking therapies can stumble due to incomplete or unreliable information.

 

Real-world examples of CDM impact

Spotting issues early

In an oncology trial, centralized monitoring picked up unusual liver enzyme levels across several patients. Because of that insight, clinicians were able to tweak treatment plans right away, preventing serious side effects and keeping patients safe.

 

Identifying dosing mistakes

During a diabetes study, data checks uncovered inconsistencies in insulin doses. Fixing those errors ensured patients got the right amount of medication, reducing the risk of hypoglycemia and keeping the study on track.

 

Keeping patients engaged

Real-time data review revealed a trend of missed visits in a cardiovascular trial. Sharing this with site teams led to proactive outreach, helping patients stay on schedule and reducing dropout rates.

 

Bridging science and care

Clinical data managers play a behind-the-scenes role, but their work directly influences what happens in the exam room. For example:

 

Keeping data consistent

Consistency ensures that trial results are reliable and can be applied to real-world care, not just on paper.

 

Building trust in the numbers

Data Integrity means clinicians can rely on the information when adjusting dosages or monitoring side effects. No second-guessing, just confidence.

 

Protecting patients and speeding up progress

Regulatory compliance isn’t just about ticking boxes — it keeps patients safe and helps move promising therapies from research to approval faster.

 

Better communication

Real-time data sharing helps patients stay informed about their progress, reducing uncertainty.

 

Fewer repeat visits

Catching errors early means patients avoid unnecessary trips back to the clinic, saving time and stress.

 

The human element — My perspective

As a Principal Clinical Data Manager, I’ve had the privilege of seeing this impact firsthand. One moment that stands out was during a rare disease trial where every day mattered for patients waiting for treatment. By streamlining data cleaning and resolving queries quickly, we helped lock the database ahead of schedule. Knowing that this effort contributed to patients receiving life-changing therapy sooner was incredibly rewarding.

It’s in these moments that the connection between data and human lives becomes crystal clear. Behind every query, every validation check, there’s a patient hoping for better health and that’s what drives our work. CDM is not just about compliance; it’s about compassion through precision.

 

Looking ahead

As technology advances, the integration of real-time data and AI-driven insights will make clinical data management even more impactful. The clinic will become a hub where data flows seamlessly, supporting personalized medicine and improving patient experiences. Predictive analytics could help identify risks before they occur, and automation will free up time for deeper analysis. The future of CDM isn’t just about managing data, it’s about transforming care.

In short, clinical data management isn’t just a technical process, it’s a human story where every detail matters.

 

Interested in learning more?

Career Perspectives: A Conversation with Naydene Slabbert

In this edition of our Career Perspectives series, we are delighted to feature Naydene Slabbert, Principal Clinical Data Manager, at Cytel. Naydene shares insights from her career journey, discusses the critical role of early-stage clinical trial setup in ensuring the delivery of high-quality, actionable data, and reflects on the evolving role of data managers in clinical trials.

 

Can you give us a little background on your career so far? What led you to clinical data management, and how has your path evolved over the years?

My journey in clinical data management started over 23 years ago, and it’s been such a rewarding experience filled with growth, learning, and a lot of exciting challenges. I began my career at Quintiles (now IQVIA), where I started as an Assistant Data Coordinator and eventually became a Data Team Lead. Those early years gave me a solid foundation in clinical trial operations and sparked my interest in data quality and process improvement.

In 2021, I moved to DF/Net Research, where I led several high-profile studies and contributed to infrastructure and software development. That role helped me expand my technical and strategic skills, especially in managing complex, multi-site trials.

Now, I’m proud to be part of Cytel as a Principal Clinical Data Manager. My focus is on enhancing end-to-end data management processes, working closely with cross-functional teams, and making sure our systems support both scientific excellence and regulatory success. Over the years, my role has evolved from hands-on data work to strategic leadership, and I continue to be inspired by the impact that well-managed clinical data can have on public health and patient outcomes.

 

You’ve been supporting the lead on a major study that went live in September. What did your day-to-day work look like at this stage of the project?

During the go-live phase of the study I’m working on, my daily focus was to make sure our data management systems and processes were running smoothly and in sync across teams. It’s a crucial time where accuracy, quick thinking, and strong teamwork really matter.

I partnered closely with the study lead and various cross-functional teams to validate the Electronic Data Capture (EDC) system, double-checking that all edit checks and Case Report Forms (CRFs) were working as expected. We held daily huddles and status meetings to keep everyone aligned and moving forward, which made it easier to spot and tackle any issues early on.

This stage demanded a lot of agility, collaboration, and attention to detail — all with the goal of setting the study up for long-term success.

 

From preparing documents to getting the database ready for data collection — how do these early tasks set the foundation for a successful study?

The early stages of a clinical study really lay the groundwork for everything that follows. It’s where we take the scientific goals outlined in the protocol and turn them into practical, workable data processes. Getting this part right is key to the trial’s overall success.

A big part of this involves preparing core documents like the Data Management Plan, validation guidelines, and Standard Operating Procedures (SOPs). These aren’t just paperwork — they’re the playbook that keeps everyone aligned on exactly how data will be collected, reviewed, and reported. They help ensure consistency, compliance, and quality from start to finish.

At the same time, building and testing the database, from CRF design to edit checks and system integrations, is just as critical. This is where we make sure the tools for capturing data are user-friendly, accurate, and fully aligned with the protocol. A well-designed database helps reduce errors, speeds up query resolution, and supports faster decision-making.

By putting in time and care upfront, we’re able to minimize potential risks, boost efficiency, and set the stage for a study that’s not only regulatory-ready but also delivers high-quality, actionable data. In my experience, a strong launch phase really sets the tone for everything that follows.

 

Now the study has gone live, you’re overseeing the data. What does that oversight involve, and how do you ensure data quality and consistency throughout the trial?

Once a study goes live, my role shifts into a proactive oversight phase where the focus is on maintaining data integrity, consistency, and compliance across all participating sites and systems.

Ultimately, my goal is to create a system of continuous quality assurance. By fostering strong communication, leveraging technology for real-time insights, and maintaining rigorous documentation, I help ensure that the data collected is accurate, timely, and meaningful. This supports both scientific outcomes and regulatory success, and ultimately, the patients.

 

What do you like best about your role, and about working at Cytel?

What I enjoy most about my role is the opportunity to lead complex studies that have real-world impact, while collaborating with talented teams across disciplines. I thrive on problem-solving and ensuring data quality from start to finish, and I appreciate the autonomy and trust I’m given to manage projects effectively.

As for Cytel, I value the supportive culture and global collaboration. The company encourages continuous learning and innovation, and I’ve found the environment to be both respectful and intellectually stimulating. It’s rewarding to be part of an organization that’s committed to advancing clinical research through data-driven solutions.

 

Is there a particular project or initiative you’ve worked on recently that you’re especially proud of?

One project I’m especially proud of is the trial I mentioned earlier, which went live recently. It’s a high-profile study with complex data requirements, and I’ve been deeply involved from the early planning stages through to go-live. I helped translate the protocol into robust data collection tools, oversaw database setup and testing, and now manage ongoing data oversight. What makes this project stand out is the level of collaboration and precision required. It’s been incredibly rewarding to see our preparation pay off in a smooth launch!

 

You’ve held leadership roles across several organizations. What’s one piece of career advice you wish you had received earlier?

If I could go back and give myself one piece of advice early in my career, it would be: “Don’t shy away from getting your hands dirty.” I used to think leadership was mostly about strategy and oversight, but some of the most valuable lessons, and the biggest impacts made, came from jumping into the details.

Whether it’s troubleshooting a tricky data issue, reviewing CRFs, or helping build out a database, being hands-on keeps you sharp and connected to the work. It also builds trust with your team. They can see you’re not just directing from the sidelines, but genuinely in it with them. That kind of involvement helps you lead with more empathy, insight, and credibility.

 

How has your approach to managing clinical data changed over time, especially as you’ve moved into more strategic roles?

Over time, my approach to managing clinical data has shifted from task execution to strategic oversight. Early in my career, I focused on operational details such as CRF design, data cleaning, and query resolution. As I moved into leadership roles, I began shaping data strategies, aligning them with protocol goals, regulatory requirements, and sponsor expectations. I now prioritize proactive planning, cross-functional collaboration, and system optimization to ensure data quality and efficiency across the entire study lifecycle.

Now, my approach is focused on seeing the bigger picture and guiding teams toward smarter, scalable solutions.

 

Clinical trials can be complex, especially when managing data across different regions and systems. What are some of the biggest challenges you’ve faced in data management, and how did you tackle them?

One of the biggest challenges in clinical data management is keeping data consistent and reliable across multiple regions, especially in large, global studies. Each site often has its own workflows, varying levels of experience, and different infrastructure, which can lead to inconsistencies in how data is captured and handled.

To manage this, I focus on creating clear, well-structured documentation and providing centralized training to ensure everyone is on the same page. I also put strong validation processes in place to catch issues early. Working closely with vendors and site teams is key — it allows us to resolve problems in real time and keep the data aligned across systems.

Strategic planning and open communication play a big role too. By staying connected with all stakeholders and anticipating potential challenges, we’re able to maintain high-quality, harmonized data throughout the trial. It’s all about building trust, being proactive, and keeping the bigger picture in mind.

 

The field is evolving quickly. How do you see the role of data managers changing with the rise of AI, machine learning, and decentralized trials?

The role of data managers is indeed evolving rapidly with the rise of AI, machine learning, and decentralized trials. We’re moving from purely operational roles to more strategic ones, where we not only manage data but also help shape how it’s collected, interpreted, and used.

One trend I’m particularly excited about is the integration of AI and machine learning into data cleaning and query management. These tools help us move from reactive to proactive data oversight, identifying patterns and anomalies much earlier in the process. Decentralized trials are also reshaping how we collect and manage data — requiring more flexible systems and real-time validation strategies. As a data manager, I now focus more on system integration, data governance, and ensuring that new technologies align with regulatory standards and study goals.

These innovations are pushing us to become more strategic, tech-savvy, and collaborative, which I find both challenging and energizing. It’s an exciting shift that requires both adaptability and a strong foundation in data quality principles.

 

What skills do you think will be essential for future data managers entering the field?

I think future professionals in this space will need a mix of technical know-how, strategic thinking, and flexibility to really thrive.

For starters, being comfortable with data, understanding how to interpret it, analyze it, and use tools that support automation and predictive insights, is going to be key. With AI, machine learning, and real-time data becoming more common, data managers will need to be confident working with more complex systems and datasets.

Technical skills will always be important. You’ll still need to work with EDC platforms, understand coding and data standards, and know how to manage data integrations. But we’re also seeing a growing need to understand APIs, interoperability, and data governance, especially as decentralized trials become more widespread.

Just as important are the soft skills. Strong communication, collaboration, and leadership are essential because data managers often act as the link between clinical, statistical, and operational teams. Being able to bring people together and keep everyone aligned makes a huge difference.

And finally, I’d say curiosity and a willingness to keep learning are vital. The field is changing fast, and those who stay open to new ideas and keep building their skills will be best positioned to lead the way.

 

As a remote employee, how do you maintain a healthy work-life balance? What strategies work for you, and do you feel supported by Cytel in this regard?

Working remotely definitely has its perks, but maintaining a healthy work-life balance takes a bit of intention. For me, it starts with having a clear plan for the day. I like to set goals, block out time for focused work, and make sure I take regular breaks. I also try to stick to a consistent “log-off” time, which helps me mentally switch from work mode to personal time.

One thing that’s really helped is having a dedicated workspace that’s separate from my living space. It makes it easier to stay focused during the day and disconnect in the evenings. I also make time for walks, family, and activities that help me recharge as those are just as important as meetings and deadlines.

Cytel has been incredibly supportive when it comes to flexibility and balance. There’s a lot of trust and autonomy, and the culture really respects personal time. Leadership encourages us to take care of ourselves, which makes remote work not only manageable but genuinely enjoyable.

 

You have been with Cytel for around 6 months now. What aspects of Cytel’s culture stood out to you when you joined?

What really stood out for me when I joined Cytel was how collaborative and welcoming the culture is. From day one, I felt like part of a team. People are generous with their time, open to new ideas, and genuinely invested in working together to achieve shared goals. It’s not just about getting the job done; it’s about how we support each other along the way.

I also really appreciate the company’s focus on quality and innovation. There’s a strong drive for continuous improvement, and strategic thinking is encouraged. That’s something I value deeply in my own work, especially when it comes to refining processes and contributing to cross-functional initiatives.

Another thing that impressed me is how well remote employees are supported. Even though I’m based in South Africa, I’ve felt fully connected to the global team. Communication is seamless, and there’s a real effort to make sure remote staff feel included and empowered.

Overall, Cytel fosters a culture that supports both professional growth and personal well-being, and that’s something I truly appreciate.

 

Finally, what are your main interests outside of work? What helps you recharge and stay inspired?

When I’m not working, you’ll probably find me out in the beautiful South African bushveld, book in hand, or enjoying coffee in the sun — my personal reset button. I love getting creative in the kitchen (even if some dinners end up as “learning experiences”) and tackling home improvement projects just for the fun of it.

I’m also a mom to teenagers, which means my life is a mix of deep chats, dramatic eye rolls, and trying to keep up with slang that changes weekly. They keep me laughing, grounded, and constantly on my toes.

Spending time with family and friends is what really recharges me. It’s the fuel that keeps everything else running smoothly.

Thank you, Naydene, for sharing your experience with us!

Naydene Slabbert

Leading Across Generations: The Millennial Manager Perspective

“It was the best of times, it was the worst of times…”

Charles Dickens’s iconic opening line from A Tale of Two Cities captures an experience familiar to many millennial managers in clinical data management today. We are at a rewarding stage of our careers, having worked hard to reach leadership roles where we are trusted to guide teams. This often presents challenges, particularly in leading younger colleagues whose professional values, expectations, and communication styles differ significantly from those we encountered early in our own journeys.

 

The generational context

Within our field, three generations now work side by side:

Gen X: Our senior colleagues and mentors, who built much of the foundation of clinical data management as we know it. They helped standardize processes, develop systems, and establish the professional culture we inherited.

Millennials: We bridge the gap having learned Clinical Data Management during a time when CRFs were moving to eCRFs but were often still paper. We had our favorite pen, that wrote just right and didn’t bleed through the fields. We remember crowded meetings in conference rooms and answering a landline phone from our cubicles. We learned how to balance traditional processes with emerging technologies, bridging tried and true processes with emerging digital tools.

Gen Z: The newest members of our teams, entering the field during the era of AI, big data, and complex system integrations. They are digital natives, fluent in technology, and bring fresh perspectives about work-life balance, collaboration, and communication. They are also the Clinical Data Scientists who will lead the innovations of tomorrow.

Great Data Managers come from every generation however, generational diversity adds both strength and complexity to our teams. Gen Z colleagues bring technological fluency, adaptability, and fresh perspectives. At the same time, their expectations for workplace culture, work-life balance, and communication can differ from the norms millennials and Gen X grew up with. For millennial managers, the challenge is to find innovative ways to mentor effectively while honoring those differences.

 

Building trust through empowerment

One of the most critical factors in leading Gen Z professionals is building trust. Traditional management doesn’t work for this group while micromanaging only breeds frustration and disengagement. Instead, our younger colleagues thrive when given responsibility, space, and opportunities to grow. I’ve found the “see one, do one, teach one” method incredibly effective at building trust and confidence early in the relationship. Here’s how it works.

See one: allow team members to observe the process in action, maybe walk them through how you approach reviewing new data.

Do one: provide the tools and empower your team member to try it themselves. The first attempt may not be perfect — in fact, the first few tries might be a little rough — but that’s part of the learning process. Be supportive, offering encouragement while also giving constructive feedback and gentle corrections. Remember, we’ve all been in their shoes at some point. Teach them with the same patience and guidance you wish you had received.

Teach one: once they have done a few on their own encourage them to share their understanding by training a colleague or explaining the process back to you. It’s easy to simply nod along when working side by side or to stumble through the steps on their own, but having them articulate the process step by step as they complete it ensures genuine comprehension.

This model does more than transfer skills. It reinforces confidence, demonstrates trust, and creates natural opportunities for delegation. As managers, this method also ensures knowledge transfer is not unidirectional, it cultivates a team culture where everyone learns and contributes.

 

Embracing work-life balance

Another defining characteristic of Gen Z professionals is their expectation of balance. These team members are less willing to conform to the traditional model of remaining at a desk for eight consecutive hours. Rather than resisting this shift, managers can embrace it as an opportunity to foster healthier and more sustainable work habits across the team.

Encouraging short breaks for walking, stretching, or resetting after extended meetings benefits everyone. It promotes productivity, reduces burnout, and models an organizational culture that values employee well-being. For millennial managers, who often grew up in a culture of “always on” availability, adopting these practices can be a healthy correction that gives us an opportunity to model healthier practices for our team.

 

Adapting communication with care

Communication is an area where generational differences can be most visible. Gen Z professionals often write messages in a style that mirrors casual digital communication, which can seem too informal in a professional context. Instead of correcting tone in a prescriptive way, effective leaders provide guidance with flexibility.

I’ve found success in offering alternative phrasings that may be better suited to different situations. While “Warm Regards” may not resonate with this generation, we can certainly guide team members toward signoffs that are more professional than “I’m out” or “G2G.” Share your own experiences of learning professional communication, emphasizing that style naturally evolves with practice and experience. Encourage individuals to select wording that feels authentic to them while still meeting professional standards. This approach respects individuality while fostering growth, and it frames communication coaching not as criticism, but as an opportunity for development.

 

The reciprocal nature of leadership

Leading across generations is not simply about teaching; it is also about learning. Gen Z’s comfort with emerging technologies, openness to change, and insistence on balance can challenge established norms in constructive ways. Millennial managers, positioned between Gen X’s wealth of experience and Gen Z’s innovation, are uniquely suited to translate across generations.

By approaching leadership as a reciprocal process, managers can both strengthen their teams and enrich their own skills in the process.

 

From the “best of times” to a better future

As Dickens suggested, every era brings both opportunities and challenges. For millennial managers in clinical data management, the current moment requires balancing tradition and innovation, structure and flexibility, guidance and trust.

By focusing on empowerment, balance, and communication, we can help Gen Z professionals grow into the leaders and Clinical Data Scientists of tomorrow. At the same time, we strengthen our own leadership by embracing their fresh perspectives.

Ultimately, bridging generations is not about choosing the best of times over the worst — it is about ensuring that the best defines the future we create together.

 

Join us at SCDM 2025 in Baltimore, MD, where Jennifer Sustin will be presenting “Generations and Culture: How Prepared Are We to Welcome the Next Gen Workforce?” Stop by Booth 528 to connect with our team, explore live demos, and learn how we’re shaping the future of clinical data management.

Streamlining Data Management and Improving Statistical Accuracy in Clinical Trials with AI

As clinical trials grow increasingly complex, the need for smarter, faster, and more efficient data processes and analysis is in demand. Artificial intelligence (AI) emerges as a powerful tool, especially in programming and data management. For clinical trial professionals, AI offers the promise of streamlining workflows, improving data quality, and reducing time to database lock.

 

The evolving role of AI in clinical data programming

AI is not replacing clinical programmers; it’s augmenting them. AI should be considered a tool to use within clinical trials, just as EDC and SAS are commonly used tools. Automation tools driven by machine learning can now handle routine, rules-based programming tasks such as edit check generation, derivation logic, and data transformation. This allows programmers to focus on more strategic activities like validating statistical code or optimizing data pipelines. AI needs the expertise of our clinical trial professionals.

Natural Language Processing (NLP) is also making great progress. For instance, NLP can interpret free-text protocol documents to auto-generate specifications, electronic case report form (eCRF) templates, or even suggest initial SDTM mappings, significantly reducing manual effort.

 

AI in data cleaning and quality oversight

Traditionally, data cleaning has been labor-intensive, with data managers manually reviewing queries, data listings, and edit checks across multiple data sources and systems. AI tools can now proactively flag anomalies or data trends that human review might miss, such as unexpected patterns in lab values, inconsistencies across visits, or possible fraudulent data across participants and sites.

Predictive models can help identify study participants at high risk of dropout or noncompliance, enabling earlier intervention. This not only improves data completeness but also enhances trial efficiency and participant retention. The effort and cost of replacing clinical trial participants is significant and felt across all stakeholders. Improving the patient’s experience would be a significant way to save time, money, and accelerating progress.

 

AI in statistical programming: From code automation to advanced insights

Statistical programming is central to clinical trial analysis from producing tables, listings, and figures (TLFs) to preparing submission-ready datasets. Traditionally reliant on manual coding in SAS or R, this work is now gaining speed, consistency, and quality through AI augmentation.

 

Where AI adds value in statistical programming

  • Automated code generation: AI models trained on historical programming logic can produce initial SAS macros or R scripts for common TLFs and dataset derivations. These drafts accelerate development by up to 40–60%, freeing programmers and biostatisticians to focus on complex analyses and interpretation.
  • Code review and validation: AI-assisted tools can scan code for logic errors, inefficiencies, redundant steps, and deviations from programming standards. Acting as a “second reviewer,” they flag potential issues early and suggest optimizations.
  • Dynamic statistical modeling: AI algorithms can rapidly explore large trial datasets to detect subgroup effects, anomalies, or emerging trends. When guided by statistical oversight, these insights can refine analysis plans and support adaptive trial decisions.

The aim is not to replace human judgment, but to boost productivity, reproducibility, and the speed of insight generation, without compromising scientific rigor.

 

AI in biostatistics: Powering smarter, more adaptive clinical trials

 Biostatistics remains the foundation of evidence generation in clinical trials, providing the methodological rigor to transform raw data into reliable conclusions. In the context of AI, biostatisticians play a dual role: safeguarding scientific validity while leveraging new computational tools to enhance insight generation. This requires a careful balance between deep domain knowledge and technical proficiency in emerging AI-driven methodologies. From applying knowledge graphs (KGs) to map complex biomedical relationships, to developing predictive models that anticipate trial outcomes, biostatistics is evolving into a more dynamic and interconnected discipline.

 

Where AI adds value in biostatistics

  • Balanced expertise: Integrating statistical theory with AI/ML techniques to ensure robust, interpretable results.
  • Knowledge graph applications: Using KGs to uncover hidden relationships between biomarkers, treatments, and outcomes, supporting hypothesis generation and trial design.
  • Early prediction tools: Building predictive models for recruitment success, dropout risk, and endpoint achievement.
  • Segmentation and personalization: Identifying patient subgroups most likely to benefit from a therapy, improving trial efficiency and precision medicine strategies.
  • Support for registrational trials: Leveraging AI to optimize trial design, stratify patient populations, and run simulations that ensure the study is powered and structured for regulatory success.

 

Regulatory readiness and caution

Despite its promise, AI must be implemented thoughtfully. Regulatory agencies like the FDA are increasingly open to the use of advanced technologies but expect transparency, traceability, and validation. AI-based tools must be auditable and explainable, especially when used in clinical data workflows that feed into regulatory submissions.

 

What’s next?

As AI becomes more embedded in clinical trial ecosystems, we can expect increased integration with EDC systems, CDISC standards, and statistical programming tools. The goal isn’t to eliminate human oversight but to enhance it, allowing clinical data professionals to make faster, better-informed decisions.

 

Final takeaways

AI is reshaping programming and data management in clinical trials. For clinical trial professionals, now is the time to become familiar with these tools, understand their capabilities and limitations, and engage with cross-functional teams to ensure responsible and impactful implementation. Ultimately our goal is to shorten drug development timelines and improve patient outcomes. With AI, we can be part of the solution to provide improved treatments for patients.

 

Interested in learning more?

Join Steven Thacker, Sheree King, Kunal Sanghavi, and Juan Pablo Garcia Martinez for their upcoming webinar, “How AI Enhances Biometrics Services: Streamlining Data Management and Improving Statistical Accuracy in Clinical Trials” on Thursday, August 28 at 10 am ET:

How CDISC and CDASH (CRF Standards) Streamline Clinical Trials

In today’s global research landscape, clear and consistent communication is more than a necessity — it’s a strategic advantage. It is particularly critical in clinical trials, where data must speak a universal language across teams, geographies, and regulatory frameworks.

The CDISC (Clinical Data Interchange Standards Consortium) and CRF (Case Report Form) standards serve as the universal language of clinical trials, ensuring consistency, clarity, and collaboration across the entire study lifecycle. By implementing these essential frameworks, organizations can optimize data collection, management, and submission — driving cost efficiency and accelerating medical advancements.

Here, we discuss CDISC and CRF standards and how they support the design, execution, and analysis of clinical trials.

 

The need for standardization

Overall, ensuring consistent and reliable data across multiple clinical studies requires the standardization of processes, procedures, and data collection methods. This uniformity can improve data quality, facilitate data sharing and analysis, and ultimately enhance the efficiency and validity of clinical research.

There are many benefits to utilizing CDISC and CRF standards, such as:

  • Improved data quality and reliability
  • Enhanced data sharing and integration
  • Increase efficiency
  • Improved communication and collaboration
  • Support for regulatory compliance
  • Scalability and repeatability

Let’s take a closer look at how CDISC and CDASH standards help create a foundation for data collection, presentation, and submission in clinical trials.

 

CDISC Foundational Standards

CDISC (Clinical Data Interchange Standards Consortium), a global non-profit organization, develops and promotes standards for data exchange in clinical research. The CDISC Foundational Standards support end-to-end clinical and non-clinical research processes, focusing on the core principles for defining data standards, and include models, domains, and specifications for data representation.

 

FDA guidance on CDISC standards

In recent years, the FDA has clearly stated its preference for receiving both clinical and analysis data formatted in compliance with CDISC standards. This has been communicated through a series of guidance documents, correspondence with sponsors, and presentations at conferences. As a result, CDISC models have become the de facto standard for submitting data to the FDA.

As of today, the FDA requires the following CDISC standards:

  • Controlled terminology
  • SEND
  • SDTM
  • ADaM
  • Define-XML

 

CDASH: Maximizing data quality

CDASH (Clinical Data Acquisition Standards Harmonization), a foundational standard developed by CDISC, focuses on harmonizing data collection in clinical trials, providing guidance on how to design and populate case report forms (CRFs) to ensure consistent data collection across studies. These standards help maximize data quality in order to streamline processes across the entire spectrum of medical research, from crafting clinical research protocols to reporting and regulatory submissions.

CDASH Model v1.3 — the latest version — was released in September 2023.

 

Key features of CDASH

  • Provides guidance on designing and populating CRFs/eCRFs, covering all therapeutic areas and phases of clinical trials
  • Specifies standard field names, meanings, and how to fill them
  • Characterizes fields as highly recommended, conditional, or optional
  • Includes a CDASH Model and CDASH Implementation Guide

 

The benefits of CDASH

Instead of following bespoke standards, CDASH’s guidelines for CRFs/eCRFs help sponsors collect data consistently across studies. This further aids in producing data in SDTM format for submission purposes and allows regulators to review data submission packages more accurately and efficiently, identifying concerns or making approvals faster. In addition, you can remove the duplication of trials and post-marketing evaluation, improving patient centricity.

CDASH standards also provide guidance for the development of data collection tools, which are clear, understandable, and precise. Following CDASH standards ensures traceability of trial data from the time the data is collected at the site until the data is ready for final analysis and regulatory submission. This maintains the integrity of source data to support the trial’s outcome/findings.

Sponsors can further save on time required for setting up new studies following the CDASH standards as most of the data collection and associated programming can be standardized across studies.

 

CRF libraries

A Case Report Form (CRF) library in clinical trials is a collection of standardized, reusable CRFs designed to streamline data collection and management. These libraries, whether electronic or physical, offer templates and guidelines for collecting data across different trials and therapeutic areas. They ensure uniformity, accuracy, and efficiency in data collection, ultimately benefiting trial conduct and analysis.

CRF libraries can reduce the cost and time budgeted for the clinical trial database preparation by:

  • Streamlining processes
  • Reducing training
  • Accelerating clinical trials
  • Using resources more efficiently
  • Improving adaptability and consistency
  • Focusing on design

 

Final Takeaways

CDISC standards, including CDASH and CRF standards, have revolutionized the way clinical data is managed, presented, and submitted, enhancing its integrity and efficiency in clinical research and drug development. Conformance to these standards is thus a critical aspect of clinical studies to ensure uniform data collection and submission processes, ultimately bringing quality treatments to patients faster.

 

Interested in learning more?

Watch our on-demand webinar, “Boosting Efficiency with CRF and CDISC Standards”:

Best Practices for Ensuring Data Quality in Clinical Trials

Good data is essential for successful clinical trials. It helps ensure accurate analysis, guides important decisions, and supports the approval and safe use of new treatments. As trials become more complex with remote setups, many data sources, and stricter rules, keeping data quality high is more important than ever.

In this post, we’ll look at simple, effective ways to protect the accuracy and trustworthiness of data in clinical trials.

 

Create a strong data management plan

A good Data Management Plan (DMP) is the first step to quality data. It explains how data will be collected, checked, cleaned, and stored during the trial. It also helps everyone involved know their role.

A strong DMP should include:

  • Clear roles and responsibilities
  • Information about study set up including the electronic data capture used and the audit trail.
  • Step-by-step instructions for entering and handling data
  • Data cleaning process and details
  • Management of Serious Adverse Event (SAE) reconciliation and medical coding within the study

If you start your DMP early and keep it up to date, it will help avoid confusion and keep the trial consistent.

How to create a strong data management plan infographic

Use standardized data collection methods

Data collection should follow a consistent approach. It starts with designing a smart Case Report Form (CRF) that only asks for the necessary information and matches the trial goals. Using standard forms (like CDASH) across studies makes data easier to manage and review.

Other ways to keep data collection consistent:

  • Use standard medical terms (e.g., MedDRA, WHO Drug Dictionary)
  • Train staff on correct data entry
  • Use reliable electronic systems with built-in checks to catch errors
  • Avoid the comment or text field as much as possible
  • Do not collect data twice (duplicated data)

These strategies will reduce mistakes and save time fixing issues later.

 

Monitor data actively

To keep data quality high, you need to prevent problems and catch them early. Active monitoring either remotely or based on risk can help spot problems before they get worse.

Examples of active monitoring:

  • Dashboards that show missing or unusual data
  • Review of top priority data for the primary analysis
  • Regular review of key items like side effects or medication use
  • Focus monitoring on high-risk sites and processes

Finding and fixing issues early keeps your data reliable. Moreover, fixing problems as early as possible enables the site to avoid recurring issues.

 

Handle queries quickly and clearly

Resolving data queries (questions or issues) takes time, so it’s important to manage this well.

Tips for efficient query handling:

  • Use automated checks to catch simple issues
  • Focus manual review on complex or safety related data
  • Keep clear records of how each query is resolved by adding a comment
  • Pay attention to queries opened for several days to check with the sites on the reason why

Good query management keeps the trial moving and ensures the data is clean and complete.

 

Combine data from different sources carefully

Today’s trials often use data from many places like labs, apps, devices, and imaging systems. Keeping this data consistent is key.

Best practices include:

  • Creating a Data Transfer Agreement (DTA) detailing data transfer specifications, like for instance how the data will be transmitted, the frequency of the transfer, and the data to be transmitted
  • Checking and validating all incoming data
  • Setting up checks to make sure sources agree (e.g., comparing lab data with system data)

Good data integration helps you understand results more clearly and trust the final data.

 

Follow regulatory guidelines

High data quality also means following the rules. Agencies like the FDA and EMA expect clean, traceable, and well-documented data.

To be compliant:

  • Have clear procedures and test your systems
  • Run regular data audits
  • Make sure your data follows ALCOA+ principles (Attributable, Legible, Contemporaneous, Original, Accurate, and more)

Meeting these rules protects your study and shows the data is trustworthy.

 

Staff training and communication

Even with great tools, skilled people are essential. Train all team members regularly so they understand their role in protecting data quality. Make sure communication is clear between everyone’s sites, teams, and vendors. Write eCRF Completion Guidelines and perform a training video to train the sites and the investigator, explaining how the system works and how to perform data entry.

Sharing knowledge and working together helps build a culture where quality comes first.

 

Final thoughts

Keeping data quality high in clinical trials takes planning, careful checks, and teamwork. By following best practices like clear data collection, active monitoring, and smart integration, you can ensure your data is accurate and ready for review.

As clinical trials continue to evolve, one thing stays the same: quality data is key to faster approvals and better treatments for patients.

The Use of AI in Clinical Trial Data Management

Clinical data managers play a key role in clinical trials, ensuring the integrity of the clinical data and bearing ultimate responsibility for preparing the data for statistical analysis. As clinical studies evolve, data management is becoming more complex with the use of multiple data sources, as well as the increased volume of data through case report forms, patient reported outcomes, laboratory data, electronic health records, imaging reports, and more.

Here, we discuss various ways artificial intelligence has the potential to accelerate clinical trial data management as well as some of the benefits and challenges of using these groundbreaking tools.

 

Transforming data cleaning in clinical trials

The traditional method of data cleaning involves manual checks and review of data listings. Data that falls outside the expected results is queried, which leads to time-consuming communications that are often prone to error. Additionally, a significant amount of time is spent programming and validating data checks and listings.

AI has the potential to transform data cleaning: AI tools can quickly spot outliers, inconsistencies, and errors in datasets that may be missed with traditional methods. An example of this is an exception listing, which compares elevated laboratory parameters with adverse events. This listing would seek to strike a correlation between laboratory values and concurrent adverse events. AI can also detect possible missing or duplicate data. Ultimately, AI can lead to faster data availability by improving clinical trial data analysis and cleaning.

Furthermore, AI can be used to detect fraudulent data. As data managers review one patient data at a time, AI tools can look at data at a site collectively and look for potential fraudulent data (for example, dosing for all patients at the site is on the same day and time).

 

Improving database development in clinical trials

The use of AI is becoming more prevalent in the software industry: electronic data capture (EDC) companies are using AI technology to translate plain text for edit check requirements (e.g., temperature should be within 36–40 Celsius) into the programmed edit checks. This will significantly improve the timelines for database development within the clinical trial space.

 

Shortening database lock timelines

AI can also shorten database lock timelines. Data managers are often tasked with manually reviewing data to ensure data accuracy as well as verifying whether there is missing data. An example of this is considering a study participant with colon cancer, but there is no prior therapy within the timeframe of the condition. Data managers require a significant amount of time to perform such reviews. AI can examine the data holistically, flag possible discrepancies, and reduce manual effort and human error, allowing for a cleaner database at a much faster rate.

 

The need for human involvement

While AI can process large amounts of data, identify suspicious patterns, and analyze images, human involvement is required to validate the AI functionality. Humans will also be required to perform ongoing reviews of the output and adjust the AI tools as the amount of clinical trial data increases. With the use of AI, individuals involved in the clinical trial can focus their skills and time on evaluating complex factors and decision-making.

 

Benefits and challenges of AI

The use of AI in clinical trial data management and EDC development has several benefits:

  • AI can automate time-consuming manual tasks for a faster and more accurate database.
  • AI can reduce the possibility of human error while continuously monitoring clinical trial data. This allows data issues to be detected quickly, which can improve data accuracy and safety for trial participants.

However, challenges remain:

  • Maintaining participant confidentiality while using AI tools can be a significant challenge.
  • AI systems must be configured and monitored to ensure there is no risk of accessing unauthorized or unblinded data.
  • Large amounts of data are required for the AI to be properly trained to deliver high-quality results.

Ultimately, the decision must be made of whether the benefits of AI outweigh the challenges.

The pharmaceutical industry is a heavily regulated industry. Validity of AI tools must be established before these tools are deployed for database development and data cleaning. This will ensure the sensitivity and specificity with which the AI tool is expected to perform with a negligible error rate.

 

Final takeaways

AI will play an important role in many aspects of clinical trials, including and beyond data management. From identifying potential compounds to automating routine tasks, innovating statistical programming, streamlining medical writing, and creating digital twins, we will only continue to see advancements in AI tools in the coming years. AI can be a groundbreaking tool to shorten drug development timelines and improve patient outcomes.