|Articles|June 30, 2022

Metrics in Clinical Medicine and Clinical Trials Should Look Beyond the P Value

From evaluating clinical outcomes to determining payment for services, measurements are important in health care.

From evaluating clinical outcomes to determining payment for services, measurements are important in health care. Today, it would be difficult to envision a scenario where this would not be the case. Although experience is highly relevant in the clinical domain—from a surgeon’s decision to remove or not remove a patient’s large vascular tumorous mass to a medical oncologist’s recommendation that a patient to try 1 more antineoplastic regimen—the clinical value of these critical but subjective individual determinations must also be examined based on the objective results. For example, did the patients highlighted above suffer excessive measurable adverse effects (AEs) vs the time they spent out of the hospital? What was the ultimate duration of their measured survival?

One can rigorously debate the specific role the patient/family vs the health care system/payor should have in the assessment of “value.” In the opinion of this commentator, it is difficult to argue against the hypothesis that such an evaluation should be rationally based, at least in part, on an examination of objectively measured outcomes.

However, when moving beyond the individual, a discussion of the role of metrics becomes even more complex and potentially problematic. How does a claimed “statistically superior or inferior” result of a clinical trial examining a diagnostic or therapeutic treatment approach in carefully selected patients with cancer relate to an individual seen in clinical practice? For example, factors such as age, ethnic background, known or potentially unknown relevant comorbidities, physiologic measurements, life experiences, nutrition, family support, psychological profile, and willingness to assume and understand risk might have an influence on the relevance of a trial result for an individual or for the providers advising their patients.

It also must be acknowledged that observed metrics in a clinical trial setting can be intentionally or unintentionally manipulated to achieve a superior outcome. This commonly occurs through the exclusion of particular groups of individuals from clinical trials using criteria such as age, prior therapy, comorbidities, and medication use, among others.

The anticipated claim for such actions by study sponsors and regulators is to say that they are attempting to protect participants from serious AEs. These decisions to exclude individuals provide the opportunity for a treatment strategy to be approved by a regulatory agency and/or have results published in the peer-reviewed literature before it is objectively known if this regimen is safe for individuals with clinical features that were not permitted in the study population.

This concern is far from uncommon in oncology studies, which routinely exclude or subtly discourage admission of individuals with, for example, a history of cardiac events or mild-to-moderate renal dysfunction, or those requiring a variety of pharmaceutical agents for chronic nononcologic related illnesses.

Further, despite the recognized value and need for randomized trials in oncology, they are the most susceptible to the concerns highlighted above. This is in part because of the requirement to have a population of patients that is as homogeneous as possible to isolate the therapeutic effects, such as time to disease progression or overall survival, and the AE profile of the approach being evaluated. Imbalances within the randomly assigned populations can complicate the objective interpretation of a study end point.

The protocol for randomized trials is appreciated, but it must not be forgotten that the actual aim of such trials should be to evaluate the potential benefit and risk of therapy in patients who are reflective of the real-world setting, after the strategy leaves the well-contained universe of a clinical trial. How does the community of patients with cancer benefit if large populations of individuals are denied access to potentially effective strategies? Should providers make educated guesses regarding efficacy vs safety and appropriate dosing because of a lack of essential trial-based data?

A final question concerns the legitimacy of the authority for randomized trials as the gold standard for objectively determining clinical relevance, using what has been quite arbitrarily labeled statistical significance. For example, there has been considerable discussion in the medical literature regarding the lack of understanding, misinterpretation, and inadequate reporting of P values in such efforts.¹

As one commentator pointedly noted: “Fundamentally, statistical inference using P values involves mathematical attempts to facilitate the development of explanatory theory in the context of random error. However, P values provide only a particular mathematical description of a specific data set and not a comprehensive scientific explanation of cause-and-effect relationships in a target population.”²

A second commentator was even more direct in criticizing common misconceptions regarding this statistical test: “A P value of 0.05 does not mean that there is a 95% chance that a given hypothesis is correct. Instead, it signifies that if the null hypothesis is true, and all other assumptions made are valid, there is a 5% chance of obtaining a result at least as extreme as the one observed. And a P value cannot indicate the importance of a finding; for instance, a drug can have a statistically significant effect on a patient’s blood glucose levels without having a therapeutic effect.”³

The danger of relying on a P value to determine statistical significance and to subsequently convert this number to define clinical benefit was highlighted in a recent report evaluating the results of randomized controlled trials (RCTs) for various interventions for COVID-19.⁴ The investigators found that “a relatively small number of events (a median of 4) would be required to change the results of COVID-19 RCTs from statistically significant to not significant,” calling into question the reliability of any single study to define both statistical significance and clinical utility.

The potential absurdity of employing P values to define clinical relevance in oncology is highlighted by a relatively recent article in a high-impact medical journal. The retrospective review sought to compare 258 patients who received complementary medicine (0.0136% of the total population) with 1,901,557 patients in the control group to reach its claim of statistical significance and impressive P values (< .001). The authors wrote, “patients who received CM [complementary medicine] were more likely to refuse additional CCT [conventional cancer treatment] and had a higher risk of death.”⁴

There is much to criticize regarding this report, including the definition of complementary medicine employed by the authors. Apparently, this paper’s reviewers and the journal editor were impressed with this P value and ignored the profoundly small, highly biased sample size and patient population and the highly debatable conclusions.

Whether it was Mark Twain or Benjamin Disraeli who said it first, there remains much truth to the expression, “There are three kinds of lies: lies, damned lies, and statistics.”

References

Chavalarias D, Wallach JD, Li AHT, Ioannidis JPA. Evolution of reporting P values in the biomedical literature, 1990-2015. JAMA. 2016;315(11):1141-1148. doi:10.1001/jama.2016.1952
Kyriacou DN. The enduring evolution of the P value. JAMA. 2016;315(11):1113-1115. doi:10.1001/jama.2016.2152
Baker M. Statisticians issue warning on P values. Nature. 2016;531(7593):151. doi:10.1038/nature.2016.19503
Johnson SB, Park HS, Gross CP, Yu JB. Complementary medicine, refusal of conventional cancer therapy, and survival among patients with curable cancers. JAMA Oncol. 2018;4(10):1375-1381. doi:10.1001/jamaon-col2018.2487

Stay up to date on the most recent and practice-changing oncology data

Subscribe Now!

Metrics in Clinical Medicine and Clinical Trials Should Look Beyond the P Value

References

Newsletter

Related Content

The OncFive: Top Oncology Articles for the Week of 2/1

Outcomes With Bridging Therapy Correlate With Cilta-Cel Efficacy, Safety in Multiple Myeloma

Real-World Data Support Clinical Benefit With Lifileucel in Previously Treated Advanced Melanoma

Single-Center, Retrospective Data Show Low Rate of Lifileucel Infusion Following Referral in Advanced Melanoma

Long-Term Cilta-Cel Data Show Low Rates of PFS Events in Standard-Risk R/R Myeloma

Latest CME

Community Oncology Connections™: Beyond Primary End Points—Digging Into Randomized and Real-World Data to Guide Challenging Treatment Decisions in HR+/HER2− Metastatic Breast Cancer | Washington State Medical Oncology Society

A Breath of Strength: Managing Cancer Associated LEMS and Lung Cancer as One

Striking the Right Nerve: Managing Cancer Associated LEMS in Lung Cancer Patients

Show Me the Data™: Bridging Clinical Gaps Along the Continuum From Resectable, Early Stage to Advanced Gastric/Gastroesophageal Junction Cancers

Community Oncology Connections™: Beyond Primary End Points—Digging Into Randomized and Real-World Data to Guide Challenging Treatment Decisions in HR+/HER2− Metastatic Breast Cancer | Kentucky Society of Clinical Oncology

Community Oncology Connections™: Beyond Primary End Points—Digging Into Randomized and Real-World Data to Guide Challenging Treatment Decisions in HR+/HER2− Metastatic Breast Cancer | Indiana Oncology Society

19th Annual New York GU Cancers Congress™

Medical Crossfire®: Expert Interpretations of the Latest Data in CLL Management – Understanding the Impact of Optimal Treatment Selection on Patient Outcomes

Virtual Testing Board: Digging Deeper on Your Testing Reports to Elevate Patient Outcomes in Advanced Non–Small Cell Lung Cancer

Medical Crossfire® – From Diagnostic Dilemmas to Potential Treatment Breakthroughs: Exploring Novel Targets for Extrapulmonary Neuroendocrine Carcinomas

Addressing Unmet Needs in HER2+ Metastatic BTC

Community Practice Connections™: Tailored Treatment Approaches for Older Patients With Advanced HR+/HER2– Breast Cancer

Community Practice Connections™: Empowering Interventional Radiologists in the Emerging Era of Oncolytic Immunotherapies for Melanoma

GI Tumor Board—Applying Recent Advances in Biomarker Testing and Treatment in Metastatic Colorectal Cancer

Medical Crossfire®: Harnessing the Power of Modern Therapies in Newly Diagnosed Multiple Myeloma

Medical Crossfire®: Expert Perspectives on Targeting c-Met Overexpression and 𝘔𝘌𝘛 Genomic Alterations in NSCLC – Unveiling the Complexities of 𝘔𝘌𝘛 Dysregulation

PER Tumor Board®: Applying Recent Advances to Transform the Treatment Paradigm in SCLC—Expert Perspectives on New Approvals and Emerging Strategies

Cases & Conversations™: Transforming AML Care—Precision Strategies, Evolving Therapies, and Clinical Insights

Medical Crossfire®: Precision Medicine in Glioma Treatment — Integration of Molecular Profiling to Inform Targeted Therapies

Medical Crossfire®: Integrating Next-Generation Endocrine Targeting Therapies to Improve Outcomes for Patients With HR+/HER2- Breast Cancer

Cases and Conversations™: Sorting Through the Expanding Treatment Options for Patients with Relapsed/Refractory Multiple Myeloma

Medical Crossfire®: Improving Patient Outcomes in Myeloproliferative Neoplasms With Novel Therapeutic Approaches

Community Oncology Connections™: Optimizing SCLC Treatment Strategies and Managing Adverse Events Across Disease Stages | South Carolina

Personalized Management in NSCLC: Strategies for Early Detection, Molecular Testing, and Targeted Therapies | Kansas

Personalized Management in NSCLC: Strategies for Early Detection, Molecular Testing, and Targeted Therapies | Wyoming and Montana

Personalized Management in NSCLC: Strategies for Early Detection, Molecular Testing, and Targeted Therapies | New Mexico

Community Oncology Connections™: Optimizing SCLC Treatment Strategies and Managing Adverse Events Across Disease Stages | North Carolina

Live Tumor Board: Squamous Cell Carcinoma of the Head & Neck – Post-CRT Decisions in the Locally Advanced Setting

Community Practice Connections™: Optimizing Treatment Outcomes and Preserving Fertility in Premenopausal HR+ Breast Cancer

From Bench to Bedside: Paradigm Shifts in HER2+ Metastatic BTC Treatment

Proactive Adverse Event Management for HER2+ BTC Treatments

A Case-Guided Discussion on Managing Immune Thrombocytopenic Purpura (ITP)

Tumor Board: Expert Insights on Managing Classical 𝘌𝘎𝘍𝘙 Mutations, 𝘌𝘎𝘍𝘙 Exon 20 Insertions, and Atypical 𝘌𝘎𝘍𝘙 Mutations in Metastatic NSCLC

Evolving Treatment Strategies in Pancreatic Cancer: Current Standards, Emerging Targets, and the Role of Molecular Testing

Breast Cancer Tumor Board: Targeting TROP2 – Innovations in Triple-Negative Breast Cancer Treatment

Trending on OncLive

Long-Term Cilta-Cel Data Show Low Rates of PFS Events in Standard-Risk R/R Myeloma

Single-Center, Retrospective Data Show Low Rate of Lifileucel Infusion Following Referral in Advanced Melanoma

Real-World Data Support Clinical Benefit With Lifileucel in Previously Treated Advanced Melanoma

Dr Riedell on the Long-Term Efficacy of Tisa-Cel in R/R Follicular Lymphoma

FDA Updates Axi-Cel Label to Remove Limitation of Use in R/R PCNSL