CODEX Digest - 5.21.26

Banner

Want this delivered straight to your inbox every Thursday? Subscribe now.

This week's digest features a review analyzing the cost-effectiveness of diagnostic AI in radiology (#3), a review on the risks of overdiagnosis with wearables (#5), and an interview study of diagnostic overshadowing for individuals with ill mental health (#11). 

Titles link to the PubMed record or free-to-access sites with full text availability.

1) Premature closure underlies bias in medical diagnosis in students: A randomised controlled experiment.

Al Essa A, Schmidt HG, Mamede S, et al. Med Educ. Epub 2026 Apr 28.

Affected cognition is known to bias information and contribute to diagnostic error. This study examines how introducing distracting, believable information influences diagnostic accuracy in UAE medical students. When such information was provided early in the decision process, students made more diagnostic errors and processed information less thoroughly, indicating premature closure. The results find reduced processing time only when biased information appeared early in the decision making, highlighting its impact on error rates.

2) Performance of a large language model on the reasoning tasks of a physician.

Brodeur PG, Buckley TA, Kanjee Z, et al. Science. 2026;392(6797):524-527.

Clinical reasoning is a useful benchmark for LLM task performance. This study shows that OpenAI's o1-preview model outperforms other LLMs in diagnostic and management reasoning, particularly generating differential diagnoses. The study included a real-world comparison of the LLM and human clinicians in emergency department patients. The LLMs performance in probabilistic reasoning and triage remains similar to previous models, suggesting strengths in complex critical thinking but no improvement in probability-based tasks. Based on these results, the authors advocate for prospective clinical trials to measure the impact of LLMs on diagnostic errors and other patient-centered outcomes.

Read more from this article inThe San Francisco Chronicle featuring CODEX Director Sumant Ranji, MD. 

3) Economic evaluations of AI applications in radiology: a systematic review.

Gregory L, Lock F, Harvey H, et al. Eur Radiol. Epub 2026 Feb 20.

AI in radiology has potential to improve diagnostic excellence, but limited cost-effectiveness data slows adoption. This review analyzes economic evaluations of radiology AI, highlighting inconsistent outcome measures and calling for standard metrics to enhance comparability, research, and responsible implementation in imaging services.

4) Clinicians' rationale for editing ambient AI-drafted clinical notes: persistent challenges and implications for improvement.

Guo Y, Hu D, Yang Z, et al. J Am Med Inform Assoc. Epub 2026 Apr 27.

Ambient AI documentation tools create initial clinical note drafts from patient-provider conversations, which clinicians review and edit before adding to the EHR. Understanding clinicians' modifications is important for refining AI and improving efficiency. This study finds that changes were mainly made to enhance clinical accuracy, diagnostic assessment, specificity and history, specialty precision, and comply with medico-legal, billing, and documentation requirements.

5) Overdiagnosis in atrial fibrillation screening with wearables.

Haase CB, Jensen AE, Modin FA, et al. Scand J Prim Health Care. 2026;44(1):2656694.

Wearable technologies are increasingly used to detect atrial fibrillation (AF) outside clinical settings. This review finds that continuous screening with wearables nearly tripled AF diagnoses in asymptomatic, high-risk individuals, leading to more treatment but no clear improvement in outcomes. The findings suggest overdiagnosis and emphasize the need for careful integration of wearable technologies in primary care.

6) What shapes patient experience of diagnosis in the emergency department? Findings from a mixed-methods study.

Kuhn D, Dukhanin V, Gleason KT. Diagnosis (Berl). Epub 2026 Apr 23.

Patients often report better diagnostic experiences when imaging is performed, though the reasons remain unclear. This mixed-methods study uses the PRIME-ED (Patient-Reported Measure of Diagnostic Excellence in the Emergency Department) tool to assess patient perceptions of diagnosis and communication in one academic emergency department. Most patients valued thorough evaluations, while negative feedback centered on unresolved symptoms and a desire for additional testing.

7) Barriers to leveraging patient safety event reviews for organizational learning, improvement, and diagnostic error reduction: results of a single academic center survey study.

Luty JT, Corby S, Jungbauer R, et al. J Gen Intern Med. Epub 2026 Apr 29.

Reducing diagnostic error may require more standardized safety event reviews and better data sharing across health systems. This survey finds that diagnostic errors were commonly identified in safety reporting databases, but less consistently across other event reviews, with few systems tracking lessons across categories. The authors suggest more consistent review practices and shared databases could improve learning and reduce patient harm.

8) Patient portal usability gaps: a heuristic evaluation of two major EHR systems.

Milicia A, Spaar PA, Bocknek LS, et al. Appl Clin Inform. Epub Apr 18.

Patient portals give patients access to diagnoses, test results, and clinical communication, but they often inherit EHR usability issues that hinder navigation and interpretation. This assessment of two portals found gaps in critical functions like diagnosis display and provider messaging—flaws that may reduce patient understanding and engagement. While most issues were minor, their accumulation in high-risk functions could threaten diagnostic safety.

9) Patient-centered, family-based approach to disclosure of cancer diagnosis--a culturally responsive model.

Mori M, Suh S-Y, Cheng S-Y. JAMA Netw Open. 2026;9(4):e269959.

Cancer diagnosis disclosure is a process over time, and involving families in care is crucial. This commentary reviews a Chinese study with global relevance, revealing the complexities of nondisclosure as families and clinicians decide what to share or withhold. Individualized care, cultural humility, contextual awareness, and understanding of patient and family perspectives are vital. Interdisciplinary collaboration helps explore illness perceptions and preferences about cancer diagnosis communication.

10) Risk stratified breast cancer screening: early outcomes and psychological impact from PRSONAL - a randomized clinical trial.

Pedersen LH, Bigaard J, Vejborg I, et al. Breast. 2026;86:104743.

Risk-stratified cancer screening can reduce overdiagnosis in low-risk women, improve early detection in high-risk groups, and lower costs, though implementation is difficult. This trial shows that automated risk-based screening is feasible and acceptable with minimal impact on short-term psychological well-being, but these early results should be viewed cautiously since the main outcome will be measured after an additional 17 months of follow-up.

11) "Although I'm mentally ill, that doesn't mean that I'm not also physically ill" - barriers, facilitators and diagnostic overshadowing in healthcare for individuals with lived experience of mental ill-health.

Schäfer K, Mües HM, Wrzesińska MA, et al. Front Public Health. 2026;14:1739409.

People with lived experience of mental ill-health (PWLE) encounter health care inequities, including stigma, discrimination, and diagnostic overshadowing. This interview study examines factors affecting the experience of PWLE across four European health systems. Three key themes emerged from the analysis highlighting systemic, personal, and diagnostic shadowing influences affecting PWLE care. Across all countries represented, patients reported not being taken seriously and facing stigma and discrimination in healthcare settings.

12) Laypeople's perceptions of clinician performance metrics based on cancer screening attendance.

Scherer LD, Lewis CL, Cappella JN, et al. JAMA Health Forum. 2026;7(4):e260714.

Patients’ views of clinician screening metrics can influence their willingness to be screened, follow up, or discuss results, affecting diagnosis. Survey results show half disagreement with metrics based on screening attendance, two-thirds opposed financial incentives, and most preferred physician scores tied to patient-centered discussions over care receipt.

About the CODEX Digest

Stay current with the CODEX Digest, which cuts through the noise to bring you a list of recent must-read publications handpicked by the Learning Hub team. Each edition features timely, relevant, and impactful journal articles, books, reports, studies, reviews, and more selected from the broader CODEX Collection—so you can spend less time searching and more time learning.

Get the latest in diagnostic excellence, curated and delivered straight to your inbox every week:

Subscribe Now

See past digests here