Projects per year
Abstract
Background: Phenotyping, the process of systematically identifying and classifying conditions within clinical data, is a crucial first step in any data science work involving Electronic Health Records (EHRs). Traditional approaches require extensive manual annotation efforts and face challenges with scalability. Methods: We investigated the use of Large Language Models (LLMs) for zero-shot phenotyping of 20 prevalent chronic conditions based on synthetic patient summaries generated from real structured EHRs codes. We evaluated the performance of multiple LLMs, including GPT-4o, GPT-3.5, and LLaMA 3 models with 8-billion, 70-billion, and 405-billion parameters, comparing them against traditional rule-based methods. For the analysis we used a dataset of 1,000 patients from Hospital da Luz Lisboa. Results: GPT-4o outperformed both traditional rule-based methods and alternative LLMs, achieving superior recall (0.97) and macro-F1 score (0.92). Rule-based phenotyping, while highly precise (0.92), showed lower recall (0.36). The integration of rule-based methods with LLMs optimized phenotyping accuracy by targeting manual annotation efforts on discordant cases. Conclusion: Zero-shot learning with LLMs, particularly GPT-4o, offers a powerful and efficient approach for phenotyping chronic conditions from EHRs, significantly reducing the need for extensive labeled datasets while maintaining high accuracy and interpretability.
Original language | English |
---|---|
Article number | 110181 |
Number of pages | 17 |
Journal | Computers in Biology and Medicine |
Volume | 192 |
DOIs | |
Publication status | Published - Jun 2025 |
Keywords
- Large language models
- Multimorbidity
- Phenotyping
- Zero-shot learning
Fingerprint
Dive into the research topics of 'Zero-shot learning for clinical phenotyping: comparing LLMs and rule-based methods'. Together they form a unique fingerprint.Projects
- 1 Finished
-
CIIS: Center for Interdisciplinary Research in Health
Barros, M. (PI), Rosa, N. (Researcher), Correia, M. J. (Researcher), Caldas, A. C. (Researcher), Amado, J. C. (Researcher), Figueiredo, A. S. (Researcher), Esteves, A. C. (Researcher), Mineiro, A. (Researcher), Abreu, A. M. (Researcher), Duarte, A. S. (Researcher), Almeida, S. F. (Researcher), Correia, A. (Researcher), Moura, A. (Researcher), Almeida, A. (Researcher), Araújo, B. (Researcher), Moura-Netto, C. (Researcher), Ferrito, C. (Researcher), Pais-Vieira, C. (Researcher), Festas, C. (Researcher), Marques-Vieira, C. (Researcher), Catré, D. (Researcher), Nunes, E. (Researcher), Jesus, É. (Researcher), Ribeiro, F. (Researcher), Rosário, F. (Researcher), Fernandes, G. (Researcher), Rato, J. R. (Scholarship holder), Salgado, J. R. (Researcher), Neves-Amado, J. (Researcher), Amendoeira, J. (Researcher), Sá, L. (Researcher), Capelas, M. L. (Researcher), Vieira, M. M. (Researcher), Nunes, M. V. S. (Researcher), Cardoso, M. (Researcher), Veiga, N. J. (Researcher), Fonseca, P. (Researcher), Correia, P. N. (Researcher), Couto, P. (Researcher), Pontífice-Sousa, P. (Researcher), Ravasco, P. (Researcher), Carvalho, P. V. D. (Researcher), Alves, P. (Researcher), Melo, P. (Researcher), Silva, R. (Researcher), Canaipa, R. (Researcher), Noites, R. (Researcher), Rio, R. (Researcher), Almeida, S. (Researcher), Deodato, S. (Researcher), Caldeira, S. (Researcher), Silva, S. (Researcher), Borges, T. (Researcher), Silva, V. (Researcher), Charepe, Z. (Researcher), Rodrigues-Pires, F. (Volunteer), Veludo, F. (Researcher), Carmo, H. (Scholarship holder), Romeiro, J. (Student), Melo, M. (Researcher), Braga, M. (Researcher), Amaral, T. (Researcher), Moreira, M. A. (Researcher), Guerra, N. (Student), Santos, P. (Researcher), Paço, S. (Student), Lynce, S. (Scholarship holder), Miguel, S. (Student), Costa, T. (Researcher), Silva-Neves, V. (Researcher), Silva, A. (Researcher), Carvalho, A. R. (Researcher), Almeida, B. (Researcher), Figueiredo, C. (Researcher), Esteves, E. (Scholarship holder), Araújo, F. M. (Researcher), Garcia, J. G. (Researcher), Santos, L. (Researcher), Santos, N. M. D. (Researcher), Lopes, P. (Researcher), Bornes, R. (Researcher), Silva, R. (Researcher), Costa, S. (Researcher), Silva, S. M. (Researcher), Marques, T. (Researcher), Almeida, A. (Researcher), Santos, M. (Scholarship holder), Santos, P. (Researcher), Miguel, S. (Researcher), Mendes, K. (Researcher), Gomes, A. P. (Researcher), Henriques, J. M. P. (Researcher), Costa, H. A. F. P. (Researcher) & Ribeiro, P. (Researcher)
Fundação para a Ciência e a Tecnologia
1/01/20 → 31/12/24
Project: Research