TY - GEN
T1 - A multi-modal deep learning method for classifying chest radiology exams
AU - Nunes, Nelson
AU - Martins, Bruno
AU - Silva, Nuno André da
AU - Leite, Francisca
AU - Silva, Mário J.
N1 - Funding Information:
Authors from INESC-ID were partially supported through Fundação para a Ciência e Tecnologia (FCT), specifically through the INESC-ID multi-annual funding from the PIDDAC programme (UID/CEC/50021/2019). We gratefully acknowledge the support of NVIDIA Corporation, with the donation of the Titan Xp GPU used in the experiments.
Publisher Copyright:
© Springer Nature Switzerland AG 2019.
PY - 2019/8/30
Y1 - 2019/8/30
N2 - Non-invasive medical imaging techniques, such as radiography or computed tomography, are extensively used in hospitals and clinics for the diagnosis of diverse injuries or diseases. However, the interpretation of these images, which often results in a free-text radiology report and/or a classification, requires specialized medical professionals, leading to high labor costs and waiting lists. Automatic inference of thoracic diseases from the results of chest radiography exams, e.g. for the purpose of indexing these documents, is still a challenging task, even if combining images with the free-text reports. Deep neural architectures can contribute to a more efficient indexing of radiology exams (e.g., associating the data to diagnostic codes), providing interpretable classification results that can guide the domain experts. This work proposes a novel multi-modal approach, combining a dual path convolutional neural network for processing images with a bidirectional recurrent neural network for processing text, enhanced with attention mechanisms and leveraging pre-trained clinical word embeddings. The experimental results show interesting patterns, e.g. validating the high performance of the individual components, and showing promising results for the multi-modal processing of radiology examination data, particularly when pre-training the components of the model with large pre-existing datasets (i.e., a 10% increase in terms of the average value for the areas under the receiver operating characteristic curves).
AB - Non-invasive medical imaging techniques, such as radiography or computed tomography, are extensively used in hospitals and clinics for the diagnosis of diverse injuries or diseases. However, the interpretation of these images, which often results in a free-text radiology report and/or a classification, requires specialized medical professionals, leading to high labor costs and waiting lists. Automatic inference of thoracic diseases from the results of chest radiography exams, e.g. for the purpose of indexing these documents, is still a challenging task, even if combining images with the free-text reports. Deep neural architectures can contribute to a more efficient indexing of radiology exams (e.g., associating the data to diagnostic codes), providing interpretable classification results that can guide the domain experts. This work proposes a novel multi-modal approach, combining a dual path convolutional neural network for processing images with a bidirectional recurrent neural network for processing text, enhanced with attention mechanisms and leveraging pre-trained clinical word embeddings. The experimental results show interesting patterns, e.g. validating the high performance of the individual components, and showing promising results for the multi-modal processing of radiology examination data, particularly when pre-training the components of the model with large pre-existing datasets (i.e., a 10% increase in terms of the average value for the areas under the receiver operating characteristic curves).
KW - Classification of radiology exams
KW - Deep learning
KW - Learning from multi-modal data
KW - Machine learning in medicine
UR - http://www.scopus.com/inward/record.url?scp=85072899138&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-30241-2_28
DO - 10.1007/978-3-030-30241-2_28
M3 - Conference contribution
AN - SCOPUS:85072899138
SN - 9783030302405
SN - 9783030302412
VL - 1
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 323
EP - 335
BT - Progress in artificial intelligence
A2 - Oliveira, Paulo Moura
A2 - Novais, Paulo
A2 - Reis, Luís Paulo
PB - Springer Verlag
T2 - 19th EPIA Conference on Artificial Intelligence, EPIA 2019
Y2 - 3 September 2019 through 6 September 2019
ER -