Natural Language Processing to Extract Information from Portuguese-Language Medical Records

da Rocha, Naila Camila [UNESP]; Barbosa, Abner Macola Pacheco [UNESP]; Schnr, Yaron Oliveira [UNESP]; Machado-Rugolo, Juliana [UNESP]; de Andrade, Luis Gustavo Modelli [UNESP]; Corrente, José Eduardo; de Arruda Silveira, Liciana Vaz [UNESP]

Natural Language Processing to Extract Information from Portuguese-Language Medical Records

Data

2023-01-01

Autores

da Rocha, Naila Camila [UNESP]

Barbosa, Abner Macola Pacheco [UNESP]

Schnr, Yaron Oliveira [UNESP]

Machado-Rugolo, Juliana [UNESP]

de Andrade, Luis Gustavo Modelli [UNESP]

Corrente, José Eduardo

de Arruda Silveira, Liciana Vaz [UNESP]

Resumo

Studies that use medical records are often impeded due to the information presented in narrative fields. However, recent studies have used artificial intelligence to extract and process secondary health data from electronic medical records. The aim of this study was to develop a neural network that uses data from unstructured medical records to capture information regarding symptoms, diagnoses, medications, conditions, exams, and treatment. Data from 30,000 medical records of patients hospitalized in the Clinical Hospital of the Botucatu Medical School (HCFMB), São Paulo, Brazil, were obtained, creating a corpus with 1200 clinical texts. A natural language algorithm for text extraction and convolutional neural networks for pattern recognition were used to evaluate the model with goodness-of-fit indices. The results showed good accuracy, considering the complexity of the model, with an F-score of 63.9% and a precision of 72.7%. The patient condition class reached a precision of 90.3% and the medication class reached 87.5%. The proposed neural network will facilitate the detection of relationships between diseases and symptoms and prevalence and incidence, in addition to detecting the identification of clinical conditions, disease evolution, and the effects of prescribed medications.

Palavras-chave

medical records, named entity recognition, neural networks

Como citar

Data, v. 8, n. 1, 2023.

URI

http://hdl.handle.net/11449/246711

Coleções

Artigos

Página do item completo

Natural Language Processing to Extract Information from Portuguese-Language Medical Records

Data

Autores

Título da Revista

ISSN da Revista

Título de Volume

Editor

Resumo

Descrição

Palavras-chave

Como citar

URI

Coleções