|
" Patients Harm Classification and Prediction Through Natural Language Processing in a Cancer Medical Center "
Salazar Reyna, Roberto Jesus
Khasawneh, Mohammad T.
Document Type
|
:
|
Latin Dissertation
|
Language of Document
|
:
|
English
|
Record Number
|
:
|
1054612
|
Doc. No
|
:
|
TL53729
|
Main Entry
|
:
|
Salazar Reyna, Roberto Jesus
|
Title & Author
|
:
|
Patients Harm Classification and Prediction Through Natural Language Processing in a Cancer Medical Center\ Salazar Reyna, Roberto JesusKhasawneh, Mohammad T.
|
College
|
:
|
State University of New York at Binghamton
|
Date
|
:
|
2020
|
Degree
|
:
|
M.Eng.
|
student score
|
:
|
2020
|
Note
|
:
|
106 p.
|
Abstract
|
:
|
This research addresses the problem of predicting adverse events and patient harm in cancer medical centers. Unintentional harm on patients has negative consequences for multiple parties, including patients and their families, healthcare providers, hospitals, and healthcare organizations. Therefore, identifying in advance patients that are at higher risk of suffering an adverse event is necessary to prevent harm, avoid unnecessary costs and provide safer care services. This can be achieved by employing natural language processing and machine learning algorithms to predict harm on patients based on structured and unstructured medical data. Most previous research works that addressed this problem have built and evaluated a single machine learning model using a single source of medical text. For this purpose, this research builds, evaluates and compares the performance of multiple machine learning models using three different sources of medical text (admission notes, final nursing assessment, and discharge notes) to determine the most reliable source to predict harm. Over 27,000 records were used to build the classification models. The experimental results suggested the final nursing assessment as the best source of medical text to predict harm. The best prediction model implemented 10 folds cross-validation, TF-IDF bag of words, chi-square feature selection, ADASYN over sampling technique, and a random forest classifier. This model outperformed the others by achieving 85% accuracy, 95% precision, 75% recall and 85% F1-scores. Results suggests that predicting adverse event and harm is possible through the use of natural language processing and machine learning algorithms.
|
Descriptor
|
:
|
Artificial intelligence
|
|
:
|
Computer engineering
|
|
:
|
Health care management
|
|
:
|
Health sciences
|
|
:
|
Industrial engineering
|
|
:
|
Medical ethics
|
|
:
|
Technical communication
|
Added Entry
|
:
|
Khasawneh, Mohammad T.
|
Added Entry
|
:
|
State University of New York at Binghamton
|
| |