TY - JOUR
T1 - Characterizing shared and distinct symptom clusters in common chronic conditions through natural language processing of nursing notes
AU - Koleck, Theresa A.
AU - Topaz, Maxim
AU - Tatonetti, Nicholas P.
AU - George, Maureen
AU - Miaskowski, Christine
AU - Smaldone, Arlene
AU - Bakken, Suzanne
N1 - Publisher Copyright:
© 2021 Wiley Periodicals LLC
PY - 2021/12
Y1 - 2021/12
N2 - Data-driven characterization of symptom clusters in chronic conditions is essential for shared cluster detection and physiological mechanism discovery. This study aims to computationally describe symptom documentation from electronic nursing notes and compare symptom clusters among patients diagnosed with four chronic conditions–chronic obstructive pulmonary disease (COPD), heart failure, type 2 diabetes mellitus, and cancer. Nursing notes (N = 504,395; 133,977 patients) were obtained for the 2016 calendar year from a single medical center. We used NimbleMiner, a natural language processing application, to identify the presence of 56 symptoms. We calculated symptom documentation prevalence by note and patient for the corpus. Then, we visually compared documentation for a subset of patients (N = 22,657) diagnosed with COPD (n = 3339), heart failure (n = 6587), diabetes (n = 12,139), and cancer (n = 7269) and conducted multiple correspondence analysis and hierarchical clustering to discover underlying groups of patients who have similar symptom profiles (i.e., symptom clusters) for each condition. As expected, pain was the most frequently documented symptom. All conditions had a group of patients characterized by no symptoms. Shared clusters included cardiovascular symptoms for heart failure and diabetes; pain and other symptoms for COPD, diabetes, and cancer; and a newly-identified cognitive and neurological symptom cluster for heart failure, diabetes, and cancer. Cancer (gastrointestinal symptoms and fatigue) and COPD (mental health symptoms) each contained a unique cluster. In summary, we report both shared and distinct, as well as established and novel, symptom clusters across chronic conditions. Findings support the use of electronic health record-derived notes and NLP methods to study symptoms and symptom clusters to advance symptom science.
AB - Data-driven characterization of symptom clusters in chronic conditions is essential for shared cluster detection and physiological mechanism discovery. This study aims to computationally describe symptom documentation from electronic nursing notes and compare symptom clusters among patients diagnosed with four chronic conditions–chronic obstructive pulmonary disease (COPD), heart failure, type 2 diabetes mellitus, and cancer. Nursing notes (N = 504,395; 133,977 patients) were obtained for the 2016 calendar year from a single medical center. We used NimbleMiner, a natural language processing application, to identify the presence of 56 symptoms. We calculated symptom documentation prevalence by note and patient for the corpus. Then, we visually compared documentation for a subset of patients (N = 22,657) diagnosed with COPD (n = 3339), heart failure (n = 6587), diabetes (n = 12,139), and cancer (n = 7269) and conducted multiple correspondence analysis and hierarchical clustering to discover underlying groups of patients who have similar symptom profiles (i.e., symptom clusters) for each condition. As expected, pain was the most frequently documented symptom. All conditions had a group of patients characterized by no symptoms. Shared clusters included cardiovascular symptoms for heart failure and diabetes; pain and other symptoms for COPD, diabetes, and cancer; and a newly-identified cognitive and neurological symptom cluster for heart failure, diabetes, and cancer. Cancer (gastrointestinal symptoms and fatigue) and COPD (mental health symptoms) each contained a unique cluster. In summary, we report both shared and distinct, as well as established and novel, symptom clusters across chronic conditions. Findings support the use of electronic health record-derived notes and NLP methods to study symptoms and symptom clusters to advance symptom science.
UR - http://www.scopus.com/inward/record.url?scp=85116880099&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85116880099&partnerID=8YFLogxK
U2 - 10.1002/nur.22190
DO - 10.1002/nur.22190
M3 - Article
C2 - 34637147
AN - SCOPUS:85116880099
SN - 0160-6891
VL - 44
SP - 906
EP - 919
JO - Research in Nursing and Health
JF - Research in Nursing and Health
IS - 6
ER -