Detalles del proyecto
Description
The goal of this project is to leverage deep-learning algorithms on Electronic Health Records (EHRs) to
improve the early detection of pancreatic ductal adenocarcinoma (PDAC), a malignancy with high mortality
and morbidity. Compared to other major types of cancer in the U.S. (e.g., colorectal, prostate, breast, lung),
PDAC has a uniquely high mortality with a 5-year survival of only 3%, largely due to the late stage of diagnosis
and the aggressiveness of the malignancy. In this K25 application, we aim to develop novel structured
methodologies for systematically incorporating human expert domain knowledge into the training procedure of
deep-learning algorithms (“Human-in-the-Loop” approach) for improving the early detection of PDAC. The
overarching hypothesis for this study, which has already been demonstrated in numerous other contexts, is
that the “Human-in-the-Loop” approach imbues the deep learning in the PDAC prediction model with expert
domain knowledge (e.g., clinical work-flows, statistical knowledge) to result in improved model performance as
well as interpretability of results.
The proposed research will accomplish three aims. In Aim 1, we will build preprocessing pipelines that
are generalizable for analyzing multimodal data from different data collection systems (national, state, and
institutional). The resultant pipelines will provide multimodal EHR deep embeddings optimized for deep
learning applications in Aim 2 & 3. In Aim 2, we will investigate feature grouping strategies relying on
information from clinical workflows and incorporate them into the deep learning prediction model. The
proposed model will provide new clinical predictors represented by single or composite variables according to
the grouping strategies. Examples in the literature show that such grouped predictors consistently have
superior predictive power compared to their individual components. In Aim 3 we will study causal relationships
between patient variables (including composite variables discovered in Aim 2) and the PDAC risk. We will use
the Human-in-the-Loop approach where the possible causal relationships suggested from the deep learning
model will be evaluated and corrected by human experts (e.g., clinicians, statisticians), to construct faithful
Causal Bayesian Networks (CBNs) visualizing causal pathways from patient variables to PDAC risk. The
resultant CBNs will be used as a framework for developing a risk assessment questionnaire to collect Patient-
Generated Health Data (PGHD), which will be further evaluated and optimized in my future R01 focused on the
development of a mobile survey application to efficiently collect PGHD and improve the early detection of
PDAC.
This proposal can potentially lead to new criteria for identifying high-risk patients for PDAC and inform
targeted screening practices, that will likely be generalizable to other types of cancer.
Estado | Finalizado |
---|---|
Fecha de inicio/Fecha fin | 5/1/23 → 4/30/24 |
Keywords
- Investigación sobre el cáncer
- Inteligencia artificial
- Oncología
Huella digital
Explore los temas de investigación que se abordan en este proyecto. Estas etiquetas se generan con base en las adjudicaciones/concesiones subyacentes. Juntos, forma una huella digital única.