Quantitative disease risk scores for common diseases, with applications to eMERGE

Project: Research project

Project Details

Description

Summary Labeling clinical data from electronic health records (EHR) in health systems requires extensive knowledge of human expert, is time-consuming, and leads to inconsistencies in case de nitions across di erent phenotyping algorithms. There is increased recognition that common diseases are not discrete entities but rather reside on a continuum. We propose here to take advantage of rich phenotype data in electronic health records, and propose quantitative disease risk scores based on unsupervised methods that require minimal input from clinicians. We will implement the proposed methods into R packages to be made available to the scienti c community. Fur- thermore, we propose applications to phenotypic and genomic data on approximately 100,000 individuals in the eMERGE network, and 500,000 individuals in the UK biobank. We will design a website containing the results of these analyses, including summary statistics from the GWAS analyses for these phenotypes. We believe the proposed research is very timely and novel, and has the potential to facilitate genomic research using rich phenotype data in electronic health records in general.
StatusFinished
Effective start/end date9/8/218/31/23

Funding

  • National Human Genome Research Institute: US$445,500.00

ASJC Scopus Subject Areas

  • Health Informatics

Fingerprint

Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.