Predict Likelihood of Completion for Future Lifestyle Medicine Program

From REU@MU
Jump to: navigation, search

Student Researcher: Jennifer Sailor

Mentor: Ms. Olga Kozlova & Dr. Praveen Madiraju

Project Description :

The United States is in a dubious position: it spends more on per capita health care than any other country, has the highest burden of adults with chronic disease, and ranks first in avoidable deaths. Ninety percent of its annual healthcare dollars are spent in treatment of chronic diseases and their complications. Proactive disease prevention and wellness is essential to reversing these healthcare trends and to improving the health outcomes of our communities. This project will be based on the future Lifestyle Medicine Program at one of the largest integrated health systems in the Midwest. Lifestyle Medicine is a multidisciplinary approach to Health and Wellness that enables individuals to cardinally change their lifestyle to improve or even eliminate a chronic condition and enhance their wellbeing long-term.

Lifestyle Medicine targets specific cohorts of patient population: individuals with the current history of diabetes mellitus, hyperlipidemia, hypertension, BMI of 25-40, and prediabetes. In addition to the data on medical conditions, this project will use simulated data of psycho-demographic variables known to play a role in a) likelihood for self-selection into comparable programs; b) likelihood for completion of the program in its entirety. The project will rely on the ample published literature and structured interviews with subject matter experts such as physicians certified in Lifestyle Medicine.

Project Goals :

  • Gain understanding of Lifestyle Medicine and how it works to prevent or reverse target conditions.
  • Study relevant literature to create a set of psycho-demographic variables that can serve as reliable predictors for self-selection for and completion of the Program.
  • Conduct interviews with SMEs to validate simulated data.
  • Create simulated data on the basis of the literature review and SME interviews.
  • Perform data cleaning, wrangling, and feature engineering on the dataset.
  • Complete exploratory data analysis and data visualization on the dataset.
  • Implement unsupervised? machine learning models to predict self-selection and completion.
  • Evaluate and compare the machine learning models using quantitative measures: accuracy, precision, recall, area under the curve (AUC).
  • Make conclusions about how the model can be used in the future Lifestyle Medicine Program to target individuals for enrollment.

Tentative Milestones and Goals

Week Description
Week 1: Orientation
  • Attend REU orientation
  • Attend Data Science Boot Camp
  • Discuss the project with mentors
  • Set milestones and goals for the project duration
Week 2: Study Relevant Work
  • Take course on Responsible Research Conduct
  • Take CITI training for healthcare data
  • Study literature to create a set of psycho-demographic variables
Week 3: Study Relevant Work
  • Study literature to create a set of psycho-demographic variables to use as reliable predictors
  • Start to prepare final paper
Week 4: Simulate Data
  • Conduct interviews with SMEs to validate simulated data
  • Create simulated data
  • Start to prepare midway presentation
Week 5: Presentation
  • Give midway presentation
Week 6: Start Writing Paper
  • Complete the first part of the paper
  • Start to develop poster
  • Perform Data Cleaning on dataset
Week 7: Cleaning and Visualization
  • Perform Data Cleaning, Wrangling, and feature engineering on the dataset
  • Data Visualization and Data Analysis of dataset
Week 8: Machine Learning Models
  • Implement machine learning models
Week 9: Conclusions
  • Evaluate and Compare machine learning modules
  • Make conclusions
Week 10: Presenting Research
  • Present at poster session
  • Prepare and give oral presentation
  • Finish and submit final paper