1- # Asclepios AI: Optimizing Substance Use Disorder Treatment
1+ # ** Asclepios AI: Data-Driven Treatment Optimization for Substance Use Disorder (SUD) **
22
3- Welcome to the Asclepios AI project! This repository contains all the work
4- done by our team to develop an AI-powered platform that helps substance use
5- disorder (SUD) treatment facilities optimize patient care and resource
6- allocation.
3+ Asclepios AI is a data-driven exploration into how machine learning,
4+ domain knowledge, and thoughtful analysis can help improve outcomes for
5+ individuals undergoing Substance Use Disorder (SUD) treatment. Using real
6+ admissions and treatment episode data, our goal is to identify patterns
7+ that can help facilities personalize treatment duration, improve completion
8+ rates, and reduce relapse/readmission.
79
8- ## 🌟 What We're Building
10+ ## Our Team
11+
12+ This project is being developed by a collaborative team:
13+
14+ - ** Caesar Ghazi** ([ @CaesarGhazi ] ( https://github.com/CaesarGhazi ) )
15+ - ** Moe Alwathiq** ([ Moe-phantom] ( https://github.com/Moe-phantom ) )
16+ - ** Rafaa Ali** ([ @RafaaAli ] ( https://github.com/RafaaAli ) )
17+ - ** Wuor Bhang** ([ @WuorBhang ] ( https://github.com/WuorBhang ) )
18+
19+ ## ** Primary Research Question**
20+
21+ How accurately can patient demographics, history, and treatment factors
22+ predict an optimal treatment duration that minimizes relapse risk in Substance
23+ Use Disorder treatment?
24+
25+ ## Problem Statement
26+
27+ Despite the critical importance of completing adequate treatment for
28+ sustained recovery, SUD facilities lack systematic approaches to personalizing
29+ treatment length based on patient characteristics and predicted outcomes.
30+ Approximately ** 30% of patients are readmitted within one year** , suggesting
31+ many individuals either receive insufficient treatment or discontinue prematurely.
32+
33+ At the same time, treatment facilities struggle with capacity planning. They
34+ often cannot effectively match the timing and volume of new admissions with
35+ available beds and staff. This mismatch leads to under-used treatment slots,
36+ prolonged waitlists, and reduced access to care.
37+
38+ Asclepios AI aims to use data-driven insights to address these challenges.
39+
40+ ## What We're Building
941
1042We're creating a smart system that helps treatment centers:
1143
@@ -17,16 +49,7 @@ Think of it like a personalized recommendation system - just like how Netflix
1749recommends movies you might like, our system recommends the optimal treatment
1850plan for each patient based on their unique situation.
1951
20- ## 👥 Our Team
21-
22- This project is being developed by a collaborative team:
23-
24- - ** Caesar Ghazi** ([ @CaesarGhazi ] ( https://github.com/CaesarGhazi ) )
25- - ** Moe Alwathiq** ([ Moe-phantom] ( https://github.com/Moe-phantom ) )
26- - ** Rafaa Ali** ([ @RafaaAli ] ( https://github.com/RafaaAli ) )
27- - ** Wuor Bhang** ([ @WuorBhang ] ( https://github.com/WuorBhang ) )
28-
29- ## 🎯 Why This Matters
52+ ## Why This Matters
3053
3154Substance use disorders affect millions of people worldwide and cost society
3255billions of dollars each year. Unfortunately:
@@ -39,61 +62,129 @@ billions of dollars each year. Unfortunately:
3962Our AI system aims to solve these problems by using data to make better
4063predictions about what each patient needs.
4164
42- ## 📁 Repository Structure
65+ ## Data Overview
4366
44- Our work is organized in a step-by-step approach:
67+ We use publicly available SUD admissions and treatment episode datasets,
68+ which include variables such as:
4569
46- ``` /
47- ├── 0_domain_study/ # Research about SUD treatment and AI applications
48- ├── 1_datasets/ # The data we're using to train our AI models
49- ├── 2_data_preparation/ # Cleaning and organizing the data
50- ├── 3_data_exploration/ # Understanding patterns in the data
51- ├── 4_data_analysis/ # Building and testing our AI models
52- ├── 5_communication_strategy/ # How we'll share our findings with others
53- ├── 6_final_presentation/ # Our final presentation materials
54- └── collaboration/ # Team agreements, schedules, and reflections
70+ - Patient demographics
71+ - Substance use patterns
72+ - Co-occurring issues
73+ - Treatment history
74+ - Treatment length
75+ - Completion status
76+ - Readmission / relapse factors
5577
56- ```
78+ ### Repository Data Folders
5779
58- ## 📊 Data Sources
80+ - 📁 ** Raw Data:** [ ` 1_datasets/raw/ ` ] ( ./1_datasets/raw/ )
81+ - 📁 ** Processed Data:** [ ` 1_datasets/processed/ ` ] ( ./1_datasets/processed/ )
82+ - 📁 ** Sample Data:** [ ` 1_datasets/sample/ ` ] ( ./1_datasets/sample/ )
83+ - 📄 ** Data Documentation:** [ ` 1_datasets/README.md ` ] ( ./1_datasets/README.md )
5984
60- We're using publicly available datasets from reputable sources:
85+ ---
86+
87+ ## Data Preparation
88+
89+ This step includes:
90+
91+ - Cleaning and formatting raw datasets
92+ - Handling missing or inconsistent data
93+ - Encoding categorical variables
94+ - Normalizing and transforming features
95+ - Structuring datasets for modeling
96+
97+ 📁 Folder:
98+ ** [ ` 2_data_preparation/ ` ] ( ./2_data_preparation/ ) **
99+
100+ ---
61101
62- - ** Treatment Episode Data Set (TEDS)** : Information about admissions to
63- substance abuse treatment facilities
64- - ** National Survey on Drug Use and Health (NSDUH)** : Annual survey data about
65- drug use and mental health in the U.S.
102+ ## Exploratory Data Analysis (EDA)
66103
67- All data is handled ethically and with respect for patient privacy.
104+ We explore:
68105
69- ## 🚀 How to Use This Repository
106+ - Relationships between treatment factors and outcomes
107+ - Duration patterns across substance types
108+ - Demographic and behavioral trends
109+ - Variables most correlated with relapse or completion
110+ - Facility-level patterns affecting patient success
70111
71- If you're interested in our work, you can:
112+ 📁 Folder:
113+ ** [ ` 3_data_exploration/ ` ] ( ./3_data_exploration/ ) **
72114
73- 1 . ** Explore our research** in the [ 0_domain_study] ( ./0_domain_study/ ) folder
74- 2 . ** See our data** in the [ 1_datasets] ( ./1_datasets/ ) folder
75- 3 . ** Learn about our approach** by reading the README files in each folder
76- 4 . ** Follow our progress** through our collaboration documents
115+ ---
77116
78- Each folder contains a README.md file that explains what's in that section in
79- plain language.
117+ ## Modeling & Analysis
80118
81- ## 🤝 Want to Contribute?
119+ We will build models to:
82120
83- While this is primarily a student project, we welcome feedback and
84- suggestions! Feel free to:
121+ - Predict relapse likelihood
122+ - Identify high-risk patients
123+ - Predict facility resource demand
124+ - Evaluate model fairness and robustness
85125
86- - Open an issue if you have questions or ideas
87- - Fork the repository to experiment with our approach
88- - Contact any of our team members with questions
126+ Analysis includes:
89127
90- ## 📚 Learn More
128+ - Feature engineering
129+ - Training and validation
130+ - Model comparison
131+ - Interpretability
91132
92- - Read about our research approach in [ 0_domain_study/README.md] ( ./0_domain_study/README.md )
93- - See what datasets we're using in [ 1_datasets/README.md] ( ./1_datasets/README.md )
94- - Check out our team agreements in the [ collaboration] ( ./collaboration/ ) folder
133+ 📁 Folder:
134+ ** [ ` 4_data_analysis/ ` ] ( ./4_data_analysis/ ) **
95135
96136---
97137
98- * This project is part of the Emerging Talent 6 Collaborative Data Science
99- Project*
138+ ## Communicating Results
139+
140+ Our communication will include:
141+
142+ - Insight summaries
143+ - Visual dashboards or graphs
144+ - Interpretations for clinicians and decision-makers
145+ - Recommendations based on model findings
146+ - Limitations and ethical considerations
147+
148+ 📁 Folder:
149+ ** [ ` 5_communication_strategy/ ` ] ( ./5_communication_strategy/ ) **
150+
151+ ---
152+
153+ ## Final Presentation
154+
155+ Our final deliverables will recap the entire analysis with:
156+
157+ - A structured summary of findings
158+ - Visualizations
159+ - Model results
160+ - Recommendations for treatment facilities
161+ - Reflections and next steps
162+
163+ 📁 Folder:
164+ ** [ ` 6_final_presentation/ ` ] ( ./6_final_presentation/ ) **
165+
166+ ---
167+
168+ ## Repository Structure
169+
170+ Below is the full layout of the project repository:
171+
172+ ``` text
173+ Asclepios_Ai/
174+ │
175+ ├── 0_domain_study/
176+ ├── 1_datasets/
177+ │ ├── processed/
178+ │ ├── sample/
179+ │ └── raw/
180+ ├── 2_data_preparation/
181+ ├── 3_data_exploration/
182+ ├── 4_data_analysis/
183+ ├── 5_communication_strategy/
184+ └── 6_final_presentation/
185+ ```
186+
187+ ### License
188+
189+ This project is licensed under the ** MIT License** .
190+ 📄 [ View License] ( ./LICENSE )
0 commit comments