Skip to content

Commit 0112010

Browse files
committed
the initial draft of the main readme
1 parent 77c5417 commit 0112010

1 file changed

Lines changed: 146 additions & 55 deletions

File tree

README.md

Lines changed: 146 additions & 55 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,43 @@
1-
# Asclepios AI: Optimizing Substance Use Disorder Treatment
1+
# **Asclepios AI: Data-Driven Treatment Optimization for Substance Use Disorder (SUD)**
22

3-
Welcome to the Asclepios AI project! This repository contains all the work
4-
done by our team to develop an AI-powered platform that helps substance use
5-
disorder (SUD) treatment facilities optimize patient care and resource
6-
allocation.
3+
Asclepios AI is a data-driven exploration into how machine learning,
4+
domain knowledge, and thoughtful analysis can help improve outcomes for
5+
individuals undergoing Substance Use Disorder (SUD) treatment. Using real
6+
admissions and treatment episode data, our goal is to identify patterns
7+
that can help facilities personalize treatment duration, improve completion
8+
rates, and reduce relapse/readmission.
79

8-
## 🌟 What We're Building
10+
## Our Team
11+
12+
This project is being developed by a collaborative team:
13+
14+
- **Caesar Ghazi** ([@CaesarGhazi](https://github.com/CaesarGhazi))
15+
- **Moe Alwathiq** ([Moe-phantom](https://github.com/Moe-phantom))
16+
- **Rafaa Ali** ([@RafaaAli](https://github.com/RafaaAli))
17+
- **Wuor Bhang** ([@WuorBhang](https://github.com/WuorBhang))
18+
19+
## **Primary Research Question**
20+
21+
How accurately can patient demographics, history, and treatment factors
22+
predict an optimal treatment duration that minimizes relapse risk in Substance
23+
Use Disorder treatment?
24+
25+
## Problem Statement
26+
27+
Despite the critical importance of completing adequate treatment for
28+
sustained recovery, SUD facilities lack systematic approaches to personalizing
29+
treatment length based on patient characteristics and predicted outcomes.
30+
Approximately **30% of patients are readmitted within one year**, suggesting
31+
many individuals either receive insufficient treatment or discontinue prematurely.
32+
33+
At the same time, treatment facilities struggle with capacity planning. They
34+
often cannot effectively match the timing and volume of new admissions with
35+
available beds and staff. This mismatch leads to under-used treatment slots,
36+
prolonged waitlists, and reduced access to care.
37+
38+
Asclepios AI aims to use data-driven insights to address these challenges.
39+
40+
## What We're Building
941

1042
We're creating a smart system that helps treatment centers:
1143

@@ -17,16 +49,7 @@ Think of it like a personalized recommendation system - just like how Netflix
1749
recommends movies you might like, our system recommends the optimal treatment
1850
plan for each patient based on their unique situation.
1951

20-
## 👥 Our Team
21-
22-
This project is being developed by a collaborative team:
23-
24-
- **Caesar Ghazi** ([@CaesarGhazi](https://github.com/CaesarGhazi))
25-
- **Moe Alwathiq** ([Moe-phantom](https://github.com/Moe-phantom))
26-
- **Rafaa Ali** ([@RafaaAli](https://github.com/RafaaAli))
27-
- **Wuor Bhang** ([@WuorBhang](https://github.com/WuorBhang))
28-
29-
## 🎯 Why This Matters
52+
## Why This Matters
3053

3154
Substance use disorders affect millions of people worldwide and cost society
3255
billions of dollars each year. Unfortunately:
@@ -39,61 +62,129 @@ billions of dollars each year. Unfortunately:
3962
Our AI system aims to solve these problems by using data to make better
4063
predictions about what each patient needs.
4164

42-
## 📁 Repository Structure
65+
## Data Overview
4366

44-
Our work is organized in a step-by-step approach:
67+
We use publicly available SUD admissions and treatment episode datasets,
68+
which include variables such as:
4569

46-
```/
47-
├── 0_domain_study/ # Research about SUD treatment and AI applications
48-
├── 1_datasets/ # The data we're using to train our AI models
49-
├── 2_data_preparation/ # Cleaning and organizing the data
50-
├── 3_data_exploration/ # Understanding patterns in the data
51-
├── 4_data_analysis/ # Building and testing our AI models
52-
├── 5_communication_strategy/ # How we'll share our findings with others
53-
├── 6_final_presentation/ # Our final presentation materials
54-
└── collaboration/ # Team agreements, schedules, and reflections
70+
- Patient demographics
71+
- Substance use patterns
72+
- Co-occurring issues
73+
- Treatment history
74+
- Treatment length
75+
- Completion status
76+
- Readmission / relapse factors
5577

56-
```
78+
### Repository Data Folders
5779

58-
## 📊 Data Sources
80+
- 📁 **Raw Data:** [`1_datasets/raw/`](./1_datasets/raw/)
81+
- 📁 **Processed Data:** [`1_datasets/processed/`](./1_datasets/processed/)
82+
- 📁 **Sample Data:** [`1_datasets/sample/`](./1_datasets/sample/)
83+
- 📄 **Data Documentation:** [`1_datasets/README.md`](./1_datasets/README.md)
5984

60-
We're using publicly available datasets from reputable sources:
85+
---
86+
87+
## Data Preparation
88+
89+
This step includes:
90+
91+
- Cleaning and formatting raw datasets
92+
- Handling missing or inconsistent data
93+
- Encoding categorical variables
94+
- Normalizing and transforming features
95+
- Structuring datasets for modeling
96+
97+
📁 Folder:
98+
**[`2_data_preparation/`](./2_data_preparation/)**
99+
100+
---
61101

62-
- **Treatment Episode Data Set (TEDS)**: Information about admissions to
63-
substance abuse treatment facilities
64-
- **National Survey on Drug Use and Health (NSDUH)**: Annual survey data about
65-
drug use and mental health in the U.S.
102+
## Exploratory Data Analysis (EDA)
66103

67-
All data is handled ethically and with respect for patient privacy.
104+
We explore:
68105

69-
## 🚀 How to Use This Repository
106+
- Relationships between treatment factors and outcomes
107+
- Duration patterns across substance types
108+
- Demographic and behavioral trends
109+
- Variables most correlated with relapse or completion
110+
- Facility-level patterns affecting patient success
70111

71-
If you're interested in our work, you can:
112+
📁 Folder:
113+
**[`3_data_exploration/`](./3_data_exploration/)**
72114

73-
1. **Explore our research** in the [0_domain_study](./0_domain_study/) folder
74-
2. **See our data** in the [1_datasets](./1_datasets/) folder
75-
3. **Learn about our approach** by reading the README files in each folder
76-
4. **Follow our progress** through our collaboration documents
115+
---
77116

78-
Each folder contains a README.md file that explains what's in that section in
79-
plain language.
117+
## Modeling & Analysis
80118

81-
## 🤝 Want to Contribute?
119+
We will build models to:
82120

83-
While this is primarily a student project, we welcome feedback and
84-
suggestions! Feel free to:
121+
- Predict relapse likelihood
122+
- Identify high-risk patients
123+
- Predict facility resource demand
124+
- Evaluate model fairness and robustness
85125

86-
- Open an issue if you have questions or ideas
87-
- Fork the repository to experiment with our approach
88-
- Contact any of our team members with questions
126+
Analysis includes:
89127

90-
## 📚 Learn More
128+
- Feature engineering
129+
- Training and validation
130+
- Model comparison
131+
- Interpretability
91132

92-
- Read about our research approach in [0_domain_study/README.md](./0_domain_study/README.md)
93-
- See what datasets we're using in [1_datasets/README.md](./1_datasets/README.md)
94-
- Check out our team agreements in the [collaboration](./collaboration/) folder
133+
📁 Folder:
134+
**[`4_data_analysis/`](./4_data_analysis/)**
95135

96136
---
97137

98-
*This project is part of the Emerging Talent 6 Collaborative Data Science
99-
Project*
138+
## Communicating Results
139+
140+
Our communication will include:
141+
142+
- Insight summaries
143+
- Visual dashboards or graphs
144+
- Interpretations for clinicians and decision-makers
145+
- Recommendations based on model findings
146+
- Limitations and ethical considerations
147+
148+
📁 Folder:
149+
**[`5_communication_strategy/`](./5_communication_strategy/)**
150+
151+
---
152+
153+
## Final Presentation
154+
155+
Our final deliverables will recap the entire analysis with:
156+
157+
- A structured summary of findings
158+
- Visualizations
159+
- Model results
160+
- Recommendations for treatment facilities
161+
- Reflections and next steps
162+
163+
📁 Folder:
164+
**[`6_final_presentation/`](./6_final_presentation/)**
165+
166+
---
167+
168+
## Repository Structure
169+
170+
Below is the full layout of the project repository:
171+
172+
```text
173+
Asclepios_Ai/
174+
175+
├── 0_domain_study/
176+
├── 1_datasets/
177+
│ ├── processed/
178+
│ ├── sample/
179+
│ └── raw/
180+
├── 2_data_preparation/
181+
├── 3_data_exploration/
182+
├── 4_data_analysis/
183+
├── 5_communication_strategy/
184+
└── 6_final_presentation/
185+
```
186+
187+
### License
188+
189+
This project is licensed under the **MIT License**.
190+
📄 [View License](./LICENSE)

0 commit comments

Comments
 (0)