Description
The current pipeline is tested on synthetic or controlled data.
To better assess robustness and realism, we want to integrate and test the system using real-world data.
Goals
- Validate the pipeline on realistic data
- Identify limitations and assumptions
- Test scalability and data compatibility
Possible Data Sources
- Public datasets (e.g. Kaggle)
- Time series demand datasets
- Transportation / logistics network datasets
Proposed Steps
- Identify a suitable dataset
- Adapt the ingestion layer if needed
- Run the full pipeline end-to-end
- Analyze results and document findings
Acceptance Criteria
- At least one real dataset successfully integrated
- Pipeline runs end-to-end without errors
- Basic analysis of results documented
- Key limitations and assumptions clearly stated
Description
The current pipeline is tested on synthetic or controlled data.
To better assess robustness and realism, we want to integrate and test the system using real-world data.
Goals
Possible Data Sources
Proposed Steps
Acceptance Criteria