@@ -84,19 +84,21 @@ The only CI workflow is **CodeQL Advanced** (`.github/workflows/codeql.yml`), wh
8484- Built on [ Trame] ( https://kitware.github.io/trame/ ) — a Python framework for interactive web applications.
8585- Uses the ** manager pattern** : each feature area has a dedicated ` *_manager.py ` class that handles its UI components and business logic.
8686- ` state_manager.py ` manages the global Trame server, state, and controller.
87- - Data flows through MongoDB (PyMongo) for experiment/simulation data and ML models.
87+ - Data flows through MongoDB (PyMongo) for experiment and simulation data.
88+ - Data flows through MLflow for ML models.
8889- NERSC Superfacility API integration is in ` sfapi_manager.py ` .
8990
9091### ML Training
9192
9293- ` train_model.py ` supports three model types: Gaussian Process (GP), Neural Network (NN), and Ensemble.
9394- Uses PyTorch, BoTorch, and GPyTorch for model training.
9495- CUDA is auto-detected for GPU acceleration.
95- - Models are serialized and stored in MongoDB .
96+ - Models are serialized and stored in an MLflow tracking server .
9697
9798### Data Storage
9899
99- - ** MongoDB** is used for all persistent data (experiments, simulation data, ML models).
100+ - ** MongoDB** is used for persistent data from experiments and simulations.
101+ - ** MLflow** is used for persistent data from ML models.
100102- Database access requires SSH tunneling to NERSC when running locally.
101103- Environment variables: ` SF_DB_HOST ` , ` SF_DB_READONLY_PASSWORD ` (dashboard), ` SF_DB_ADMIN_PASSWORD ` (ML training).
102104
0 commit comments