HOSER/docs/ABNORMAL_OD_WORKFLOW_GUIDE.md at main · matercomus/HOSER

... [Previous content remains unchanged until Phase 6 section]

Phase 6: LM-TAD Teacher Evaluation (Optional)

Evaluate the LM-TAD teacher model on both real and generated trajectories to enable direct performance comparison with student models.

When to run:

Establishing teacher baseline performance
Direct teacher-student comparison on abnormal patterns
Validating compression-performance tradeoff

Configuration:

# config/evaluation.yaml
lmtad:
  # Required for Phase 6
  enabled: true  # Enable LM-TAD teacher evaluation
  repo: /home/mka299/LMTAD  # Path to LM-TAD repository
  checkpoint: /path/to/weights_only.pt  # Teacher checkpoint
  grid_size: 0.002  # Grid size (Beijing: 0.002, Porto: 0.001)
  device: cuda:0  # Device for evaluation

evaluation:
  # Abnormal OD parameters
  max_pairs_per_category: 20
  num_trajectories_per_od: 50
  seed: 42

Programmatic Usage:

from pathlib import Path
from tools.run_abnormal_od_workflow import run_abnormal_od_workflow

# Run complete workflow including Phase 6
analysis_dir = run_abnormal_od_workflow(
    eval_dir=Path("abnormal/Beijing"),
    dataset="Beijing",
    real_data_dir=Path("data/Beijing"),
    num_trajectories=50,
    max_pairs_per_category=20,
    skip_detection=True  # If detection already exists
)

CLI:

uv run python tools/run_abnormal_od_workflow.py \\
  --eval-dir abnormal/Beijing \\
  --dataset Beijing \\
  --real-data-dir data/Beijing \\
  --skip-detection \\
  --num-traj 50 \\
  --max-pairs 20

Output Structure:

eval_lmtad/Beijing/
├── real_data/
│   ├── evaluation_results.tsv   # Per-trajectory perplexity scores
│   └── outlier_stats.json      # Summary statistics (mean, std, threshold)
├── generated/
│   ├── model1/
│   │   ├── trajectories_lmtad_format.csv  # Grid-tokenized trajectories
│   │   ├── evaluation_results.tsv         # Perplexity scores
│   │   └── outlier_stats.json            # Summary statistics
│   └── model2/...
└── comparison_summary.json     # Teacher vs student comparison

Metrics Evaluated:

Real Data Baseline:
- Mean perplexity and standard deviation
- Outlier threshold (auto-computed)
- Outlier rate in real trajectories
Generated Trajectories:
- Per-model perplexity distributions
- Outlier rates compared to real baseline
- Performance retention vs teacher
Teacher vs Student Comparison:
- Detection F1 score (teacher)
- Abnormality reproduction rate (student)
- Compression-performance tradeoff metrics

Required Files:

LM-TAD Repository:
- Located at path specified in config
- Contains model implementation and utils
Teacher Checkpoint:
- Trained LM-TAD model weights
- Format: PyTorch state dict (.pt)
Pre-converted Real Data:
- Grid-tokenized real trajectories
- Matching vocabulary files
- Used for baseline evaluation

Integration with Previous Phases:

Uses abnormal OD pairs from Phase 3
Evaluates trajectories generated in Phase 4
Complements Wang analysis from Phase 5

Error Resolution:

"LM-TAD real data not found":
- Check pre-converted data paths
- Ensure grid tokenization matches teacher
"Checkpoint not found":
- Verify checkpoint path in config
- Check file permissions

"CUDA out of memory":

# Reduce batch size in config
lmtad:
  batch_size: 64  # Default: 128

Best Practices:

Grid Size Consistency:
- Use same grid as teacher training
- Beijing: 0.002, Porto: 0.001
Resource Management:
- Enable AMP (automatic mixed precision)
- Monitor GPU memory usage
- Adjust batch sizes if needed
Result Validation:
- Compare with published baselines
- Check perplexity distributions
- Validate outlier thresholds

Visualization: Results can be visualized using the plotting module:

uv run python tools/plot_lmtad_evaluation.py \\
  --eval-dir eval_lmtad/Beijing \\
  --output-dir figures/lmtad \\
  --dataset Beijing

Generated plots include:

Perplexity distribution comparison
Outlier rate analysis
Teacher-student performance gap
Compression ratio visualization

See docs/results/ABNORMAL_OD_TEACHER_STUDENT_BRIDGE.md for interpretation guidelines.

Phase 7: LM-TAD Spatial Abnormality Evaluation (Optional)

Evaluate how well HOSER models reproduce spatial abnormalities (route switches and detours) identified by the LM-TAD teacher model. This complements the Wang temporal abnormality detection by focusing on spatial route deviations.

When to run:

Evaluating spatial pattern reproduction
Comparing models on route switch vs detour generation
Comprehensive abnormal trajectory analysis (temporal + spatial)
Validating distillation impact on spatial behavior

Prerequisites:

LM-TAD source evaluation completed
LM-TAD checkpoint available
Phase 6 (LM-TAD evaluation) optional but recommended

Configuration:

# config/evaluation.yaml
run_lmtad_spatial_detection: true  # Enable LM-TAD spatial evaluation
lmtad_spatial_config: null  # Optional: path to config file
lmtad_source_eval_dir: null  # Optional: auto-detect from LMTAD repo

CLI:

# Integrated in main pipeline
uv run python python_pipeline.py \
  --eval-dir eval_dir \
  --only lmtad_spatial_abnormality \
  --run-lmtad-spatial

# Or standalone pipeline
uv run python tools/run_lmtad_spatial_pipeline.py \
  --eval-dir eval_dir \
  --dataset porto_hoser \
  --lmtad-source-eval-dir /path/to/lmtad/eval \
  --lmtad-checkpoint /path/to/ckpt_best.pt

Output Structure:

{eval_dir}/
├── abnormal_od_pairs_lmtad_spatial_{dataset}.json
├── gene_abnormal_lmtad_spatial/{dataset}/seed{seed}/
│   └── {model}_spatial_abnormal.csv
├── eval_lmtad_spatial/{dataset}/
│   └── {model}_spatial_evaluation.json
├── analysis_abnormal/{dataset}/
│   ├── lmtad_spatial_results_aggregated.json
│   └── COMBINED_ABNORMAL_TRAJECTORY_ANALYSIS_REPORT.md
└── figures/lmtad_spatial_abnormality/{dataset}/
    └── [5 visualization plots]

Metrics Evaluated:

Spatial Abnormality Rates: Route switch, detour, and overall rates
Statistical Comparisons: Chi-square tests, effect sizes, confidence intervals
Model Rankings: Ranked by deviation from real spatial abnormality rate

Integration with Previous Phases:

Extracts spatial abnormal OD pairs from LM-TAD source evaluation
Generates trajectories similar to Phase 4
Evaluates with LM-TAD similar to Phase 6
Complements Wang temporal analysis from Phase 5
Can combine results with Wang for comprehensive analysis

See docs/guides/RUN_LMTAD_SPATIAL_ABNORMALITY_ANALYSIS.md for detailed execution guide.

[Rest of the document remains unchanged]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Phase 6: LM-TAD Teacher Evaluation (Optional)

Phase 7: LM-TAD Spatial Abnormality Evaluation (Optional)

FilesExpand file tree

ABNORMAL_OD_WORKFLOW_GUIDE.md

Latest commit

History

ABNORMAL_OD_WORKFLOW_GUIDE.md

File metadata and controls

Phase 6: LM-TAD Teacher Evaluation (Optional)

Phase 7: LM-TAD Spatial Abnormality Evaluation (Optional)