Raising the Bar on ML Model Deployment Safety
Uber describes enhancements to its Michelangelo ML platform that raise deployment safety across the ML lifecycle: explicit data and feature validation, standardized model reports, mandatory backtesting and shadow testing, controlled rollouts with automatic rollback, and continuous monitoring via the Hue observability stack. The platform also includes a safety-scoring system integrated with CI/CD to track adoption; future work covers GenAI-assisted code checks, semantic drift detection for embeddings, and expanded truthfulness/bias monitoring.