Back to Article

MLOps Frameworks for Reliable Model Deployment in Cloud Data Platforms

Journal of Artificial Intelligence and Big Data | Vol 3, Issue 1

Table 1. Observed Reliability and Performance Metricsfor ML Inference Services

MetricValue
Availability0.96875
Latency p50 (ms)45.60916859994713
Latency p95 (ms)83.27928987399044
Latency p99 (ms)112.2915089738782
Avg throughput (req/min)872.2756944444444