Back to Article
MLOps Frameworks for Reliable Model Deployment in Cloud Data Platforms
Journal of Artificial Intelligence and Big Data
| Vol 3, Issue 1
Table 1. Observed Reliability and Performance Metricsfor ML Inference Services
| Metric | Value |
| Availability | 0.96875 |
| Latency p50 (ms) | 45.60916859994713 |
| Latency p95 (ms) | 83.27928987399044 |
| Latency p99 (ms) | 112.2915089738782 |
| Avg throughput (req/min) | 872.2756944444444 |