BOS
AC
Acme Corp
Production
Search across consoles...
⌘K
0 Projects
New Project
Command
⌘K
Monitoring
System health, alerts & incident tracking
Refresh
New Alert Rule
4
Active Alerts
rule firing
1
Open Incidents
requires action
99.92%
Uptime (30d)
SLA: 99.9%
182ms
Avg Response
p99: 420ms
System Overview
Alert Rules
Maintenance Windows
Incidents
↑
42%
CPU (avg)
→
61%
Memory
↑
54%
Disk
↑
2.1 GB/s
Network I/O
Service Health
Ingestion Service
healthy
Storage Layer
healthy
Query Engine
healthy
Streaming Kafka
degraded
API Gateway
healthy
AI/ML Runtime
healthy
LLM Studio
healthy
Governance Engine
healthy
Recent Incidents
API Latency Spike
2026-04-11 06:31
resolved
Storage Mount Failure
2026-04-08 03:14
resolved
CPU Spike — Ingest Cluster
2026-04-10 14:22
resolved
Error Rate Spike
2026-04-09 12:10
investigating