Skip to content

Observability

Huddle utilizes a unified observability stack to monitor performance and diagnose distributed issues across microservices.

All services are instrumented using OpenTelemetry. This provides a vendor-neutral standard for collecting telemetry data.

We leverage the Grafana Labs ecosystem for data visualization and storage:

Prometheus

Metrics: Time-series database for service performance.

Loki

Logs: High-availability log aggregation system.

Tempo

Traces: Distributed tracing backend.

Grafana

Dashboards: Centralized alerting and visualization.

  • Metrics: Real-time monitoring of request latency, error rates, and resource utilization.
  • Distributed Tracing: Visualization of request paths across the API, App, and Notification services.
  • Logging: Structured logging for detailed post-mortem analysis and audit verification.