Observability in DevOps: Evaluating Tools for Cloud-Native Monitoring and Logging

Authors

  • Sai Kiran Meda Tech Lead Platform Engineering, StockX LLC
  • Samara Morapally Site Reliability Engineer, Inceptio Technology

Keywords:

OpenTelemetry, Datadog, ELK Stack, Grafana, Prometheus, Logging, Cloud-Native Monitoring, DevOps, Observability

Abstract

Discoverability is one of the main values in the DevOps realm, with more emphasis, especially now that cloud-native infrastructures with microservices, containers, and dynamic workloads have emerged. Truncated as such, this article is a detailed look at observability in DevOps and why it needs to be the primary way of monitoring and tracing other systems. It evaluates the leading players in the cloud-native monitoring and logging platforms, outlining services including Prometheus, Grafana, Elasticsearch, Datadog, and OpenTelemetry services. Scalability, integration, visualization, and cost are key factors needed in the decision-making process that organizations should undertake to determine the right solutions for application in their company. Finally, recommendations to Amazon Web Services (AWS) teams and techniques on how to build for the present and the future are offered to ensure the best solution for better observability solutions, as well as talking about the best practices and trends such as AI and Multi-Cloud observability.

Published

30-06-2021

How to Cite

Sai Kiran Meda, & Samara Morapally. (2021). Observability in DevOps: Evaluating Tools for Cloud-Native Monitoring and Logging. Well Testing Journal, 30(1), 71–95. Retrieved from https://welltestingjournal.com/index.php/WT/article/view/71-95

Issue

Section

Research Articles