Causely is a new player in modern DevOps. Their pitch is simple: your observability systems are too noisy. It takes too much time to identify root causes among all of the alerts and dashboards. And these issues are being exacerbated by the rise of AI.
Their founder, Shmuel Kliger, has been thinking about IT Operations and complex systems for decades. And he has some counterculture opinions when it comes to how to best manage these complex systems.
Firstly, he doesn’t believe in the “collect all the data” approach that many modern observability platforms promote. Because while yes, storing data is becoming cheaper, there is also a lot more of it. And perhaps more importantly than the cost implications of collecting everything is simply the fact that 80% of that data will never be needed. In other words, teams are accepting a lot of noise in the name of making sure they have total visibility.
When it’s time to actually address an alert or a customer complaint, this is all the data that gets in the way and makes finding the actionable root cause like trying to find a needle in a haystack. Instead, Kliger believes that companies should take a “top-down” approach, starting with the likely root causes and then drilling down into the necessary data. The platform he’s built, called Causely, leverages causal reasoning to map observability signals to likely root causes.
Kliger is also very direct about the fact that in the not-so-distant future, humans will be out of the loop when it comes to managing IT Operations. “Planes can fly themselves—what makes us think IT Operations can’t be autonomous?” he asks us to consider. He does, however, acknowledge that it’s a journey and we aren’t quite there yet—but more and more we should expect machines to take over identifying performance, security, and reliability issues.
A nod in that direction is the company’s latest integration with Grafana. Grafana Labs is the observability company behind the world’s most ubiquitous and open dashboards leveraged by over 25 million users around the world. By embedding Causely’s intelligence directly into a Grafana dashboard, engineers can instantly see the “why” behind performance issues in the context of their services, significantly cutting resolution time when there’s an alert that needs to be addressed.
Causely also plugs into Grafana Alertmanager, enriching existing alerts with real-time, continuously updated root-cause intelligence. This AI-powered capability goes beyond sending alerts when something is wrong, getting deeper into where the problem originated and what to do next within the incident response workflow.
“Engineers are overwhelmed by too many tools, alerts, and data coming from their existing observability solutions,” said Kliger. “We are trying to make it as simple as possible to plug in our solution to your existing workflow and significantly reduce the manual toil required to identify root causes within complex modern applications.”
The post Getting Humans Out of the Observability Loop with Causely appeared first on International Business Times.