AI is all over the news and as a kid of the 80s I’m waiting for the headline that Skynet has become self-aware. So, is it time prep and build my backyard bunker for the dystopian future that lies ahead? Or maybe this future Skynet will just know what groceries I’m most likely to order next week? I suppose the changes AI will bring to our life fall somewhere in between. While we think about all the possibilities, something is for certain, the network needs AI. That’s why I’m at Augtera.
Not a week goes by without us learning about a preventive maintenance case detected by Augtera’s machine learning by a customer. Today’s short story is this: The customer received a ticket opened by Augtera in their ticketing system (ServiceNow) stating that there is an abnormal rate of environmental alerts found automatically with machine learning. That’s it: Only one ticket waiting for the operator to click for actionable information.
Augtera founder and CEO Rahul Aggarwal recently spoke about “AI Driven Automation of Network Operations” at MPLSSDAINETWORLD23. In this blog we will cover some of the key points from his presentation. Digital transformation has been a major focus in driving innovation and great change across entire enterprises and their IT infrastructure. One area of IT, however, that remains largely unaltered is network operations. While NetOps teams certainly have access to a number of network monitoring tools and solutions today, most of those tools still rely on highly reactive and manual processes.
As digital transformation takes hold across industries, networks become more complex. Modern technology has led to a proliferation of network operations tools to help manage that complexity, but the wide array of tools—combined with the ever-increasing demand for high network performance and near-instant availability of data and applications—has only served to complicate things even further. A major challenge for network operations teams today is keeping networks running optimally while also dealing with the “alert storms” generated by all the management tools designed to simplify NetOps. For instance, if a network switch fails, that failure could result in a ripple effect of network alerts that floods NetOps teams with often dozens or hundreds of notifications. Even powering down a device for scheduled maintenance could cause a flurry of alerts.
As more enterprises move data and applications out of the data center into hybrid or multicloud platforms, networks—and network operations—have become increasingly complicated. On-premises architecture was never exactly simple, but today IT must manage the data center as well as many other connected environments, including public and private clouds and do so in real-time. The technology that was developed to improve IT and aid digital transformation may have enabled enterprises to increase agility, improve scalability, and enhance application development, but it has also added much more complexity from an operational point of view.
A technology that can process billions of logs per hour with natural language processing (NLP) and will automatically find exceedingly rare messages that are operationally relevant and without noise. Sounds too good to be true, right? That is exactly what I thought when our engineering team was describing the new “Zero Day Log” ML capability that they were building.
Introduction Fans of science fiction books or movies have all experienced a sequence when someone who has not yet committed a crime is arrested because AI predicts they will in the coming hours or days. Can this impressive outcome be experienced today for networks?
I met Rahul Aggarwal in June of 2022 and took an immediate liking to the team. Early in a joint customer engagement, it became very clear that there was something special here – not just special but BIG. The customer knew it too. Little did I know it would lead to a dramatic personal change as well.
AIOps that is purpose-built for Networking, is a generational step forward, going well beyond monitoring and observability to achieve new outcomes: dramatic reduction in alert / trouble ticket fatigue, multi-layer incident root identification, cloud-native geo-diverse scaling & HA, automation playbooks, and more.
Every few years, the industry creates a new set of terms that end up creating endless debate on what they mean. One top of mind term is “observability.” Recently Enterprise Management Associates (EMA) surveyed 400 IT/Networking professionals, providing data that helps us dive into the subject. In this blog we look just at the responses from Network Operations professionals.