If you work in the IT department of a company with SLA that asks for 99.6 percent availability, here is a scenario that may sound familiar to you. Customer calls at odd hours with some failing transaction issue or an information being processed wrongly, you ask your specialist from Ops team to look into the system immediately. What happens afterwards is known to everyone - more than a hundred thousand messages logged at the set timeframe – a data set impossible for a human being to review line by line. A trial and error resolution starts. With concept of DevOps emerging in recent time this problem is getting into gigantic proportion. The dramatic pace in which the processes, technologies, and tools are changing, it has become quite problematic to cope with such scenarios. Moreover, the pressure business users have been putting on DevOps teams is manifold, expecting that everything should be solved within minutes. The big data generated out of the logs are not easy to crack with manual interventions.
So, what do we do with it and how sleepless nights can be overcome and find the critical triggering event without much of time pressure? Basically, we need something automatic,which can pinpoint the problem location in the plethora of logs and help in resolving the current issue as well as ‘Reducing the PBI cycle time’ for fixing it properly to avoid recurrence. This is where Analytics with artificial intelligence capability finds a foothold.Combining Big Data, AI, and Domain Knowledge; technologists have been able to create big breakthroughs and opportunities for the DevOps teams.
In summary, integrating AI with log management is way to go for DevOps and can bring efficiency, risk management, optimisation in data management and trouble shooting and in the end customer delight. The endless wait of solution and root cause analysis, daily chorus of ticket priorities and SLA breaching could be a history!
No comments:
Post a Comment