SRE: Monitoring and Observability
.MP4, AVC, 1280x720, 30 fps | English, AAC, 2 Ch | 2h 17m | 377 MB
Instructor: Elton Stoneman
.MP4, AVC, 1280x720, 30 fps | English, AAC, 2 Ch | 2h 17m | 377 MB
Instructor: Elton Stoneman
Learn how to design effective monitoring and alerting systems, implement SLIs, and explore AIOps tools to enhance system reliability through automation and observability practices.
What you'll learn
Monitoring and observability are the key tools for SRE teams to manage systems and keep them running smoothly. Collecting and analyzing data about application behavior enables SREs to find and fix issues quickly. In this course, SRE: Monitoring and Observability, you’ll learn about the components of the observability stack and the processes it enables by following an SRE team as they prepare to onboard a new system.
First, you’ll learn what data apps need to expose to feed into the monitoring systems. Next, you’ll explore service level indicators to see how to measure performance. Then, you’ll walk through alerting and automated responses to issues. Finally. you’ll look at new tools in the AIOps space.
When you’re finished with the course, you'll be able to define and implement an observability stack and use it to drive system reliability.