- Lab
- A Cloud Guru
Using Alertmanager with Prometheus
Prometheus does not just limit us to recording metrics. One of Prometheus's core functionalities is the ability to define and route alerts to any alert management endpoint we define — or, in the case of this hands-on lab, Prometheus's own sideproject, Alertmanager. Once we have our desired alerting thresholds defined, we need to set up our routes and receivers for the Alertmanager, ensuring our notifications are going to the right end user at the correct frequency and with the right information.
Path Info
Table of Contents
-
Challenge
Add a Rules File
-
Add a rules file configuration to the Prometheus config:
sudo $EDITOR /etc/prometheus/prometheus.yml rule_files: - "rules.yml"
-
Save and exit.
-
Create and open the
rules.yml
file:sudo $EDITOR /etc/prometheus/rules.yml
-
-
Challenge
Add an Alert to Track Uptime
-
Before creating the alert itself, create a recording of the desired metric:
groups: - name: uptime rules: - record: job:uptime:average:ft expr: avg without (instance) (up{job="forethought"})
-
Create an alert based on this recording:
groups: - name: uptime rules: - record: job:uptime:average:ft expr: avg without (instance) (up{job="forethought"}) - alert: ForethoughtApplicationDown expr: job:uptime:average:ft < .75 for: 30s labels: severity: page team: devops
-
Save and exit.
-
Restart Prometheus:
sudo systemctl restart prometheus sudo systemctl status prometheus
-
-
Challenge
Configure Alertmanager to Use an SMTP Smarthost
-
Open the Alertmanager configuration file:
sudo $EDITOR /etc/alertmanager/alertmanager.yml
-
Define the global settings:
global: resolve_timeout: 5m smtp_smarthost: 'localhost:25' smtp_from: 'prometheus'
-
-
Challenge
Set up Alertmanager Routing
-
Set up the backup route:
route: receiver: 'email_backup' group_by: ['alertname'] group_wait: 10s group_interval: 10s repeat_interval: 1m
-
Set up the route for critical alerts:
route: receiver: 'email_backup' group_by: ['alertname'] group_wait: 10s group_interval: 10s repeat_interval: 1m routes: - match: severity: 'critical' group_by: ['team'] receiver: 'email_pager'
-
Set up the route for team alerts:
route: receiver: 'email_backup' group_by: ['alertname'] group_wait: 10s group_interval: 10s repeat_interval: 1m routes: - match: severity: 'critical' group_by: ['team'] receiver: 'email_pager' routes: - match: team: devops receiver: 'email_devops' ``
-
-
Challenge
Create the Needed Receivers
-
Create the receivers:
receivers: - name: 'email_backup' email_configs: - to: '[email protected]' - name: 'email_pager' email_configs: - to: '[email protected]' - name: 'email_devops' email_configs: - to: '[email protected]'
-
Save and exit.
-
Restart Alertmanager:
sudo systemctl restart alertmanager
-
What's a lab?
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Provided environment for hands-on practice
We will provide the credentials and environment necessary for you to practice right within your browser.
Guided walkthrough
Follow along with the author’s guided walkthrough and build something new in your provided environment!
Did you know?
On average, you retain 75% more of your learning if you get time for practice.