Introduction
The alert definition feature allows you to set alerts on a metric using a PromQL query. It is currently defined in the K8s ConfigMap named opsramp-alert-user-config
, which you can find in the OpsRamp agent installed namespace.
The Kubernetes 2.0 agent computes the alert metric value based on the PromQL expression and computes the alert state by comparing the thresholds in the alert definition.
The OpsRamp Agent sends the alert on the K8s resource based on labels generated in resultant metric streams after running promQL query. If no resource matches with labels then alert will be sent on the cluster.
Alert Definition Template
Sample template to define a single alert:
Explanation of template fields:
resourceType
: Specify the type of K8s resource (e.g., k8s_pod). Following are the possible values for resourceType:- k8s_pod
- k8s_node
- k8s_namespace
- k8s_service
- k8s_pv
- k8s_pvc
- k8s_deployment
- k8s_replicaset
- k8s_daemonset
- k8s_statefulset
rules
: A set of rules for the alert definition.name
: A unique name for the alert.interval
: Polling interval at which alert definition should run. The interval should given in time duration format (e.g., 1m, 5m, 1h).expr
: A valid PromQL query expression for calculating the metric.isAvailability
: Boolean indicating if the alert should consider resource availability.warnOperator
&criticalOperator
: Operators to compare and compute alert states. OpsRamp supports below operators for comparison:GREATER_THAN_EQUAL
GREATER_THAN
EQUAL
NOT_EQUAL
LESS_THAN_EQUAL
LESS_THAN
warnThreshold
&criticalThreshold
: Values for warning and critical thresholds.alertSub
&alertBody
: Content displayed for alerts, which can use macros for dynamic values. Below are macros that can be used while defining alert subject/body:- ${
severity
} - ${
metric.name
} - ${
component.name
} - ${
metric.value
} - ${
threshold
} - ${
resource.name
} - ${
resource.uniqueid
}
- ${
Configure Alert
OpsRamp provides basic alert definitions for resources like pods and nodes by default. Users can configure the alert definitions by editing the K8s ConfigMap below in the namespace where the agent is installed.
K8s ConfigMap Name: opsramp-alert-user-config
Access the ConfigMap:
To view or edit the ConfigMap, use the following command:
Example ConfigMap:
You can Add or Remove new alerts using standard PromQL expressions.
Configure Availability
To configure availability of a resource, define alert definition and make the key isAvailability
as true.
Same alert definition rule will be used to compute the availability of a resource. For example, if we want to define Pod availability based on pod memory usage, we can give alert definition as below with isAvailability
as true.
Here if k8s_pod_memory_usage_percent
is either warning or critical, availability will be considered as down otherwise up.
View Alerts in OpsRamp Portal
Users can view logs in the OpsRamp portal from Command Center > Alerts.
- Default Alerts Screen: View your alerts through the Alerts UI.
- Alerts Details: Click on any alert entry for detailed information.