Skip to main content

Alert

Overview

Alert integrates the “detecting anomalies - notifying personnel - quick resolution” process into a closed-loop system. It monitors critical metrics and events in real time through unified rules, routes alerts to designated personnel and channels based on policies, and employs automated noise reduction and convergence mechanisms to ensure precise, timely delivery. This approach effectively reduces Mean Time to Acknowledge (MTTA) and Mean Time to Repair (MTTR) while minimizing disruption from false alarms.

Concepts

1. Alert Subtypes

Alert subtypes include: Periodic Metric Alerts, Event Monitoring Alerts, and VIP Alerts.

Among event monitoring alerts, "New Crash Alerts" and "Long Request Alerts" do not have "Recovery" events and are one-time events. The next version will add an "Occurred" status.

Currently, considering the weekend cycle of events, "Auto Close" has been added for these two events, meaning these two types of events will automatically close 6 hours after occurrence by default.

2. Alert Status

  • Alerting: Alert events that have been generated but not yet recovered.
  • Recovered: Alert events whose metrics have changed from abnormal to normal status and meet recovery conditions are marked as Recovered.
  • Closed: Alert events in "Recovered" status will be considered truly resolved and their status set to "Closed" only after passing through an "observation window period" with no alert jitter. The default "observation window period" is currently 10 minutes.

3. Notification Configuration

  • Alert: Alerts that do not generate Problems send notifications using the alert-type notification template content.
  • Problem: A Problem contains 1 to n alerts. These alerts send notifications to customers using the "Problem" notification template by default. Contained alerts will not be notified independently.

4. Convergence Principle

AI Smart Convergence determines whether multiple alerts within a certain effective time window are caused by the same root cause. If the algorithm determines they are caused by the same root cause, these alerts are aggregated into one Problem.

Use Cases

Provides metric alerts and event monitoring alerts for entities monitored by the platform, routes events that meet alert conditions to relevant channels and responsible personnel through notification configuration, and uses notification templates to convey alert content.

Configuration Approach

  1. First, configure notification templates. The system provides default templates for email, SMS, WeChat Mini Program, and WeChat Official Account. Others can be customized.
  2. Second, configure notification settings, selecting different notification channels and responsible personnel based on the severity level and status of alerts or Problems.
  3. Finally, configure alert rules by selecting monitoring entities and metrics, setting alert thresholds, and defining trigger conditions to complete the configuration.

Prerequisites

  • If setting alert rules for VIP users, first create VIP users and VIP user groups. After configuration, the platform will automatically distribute rules to collectors, and collectors will process data based on VIP identifiers.
  • Users marked as VIP will not be "sampled" when APM and RUM collect trace information.

Alert List

As the presentation entry for alert events, it displays all events that are currently Alerting (not limited by "query time window") by default. You can search for alerts of interest through filter conditions.

Time Window

The time range window for querying alert events, typically used together with alert search.

By default, all events that are currently alerting are displayed, not limited by the "query time window".

The query time window only takes effect when "Recovered" or "Closed" is selected for status.

As shown below, the query time window is "Last 30 minutes":

Time Window Example

The system displays alerting event data by default, so the following alert IDs will be queried and displayed:

  • #703 (Alert event that occurred 30 days ago, status has been "Alerting" all along)
  • #1122 (Alert generated within the query time window)

Alert List

Alert List Example

Page AreaItemDescription
1 Filter BarStatusDisplays "Alerting" alert events by default.
Alert LevelCritical, High, Warning, and Info.
Alert CategoryAvailability, Error, Slow, Resource, Custom Event.
VIP AlertYes and No
Event MonitoringYes and No
2 Time WindowTime ControlFind alert information by selecting time controls.
3 Filter ConditionsAlert IDExact search
Entity ObjectFuzzy match
Alert RuleFuzzy match
4 Alert ListAlert ContentClick to navigate to the alert detail page.
Entity ObjectMonitoring entity object to which the alert event belongs.
StatusAlerting, Recovered, Closed.
Alert RuleClick to view details of the alert rule corresponding to this alert event.
Associated ProblemAfter alert convergence, alerts belong to Problems. Some alerts do not converge and therefore do not belong to Problems.
Alert CategoryAvailability, Error, Slow, Resource, Custom Event
Alert TimeThe last time point that met the alert condition. For example: if an alert is triggered when the threshold is reached 3 times within 5 minutes, the time after the third threshold is reached is used as the alert time.
DurationAlert close time - Alert time (alert already closed)
System current time - Alert time (alert not yet closed)
Recovery TimeAlert recovery may produce alert jitter, so an "observation period" is still required.
Close TimeThe time when the alert is truly "ended" after a short observation period starting from alert recovery.
Alert IDUnique identifier of the alert, used for alert queries.
5 Alert ActionsExport ExcelMaximum 1000 records limit.
Custom HeadersYou can flexibly select which column fields to display.
Add to DashboardYou can add the alert list to an existing dashboard.

Alert Details

Displays the entire lifecycle status process of the alert event, alert metric trend chart, and alert event attribute information.

Alert Details Example

Page AreaItemDescription
1 Alert TitleAlert LevelCritical, High, Warning, and Info.
Title DescriptionLatest alert event description.
2 Alert AttributesAlert IDUnique identifier of the alert, used for alert queries.
Alert CategoryAvailability, Error, Slow, Resource, Custom Event
Entity ObjectMonitoring object of the alert entity
Alert RuleClick to display details in the right drawer panel
3 Metric Trend ChartMetric NameName of the alert metric
Alert LineAlert line formed by alert thresholds
Trend ChartUnclosed alerts: Look forward 1x time + Alert duration
Closed alerts: Look forward 1x time + Alert duration + Look backward 1x time
Multidimensional ExplorationClick to enter the Multidimensional Exploration page to further explore issues revealed by this metric trend chart.
4 EventsAlert TimelineDisplays the latest alert event details from top to bottom
Alert DetailsDisplays basic attribute information of the alert event. Hover over the notifier to display specific notification channels, whether the notification API call was successful, and notification time.

Alert Configuration

Overview

Complete alert rule configuration by selecting monitoring entities and metrics, setting alert trigger conditions, and matching notification strategies to ensure alerts and notifications are generated when monitoring data triggers alert rule conditions.

Configuration Approach

  1. Select alert entities and metrics:
    1. Select "Configure by Entity Model" method.
    2. Select "Configure by Metric Model" method.
  2. Configure alert thresholds: Static Threshold, Dynamic Baseline, Smart AI Algorithm
  3. Event Generation: Event title, content, category, and level.
  4. Convergence Configuration: Can configure convergence or no convergence.
  5. Notification Strategy: Select existing strategies or create new strategies to complete.

Configuration Guide

Creating Alert Configuration

Parameter ConfigurationItemDescription
Basic InformationRule NameName the alert rule
Configure by Entity ModelEntity ModelTraditional alert configuration method. First select alert entities, then select metrics to configure based on alert entities.
Expiration TimeAlert events are not generated during expiration time.
No Data Resolution Wait TimeIf recovery has not occurred after waiting 1 day, the system automatically sends a recovery event.
Filter EntitiesTrigger ConditionMeet all of the following conditions (AND relationship), Meet any of the following conditions (OR relationship)
Based OnFilter entities by tags
FilterTag key
Condition ExpressionEqual to, Exists, In
AddAdd a filter condition
Configure by Metric ModelMetric ModelDefault "METRIC" model.
Query OffsetFor delayed metrics, configuring the minute offset of the sliding evaluation window can make event generation more accurate
VISUALIZESelect "Metric + Metric Statistical Function"
WHEREFilter metric data range through attributes of metric data
Static ThresholdTrigger MethodGreater than, Greater than or equal to, Less than, Less than or equal to, Equal to, Not equal to
Static ThresholdInput alert threshold
Dynamic BaselineBaseline ConfigurationSelect baseline days as the baseline value to compare with current metric values
Baseline AlgorithmOffset baseline percentage
Offset baseline fixed value
Offset ThresholdPercentage value or specific fixed value
Trigger MethodGreater than baseline, Less than baseline, Deviate from baseline (including offset upper and lower limits)
Smart AI AlgorithmSensitivityBetween 1-100, larger values represent stricter anomaly detection
Trigger MethodGreater than upper threshold, Less than lower threshold, Exceed threshold boundaries
No Data ReportedNo Data ReportedAlert when metrics have no data reported
Event MonitoringEvent TypeNo Data Alert, No Heartbeat Alert, Request Duration Alert, New Crash Rate Alert
Event GenerationEvent TitleSystem provides default values. You can configure using "constants + variables"
Event DescriptionSystem provides default values. You can configure using "constants + variables"
Event CategoryAvailability, Error, Slow, Resource, Custom Event
Event LevelCritical, High, Warning, and Info
Convergence ConfigurationNo ConvergenceOnly generate alerts, do not generate Problems
AI Smart ConvergenceGenerate both alerts and Problems
Notification StrategySelectIf the platform has matching strategies, they will be automatically associated and displayed. If not, click "Add Notification Strategy"
Add Notification StrategyDisplay the add notification strategy page in the right drawer panel
Page Save/CancelCancelDo not save current configuration
OKSave the configuration page

Alert Configuration List

Alert Configuration List Example

ItemDescription
Rule NameName of the alert configuration rule
Notification StrategyDisplay names of matched notification strategies, 1-n.
Rule EnabledUnique identifier of the alert, used for alert queries.
ActionsEdit: Modify existing alert configuration
Copy: Copy an existing alert configuration
Delete: Delete an existing alert configuration
List SearchSupports fuzzy matching search for alert rule names.
NewClick the "New" button. A floating window below provides two scenarios: "Alert Rule Creation" and "VIP Alert Rule Creation". Select the corresponding alert rule scenario to create based on your needs.
ExportClick the "Export" icon to export up to 1000 list records in Excel file format.

Notification Configuration

Overview

The notification configuration function aims to help you flexibly manage alert notifications, ensuring that critical events can quickly reach relevant personnel and improving problem response efficiency.

Configuration Approach

  1. First determine the push type: "Alert" or "Problem".
  2. Set push trigger conditions.
  3. Configure corresponding notification channels and recipients.
  4. Click Save to complete configuration.

Configuration Guide

Creating Notification Configuration

New Notification Configuration Example

Parameter ConfigurationItemDescription
Basic InformationStrategy NameName the notification strategy
Strategy InformationStrategy CategoryAlert, Problem
Filter notification issues by tags, entities, or rulesToggle, representing enable and disable.
Alert RuleSelect an alert rule to configure the corresponding notification strategy
Filter by TagsEntity objects with tags can be filtered
Filter by EntityEntity object attributes, filter through selection
Filter by Alert Rule GroupFor alert rules configured through "Metric Model Configuration", filter through the "Group" field
Rule Conditions - AlertCriticalHighest severity level of alerts
HighSecond highest severity level of alerts
WarningMedium severity level of alerts
InfoLowest severity level of alerts
ResolvedStatus corresponding to alert recovery
Rule Conditions - Problem - Problem Status ChangesNew Problem DetectedTrigger notification when a new problem is detected
Problem ResolvedTrigger notification when a problem is resolved
Problem ClosedTrigger notification when a problem is closed
Problem ReopenedTrigger notification when a problem is reopened. When a problem is in resolved status and new events are added to the problem, it becomes reopened status.
New Entity Added to ProblemWhen new alert event entities are added to the problem
Rule Conditions - Problem - Problem Root Cause ChangesRoot Cause ChangedTrigger notification when the root cause determination of a Problem changes
Rule Conditions - Problem - Problem Level ChangesProblem Level Changed to Critical (FATAL)Trigger notification when severity is critical
Problem Level Changed to Error (ERROR)Trigger notification when severity is error
Problem Level Changed to Warning (WARNING)Trigger notification when severity is warning
Problem Level Changed to Info (INFO)Trigger notification when severity is info
Response ActionsGeneral NotificationIncludes voice, SMS, email, Enterprise WeChat (personal), Enterprise WeChat (Mini Program).
Select notification channel, notification template, recipients/recipient groups, notification time range
Third-party NotificationNotification objects configured in third-party push.
Select notification channel, notification template, recipients/recipient groups, notification time range
Page Save/CancelCancelDo not save current configuration
OKSave the configuration page

Notification Configuration List

Notification Configuration List Example

Page AreaItemDescription
List HeaderStrategy NameCritical, High, Warning, and Info.
Strategy CategoryLatest alert event description.
Response ActionsUnique identifier of the alert, used for alert queries.
Created TimeAvailability, Error, Slow, Resource, Custom Event
Modified TimeMonitoring object of the alert entity
Enabled StatusEnable or disable notification configuration
List ActionsEditModify notification configuration strategy.
CopyQuickly copy existing notification configuration.
DeleteClick the New button in the upper right corner to configure notification strategy.
Search AreaSearch ConditionFuzzy match alert strategy name
NewClick to open a new configuration page in the right drawer panel
ExportClick the "Export" icon to export up to 1000 list records in Excel file format.

Notification Templates

Overview

The platform provides default alert notification templates for different notification channels (such as SMS, email, voice, etc.).

You can also manually add custom notification templates.

Note: For the SMS notification channel, you cannot manually create notification templates yourself because of operator restrictions. You can only use the platform's default templates.

Configuration Guide

Creating Notification Template

New Notification Template Example

Taking the email notification channel template as an example, the platform provides default template content. You can customize and modify based on this template.

Form elements:

ItemDescription
Template NameName the notification template.
Template TypeAlert, Problem. After clicking to switch, template content and variable descriptions on the right will change dynamically.
Email SubjectThe system provides a default subject content of "variables + constants". You can customize it.
Content FormatTXT (simple format), HTML (rich text format)
Include Short LinkYes, No
Template ContentThe system provides a default content of "variables + constants". You can customize it.
Variable DescriptionExplains and describes variables in the email subject and content
CancelDo not save current configuration
OKSave the configuration page

Template List

Template List Example

Page AreaItemDescription
List HeaderTemplate NameNotification template name.
Template TypeAlert, Problem.
Notification ChannelGeneral Notification and Third-party Notification.
Enabled StatusEnable or disable notification configuration
List ActionsEditModify notification template.
CopyQuickly copy existing notification template.
DeleteClick the New button in the upper right corner to configure notification template.
Search AreaSearch ConditionFuzzy match template name
NewClick to open a new configuration page in the right drawer panel
ExportClick the "Export" icon to export up to 1000 list records in Excel file format.

VIP Settings

Overview

VIP User and VIP User Group functions are used to focus monitoring and alert management on critical users or business objects, supporting the creation of exclusive user groups, customized alert strategies, and notification rules to ensure priority response to critical business events.

Use Cases

Monitor and alert VIP users. When VIP users experience situations such as user experience lag, slow response time, slow backend page service requests, etc., immediately notify the responsible person for VIP users to help troubleshoot and resolve performance experience issues as soon as possible.

Configuration Approach

  1. First, create VIP users and VIP user groups.
  2. The platform automatically distributes configured parameters to collectors.
  3. Collectors automatically tag data with VIP labels according to rules.
  4. Create VIP user alerts in alert rules.

Configuration Guide

Creating VIP User

New VIP User Example

ItemDescription
VIP User IdentifierUser of our platform. Enter user unique identifier
VIP User Information
EmailEmail address used for email alert notifications
Phone NumberUsed for SMS or voice alert notifications
Select VIP User GroupBelong to VIP user group

VIP User List

Manage by manually creating VIP users.

VIP User List Example

Page AreaItemDescription
List HeaderVIP User IdentifierNotification template name.
VIP User Information.
EmailEmail address of VIP user.
Phone NumberPhone number of VIP user.
List ActionsEditModify VIP user configuration.
CopyQuickly copy existing VIP user configuration.
DeleteClick the New button in the upper right corner to configure VIP users.
Search AreaSearch ConditionFuzzy match VIP user identifier and VIP user information
NewClick to open a new configuration page in the right drawer panel
ExportClick the "Export" icon to export up to 1000 list records in Excel file format.

Creating VIP User Group

New VIP User Group Example

ItemDescription
Group NameName the VIP user group
Filter MethodFilter matching VIP users based on drop-down list values.
Member ListMatch corresponding content according to filter method and complete by filling in.

VIP User Group List

VIP User Group List Example

Page AreaItemDescription
List HeaderGroup NameVIP user group name.
Filter MethodSpecify User ID, Contains, Not Contains, Equal to, Regular Expression, Starts With Prefix, Ends With Suffix
Member ListFilter member list according to filter method.
Member CountNumber of VIP users in this VIP user group.
List ActionsEditModify VIP user group configuration.
CopyQuickly copy existing VIP user group configuration.
DeleteClick the New button in the upper right corner to configure VIP user groups.
Search AreaSearch ConditionFuzzy match VIP user group name.
NewClick to open a new configuration page in the right drawer panel
ExportClick the "Export" icon to export up to 1000 list records in Excel file format.