# Highlight Alerting Mechanism

### Highlight Alerting Mechanism

The mechanism works on the principle that if an error condition is encountered in any element then a fuel gauge-type counter is decreased. In all other circumstances, it’s increased. When the counter goes below pre-set thresholds, Highlight signals an amber condition, and then a red. Please see below the conditions when the various counters are decreased, associated with Stability, Load, or Health which we refer to as level 3 metrics.

{% hint style="info" %}

#### Custom defined  or configurable settings

Some thresholds are hard coded in Highlight and Admin users can define others. Targets in performance tests are defined during setup. These are labelled as "configurable" below.
{% endhint %}

How quickly a heat tile changes colour is governed by the sensitivity settings. Please find out more about [changing the sensitivity of heat tiles](https://support.highlight.net/help/admin/heat_tiles_sensitivity) or [contact us](/getting-started/contact-us.md) for details.

{% hint style="info" %}

#### In maintenance  <picture><source srcset="/files/DCPpkgHz04jexgzIfnFF" media="(prefers-color-scheme: dark)"><img src="/files/JrKVRD5I67SN056HBH2v" alt="Maintenance indicator"></picture>

Watches in maintenance do not send alerts, change the colour of heat tiles nor decrement availability figures
{% endhint %}

{% hint style="info" %}

#### Dormant  <picture><source srcset="/files/5SBR5jHWLjG8EvtTvowp" media="(prefers-color-scheme: dark)"><img src="/files/vFD1plJG5q4H74nKqNNV" alt="Dormant icon" data-size="original"></picture>

Dormant interfaces are not expected to pass traffic and only send stability alerts if the device is uncontactable
{% endhint %}

#### Stability

Decrease the counter if any of the following conditions are met, which we refer to as level 2 metrics affecting stability:

* The device loses connection with either of the Highlight pollers
* The monitored interface is down or indicating a brief outage or has been taken out of service or no longer exists
* There has been a device restart
* &#x20;A switch port designated as critical is down
* &#x20;**Performance tests: ICMP Ping, UDP Echo and TCP Open:** 100% packet loss of all tests in a sample
* **Performance test: MOS:** Application failure, MOS is less than 1.0
* **Performance test: Precision Delay:** 100% packet loss of all tests in a sample. The health index is also affected.

#### Load

Decrease the counter if any of these conditions are met, which we refer to as level 2 metrics affecting load:

* Link utilisation (traffic in or out) exceeds threshold (default is 80%  - configurable)
* Tunnel utilisation (traffic in or out) exceeds threshold (default is 82% )
* Traffic in or out on a dormant watch exceeds a threshold (default is 1000 Kbps - configurable)
* CPU for a router exceeds 60%
* WiFi Client Count exceeds a threshold (default is 30 client devices - configurable)
* Wireless Utilisation exceeds a threshold (default is 50% - configurable)

#### Health

Decrease the counter if any of these conditions are met, which we refer to as level 2 metrics affecting health:

* Link errors exceed a threshold (default is 1% or 10,000 packets per million - configurable)
* Link congestion occurs:
  * Queue length exceeds 0
  * Discards exceed a threshold (default is 1% or 10,000 packets per million - configurable)
  * Class drops exceed 0 (configurable)
* **Broadband Clarity:**
  * The connection speed of the broadband service drops below the speed threshold, which is auto-learned or manually set - configurable
* &#x20; **Cellular Clarity:**
  * The signal strength score of the cellular service drops below the threshold, which may be set (default is 0 - configurable)
* &#x20;**WiFi:**
  * Congestion (discards) exceeds a threshold (default is 1% or 10,000 packets per million - configurable) or
  * Signal Problems exceed a threshold (default is 25% - configurable)
* &#x20;**Performance tests - ICMP Ping, UDP Echo and TCP Open:**

  * Any one of the tests in a sample shows response exceeds the target (configurable)
  * At least one test in a sample fails to respond (lost packet)

  Note: One sample can contain up to six test results; if all tests in a sample fail it affects stability and health
* **Performance tests - Precision Delay and MOS:** (configurable)

  <table><thead><tr><th width="402">Condition</th><th>Precision Delay</th><th>MOS</th></tr></thead><tbody><tr><td>Average response from the burst exceeds response target</td><td>Yes</td><td>Yes</td></tr><tr><td>Percentage of lost packets exceeds packet loss target</td><td>Yes</td><td>Yes</td></tr><tr><td>Jitter measured over the burst exceeds jitter target</td><td>Yes</td><td>Yes</td></tr><tr><td>MOS Score is less than target</td><td>N/A</td><td>Yes</td></tr></tbody></table>

  #### Alert generation

  Note that each level 2 metric above can trigger an alert so for example you may get an alert when a heat tile goes red caused by Inbound Link utilisation, then another alert caused by Outbound Link utilisation even though the tile is already red. The tile colour represents the worst case of all level 2 metrics associated with it.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.highlight.net/status/network/highlight-alerting-mechanism.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
