
I’ll describe in this blog how to monitor vSAN events in Slack with VMC on AWS. Creating a Slack notification alert of vSAN threshold exceeding, using vRLIC (vRealize Log Insight Cloud).
Background
Why do you need to monitor on Slack vSAN threshold exceeding events in VMC on AWS?
Monitor vSAN events are one of the most common gotchas in VMC on AWS.
When the vSAN Cluster reaches 70% of its storage utilization, Additional Hosts have added automatically to the Cluster.
You cannot change this behavior through configurations. Therefore monitoring the vSAN storage utilization is key in production.
Until the September 2021 release, there was no default VMC notification on AWS before adding a new host. A new Alert introduced In September when the vSAN Cluster reaches 65% of capacity usage, VMC will send an email to organization users.
The Slack notification use case of monitoring the vSAN cluster usage before reaching 65% is helpful for other monitoring use cases. For instance, alert on noisy neighbor VM’s.
Monitoring VSAN cluster usage with a vCenter alert
Pre-requisite for Slack alert notification, vCenter Alert configuration.
First, open the vCenter in VMC on AWS and define a new datastore Alarm.



In the operator, select “if above.”
In the threshold enter threshold percentage, in my example, 65%.



Second part vRealize log insight(vRLIC) webhook and Alert configuration
Webhook Configuration


Afterward, click on New Webhook and then select Slack Endpoint and enter Destination URL.
You can find the slack side definition of the slack documentation here.
Alert Definitions configuration

Expand alerts, go to alert definitions, and click create a new alert on the left-hand menu.

Please make sure that the query text contains and matches the alert name as defined in vCenter? vRLIC will pick the name from the logs. Make sure you select under trigger condition 1, notification towards the slack endpoint we just created.


It may take several minutes before vRLIC propagates the alert creation correctly and triggers an alert when performing testing.

It helps monitor your infrastructure, and you can use it for additional use cases.
If you found this useful, please feel free to leave a comment or to reach out directly.
Back to the original site