Avoiding Bogus Alerts Using AWS-Based Proxies & Outsourced BGP for Distributed Monitoring
LogicMonitor is a SaaS-based performance and monitoring platform servicing clients across the world. Our customers installLogicMonitor “Collectors” within their data centers to gather data from devices and services utilizing a web application to analyze aggregated performance metrics, and to configure alerting and reporting. This means our entire operation (and therefore the monitoring our customers are dependent on) relies on ISPs to ensure that we efficiently and accurately receive billions of data points a day.
Detecting Major Outages
The logic for alerting on potential data center wide or major network issues is fairly simple: if we have not heard from a Collector after a few minutes, we consider the Collector to be down. This may indicate a data center wide power or network issue, and we have a special class of alert for these cases.
However, what happens if the network path between a Collector and our data center is blocked due to a transit ISP issue? This used to be a problematic issue for our more remote clients (e.g., those in Australia) where the paths to the U.S. are less resilient and prone to brief, periodic lapses in connectivity due to a multitude of transit ISPs.
Learn more, read the entire article here, Avoiding Bogus Alerts Using AWS-Based Proxies & Outsourced BGP for Distributed Monitoring