Skip to main content

Safous ZTNA monitoring schema

Broadly speaking, our monitored infrastructures are: Global Point of Presence (PoP) and App Gateway. Please refer to this article for details: General Architecture.

Since a lot of data are collected from many points around the world, we also set up monitoring proxies in different regions. This is to minimize latencies and pre-process collected load with greater flexibility.

Safous Internal Team will receive PoPs and App Gateway's alerts, while customer PIC will only receive alerts regarding their own App Gateway via the registered PIC tenant email.

Method Overview​

  • Web test: Is a scenario-based monitoring where agent performs a set of predefined HTTP requests. This can also be used to test the functionality of web application. Collected data which sent back to the monitoring host can be used to set up monitoring alert.
  • Active agent: In contrast with Passive Agent--whereby monitoring host request monitoring data from agent each time host reaches its update interval, Active Agent works the other way around, with the agent actively sending monitoring data to monitoring host every time agent reaches its update interval. Monitoring data can then be used directly to set up alert based on certain threshold without the need of pre-processing.
  • Internal check: In simplest term, this is a self-check. The monitoring host will ask the agent to collect the monitoring data of itself which then sent back to monitoring host--in a similar fashion to Active check, monitoring data is also ready-to-use.
  • HTTP agent: An agent is set up with the sole responsibility of answering request from authorized monitoring host with HTML web page filled with monitoring data as the content. Pre-processing of the return page is needed before monitoring data can be used in the host.
  • External check: The monitoring host will ask an agent to execute a script which supports custom parameters. The agent will simply send the return result of the script back to the monitoring host--in a similar fashion to Active check, monitoring data is also ready-to-use. 

PoP Monitoring​

From service side, Global PoPs are monitored using agent-based system. The monitored items includes:

Regional Availability​

ItemDescriptionIntervalThresholdMethod
Affinity checkMonitor PoP availability1mResponse code is not 200Web test

System Resources​

ItemDescriptionIntervalThresholdMethod
Load averageMonitor current PoP load avg (1m, 5m, 15m)1m> 1.5 CPU load avgActive agent
Memory usageMonitor currently available PoP memory in percentage1m> 90%Active agent
Disk usageMonitor currently used PoP disk space in percentage1m> 90%Active agent
Inodes usageMonitor currently available PoP disk inodes1m< 10 inodesActive agent

PoP Properties​

ItemDescriptionIntervalThresholdMethod
Agent availabilityMonitor agent availability1mResponse timeout = 0Internal
PoP NATSMonitor availability of PoP NATS function1mStatus code > 0Active agent
PoP routerMonitor availability of PoP router function1mStatus code > 0Active agent
PoP watchtowerMonitor availability of PoP watchtower function1mStatus code > 0Active agent

App Gateway Monitoring​

Meanwhile from customer side, App Gateway are monitored through agent-less HTTP request. The monitored items includes:

System Resources​

ItemDescriptionIntervalThresholdMethod
CPU numberTotal CPU on host1m-HTTP agent
Free diskFree space left on disk1m75%HTTP agent
Free memoryFree memory on host1m-HTTP agent
Host health checkMonitor host availability1mNo health check dataHTTP agent

App Gateway Properties​

ItemDescriptionIntervalThresholdMethod
SSL Cert. ValidityCheck SSL certificate expiration1d< 1dExternal check
App Gateway Num. CheckNumber of active App Gateway1m> #orderedHTTP agent
SSL Cert. Expiration (7d)Check SSL certificate expiration in 7 days1d< 7dExternal check
App Gateway Health CheckMonitor App Gateway availability1mResponse code is not 200Web test
Login Health CheckMonitor login page availability1mResponse code is not 200Web test
User Num. CheckNumber of active users6h> #orderedHTTP agent
License ExpirationCheck license status in 90 days12h< 90dHTTP agent