Monitoring Appliance Health

Overview

Appliance Health Monitoring helps you proactively manage and troubleshoot the performance of your Login Enterprise Virtual Appliance. By monitoring key metrics, you can ensure optimal operation and resource utilization, preventing minor issues from becoming major problems.

Benefits of Appliance Health Monitoring

Benefit

Explanation

Real-time health monitoring

Allows you to continuously monitor the health of your Appliance in real time.

Early issue detection

Helps identify and address potential problems before they escalate into major issues.

Disk space visibility

Provides clear insight into available disk space, helping prevent critical shortages.

Performance impact awareness

Enables observation of how heavy Tests affect Appliance performance.

Usage trend tracking

Tracks usage patterns over time to support better planning and analysis.

Informed resource allocation

Helps you make decisions about allocating resources based on collected data.

Key Metrics Monitored

  1. Disk utilization

    1. Description: Displays the percentage of total disk space currently in use.

    2. Importance: Helps you keep track of available disk space and avoid running out, which can lead to critical issues.

  2. Disk utilization (Inodes)

    1. Description: Shows the percentage of inodes used versus the total available. An inode represents a file or directory, so running out of inodes means you can’t create new files or directories even if there is free disk space.

    2. Importance: Useful for managing systems with a large number of small files, preventing inode exhaustion.

  3. CPU utilization

    1. Description: Indicates the percentage of CPU resources currently in use.

    2. Importance: Helps you monitor how efficiently your Appliance’s CPU is being used and identify potential performance bottlenecks.

  4. Memory utilization

    1. Description: Shows the percentage of RAM currently in use.

    2. Importance: Provides insight into memory usage, helping you understand if your Appliance needs more resources or if there are inefficiencies.

Technical Details on Metrics Monitoring

  • Disk space and inodes check: We check for disk space and inodes every 20 minutes.

  • Disk and Memory utilization: We monitor disk and RAM utilization every 20 seconds. The results are averaged per minute, which is then displayed in the daily graph view.

Metric Gathering Methods

  1. CPU metrics: We read the /proc/stat file at a 20-second interval and calculate the delta using the algorithm in c#. For details, see Gathering CPU utilization from /proc/stat.

  2. Memory metrics: We gather memory metrics from /proc/meminfo, where the calculation is performed as follows: UsedMemory = MemTotal - MemFree.

  3. Disk metrics: Disk usage is monitored using the df command. For details, see the df Command documentation.

  4. Inodes metrics: Inodes are monitored using the df command with the -i option.

Accessing Appliance Health Monitoring

  1. In the Login Enterprise sidebar menu, navigate to System > Appliance Health.

Frame 1980.png
  1. In System, select Appliance Health from the tab menu.

Frame 1981.png

Using Appliance Health Monitoring

  1. View metrics: The main screen displays the chart with 4 key metrics:

    1. CPU utilization

    2. Disk utilization (Inodes)

    3. Disk utilization

    4. Memory utilization

Frame 1982.png

Tip: Hover over the trend to see the metric details.

Frame 1983.png
  1. Analyze data: Review the metrics to understand your Appliance’s current health status. Check for any unusual trends or values that might indicate potential issues.

  2. Take action: Based on the data, you can take appropriate action, such as freeing up disk space, managing inode usage, or upgrading resources if necessary.

Downloading Charts as an Image

You can download charts as images for sharing or use in custom reports and presentations. Downloading a chart as an image applies to all charts: Load Test, Continuous Test, and Appliance Health. To download a chart:

  1. Locate the Download icon in the top right corner of the chart.

  2. Select image format: PNG, SVG, or JPG.

  3. Download the image. It will be saved to your Downloads folder, or the directory you've configured as your default Downloads location.

Frame 1984.png

Notifications

User Interface

The Appliance Health Monitoring interface includes persistent notifications that provide real-time alerts about system health. Currently, these notifications cannot be disabled and will always appear when certain thresholds are exceeded.

Key features:

  1. Threshold alert: A notification appears when disk usage exceeds 90%.

  2. Dismissal: You can dismiss the notification; however, this action is saved only for the current session. If the issue is not addressed and you relaunch the browser, the notification will reappear. It will not reappear for the same session after dismissal.

  3. Display timing: Notifications are triggered in 20-minute intervals. If you log in shortly after the check, you may not see the alert immediately, leading to a potential wait of up to 20 minutes for the notification to appear.

Frame 950.png

Email Notifications

The Appliance Health Monitoring system allows for customizable email notifications to keep you informed about critical issues.

Configuration options:

  1. Enable/Disable notifications: You can easily enable or disable email notifications based on your preferences.

  2. Recipient settings: Specify recipients to receive alerts when disk usage exceeds the 90% threshold.

  3. Cooldown period: The default cooldown period is set to 48 hours. This means that if a notification is sent due to high disk usage, the system will check again after 48 hours before sending another email. If the issue is resolved within that timeframe, no further emails will be sent.

Note: Email notifications are currently only available for disk usage alerts, triggered by the same threshold of over 90% disk capacity.

Frame 1985.png

Goals and Takeaways

  1. Proactive monitoring: Stay ahead of potential issues by keeping an eye on critical metrics.

  2. Informed decisions: Use the data to make decisions about resource allocation and maintenance.

  3. Efficient operation: Ensure that your Appliance runs smoothly and efficiently by addressing issues before they impact performance.