Operational monitoring
Operational monitoring for gauging system health is a important complement to monitoring for security and compliance. Proper operational monitoring can help you determine whether you need to fail over to an alternative storage or processing site. In addition, operational monitoring can help you determine whether operations have returned to normal after a system disruption. Operational metrics include measurements for CPU usage, memory usage, or API response times.
Since IBM Cloud Monitoring is not Financial Services Validated, the next two sections describe "bring your own" software solutions for Red Hat OpenShift on IBM Cloud and Virtual Servers for VPC.
Generally speaking, you should strive to use only services which are Financial Services Validated in your solutions. However, depending on your circumstance there may be exceptions. See the best practice Use only services that are IBM Cloud for Financial Services Validated for more details and potential exceptions.
Red Hat OpenShift on IBM Cloud
You need to install your own software solution for monitoring in Red Hat OpenShift on IBM Cloud within your own VPC. There are various ways an operational monitoring solution can be implemented. See Setting up an operational monitoring solution for one example that uses Red Hat OpenShift on IBM Cloud Prometheus and Grafana.
Virtual Servers for VPC
You need to install your own software solution for monitoring in virtual server instances. See Setting up an operational monitoring solution for one example of sending virtual server instance metrics with Prometheus Node Exporter to Red Hat OpenShift on IBM Cloud Prometheus and Grafana.
Related controls in IBM Cloud Framework for Financial Services
The following IBM Cloud Framework for Financial Services controls are most related to this guidance. However, in addition to following the guidance here, do your own due diligence to ensure you meet the requirements.
Family | Control |
---|---|
Contingency Planning (CP) | CP-2 (3) Contingency Plan | Resume Essential Missions / Business Functions CP-6 Alternate Storage Site CP-7 Alternate Processing Site CP-10 Information System Recovery and Reconstitution |