What are key aspects in designing and implementing a Cloud Monitoring Solution for Infrastructure? Few points -
- Identify a set of tools to monitor hardware (including servers, networks (L2/L3), storage (processors, drives etc..) and on the software front (tools, application etc).
- Choose a reporting method - For example: emails, dashboards etc.. Have necessary alerting in place, fine tune them (setting up thresholds, collection intervals etc..) to report events.
- Monitoring Dashboard for the group who receives/handles event. Design the dashboard to view, manage and triage events reported.
- Design and develop change management for each event processing/fix.
Please share your thoughts here.