35e6b139fc
ci / validate (push) Failing after 1m8s
Rework portfolio around Linux operations, Zabbix monitoring, migration validation, and ELK/Grafana log observability. Add AAP-style LVM resize workflow, Zabbix server/proxy/agent automation assets, Linux/AIX monitoring templates, and updated validation CI.
1.1 KiB
1.1 KiB
Incident Response Runbook
Filesystem Alert
- Confirm current usage and growth trend.
- Check whether the host is Linux or AIX and use the correct runbook.
- Validate application ownership of the filesystem.
- Clean known temporary paths or request LVM expansion when approved.
- Attach before/after evidence to the incident ticket.
Agent Unreachable
- Confirm whether data loss affects one host, one proxy, or one network segment.
- Check proxy queue and last seen timestamp.
- Validate agent service state and firewall path.
- For active checks, confirm
ServerActiveand hostname match.
Proxy Backlog
- Check server reachability from proxy.
- Check proxy DB filesystem usage.
- Confirm whether config sync recently changed.
- Reduce noise by temporarily disabling non-critical discovery rules if required.
Unsupported Items
- Identify affected template and item key.
- Check whether item is Linux-specific or AIX-specific.
- Validate agent version and custom user parameters.
- Roll back template change if canary host group is affected.