3.9 KiB
3.9 KiB
Changelog
[Unreleased]
Added
- Python tooling validation for operational scripts.
incident-log-summaryfor general incident log summarization.log-diff-checkerfor pre-change and post-change log comparison.auth-log-auditfor Linux authentication log review.jvm-log-analyzerfor JVM application log summaries.journal-analyzerfor exportedjournalctllog review.known-error-matcherwith JSON-based known error patterns.- Standalone Bash incident checks for CPU, memory/OOM, service restart loops, failed SSH logins, certificate expiry, DNS connectivity, NTP drift, read-only filesystems, inode usage, and JVM process diagnostics.
incident_triage_report.shfor L2 Markdown incident handover reports built from existing Bash incident checks.- Repository-level Codex guidance:
AGENTS.mddocs/codex/README.mddocs/codex/review-checklist.mddocs/codex/task-template.mddocs/codex/plans-template.md
- Lightweight validation helpers:
scripts/validate-repo.shscripts/check-bash.shscripts/check-ansible.shscripts/check-docs.sh
- Cross-repository operational documentation structure:
infra-run/docs/operations-cheatsheet.mdplatform-projects/docs/platform-cheatsheet.mdlabs/docs/lab-cheatsheet.md
- Production-oriented Linux/Unix operations reference with incident workflows, storage and networking checks, SSL/TLS notes, AIX commands, automation safety patterns, Ansible operational usage, and observability quick-reference.
- SELinux operational coverage for mode checks, context inspection, AVC audit review, persistent relabel workflow, booleans, and SELinux-specific incident response.
- Selected baseline Ansible hardening automation:
- RHEL 9 role and playbook.
- Debian 13 / Ubuntu 26.04 role and playbook.
- IBM AIX 7 role and playbook.
- Shared sanitized Ansible inventory defaults for Linux and AIX examples.
- Role-level task structure covering pre-checks, SSH, sudo, auditing, logging, services, filesystem controls, platform-specific settings, handlers, and post-check validation.
Changed
- Updated root,
infra-run, Bash, Ansible, platform, and lab README guidance for safety-first usage, validation, and future Codex-driven work. - Updated repository and
infra-runREADME files to surface the new documentation structure and operational cheatsheets. - Updated repository,
infra-run, and Ansible README files to describe the new hardening automation instead of placeholder-only Ansible structure. - Updated Python tooling documentation and repository roadmap.
- Integrated Python syntax validation into repository validation workflow and CI.
Notes
- Hardening content covers selected baseline controls and intended for portfolio/lab use; live use requires environment-specific review and validation.
[Initial Version]
Added
- Repository structure:
infra-runplatform-projectslabs
- Linux operations Bash toolkit under
infra-run/scripts/bash/os-healthcheck/:- healthcheck
- disk usage checks
- service checks
- system reporting
- Disk full incident toolkit:
- disk analysis
- large files detection
- deleted open files detection
- safe cleanup suggestions
- Network troubleshooting script under
infra-run/scripts/bash/os-healthcheck/:- interface, routing, DNS, connectivity checks
- Veritas storage toolkit:
- VxVM disk detection
- diskgroup extension
- volume/filesystem resize
- VCS freeze/unfreeze workflow
- GPFS storage toolkit:
- cluster validation
- NSD planning
- filesystem expansion
- rebalance
- Runbook-style structure and step-based execution.
Changed
- Moved Linux operations scripts into
infra-run/scripts/bash/os-healthcheck/to keep host health and troubleshooting checks grouped together.
Notes
- All scripts default to dry-run where change actions are present.
- Designed for safety and readability.
- No destructive actions without explicit confirmation.