What you'll do
- Maintain and improve our cloud infrastructure (VPS-based with Cloudflare in front).
- Own CI/CD, deploy pipelines and release safety nets.
- Build observability — metrics, logging, tracing, alerting — that helps the team find issues quickly.
- Lead post-incident reviews and ship the changes that prevent recurrences.
- Partner with security on hardening — most recently after our April 2026 incident, where ops discipline made all the difference.