$2,175.00 Fixed
AlphaEdge Tech
Contract · Flexible hours
About the role
AlphaEdge Tech is seeking a Senior Site Reliability Engineer to lead the reliability and performance of a critical micro‑services platform. You will design, implement, and automate monitoring, alerting, and incident response processes to ensure 99.9% uptime for high‑traffic applications.
Key responsibilities
- Design and maintain scalable infrastructure on AWS using IaC tools.
- Implement robust CI/CD pipelines for automated deployments.
- Develop and fine‑tune observability stacks (Prometheus, Grafana, Loki).
- Manage container orchestration with Kubernetes, ensuring high availability.
- Automate incident detection, response, and post‑mortem analysis.
- Collaborate with development teams to embed reliability best practices.
Must-have skills
- Deep experience with Linux system administration.
- Advanced knowledge of Docker and Kubernetes.
- Proficiency in AWS services and networking.
- Strong scripting abilities (Python, Bash).
- Expertise in monitoring, logging, and alerting tools.
Nice to have
- Experience with Terraform or CloudFormation.
- Familiarity with chaos engineering practices.
- Proposal: 0
- Less than 3 month
Erik Freeman
,
Member since
Oct 28, 2025
Total Job