ZF277 - Site Reliability Engineering Manager (SRE)

Openloop

  • Lima
  • Permanente
  • Tiempo completo
  • Hace 13 horas
OverviewOpenLoop is looking for a Site Reliability Engineering Manager (SRE) to join our team in Lima, Peru. This role will be part of the Engineering Team.Responsibilities- Build, lead, mentor, and grow a team of Site Reliability Engineers
- Conduct regular 1:1s, performance reviews, and career development planning
- Foster a culture of learning, collaboration, continuous improvement, sense of urgency, and clear communication
- Recruit, interview, and onboard new SRE team members
- Collaborate with engineering leadership on team planning and resource allocation
- Address and evaluate current system landscape to build an SRE roadmap and determine headcount
- Define and implement SRE strategy aligned with business objectives
- Establish and maintain SLIs, SLOs, and error budgets across all services
- Drive incident response processes and post-mortem culture
- Lead capacity planning and infrastructure scaling initiatives
- Oversee monitoring, alerting, and observability implementations
- Champion automation and infrastructure-as-code practices
- Partner with engineering teams to improve system reliability and deployment practices
- Work with security teams to implement secure, compliant infrastructure
- Collaborate with the product team to balance feature velocity with reliability
- Engage with executive leadership on infrastructure strategy and planning
- Coordinate with vendors and external partners on critical infrastructure components
- Ensure 24/7 system availability and rapid incident response
- Implement and maintain disaster recovery and business continuity plans
- Drive cost optimization initiatives while maintaining reliability standards
- Establish and track key reliability metrics and KPIs
- Lead efforts to reduce toil and increase automationQualifications- 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering
- 3+ years of people management experience, preferably in technical roles
- Proven track record of managing large-scale, distributed systems
- Experience with incident management and post-mortem processes
- Strong background in AWS and container orchestration
- Strong proficiency in at least one programming language (Typescript, Python, Go, etc.)
- Deep understanding of Linux/Unix systems and networking
- Experience with Infrastructure as Code (AWS CDK)
- Proficiency with monitoring and observability tools (Prometheus, Grafana, ELK, etc.)
- Knowledge of CI/CD pipelines and deployment automation (GitHub Actions, Jenkins, etc)
- Understanding of database systems and performance optimization
- Advanced English fluency (C1) and excellent verbal and written communication skills
- Experience leading technical discussions and presenting to stakeholders
- Ability to translate technical concepts to non-technical audiences
- Strong problem-solving and decision-making capabilities
- Experience with agile methodologies and project managementDesirable / Additional Experience- Experience in high-growth startup environments
- Background in regulated industries (healthcare, finance, etc.)
- Experience with event-driven architecture, microservices and service mesh
- Knowledge of security best practices and compliance frameworks
- AWS Certified Solutions Architect or similar cloud certifications
- Experience with chaos engineering and fault injection
- Knowledge of performance testing and load testing frameworks
- Understanding of distributed tracing and application performance monitoring
- Experience with configuration management tools (Ansible, Chef, Puppet)Benefits- Contract under a Peruvian company ID ("Planilla"). You will receive all the legal benefits in Peruvian soles (CTS, "Gratificaciones", etc).
- Monday - Friday workdays, full time (9 am - 6 pm).
- Unlimited Vacation Days - Yes! We want you to be able to relax and come back as happy and productive as ever.
- EPS healthcare covered 100% with RIMAC
- Oncology insurance covered 100% with RIMAC
- AFP retirement plan—to help you save for the future.
- We’ll assign a computer in the office so you can have the best tools to do your job.
- You will have all the benefits of the Coworking space located in Lima - Miraflores (Free beverage, internal talks, bicycle parking, best view of the city)#J-18808-Ljbffr

Kit Empleo

Empleos similares

  • Coordinador de Ingeniería / Site Manager

    Applus+

    • Lima
    Somos Applus+, una de las empresas líderes reconocido en el mercado en el sector de inspección, ensayos y certificación, que ayuda a sus clientes a potenciar la calidad y la seguri…
    • Hace 17 días
  • Site Manager

    Yara

    • Lima
    Yara es el proveedor líder mundial de soluciones de nutrición de cultivos, que combina el conocimiento global y local de los cultivos, una cartera de productos de alta calidad y un…
    • Hace 7 días
  • Site Manager

    • Lima
    Yara es el proveedor líder mundial de soluciones de nutrición de cultivos, que combina el conocimiento global y local de los cultivos, una cartera de productos de alta calidad y un…
    • Hace 8 días