(HN232) - Site Reliability Engineering Manager (SRE)

  • Lima
  • Permanente
  • Tiempo completo
  • Hace 16 horas
OpenLoop is looking for a Site Reliability Engineering Manager (SRE) to join our team in Lima, Peru. This role will be a member of the Engineering Team.About OpenLoopOpenLoop was co-founded by CEO, Dr. Jon Lensing, and COO, Christian Williams, with the vision to bring healing anywhere. Our telehealth support solutions are thoughtfully designed to streamline and simplify go-to-market care delivery for companies offering meaningful virtual support to patients across an expansive array of specialties, in all 50 states.Our Company CultureWe have a relatively flat organizational structure here at OpenLoop. Everyone is encouraged to bring ideas to the table and make things happen. This fits in well with our core values of Autonomy, Competence and Belonging, as we want everyone to feel empowered and supported to do their best work.Job source: getonbrd.com.About the RoleTeam Leadership & Management- Build, lead, mentor, and grow a team of Site Reliability Engineers
- Conduct regular 1:1s, performance reviews, and career development planning
- Foster a culture of learning, collaboration, continuous improvement, sense of urgency and over communication.
- Recruit, interview, and onboard new SRE team members
- Collaborate with engineering leadership on team planning and resource allocationTechnical Strategy & Operations
- Address and evaluate current company situation regarding applications, systems and platforms to build an SRE roadmap as well as the team headcount
- Define and implement SRE strategy aligned with business objectives
- Establish and maintain SLIs, SLOs, and error budgets across all services
- Drive incident response processes and post-mortem culture
- Lead capacity planning and infrastructure scaling initiatives
- Oversee monitoring, alerting, and observability implementations
- Champion automation and infrastructure-as-code practicesCross-Functional Collaboration
- Partner with engineering teams to improve system reliability and deployment practices
- Work with security teams to implement secure, compliant infrastructure
- Collaborate with the product team to balance feature velocity with reliability
- Engage with executive leadership on infrastructure strategy and planning
- Coordinate with vendors and external partners on critical infrastructure componentsOperational Excellence- Ensure 24/7 system availability and rapid incident response
- Implement and maintain disaster recovery and business continuity plans
- Drive cost optimization initiatives while maintaining reliability standards
- Establish and track key reliability metrics and KPIs
- Lead efforts to reduce toil and increase automationRequirementsExperience- 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering
- 3+ years of people management experience, preferably in technical roles
- Proven track record of managing large-scale, distributed systems
- Experience with incident management and post-mortem processes
- Strong background in AWS and container orchestrationTechnical Skills- Strong proficiency in at least one programming language (Typescript, Python, Go, etc.)
- Deep understanding of Linux/Unix systems and networking
- Experience with Infrastructure as Code (AWS CDK)
- Proficiency with monitoring and observability tools (Prometheus, Grafana, ELK, etc.)
- Knowledge of CI/CD pipelines and deployment automation (Github Actions, Jenkins, etc)
- Understanding of database systems and performance optimizationLeadership & Communication
- Advanced english (C1) fluency
- Excellent verbal and written communication skills
- Experience leading technical discussions and presenting to stakeholders
- Ability to translate technical concepts to non-technical audiences
- Strong problem-solving and decision-making capabilities
- Experience with agile methodologies and project managementDesirable SkillsAdditional Experience- Experience in high-growth startup environments
- Background in regulated industries (healthcare, finance, etc.)
- Experience with event-driven architecture, microservices and service mesh
- Knowledge of security best practices and compliance frameworks
- AWS Certified Solutions Architect or similar cloud certificationsTechnical Depth
- Experience with chaos engineering and fault injection
- Knowledge of performance testing and load testing frameworks
- Understanding of distributed tracing and application performance monitoring
- Experience with configuration management tools (Ansible, Chef, Puppet)Our Benefits- Contract under a Peruvian company ID("Planilla"). You will receive all the legal benefits in Peruvian soles (CTS, "Gratificaciones", etc).
- Monday - Friday workdays, full time (9 am - 6 pm).
- Unlimited Vacation Days - Yes! We want you to be able to relax and come back as happy and productive as ever.
- EPS healthcare covered 100% with RIMAC --Because you, too, deserve access to great healthcare.
- Oncology insurance covered 100% with RIMAC
- AFP retirement plan-to help you save for the future.
- We'll assign a computer in the office so you can have the best tools to do your job.
- You will have all the benefits of the Coworking space located in Lima - Miraflores (Free beverage, internal talks, bicycle parking, best view of the city)GETONBRD Job ID: 55228Wellness program OpenLoop offers or subsidies mental and/or physical health activities.Life insurance OpenLoop pays or copays life insurance for employees.Paid sick days Sick leave is compensated (limits might apply).Partially remote You can work from your home some days a week.Bicycle parking You can park your bicycle for free inside the premises.Health coverage OpenLoop pays or copays health insurance for employees.Dental insurance OpenLoop pays or copays dental insurance for employees.Free car parking You can park your car for free at the premises.Computer provided OpenLoop provides a computer for your work.Informal dress code No dress code is enforced.Vacation over legal OpenLoop gives you paid vacations over the legal minimum.Beverages and snacks OpenLoop offers beverages and snacks for free consumption.Remote work policyHybridThis job is performed partly from home and partly at the office in Lima (Peru).Apply now- Share →› › ›About OpenLoopServicing all 42,000 zip codes nationwide, OpenLoop accelerates the delivery of patient care by matching our trusted community of certified clinicians and insurance partners with innovators in the digital health space. —Similar jobsLooking for SysAdmin / DevOps / QA jobs?Sign up for free and find jobs that truly match you.Site Reliability Engineering Manager (SRE)OpenLoop • LimaThis job is performed partly from home and partly at the office in: Lima (Hybrid)ⓘ Requires applying in EnglishShare this job ShareLog in and find the best jobs.components--modal#closeWithKeyboard externalClose@window-
components--modal#close' data-components--modal-target='container' id='base-modal'>

Kit Empleo

Empleos similares

  • Coordinador de Ingeniería / Site Manager

    Applus+

    • Lima
    Somos Applus+, una de las empresas líderes reconocido en el mercado en el sector de inspección, ensayos y certificación, que ayuda a sus clientes a potenciar la calidad y la seguri…
    • Hace 17 días
  • Site Manager

    Yara

    • Lima
    Yara es el proveedor líder mundial de soluciones de nutrición de cultivos, que combina el conocimiento global y local de los cultivos, una cartera de productos de alta calidad y un…
    • Hace 7 días
  • Site Manager

    • Lima
    Yara es el proveedor líder mundial de soluciones de nutrición de cultivos, que combina el conocimiento global y local de los cultivos, una cartera de productos de alta calidad y un…
    • Hace 8 días