Get access to exclusive discount on Golang courses up to 25% off
Last developer joined
2-Click Apply
Upload Your CV
Go to your Inbox & Confirm Your Application
Senior Site Reliability Engineer, CORE at Netflix - Los Gatos, California, United States
Salary $250,000 to $500,000 a year Company Websitehttps://netflix.com Published November 2024 Applicants This Week 3
Job Description
At Netflix, we strive to bring joy to people across the world through amazing stories. As we grow internationally, we are continually enhancing our cloud-based infrastructure to improve our performance, scalability, and reliability.
The SRE team's goal is to ensure customer joy by successfully managing risk and minimizing impact across Netflix. We do this through cross-functional engagement with other engineering teams, managing issues when they happen, as well as promoting reliability and resilience practices throughout the organization.
Outcomes
Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Increase our reliability through establishing guidance and methods of improvement
Form and maintain relationships with internal and external partners
Develop deeper insights and analysis into the quality of experience for our customers
We Value
Curiosity about how complex sociotechnical systems successfully operate at scale when failure is inevitable
People who see influence as their preferred tool for cultivating relationships
Collaboration and continuous improvement
A desire to learn and readiness to teach
Iteration as the path forward
Our Work
Drive incidents to resolution by coordinating with multiple engineering teams
Identify sources of instability in large-scale distributed systems and drive operational excellence
Analyze complex systems from a reliability and resilience perspective
Engage with product teams to diagnose operational surprises and carry forward improvements
Improve reliability and drive down the burden of toil with tooling and automation
Nice to Have
Experience with global, continuous delivery methods
Development with Python, Go, Java, or JavaScript/Node.js
Involvement with incident management and response
Knowledge of cloud platforms like AWS and microservices architecture