Golang SRE Jobs in United States

Hand-Picked Golang jobs • Apply directly to companies • Clear salary ranges

Browse 6 Golang SRE Jobs (6 new this month) in United States 🇺🇸 in April 2025 at companies like Rebellion Defense, TextNow and Digital Ocean with salaries from $100,000 to $500,000 working as a Site Reliability Engineer, Senior Site Reliability Engineer and Senior Engineer Tools & Platforms SRE. Last post 2 weeks ago

Hiring Golang Developers?

Create your profile to continue

48 direct messages sent by companies to developers on Golang Cafe in the last 30 days
58 developers joined Golang Cafe in the last 30 days
13,433 developer profiles page views in the last 30 days
Get access to our Salary Explorer
Get access to exclusive discount on Golang courses up to 25% off
Last developer joined 1 day ago

2-Click Apply

Upload Your CV
Go to your Inbox & Confirm Your Application

Upload Your CV (PDF file only, max 5MB)
Notify me about new job openings

6 of 6 SRE Jobs in United States 🇺🇸 • Sort by Date

Site Reliability Engineer
Rebellion Defense
Washington, DC / Chicago, Illinois, United States
$100,000 to $200,000 a year
November 2020
2 Applicants This Week
More Than 6 Months Old

Job Description

We are looking for a Site Reliability Engineer (SRE). As an SRE, you will be tasked with the reliability and operation of our production environments. SREs are tasked with ensuring teams within the company receive help maintaining software at scale, as well as help designing and developing software for scale. SREs are expected to engage with the product teams to ensure the delivery of our software is as seamless as possible.

These position is based out of our Washington D.C. or Chicago Illinois office locations. An active clearance or ability to obtain TS/SCI clearance will be required.

We look for a track record of the following:

Coming alongside high energy engineering teams to enable the adoption of best practices to enable the scalability and reliability of deployed software,
Defined architecture and built services at scale on public infrastructure such as AWS and Azure,
Experience designing, implementing, deploying, and operating high scale production services,
Experience facilitating the definition and implementation of SLIs and SLOs,
Understanding how to carefully spend error budget to handle regular deployment of large changes to production,
Deep experience in Linux operating systems, and systems engineering,
Comfort delivering critical software in Go and Python,
Willingness to debug problems across the stack,
Comfortability with working on underspecified problems and are capable of rapidly learning and iterating on solutions,
Experience building the wrong system enough times to avoid the common pitfalls, whether building something personally or advising others.

You might be a good fit if you:

5+ years of relevant SRE experience in the tech industry,
demonstrable knowledge of TCP/IP, HTTP, web application security and experience supporting web application architecture,
experience working with a variety of storage systems, application architectures, compute infrastructure and network management systems,
experience designing, implementing, deploying, and operating high scale production service,
defined architecture and built services at scale on public infrastructure such as AWS and Azure, proven knowledge at least one higher-level language (eg. Python and Golang),
The ability and desire to build and learn new systems with new technologies.

Rebellion is a well-capitalized technology start-up firm that is passionate about defining and delivering modern, life-changing software products to the US Department of Defense (DoD), the UK Ministry of Defence (MoD), and their allies. At Rebellion we believe in operating what we own, we deliver all of our products as managed services, this allows our product teams to maintain operational ownership across all deployments. Expect talented, motivated, intense, and interesting co-workers.

Compensation includes meaningful equity ownership, competitive salaries, full medical coverage, disability and life insurance, and transit reimbursement.

An Equal Opportunity Employer/Veterans/Disabled. Rebellion Defense is an equal opportunity employer and makes employment decisions on the basis of merit and business needs. Rebellion Defense does not discriminate against applicants on the basis of race, color, religion, sex, sexual orientation, gender, gender identity, national origin, veteran status, disability, or any other protected characteristic in accordance with federal, state, and local law.

Apply ⎘ Copy Link ↗ Visit Link

Senior Site Reliability Engineer
TextNow
Remote (United States)
$150,000 to $230,000 a year
October 2021
1 Applicants This Week
More Than 6 Months Old

This job posting is no longer available

Job Description

TextNow is based around a simple idea: Communication belongs to everyone. We work hard to help people stay connected by offering a solution that makes phone service free. At TextNow, we work together to solve complex and interesting problems that have a positive impact on our customers' lives.

Join us in our mission to help people stay connected with technology that is free (or as close to free as possible.)

TextNow is looking for motivated Site Reliability Engineers (SRE's) to own infrastructure, monitoring, logging, ci/cd, reliability and everything in between!

What You’ll Do:

Be responsible for maintaining and scaling production services and servers for complex and high throughput.
Improve scalability, service reliability, capacity, and performance.
Write automation code for provisioning and operating infrastructure at scale.
Build tools for internal use to support software engineering best practices.
You are not an operator; you’re an experienced software engineer focused on operations.
Work with development teams to make sure the applications fit nicely within the infrastructure and scalability/reliability/security is designed and implemented from the start.
Participate in on-call rotation, being responsible for uptime and support.
Roll up the sleeves to troubleshoot incidents, formulate and test your hypotheses, and narrow down possibilities to find the root cause.

Who You Are:

Creator of cool stuff with experience deploying web apps and distributed, service-oriented architectures.
Brilliant Collaborator with 8+ years of professional experience in an operationally focused role, preferably in DevOps or SRE, with a B.S., M.S., or PhD. in Computer Science (or equivalent).
Someone who takes action and ownership with proven ability to use automation tools.
Respectfully candid with the ability to motivate people to act and work on behalf of our customer.
A bold risk-taker and self-starter who loves to solve challenging problems.
Resourceful and scrappy with the ability to be strategic, roll up your sleeves and execute.

Other:

Strong knowledge of Linux and open source software
Understanding of modern web architecture (HTTPS, REST) and technology stacks
2+ years of experience with programming/scripting languages (Bash, Go, Python, Ruby, etc.)
Experience with deployment automation using Ansible, Puppet, and Terraform
Experience supporting various databases such as MariaDB, Redis, and various NOSQL engines
Experience deploying containers using Docker and Kubernetes
Experience working in the Amazon public cloud (AWS)
Experience supporting mobile applications (Android and iOS)
Experience in the telecommunications industry

#LI-SW1

Benefits:

· Strong work life blend

· Flexible work arrangements (wfh, remote)

· Employee Stock Options

· Unlimited vacation

· Competitive pay and benefits

· Parental leave

· Benefits for both physical and mental well being

Diversity and Inclusion:

At TextNow, our mission is built around inclusion and offering a service for EVERYONE, in an industry that traditionally only caters to the few who have the means to afford it. We believe that diversity of thought and inclusion of others promotes a greater feeling of belonging and higher levels of engagement. We know that if we work together, we can do amazing things, and that our differences are what make our product and company great.

For TextNow Candidates:

The People and Culture team is available to support you through the hiring process by providing reasonable accommodations to help enable a barrier-free interview experience. If you need assistance applying for a role due to a disability or special need, please let us know by completing this form. Once received our Equity, Diversity and Inclusion Specialist will reach out to you and assist with accommodations that you may require.

⎘ Copy Link ↗ Visit Link

Senior Engineer Tools & Platforms SRE
Digital Ocean
New York / Cambridge / Palo Alto, United States / Remote
$155,000 to $190,000 a year
July 2019
3 Applicants This Week
More Than 6 Months Old

Job Description

Do you ever wonder what happens inside the cloud?

Based in New York, DigitalOcean is a dynamic, high-growth technology company that serves a robust and passionate community of developers, teams, and businesses around the world. We believe that today’s entrepreneurs are changing the world through software. Our mission is to empower these entrepreneurs by bringing modern app development within reach for any developer, anywhere in the world.

We want people who are passionate about building the systems, culture, and processes that will improve the resiliency, reliability, scaling, and performance for cloud services.

We are looking for an experienced Site Reliability Engineer to work closely with our product engineering and infrastructure teams. Reporting to the Director of Platform Systems, the Site Reliability Engineer will be performing a mix of hands-on development, coaching, and collaborating with other teams and stakeholders to help bring DigitalOcean’s engineering systems and culture up to the next level.

This is a key opportunity to make a significant impact in DigitalOcean’s engineering and operational systems and influence future product designs and requirements. This role is essential to accelerate the improvement of the high expectations our customers have of DigitalOcean as we continue to grow and expand.

What You’ll Be Doing:

Performing hands on technical work to directly improve the reliability, resiliency, and scaling of our key platform systems
Working with stakeholders to develop and implement reliability and performance metrics
Facilitate DigitalOcean’s culture of learning by providing insight and recommendations for improvement
Coaching teams and individuals on reliability best practices and solutions
Working with other SREs and engineering leaders to define the architectures and practices that should be adopted in order to deliver on our engineering and operational goals
Establishing best practices for development, architecture, deployment, and operations
Working with peer SREs to improve services and processes (including architecture reviews, incident response, monitoring) in a cross-functional manner throughout the engineering organization

What We’ll Expect From You:

Distinguished track record as SRE (or similar role) with hands-on experience implementing reliability, process, and scaling solutions
History of fostering positive relationships with stakeholders and a track record of successful collaboration and coaching
Clear communication skills (both written and verbal) to document processes and architectures
Experience implementing disaster recovery best practices
Developing robust solutions that facilitate streamlined resolution of customer inquiries through use of technologies for automation, deflection, and issue management
Adept in Ruby and Go with a broad understanding of the full technology stack for a modern infrastructure
Advocate of effective development environments with the use of CI/CD tooling and configuration management technologies such as Chef or Ansible

Why You’ll Like Working for DigitalOcean:

We have amazing people. We can promise you will work with some of the smartest and most interesting people in the industry. We work hard but we always have fun doing it. We care deeply about each other and take our “no jerks” rule very seriously.
We value development. We are a high-performance organization that is always challenging ourselves to continuously grow. That means we maintain a growth mindset in everything we do and invest deeply in employee development. You’ll need to be great to get hired here and we promise you’ll get even better.
We care about you. We offer competitive health, dental, and vision benefits for employees and their dependents, a monthly gym reimbursement to support your physical health, and a monthly commute allowance to make your trips to and from work easier.
We invest in your future. We offer competitive compensation and a 401k plan with up to a 4% employer match. We also provide all employees with Kindles and reimbursement for relevant conferences, training, and education.
We want you to love where you work. We have great office spaces located in the heart of SoHo NYC and Cambridge and offer daily catered lunches to keep your hunger at bay. We’re also very remote-friendly—we use Slack to communicate across the company—and all remote employees have the opportunity to onboard in-office and take an all-expenses paid trip to our annual company offsite, Shark Week, to get quality in-person time with the team at least once a year. We also allow employees to customize their workstations to meet their needs—whether remote or in office.
We value diversity and inclusivity. We are an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Apply ⎘ Copy Link ↗ Visit Link

DevOps Engineer
nextmv
Remote (Europe, United States) / New York / Philadelphia
$100,000 to $140,000 a year
January 2021
2 Applicants This Week
More Than 6 Months Old

This job posting is no longer available

Job Description

nextmv (YC W20) is changing how companies automate and optimize their operations. We provide developers with the building blocks to create and test decision models, quickly. From logistics to healthcare to finance, every company can benefit from decision engineering using optimization and simulation. We’re looking for incredibly motivated people to help!

In a little over a year we have made substantial progress. We’re already landing enterprise clients. We’ve raised over $11 million from leading VC firms including Y Combinator, Firstmark Capital, Dynamo Ventures, and 2048 VC. And we’re just getting started.

We are looking for a DevOps Engineer II who is familiar with cloud platforms, container technology and loves automation. As the first dedicated hire supporting cloud infrastructure, internal tooling and automation you will have an impact on how we operate all our systems and services. In this role you will help build and maintain cloud infrastructure for our tools and products as well as assist with customer deployments ensuring we are following best practices and industry standards. You'll directly contribute to the success of our new hosted product by serving a hybrid DevOps / SRE function. This role will participate in our on-call rotation.

Requirements

3+ years as a software engineer, DevOps engineer, cloud engineer, site reliability engineer or systems administrator
Demonstrable experience administering AWS, especially VPCs, Lambda, RDS, S3 and IAM Roles & Policies
Experience with Infrastructure as Code (IAC) using Terraform
Excellent understanding of Docker & container technologies
Hands on experience with configuration management tools such as Ansible
Demonstrable understanding of modern software development practices including pair programming, peer reviews, Git-based workflows, continuous integration and delivery, and automated testing
Comfortable with Bash and Python
Familiarity with monitoring tools and services (DataDog)

Not required, but a plus:

Experience with Go or another statically typed and compiled language
Experience with serverless systems
Hands on experience with Kubernetes
Experience with software package management (RPM, APT, npm, Maven, Nexus, Artifactory, etc)
Ability to evaluate the benefits of using in-house vs off-the-shelf solutions
Software development experience
Familiarity with on-call / incident response practices
2+ years of remote work experience

These are some of your traits:

The idea of working in a fast-paced startup environment excites you
You thrive on automating everything and adding structure to processes and procedures
Working together as a team to accomplish goals is more important than working alone
You are eager to support our customers when they have DevOps or cloud engineering questions and researching technologies to find solutions
You value simplicity over complexity
You embrace challenging technical work
You thrive on discovering and documenting simple, pragmatic solutions
You’re not afraid to speak up when you have a point of view, but can “disagree and commit” once a final decision is reached
You just read this whole list and got more excited than concerned

How we work

We are remote first

We value amazing work and a strong work-life balance. The majority of our collaboration happens on Slack and Zoom. We get together quarterly for team offsites so we can get some facetime (Covid Pending).

Salary Transparency

We believe that financial transparency creates trust, and that teams with a high level of trust are able to execute more effectively. We view salary transparency as a way to challenge a rampant problem in our industry: the wage gap. The base salary for any two employees in the same role is the same. Performance in that role is the differentiator, not upfront negotiation.

Benefits

This is a salaried role. In addition, nextmv offers:

Health Care Plan (Medical, Dental & Vision)
Minimum Vacation Policy - (3 weeks minimum)
Stock Option Plan
401k
Home Office Stipend
Parental Leave

This role (and all roles at nextmv) is remote. That being said, all employees should be able to travel to company retreats quarterly (when COVID settles down).

About nextmv

nextmv helps companies automate and optimize even the most complicated operational decisions. The nextmv platform allows any developer to quickly build, test, and deploy models that automate routing, assignment, matching and scheduling.

Our Values

Our values are aspirational and affect everything we do. At nextmv, we hope to instill core attributes and practices into our daily lives. We will work toward these goals together, and help each other along the way.

Community
We act as a group of skilled contributors with diverse backgrounds and a common mission.
We listen to each other to actively instill empathy in ourselves.
We introspect about our actions and their impacts.

Candor
We share information, from company strategy to small insights and feedback.
We collaboratively review our decisions and code using the same process.
We own our mistakes and admit our vulnerabilities.

Focus
We are ambitious and value achievement over status.
We are innately driven to innovate and improve the world.
We apply our time and skills effectively to challenging problems.

Balance
We separate our work from our self-worth to view and improve it objectively.
We don't overwork, and take regular time away to encourage creativity.
We take care of ourselves so we can give our best to our team.

Also, we love animals.

⎘ Copy Link ↗ Visit Link

Senior Site Reliability Engineer, CORE
Netflix
Los Gatos, California, United States
$250,000 to $500,000 a year
May 2020
1 Applicants This Week
More Than 6 Months Old

Job Description

At Netflix, we strive to bring joy to people across the world through amazing stories. As we grow internationally, we are continually enhancing our cloud-based infrastructure to improve our performance, scalability, and reliability.

The SRE team's goal is to ensure customer joy by successfully managing risk and minimizing impact across Netflix. We do this through cross-functional engagement with other engineering teams, managing issues when they happen, as well as promoting reliability and resilience practices throughout the organization.

Outcomes

Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Increase our reliability through establishing guidance and methods of improvement
Form and maintain relationships with internal and external partners
Develop deeper insights and analysis into the quality of experience for our customers

We Value

Curiosity about how complex sociotechnical systems successfully operate at scale when failure is inevitable
People who see influence as their preferred tool for cultivating relationships
Collaboration and continuous improvement
A desire to learn and readiness to teach
Iteration as the path forward

Our Work

Drive incidents to resolution by coordinating with multiple engineering teams
Identify sources of instability in large-scale distributed systems and drive operational excellence
Analyze complex systems from a reliability and resilience perspective
Engage with product teams to diagnose operational surprises and carry forward improvements
Improve reliability and drive down the burden of toil with tooling and automation

Nice to Have

Experience with global, continuous delivery methods
Development with Python, Go, Java, or JavaScript/Node.js
Involvement with incident management and response
Knowledge of cloud platforms like AWS and microservices architecture
Deep network analysis
Linux systems engineering capability

Things that show how we think

Apply ⎘ Copy Link ↗ Visit Link

Site Reliability Engineer
Dollar Shave Club
Los Angeles, CA, United States
$120,000 to $150,000 a year
May 2019
1 Applicants This Week
More Than 6 Months Old

Job Description

For our fundamental philosophy please see our Medium article on the subject.

Work with and contribute to k8s-native infrastructure services to speed and stabilize software delivery and stability.
Write libraries to deliver “free” additions to our common software.
- For example, monitoring and logging built-ins, RPC wrapping and stats display within running binaries.
Maintain and contribute to shared infrastructure services.
- For example, Kafka, k8s clusters, service discovery and internal load balancing.
Write documentation, tutorials and blog posts (both public and internal).
Develop OSS to help define DSC’s technical brand to the open source community
- All systems should be designed at with open source in mind (within reason)
Contribute to DSC’s OSS products (See: https://github.com/dollarshaveclub/psst for an example of SRE developed OSS at DSC)

Perks & Benefits

Relocation assistance may be available
Weekly free lunches
Free DSC grooming products
Dog-friendly office
In-office haircuts, massage, car washes

Apply ⎘ Copy Link ↗ Visit Link

6 of 6 SRE jobs in United States 🇺🇸

Golang SRE Jobs in United States

Create your profile to continue

2-Click Apply

Job Description

Job Description

What You’ll Do:

Who You Are:

Job Description

Do you ever wonder what happens inside the cloud?

We want people who are passionate about building the systems, culture, and processes that will improve the resiliency, reliability, scaling, and performance for cloud services.

What You’ll Be Doing:

What We’ll Expect From You:

Why You’ll Like Working for DigitalOcean:

Job Description

Job Description

Outcomes

We Value

Our Work

Nice to Have

Things that show how we think

Job Description

Perks & Benefits

Get a weekly email with all new Golang jobs