Golang Site Reliability Jobs Paying 150,000 USD a Year
Hand-Picked Golang jobs • Apply directly to companies •
Clear salary ranges
Browse 33 Golang Site Reliability Jobs (6 new this week) in April 2025 at companies like DroneDeploy, Algorithmia and Swish paying at least 150,000 USD per year working as a Senior DevOps Engineer, Software Engineer and Senior Systems Engineer. Last post
Hiring Golang Developers?
Create your profile to continue
48 direct messages sent by companies to developers on Golang Cafe
in the last 30 days
55 developers joined Golang Cafe in the last 30 days
16,238 developer profiles page views in the last 30 days
Get access to exclusive discount on Golang courses up to 25% off
Last developer joined
2-Click Apply
Upload Your CV
Go to your Inbox & Confirm Your Application
10 of 33 Site Reliability Jobs paying at least
150,000 USD per year • Sort by
Date
Senior DevOps Engineer DroneDeploy San Francisco / Los Angeles / Portland, United States / Remote $130,000 to $180,000 a year
October 2018
2 Applicants This Week
More Than 6 Months Old
This job posting is no longer available
Job Description
DroneDeploy is the leading cloud software platform for commercial drones, making the power of aerial data accessible and productive for everyone. Trusted by businesses and individuals in over 140 countries worldwide, we are transforming the way drone users collect, manage and digest impactful data in a variety of industries, including agriculture, real estate, mining and construction. Simple by design and easy to use, DroneDeploy builds revolutionary software compatible with any drone. If you’re excited about drones and want to help us create a simple and seamless experience for drone users across the world, we’d love to hear from you!
The Challenge
The DevOps team is tasked with ensuring the reliability and security of our exponentially scaling platform, while serving as a force multiplier for the rest of the engineering organization. Other teams rely upon our expert guidance to design a product that earns the trust of our users, without slowing down the pace of development. We believe that automation and developer empowerment are the key to creating systems that are reliable and secure by default, while minimizing cycle times. We use a collection of SaaS, open source, and proprietary technologies; whichever provides the right solution and seamless integrations for that piece of the puzzle. Some of the key technologies we leverage include Docker (for code packaging and deployment), Kubernetes (for container orchestration), Ansible (for lightweight config management), and Terraform (to control our cloud infrastructure).
The Role
In this position you will be expected to:
-Have a mind for simplifying unnecessary complexity.
-Empathize with the people who use the systems you build.
-Excel at critical thinking and adapt to new situations.
-Anticipate future problems, without over-engineering the present.
-Share your expertise with others, but never stop learning new things.
We are looking for someone with:
-A depth of knowledge in at least one domain.
-Minimum of 2 years’ experience managing complex systems using software.
-Experience writing and maintaining software applications in languages such as Golang, Python, Ruby, Java, C#, JavaScript, C, C++, etc. (not just scripts, side projects ok).
-Available to work on-site within our San Francisco office, or work remotely on Pacific Standard Time hours.
-Familiarity with configuration management systems (e.g. Ansible, Puppet, Chef, Salt, Terraform, CloudFormation).
-Experience solving difficult problems with a scripting language (e.g. Bash, Ruby, Python) in a Linux environment.
Bonus points:
-Experience with container technology (Docker/cgroups/LXC/etc) and container orchestration (Kubernetes/Mesos/CloudFoundry/etc).
-Experience with major cloud providers (AWS, GCP, Azure, etc).
Life at DroneDeploy
We’re a team of star wars loving, hot sauce eating, tech enthusiasts with inspirational talents. Everyone is empowered to explore and implement new ideas and improvements. We enjoy our collaborative office environment and encourage each other to push boundaries. We host weekly Friday night BBQs on our rooftop deck, offer great salaries, generous equity,100% employee health coverage, unlimited vacation and delicious catered meals among other perks.
Software Engineer Algorithmia Seattle / San Francisco, United States / Vancouver, Canada / Remote $100,000 to $150,000 a year
August 2018
2 Applicants This Week
More Than 6 Months Old
This job posting is no longer available
Job Description
Software Engineer (Production & Deployment)
Seattle, Vancouver, NYC, or Remote
Empower large enterprise to run AI/ML at scale, leveraging the best in modern distributed systems and automation technology
Join a truly remote-friendly company - work anywhere in the US or Canada including your sofa, the beach, or our Seattle waterfront office
Experience rapid growth in the first AI startup to be funded by Google
Algorithmia automates, optimizes, and accelerates every step of the journey to deploying of AI/ML at scale. We allow anyone to run models on massively parallel infrastructure in minutes instead of months. In our cloud or your datacenter - all completely managed for maximum performance at minimum cost. Already trusted by over 60k developers and major enterprise customers, Algorithmia makes scalable Machine Learning fast, simple, and cost-effective for everyone.
Undergoing enormous customer growth, we’re rapidly scaling our Customer Operations team to meet demand. We’re looking for talented Software Engineers to join a passionate, distributed group that's driving the design, deployment, and optimization of Algorithmia with our Enterprise customers. This unique role is a broad mix of automation, DevOps, infrastructure engineering, and software development - offering an unparalleled opportunity to learn, grow, and impact the most important financial institutions, intelligence agencies, and private companies in the country.
As a Software Engineer on the Customer Operations team at Algorithmia, you will:
Deploy Algorithmia Enterprise into Fortune 500 and Government environments
Design, build, and maintain the automation and infrastructure needed to deliver Algorithmia effectively, and to help us achieve even greater scale
Work cross-team to ensure Algoritmia supports unique customer environments, and to design solutions to meet specific customer needs
Eventually automate your role out of existence - then join us in doing something even more amazing
Handle the highest-tier of engineering support for AI/ML leaders
Have a real career plan, with mentorship and fast-track opportunities to promotion, technical leadership, people management, or wherever your interests may be
Work from anywhere in the USA or Canada. We have teams in Seattle, NYC, Vancouver BC, Nova Scotia - or go 100% remote from home (Snuggie, bunny slippers, and all - no judgement!)
And we might make the perfect match if you:
Want to work with modern cloud technologies and large scale distributed systems
Have experience multiple languages (Java, Scala, Go, Python, Bash, etc.), deployment tools (Docker, Kubernetes, Ansible, Terraform, etc.), and cloud providers (AWS, Azure, GCP, OpenStack, etc.)
Are passionate about automation, and believe nothing should ever be done manually twice
Enjoy working with customers to deliver solutions that meet business need, empower engineers (and data scientists!), and solve real-world problems
Feel most comfortable in hybrid roles that blur the line between Developer, Site Reliability Engineer, Deployment Engineer, Solutions Architect, and Consultant
Bonus points for a love of data science, any kind of AI/ML experience, interesting public code, or the implementation of something cool on our AI marketplace (hint: free trial!)
As a Software Engineer at Algorithmia you’ll join a passionate team that’s changing the way everyone uses AI and ML. You’ll solve real problems, make an impact, and work in a flexible environment that encourages you to follow your own interests as well. You’ll be welcomed into an intelligent, quirky, and diverse group and gain access to fantastic perks beyond just salary, equity, and insurance benefits - all from the comfort of your own sofa (or our dog-friendly office).
If this sounds like you APPLY NOW, or learn more at algorithmia.com
Algorithmia is an equal opportunity employer and we value diversity at our core. We will never discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status and encourage everyone to apply.
Senior Systems Engineer Swish Toronto, Canada / San Francisco, United States / Remote $80,000 to $160,000 a year
October 2018
2 Applicants This Week
More Than 6 Months Old
This job posting is no longer available
Job Description
Swish is a world-class development studio whose teams have delivered products for Google, Microsoft, Kik, Nasdaq, Factset, and other major enterprises. The blockchain engineering team at Swish is looking for talented distributed systems engineers to optimize protocol transaction throughput and network reliability for blockchains in development.
Our mission is to bring the promise of a decentralized blockchain-based future to reality for clients.
As a systems engineer, you will work with our blockchain developers, protocol researchers and clients to implement and improve on a byzantine fault tolerant blockchain architecture based on the Tendermint consensus layer, by increasing the throughput, reliability and stability of the network. This role is ideal for engineers who have experience optimizing performance and robustness of distributed systems, and are excited to be working on the cutting edge of high-performance blockchain protocol development.
You might have experience as an Unix/Linux distributed systems engineer optimizing performance and reliability for large-scale cloud servers, and be relatively new to blockchain and distributed consensus protocols. Or you might be a blockchain engineer who is very familiar with distributed consensus protocols like delegated proof-of-stake, and newer to working on low-level performance optimizations. Experience with Tendermint is a huge plus. Either way, you are a great detective and passionate about pushing the performance of your infrastructure to its limits, without compromising on safety or stability.
We are also looking for:
Strong communication skills.
Experience with performance and load testing.
You should be motivated by a desire to solve the most important problems, obtain unprecedented results, and push your methods to their maximal performance.
Responsibilities
* Optimize Tendermint consensus protocol codebase for speed, reliability and performance, including making PRs as needed to the OSS Tendermint project
* Troubleshoot reliability issues of distributed systems, e. g. connection losses between Tendermint nodes under heavy load
* Monitor the infrastructure and blockchain performance to identify issues
* Measure and improve server response times in different conditions and environments
* Guide protocol design decisions
Requirements
* 1+ years experience with Golang, C or C++
* 4+ years of experience in a systems engineering role
* Deep experience with networking and concurrent computing
* Deep experience with Unix/Linux systems
* Experience with AWS/GCP
* Comfortable operating in dynamic environments
Bonus Points
* Background in networking or distributed systems
* Familiarity with Cosmos / Tendermint
* Proficiency in protocol-level blockchain development
* Contribution to open source software
* Degree in STEM field, especially software engineering or computer science related.
* Experience in small startup environments helping large enterprises.
* Experience with a distributed team
About Swish
Launched in February 2013, Swish is a fast-growing business with an innovative working culture and teams spanned across the world with teams in Toronto, San Francisco, Berlin, Auckland, Bruxelles, Medellin, and more.
We create products for successful business using cutting-edge technologies: Blockchain, Machine Learning, and Apps Dev. Working with Swish puts you in contact with prestigious brands, wherever your base is. We are a 100% remote-work company because we believe it is everyone’s choice to live and work the way they prefer.
Work is organized in sprints - 2 weeks periods to which, as a member of our talent community, you choose to commit. You always have the choice to accept or decline a sprint, or take-on multiple sprints simultaneously.
We let members choose what suits them best depending on their current situation: family, travel, studies, finance. We know life is not linear and we respect the humans behind the screens.
Our work ethic relies on six core values: Transparency, Directness, Meritocracy, Autonomy, Responsibility, Continuous Learning.
Ensuring a diverse and inclusive workplace where we learn from each other is core to our values. We welcome people of different backgrounds, experiences, abilities, and perspectives. We are an equal opportunity employer and a fun place to work.
Sr. Full-Stack Developer TV Time Santa Monica, United States $115,000 to $155,000 a year
October 2018
1 Applicants This Week
More Than 6 Months Old
This job posting is no longer available
Job Description
There's been a massive shift in television over the past several years, both in quality of content and the way we all consume it. TV Time is at the center of this transformation. Our product enables well over two million active users all over the world to track, discover and discuss their favorite shows, no matter what, when or how they’re watching. It has quickly become the go-to product for cord cutters, streamers, bingers and premium subscribers alike to organize and connect around their passion for television.
In addition to providing a valuable service to fans, TV Time is building an immense data business. The behavioral and sentiment information we collect has become invaluable to content producers, networks and advertisers. Our billions of first-party insights allow them to discover insights and trends they can’t from any other source, which is driving strategic decisions across their businesses.
If you’re the sort of person who can discuss your favorite TV shows for hours and have the passion to be a part of small, well-funded team that’s building something monumental, you just might have found what you’re looking for.
DESCRIPTION
As our Sr. Full Stack Engineer, you will direct the technology and implementation associated to our various web projects. These projects include a rebuild of our main site tvtime.com, a progressive web app to compliment our iOS and Android app, and the transition to microservices to support the aforementioned three api clients. We need a strong full stack engineer who has done it all and understands how to build reliable and robust systems that integrate seamlessly with one another.
WHAT WILL YOU DO?
- Lead the effort to rebuild our site, tvtime.com
- Build a new CMS system to support publishing content to our site and apps
- Build progressive web app to compliment our native iOS and Android apps
- Build microservices for all 3 platforms
- Implement best web development practices
WHAT DO YOU NEED?
- 3-5 years of full stack development experience with high traffic sites
- Backend technologies NodeJS, PHP, Python, or Golang
- Frontend technologies React, Angular, or VUE.JS
- CSS3, HTML5, Sass, Less, or Gulp experience
- JAM Stack or MEAN Stack experience
- NO-SQL and MySQL experience
- Caching layer using Redis, Memcache, Nginx, or Varnish
- AWS/Cloud Experience
- Lambda and Serverless Architecture
- Microservices Experience
- Extensive experience working in unit testing frameworks and proper testing
ADDITIONAL PERKS
- Stock Options
- Full Health Benefits
- Unlimited Vacation time
- Fully Stocked Kitchen
- Team Lunch Weekly And Special Events
- Tuition Reimbursement Program
- Free Fitness Classes In The Office
- 5-minute Walk From An Expo Line stop
Location: Remote (EU, UK, US, Canada, South America)
About us
At SlashID, we are rethinking the way companies manage identity and authentication, giving users a better experience while respecting their privacy and keeping their data safe.
At the core of our system are encrypted user identities, with API-based modules built on top, which accomplish tasks such as authentication, authorization, ID verification and many others.
SlashID’s products are on our customer’s critical path and most of them require 99.99% uptime, so reliability and security are key to our engineering culture.
Last but not least, we are a young startup. We work with tight deadlines, lean processes and ambitious roadmaps. We are a small, tight-knit team who strives to succeed in a competitive environment.
About the role
We’re looking for people with a strong technical background and a passion for building highly scalable and reliable systems. You’re a good fit if you are comfortable dealing with complex distributed systems, have exquisite attention to detail, and enjoy learning new technologies.
SlashID is remote-first and we offer flexible working arrangements to help our team manage their daily lives in the way that works best for them.
Please note: the exact level of the role (Senior or Principal) will depend on your experience and interview performance.
You will:
Design, build and maintain SlashID’s products, services and features
Be part of the engineering team working on our Authentication, Data Vault and User Management services
Use and adapt state-of-the-art cryptographic libraries and primitives
Build tooling to monitor and analyze SlashID’s services, both in terms of performance and security
Write technical documentation, blogs and guides
Work with other highly motivated engineers who all have an intrinsic drive to make things better
Use your passion for technology to ensure our platform operates flawlessly 24/7
Have broad exposure to our entire architecture
You'll use:
Go (Golang)
Hardware Security Modules (HSM)
Tink
GCP
Terraform
Docker
Redis
Postgres and MySQL
You are a good fit if you:
Have a strong understanding of reliability practices, distributed systems, and cloud native architectures
Have experience as a cloud or backend engineer for a multi-tenant large scale mission critical system
Have a thorough understanding of engineering best practices, including appropriate testing paradigms, effective peer code reviews, resilient architecture
Have a good understanding of multi-threading, concurrency, and parallel processing technologies
Have experience producing high-quality technical documentation for the products you develop
Love building secure software, leveraging the latest cryptographic technology and methodology
Thrive in a fast-paced, test-driven, collaborative, and iterative environment
Have a passion for reliable and performant systems, and care deeply about user experience
Enjoy working with a diverse group of people with different backgrounds and expertise
There are more than 700,000 active installations of Grafana around the globe, monitoring everything from beehives to climate change in the Alps. The instantly recognizable dashboards have been spotted everywhere from a SpaceX launch and Minecraft HQ to Wimbledon and the Tour de France. Grafana Labs also helps companies including Bloomberg, JPMorgan Chase, and eBay manage their observability strategies with full-stack offerings that can be run fully managed with Grafana Cloud, or self-managed with Grafana Enterprise Stack. The Grafana stack has grown to include two other open-source projects, Grafana Loki (for logs) and Grafana Tempo (for traces)
About Grafana Cloud:
Our Grafana Cloud pipeline moves millions of data points, log lines and traces per second from our customer's environments into a highly available, low-latency stack that processes and stores the data, and serves it to dashboards and alerting tools. We aim to grow this to hundreds of millions per second, and it's critical that as we grow, we improve our performance, increase our reliability and do it all more efficiently.
Backend engineering roles at Grafana require engineers with a passion for performance, reliability, and who enjoy taking projects from conception to production.
Since we deploy production services, we have on-call rotations to ensure the health of the system. We dogfood our own services, so being on call is an important way to understand our system and how to use the products we create.
Our culture is one of remote-first, and our engineering organization is largely remote. We provide guidance and meet regularly using video calls, and we need people who can work independently and can communicate well.
We care deeply about open source and the projects generally are open source, check them out: https://github.com/grafana.
We primarily use Go.
Requirements:
* You are familiar with programming languages like Go, C, C#, C++, Java or Rust
* You are able to write clean, robust and performant software
* You have experience with network programming or distributed systems development
Nice to haves:
Familiarity with operations/SRE
Experience with the monitoring space in general (metrics, logging, tracing, observability)
Familiarity with time-series applications and concepts, especially Graphite or Prometheus
Experience with Kubernetes / Kafka / Cassandra / Bigtable / syslog / opentracing or similar technologies.
Benefits:
Flexible hours
The equipment you need to get the job done
Generous vacation policy of 30 days per annum with national holidays in your country of residence on top
Grafana operates in 44+ countries. We try to operate as one team and focus on global benefits which our whole team can enjoy. Inevitably there are some regional variations and we discuss the benefits offered in your country of residence through our interview process.
We offer a competitive healthcare plan (Medical, Dental & Vision) for our US based employees via our co-employer JustWorks.
We offer a 4% employer contribution match on our 401K/pension plans or a one time 4% salary increase after 6 months tenure depending on your location
Our hiring process:
Video chat with one of our Talent Managers (30 mins)
Video chat with a Hiring Manager (30 mins)
Live Coding Interview with 2 Engineers (60 mins)
Systems Design focused interview (45 mins)
Equal Opportunity Employer- At Grafana Labs we’re building a company where a diverse mix of talented people want to come, stay, and do their best work. We know that our company runs on the hard work and the dedication of our passionate and creative employees.
We will recruit, train, compensate and promote regardless of race, religion, colour, national origin, gender, disability, age, veteran status, and all the other fascinating characteristics that make us different and unique. We believe that equality and diversity builds a strong organisation and we’re working hard to make sure that’s the foundation of our organisation as we grow.
Back-End Software Engineer Conductor New York, United States $100,000 to $150,000 a year
October 2019
3 Applicants This Week
More Than 6 Months Old
Job Description
We are looking for an experienced backend software engineer to join our exceptional team of distributed systems engineers. We have embarked on a mission to create highly-scalable and performant micro-services that handle tera-bytes of data and analytics that feed into beautifully designed custom data visualizations. You will be responsible for architecting, implementing and testing new and existing systems that power our flagship product and help humanize the way content marketers interact with customers. Most importantly, your leadership as a senior and experienced member of the team will become an example for others to follow, and you will help shape and define our engineering practices and craftsmanship.
Our Engineering Values
Collaboration: We believe that engineers do their best work when working together in cohesive teams.
Excellence: We believe in doing things the "right way" rather than the "fast way", and holding ourselves to a high standard of excellence.
Growth: We believe engineers do their best work when they are constantly growing, learning, and changing.
Communication: We believe in combining empathy with openness and honesty to set clear expectations and hold each other accountable.
Impact: We believe we're making the world a better place by empowering marketers to really help their customers rather than just sell stuff.
What you'll be doing
Writing beautiful API documentation for all the services you and the team builds for consumption by other developers (we use Swagger).
Design and architect scalable services that are both highly available and performant.
Breaking down product requirements into manageable stories and delegating work to team members.
Refactoring and evolving existing distributed systems to make legacy systems anew.
Collaborating with other functions of the business and engineering department to ensure successful delivery of software.
Who you are:
A bachelor's degree or higher in Computer Science or related field
8+ years working in backend technologies in addition to Java such as Scala, Python, Golang or a combination of
8+ years working in databases such as MySQL, Postgres or other RDBMS systems
8+ years working with cloud-hosting and deep understanding of AWS, Google Cloud or Azure
3-5+ years working with Docker containers and deep understanding of the internals of how containers work
3-5+ years working with non-relational databases such as Cassandra or Dynamo
3-5+ years building and deploying containers into managed clusters with Kubernetes or equivalent
Exceptional verbal and written communication skills
Proven professional experience working with non-technical business functions
You enjoy collaborating with product management teams to ensure the best quality product is shipped out the door
You have a keen appreciation for usability and understand your users come first when designing systems
Why You Should Join
At Conductor, we are looking for engaged and passionate engineers that can raise the bar. There is a tremendous opportunity here to have immediate impact in the day-to-day and affect the company's success and growth. We all share in the same values and push each other to meet these standards.
We are made up of a diverse group of people from all backgrounds and include a team of exceptional engineers in Kyiv as well. Wherever you are from, you will find a common ground here for continuing to push forward your career and make a difference in this industry.
Conductor, Inc. is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
About Conductor
Conductor’s search and content intelligence platform helps marketers create and optimize content to improve visibility online.
The technology generates customer intent insights that lead to compelling content, increased traffic, and higher organic marketing ROI. Customizable dashboards and workflows guide marketers through the content creation process, empowering them to measure, refine, and demonstrate the effectiveness of their SEO and content marketing efforts.
In addition to its SaaS platform, Conductor offers a suite of services and support including site audits, site migrations, and managed services that empower in-house marketing teams and digital marketing agencies to drive results and put their customers' needs first.
Conductor's forward-thinking customers include global and emerging enterprise brands like Citibank, Salesforce, ClassPass, and WeWork.
Chestnut, backed by a16z, is applying modern solutions to transform a $7B industry. Our customizable, easy-to-use producer management and compensation platform is purpose-built to address the insurance industry’s unique challenges.
We help customers break free from legacy constraints, static compensation models, and manual processes—turning commissions into a dynamic revenue driver that unlocks growth.
We’re tackling one of the hardest problems in insurance. If you love solving complex challenges and want to work directly with a founding team building something groundbreaking, let’s chat!
Engineering at Chestnut
Chestnut is, at its core, a technology company, and we are building the best team! We are looking for engineers who are excited to be part of our early story and who want to build a transformational company. We hire engineers who have a broad set of technical skills, are highly cross-functional, and are eager to solve a wide range of engineering challenges. Our ideal candidate has a strong sense of ownership and enjoys owning projects from inception to scaling it in production. We value people who take pride and ownership in their work and who show an aptitude for learning quickly. As an early employee, you will be working with a nimble team of committed and talented engineers and will have a large, long-term impact on technical design and engineering culture.
About Your Role
We are looking to expand our founding engineering team. Our team is inclusive, transparent, and takes large ownership in driving features from 0 to 1. We are looking for someone who is equally interested in developing robust APIs to stand the test of time as they are in developing innovative solutions to solving tough data modeling or UX challenges.
Your Responsibilities
Architect & Build – Design, develop, and deploy scalable software solutions from scratch
Full-Stack Development – Own backend and frontend systems, ensuring seamless performance
Technical Leadership – Drive engineering decisions, set best practices, and mentor as the team grows
Rapid Iteration – Prototype, ship, and refine features based on user feedback
Scalability & Performance – Ensure system reliability and efficiency as the platform scales
Infrastructure & DevOps – Manage cloud infrastructure, CI/CD pipelines, and security best practices
Cross-Functional Collaboration – Work closely with founders, product, and design to shape the roadmap
Your Qualifications
3+ years of experience as a backend and/or full-stack engineer
Strong understanding of data structures, algorithms, and software design principles
Expert-level knowledge of Golang programming language and ecosystem
Familiarity with containerization and orchestration technologies like Docker and Kubernetes
Experience working with Git and writing technical specs
Experience working with gRPC and Protocol Buffers
Bachelor’s and or Master’s degree in Computer Science or another STEM field (or equivalent work experience)
An entrepreneurial spirit - you have or have always wanted to start a company
Bonus Points
Worked at an early stage (Seed or Series A) company, and/or a company that services the insurance industry
Familiarity with TypeScript / React or similar frameworks
Experience managing ETL data pipelines
Experience with general ledgers and double entry accounting
Experience with Terraform or other IaC equivalent technologies
Benefits
Competitive salary and equity, with 10 year exercise window for stock options
Remote-first work culture
Quarterly offsites for all of us to bond
Unlimited PTO with 4 weeks recommended per year
Top notch health, dental, and vision insurance subsidized by us
Backend (Go) Engineer Fleet Remote (Americas timezones) $100,000 to $180,000 a year
January 2022
1 Applicants This Week
More Than 6 Months Old
Job Description
Let's start with why we exist. 📡
Ever wondered if your employer is monitoring your work computer?
At Fleet, we think it's time device management went open source.
Why should you join us? 🛸
Work from anywhere with good internet. (We're 100% remote. No office. No commute.) Everyone works remote, but you don't feel remote. There is no headquarters. You are free to travel and move.
Fleet can offer you a competitive salary, significant equity, and an independent, outsider-friendly culture. Work with helpful, kind, and motivated people who know what they're doing.
At Fleet, we value focus, iteration, and meaningful results – not 60 hour work weeks. We are non-judgmental and laser-focused on growing the company.
Work closely with experienced, well-funded founders and a great team, including the people who created osquery and Sails. We care about openness and transparency.
Work computers can be private and safe. Help make endpoint monitoring less intrusive and more transparent.
Protect the production servers and employee laptops of Earth's largest companies. Work on a product used by lots of people who care about what you do.
Fleet is growing quickly, with significant revenue from Fortune 1000 customers. You will have lots of opportunities to make decisions, learn, and try new things.
Responsibilities 🔭
Fleet’s server is written in Go with go-kit. Deployments range from single servers to over 100,000 osquery clients connected to horizontally scaled Fleet servers, handling tens of thousands of requests per minute. We aim to keep Fleet’s deployment as simple as possible to ease self-hosted deployment. MySQL and Redis are used for persistence and caching.
Profile and optimize the performance of the Fleet server (along with MySQL and Redis queries) to improve reliability and increase the upper limits of deployment sizes.
Work with Fleet’s product team, customers, and the wider open-source community to improve IT and security workflows.
Mid-level to senior engineering experience (4+ years) with backend or full-stack software engineering.
Experience building scalable, production quality servers.
Comfort with server and SQL performance profiling and optimization.
Experience with Redis and/or SQL databases. (Particularly MySQL or MariaDB.)
Experience building, deploying, and operating production web servers and APIs.
⏰ Your work hours have significant overlap with Americas time zones.
🗣️ You have great written and oral communication skills, especially in English.
🔩 You are competent with source control in Git. You use issue trackers and other worthwhile processes to get more meaningful work done.
You can mentor other developers and do code reviews. Maybe you managed open source projects before; maybe you collaborated closely with more junior engineers at work. You understand the importance of promoting a positive engineering culture.
Bonus: Experience programming with Go and go-kit.
Bonus: Experience working with Mobile Device Management (MDM) APIs.
Bonus: Experience deploying/monitoring/managing containers with Docker/K8s.
There are more than 700,000 active installations of Grafana around the globe, monitoring everything from beehives to climate change in the Alps. The instantly recognizable dashboards have been spotted everywhere from a SpaceX launch and Minecraft HQ to Wimbledon and the Tour de France. Grafana Labs also helps companies including Bloomberg, JPMorgan Chase, and eBay manage their observability strategies with full-stack offerings that can be run fully managed with Grafana Cloud, or self-managed with Grafana Enterprise Stack. The Grafana stack has grown to include two other open-source projects, Grafana Loki (for logs) and Grafana Tempo (for traces)
This is a remote position, and we are considering candidates in the North, Central & South Americas regions.
About Grafana Cloud:
Grafana Cloud is our composable observability platform that integrates metrics, logs, and traces with Grafana. It allows our customers to leverage the best open source observability software – including Prometheus, Cortex, Loki, and Tempo – without the overhead of installing, maintaining and scaling their own observability stack.
Our Grafana Cloud pipeline moves millions of data points, loglines, and traces per second from our customers’ environments into a highly available, low-latency stack that processes and stores the data, and serves it to dashboards and alerting tools. We aim to grow this to hundreds of millions per second, and it's critical that as we grow, we improve our performance, increase our reliability, and do it all more efficiently.
You would be joining one of our Cloud squads, whose responsibilities span from adapting and delivering our open-source offerings to a cloud environment that can support millions of users, writing software that allows those users to easily send data from within their infrastructure, or helping to build monitoring and alerting solutions.
Our tech stack is mostly made up of services written in Go, running on multiple Kubernetes clusters that leverage Google’s Cloud Platform.
Our culture is remote-first and our engineering organization is largely remote. We provide guidance and meet regularly using video calls, so an independent attitude and strong communication skills are a must
Within 1 month you will be able to:
Gain a deeper understanding of our cloud product and our customers
Get to know the codebase and contribute to our growing list of third-party integrations
Participate in ongoing design discussions that allow us to collaborate on and inform our technical decisions
Significantly contribute to a major initiative in our roadmap
Within 3 months you will be able to:
Take an active role in shaping our roadmap and your own career objectives
Drive a project from initial ideation all the way to operations once it is in the hands of customers
Embrace our open-source culture and contribute to other projects that may not directly fall within your team’s scope
Be a part of your team’s on-call rotations and take ownership of the services you’re running
Requirements:
You are familiar with programming languages like Go, C, C#, C++, or Rust
You are able to write clean, robust, and performant software
You have experience with network programming or distributed systems development
Nice to haves:
Familiarity with operations/SRE and the concept of infrastructure as code
Experience with the observability space in general (metrics, logging, tracing, monitoring, alerting)
Experience with Kubernetes / Kafka / Cassandra / Bigtable / syslog / opentracing or similar technologies
Familiarity with time-series applications and concepts, especially Graphite or Prometheus.
Equal Opportunity Employer- At Grafana Labs we’re building a company where a diverse mix of talented people want to come, stay, and do their best work. We know that our company runs on the hard work and the dedication of our passionate and creative employees.
We will recruit, train, compensate and promote regardless of race, religion, colour, national origin, gender, disability, age, veteran status, and all the other fascinating characteristics that make us different and unique. We believe that equality and diversity builds a strong organisation and we’re working hard to make sure that’s the foundation of our organisation as we grow.