Remote Site Reliability Engineer Jobs

Explore 51 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (51)

IT Cloud Operations Engineering Specialist (Remote)

2 days ago
Full-time
United States
Key requirements: 5 years of experience, Cloud infrastructure management, AI integration in operations, VMware vSphere, Cost optimization in cloud environments, SaaS deployment, Automation scripting, IP Networking, Network Security, Linux and Windows application maintenance, Database Administration, Configuration management tools, High-availability environment troubleshooting, Data integrity monitoring, Cloud resource lifecycle management, Performance optimization dashboards
Geotab

Geotab is a Canadian technology company headquartered in Oakville, Ontario, specializing in B2B SaaS fleet management and telematics solutions for the global transportation and logistics markets.

Remote policy: Geotab operates under a flexible hybrid working model, allowing employees to work both in-person and remotely. While the company supports remote work, candidates must have US residency, as many roles are focused on North America.

Senior SRE Engineer

3 days ago
Full-time
United Kingdom, Spain, Poland, Portugal, Romania, Serbia, Lithuania, Georgia, Armenia
Key requirements: 4 years of experience, Kubernetes, ArgoCD, GitHub Actions, VictoriaMetrics, Loki, OpenTelemetry, HashiCorp Vault, Shell scripting, Python, Golang, B2 English proficiency
P2P

P2P.org is a leading fintech company specializing in cryptocurrency staking solutions, headquartered remotely, serving a global market with a focus on decentralized finance.

Remote policy: P2P.org is a fully remote company, hiring talented individuals from various regions around the world to foster a diverse and inclusive team.

DevOps/IT Engineer

7 days ago
Full-time
Europe
Key requirements: 5 years of experience, Azure DevOps, PaaS-based cloud infrastructure, Cloud security principles, Identity and access management (IAM), Bicep, CI/CD pipelines, Monitoring and observability platforms
Skedda

Skedda is a global B2B SaaS platform headquartered in an unspecified location, specializing in workplace management solutions for diverse sectors including IT, education, and finance.

Remote policy: Skedda hires remotely from various locations, with some roles, such as the QA Automation Engineer, specifically available within Europe (CET).

Staff Site Reliability Engineer

8 days ago
Full-time
United States, Canada, United Kingdom, Ireland, Finland, Singapore, India
$150,000 to $225,000 per year
Key requirements: 8 years of experience, SaaS systems at scale, Python or Go, AWS, GCP, or Azure, Kubernetes, Networking fundamentals, Monitoring & alerting (Prometheus, Grafana, Datadog, ELK), Advanced observability (OTEL, continuous profiling), Incident management, Troubleshooting across the full stack
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Intermediate Site Reliability Engineer, Database Operations

8 days ago
Full-time
Worldwide
Key requirements: PostgreSQL in high-growth environments, Infrastructure automation (Chef, Ansible, Puppet, Terraform), Observability stack development, SQL and PL/pgSQL, Large SaaS distributed systems experience, Data modeling and data structure design, Proactive problem-solving, Database infrastructure automation, Monitoring and alerting best practices, Self-service tools for engineers
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Staff Site Reliability Engineer-Federal, Security Clearance

8 days ago
Full-time
United States
$119,000 to $170,000 per year
Key requirements: 5 years of experience, Active Secret Security Clearance, Site Reliability Engineering in classified environments, Monitoring activities (vulnerability scanning, patch management), Proficiency in Linux administration, Automation tools (Ansible, Terraform), Python coding, Container-based architectures (AWS ECS, Kubernetes), Experience with air-gapped environments, High/Moderate FedRAMP authorization levels
Zscaler

Zscaler is a San Jose-based B2B cloud security company specializing in SaaS solutions for zero trust architecture and secure access service edge (SASE), serving enterprise customers globally.

Remote policy: Zscaler supports remote work and hires globally, with team members located in various regions, allowing for collaboration across time zones.

Senior Infrastructure Engineer (Core)

10 days ago
Full-time
Worldwide
Key requirements: 3 years of experience, Kubernetes management, Configuration languages (Ansible, Puppet, Terraform), CI/CD infrastructure design, Container technologies, Basic networking protocols (DNS, HTTP, TLS), Service discovery and service mesh architectures
Telnyx

Telnyx is a global telecommunications B2B company specializing in cloud-native VoIP solutions and connectivity applications, headquartered in an undisclosed location, serving businesses worldwide.

Remote policy: Telnyx supports remote work and hires from various regions globally, with team members located in over 145 countries. This allows for collaboration across different time zones.

Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy

10 days ago
Full-time
Worldwide
$90,000 to $120,000 per year
Key requirements: Kubernetes, Release processes, Deployment strategies, Application observability, Large scale systems, Automation tools, Product development mindset
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Senior Site Reliability Engineer, US Public Sector Services

11 days ago
Full-time
United States
$124,300 to $266,400 per year
Key requirements: Terraform, Kubernetes, Production-scale operations, Infrastructure automation, Cloud security best practices, GitLab platform proficiency, Incident response leadership, Multi-tenant infrastructure management
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Senior Engineer, Site Reliability Engineering, Digital Banking

11 days ago
Full-time
United States
$130,000 to $150,000 per year
Key requirements: 4 years of experience, AWS, Kubernetes, Kafka, Observability platforms, Python, Node.js, Linux administration, Chaos engineering, Distributed systems, SQL, Open Telemetry
Forbright Bank

Forbright Bank is a Chevy Chase, MD-based B2C full-service bank specializing in personal banking products with a strong commitment to sustainability and community support.

Remote policy: Forbright Bank offers flexible remote or hybrid work schedules for most positions, with team members located across various regions. Specific hiring locations are not defined, but the company supports a collaborative work culture.

Senior Site Reliability Engineer - Midnight

12 days ago
Full-time
Worldwide
Key requirements: AWS, Kubernetes, GitOps, CI/CD, Prometheus, Blockchain experience, Incident response, SLOs/SLIs, Problem-solving, Technical communication
IO Global

IO Global is a fully remote technology company specializing in blockchain research and development, focusing on decentralized solutions for a global market.

Remote policy: IO Global is a fully decentralized remote-first company hiring globally, with team members across more than 50 countries, fostering collaboration among diverse, distributed teams.

Senior Site Reliability Engineer

12 days ago
Full-time
Canada, Mexico
Key requirements: Kubernetes, GCP, AWS, Azure, Terraform, Github Actions, ArgoCD, Multi-cloud environments
Cyberhaven

Cyberhaven is a remote-based B2B cybersecurity company specializing in AI-powered data protection and advanced threat detection for organizations globally.

Remote policy: Cyberhaven supports remote work and hires from various regions, with team members located globally. The company offers flexibility for employees to work from home or the office, depending on their preference.

Senior Site Reliability Engineer (Security Clearance)

13 days ago
Full-time
United States
Key requirements: Active Security Clearance, Cloud Infrastructure (GCP, AWS, Azure), Infrastructure as Code (Terraform, Ansible, CloudFormation), Docker and Kubernetes, Python, Monitoring and Observability (Prometheus, Grafana, ELK stack), CI/CD Pipeline Development, SRE and DevOps Experience
Trase Systems

Trase Systems, Inc. is a Delaware-based B2B AI technology company specializing in deploying autonomous AI agents to optimize enterprise operations, primarily targeting healthcare and other high-value industries.

Remote policy: Trase Systems is a fully remote company, hiring globally from various locations, with team members working from 'Anywhere'. Some roles may require occasional travel.

Senior Site Reliability Engineer (Security Clearance)

13 days ago
Full-time
United States
$120,000 to $160,000 per year
Key requirements: Active Security Clearance, Cloud Infrastructure (GCP, AWS, Azure), Infrastructure as Code (Terraform, Ansible, CloudFormation), Docker and Kubernetes, Monitoring and Observability (Prometheus, Grafana, ELK stack), CI/CD Pipeline Development, Problem-Solving Skills, Communication Skills
Red Cell Partners

Red Cell Partners is a Los Angeles-based incubation firm specializing in building and scaling technology-led companies in healthcare, cyber, and national security, primarily serving B2B markets.

Remote policy: Red Cell Partners primarily operates with an on-site workspace model, hiring mainly from the United States, specifically in locations such as California and Washington.

Senior Site Reliability Engineer, Database Operations

14 days ago
Full-time
Worldwide
$120,000 to $180,000 per year
Key requirements: PostgreSQL at scale, Infrastructure automation, Large SaaS distributed systems, SQL and PL/pgSQL, Data modeling and data structure design, Automation of database infrastructure, Self-service tools for engineers, Proactive problem-solving attitude, Documentation of processes, Collaboration with DBREs and SREs
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Senior Cluster Site Reliability Engineer

17 days ago
Full-time
United States
$205,000 to $235,000 per year
Key requirements: 5 years of experience, HPC frameworks (Slurm, Kueue), Machine learning training systems (Kubeflow, MLflow), Infrastructure-as-code (Terraform, Ansible), Cloud infrastructure (AWS, GCP), Observability stacks (Prometheus, Grafana), Distributed storage technologies (Lustre, Ceph), System engineer mindset, Containerization (Docker, Singularity)
The Voleon Group

The Voleon Group is a Berkeley-based quantitative investment management firm specializing in AI-driven trading strategies for institutional investors in the finance industry.

Remote policy: The Voleon Group offers a hybrid work model, allowing for both in-office and remote work, and hires from various regions including the United States, Canada, and the United Kingdom.

Site Reliability Engineer (SRE) – Cloud Ops Focus (Mexico Only)

17 days ago
Full-time
Mexico
Key requirements: 2 years of experience, AWS, Scripting (Python, Bash), CI/CD pipelines, Containerization (Docker/K8s), Production incident management, Advanced English
Varicent

Varicent is a global B2B SaaS provider of sales and revenue performance management solutions, headquartered in an undisclosed location, empowering revenue leaders across various industries.

Senior CloudOps Engineer

19 days ago
Full-time
United States, Ireland, Czechia
Key requirements: Kubernetes, Zabbix, Prometheus, Graylog, Elasticsearch, Linux administration, Infrastructure as Code, AWS, GCP, Python, Bash
Wrike

Wrike is a San Jose-based B2B SaaS platform specializing in AI-powered enterprise work and project management solutions for professional services, marketing, and software development industries, serving over 20,000 organizations globally.

Remote policy: Wrike supports remote work globally and promotes a hybrid model for team members near office hubs in locations such as San Diego, Prague, Dublin, Nicosia, and Tallinn, with 2–3 in-office days per week.

Site Reliability Engineer

19 days ago
Full-time
United States
Key requirements: 5 years of experience, AWS or GCP, Infrastructure as Code, Docker, Kubernetes, Monitoring tools, Python or Bash, CI/CD pipelines, Technical leadership, Problem-solving mindset
edX Boot Camps

edX Boot Camps, part of the edX platform owned by 2U, Inc., is a global online education provider headquartered in Lanham, Maryland, offering B2C intensive boot camp programs in technology and data science to help individuals upskill for high-demand careers.

Remote policy: Tekmetric supports remote work with flexibility for employees, while showing a strong preference for candidates located in the Houston, Texas area.

Senior Site Reliability Engineer

19 days ago
Full-time
United States
Key requirements: Kubernetes, Terraform, Cloud services (AWS, GCP), Scripting (Bash, Python), Incident management framework, Distributed systems design, Cloud native tools (Prometheus, Istio)
Stack AV

Stack AV is a Pittsburgh-based B2B company specializing in AI-powered autonomous trucking solutions for the freight transportation industry, focusing on safety and efficiency.

Remote policy: Stack AV supports remote work through an innovative collaboration model, primarily hiring from various locations within the United States, including 15 states. Team members may be required to travel to the Pittsburgh headquarters periodically.

Site Reliability Engineer (SRE) (Bilingual)

19 days ago
Full-time
Japan
Key requirements: 5 years of experience, Kubernetes, AWS, Python, Java, Go, Observability tools, RDS, NoSQL, Distributed TiDB, Bilingual in English and Japanese, Microservice architecture tuning, High availability solutions, Incident management
PayPay

PayPay Corporation is a Japan-based fintech company specializing in mobile payment solutions, operating a leading app with over 70 million users in the B2C market.

Remote policy: PayPay Corporation supports a hybrid workstyle, allowing for flexible working arrangements that include both remote and office work. Hiring is primarily focused on candidates eligible to work in Japan.

Senior Staff Engineer - Kafka

19 days ago
Full-time
United States
Key requirements: Kafka expertise, Large-scale messaging systems, AWS infrastructure management, High-throughput data processing, Asynchronous communication patterns
Nubank

Nubank is a São Paulo-based neobank and the largest fintech in Latin America, offering a fully digital financial services platform to over 100 million customers in Brazil, Mexico, and Colombia.

Remote policy: Nubank supports remote work arrangements, with team members primarily located in Brazil, Mexico, and Colombia. While remote roles are available, employees may be expected to travel to São Paulo quarterly for team collaboration.

Staff Cloud Engineer - Observability

19 days ago
Full-time
Worldwide
$155,000 to $160,000 per year
Key requirements: 10 years of experience, Datadog, OpenTelemetry, AWS CloudTrail, CloudWatch, Observability strategy, Metrics interpretation, SRE experience, Cloud infrastructure experience, AWS services proficiency, CI/CD pipelines familiarity
NetDocuments

NetDocuments is a US-based B2B SaaS provider of cloud-native document management solutions, primarily serving the legal industry while also catering to financial services and manufacturing sectors.

Remote policy: NetDocuments is a hybrid, remote-friendly workplace, hiring from various regions to support a diverse team. Team members are encouraged to work inspired each day, regardless of their location.

Senior Site Reliability Engineer

19 days ago
Full-time
United States
$102,693 to $215,935 per year
Key requirements: 6 years of experience, Terraform, AWS Fargate, Kubernetes, Multi-tenant architecture security, Observability principles, Prometheus, Grafana, Automated compliance monitoring, Mentoring DevOps engineers
Miris

Miris is a cutting-edge technology company specializing in AI-assisted 3D content streaming, headquartered globally, operating in the media tech industry with a B2B model targeting developers and content creators.

Remote policy: Miris is a remote-first company with a globally distributed team, hiring from various locations around the world to support collaboration across time zones.

Sr. IT Operations Engineer

19 days ago
Full-time
Worldwide
$100,000 to $125,000 per year
Key requirements: 4 years of experience, Okta administration, GCP, AWS, SAML, SCIM, OAuth, RBAC design, AI integration, Okta Workflows, Zapier, JAMF, Google Workspace, Atlassian
LogicGate

LogicGate is a global leader in SaaS-based governance, risk, and compliance (GRC) solutions, offering the Risk Cloud® platform to enterprises in the B2B sector.

Remote policy: LogicGate operates a hybrid workplace, allowing for flexibility based on role responsibilities. The company hires from various regions, supporting a diverse team across different locations.

Senior Security Engineer

19 days ago
Full-time
United States
$120,000 to $175,000 per year
Key requirements: 8 years of experience, DevSecOps, GCP, AWS, CI/CD, Terraform, Pulumi, Kubernetes, SOC 2, GDPR, ISO 27001, Static/Dynamic analysis, Container scanning, Security automation, Cross-functional communication, Security-first mindset
LearnLux

LearnLux is a remote-first fintech B2B SaaS provider specializing in workplace financial wellbeing, offering personalized financial planning and education to enhance employee financial health.

Remote policy: LearnLux is a remote-first company that supports hiring from various locations, with team members currently working remotely across the United States and potentially other regions.

SRE Support Engineer

19 days ago
Full-time
North America
Key requirements: 5 years of experience, AWS, Kubernetes, Docker, Troubleshooting complex distributed systems, Linux administration, Bash, Python, Networking concepts, Technical customer communication, Operational excellence, Empathy under pressure
Virtasant Inc.

Virtasant is a Texas-based B2B technology services company specializing in cloud and AI solutions, operating fully remote with a global workforce across 130+ countries.

Remote policy: Virtasant is a fully remote company with a globally distributed team across 100+ countries, allowing flexibility in work hours. We welcome applicants from various regions, fostering a diverse and inclusive work environment.

L3 Support Engineer

19 days ago
Full-time
Asia, Oceania
Key requirements: 5 years of experience, Azure Cloud Native Services, AWS Cloud Native Services, Kubernetes, Docker, Ansible, Terraform, Entra ID, Okta, ServiceNow, Observability tools, CI/CD Pipelines
Virtasant Inc.

Virtasant is a Texas-based B2B technology services company specializing in cloud and AI solutions, operating fully remote with a global workforce across 130+ countries.

Remote policy: Virtasant is a fully remote company with a globally distributed team across 100+ countries, allowing flexibility in work hours. We welcome applicants from various regions, fostering a diverse and inclusive work environment.

Senior Cloud Site Reliability Engineer

19 days ago
Full-time
United States, Canada
$110,000 to $140,000 per year
Key requirements: 5 years of experience, AWS, GCP, IaC (Pulumi), Container orchestration (ECS, EKS), CI/CD (GitLab CI), Observability (DataDog), Incident response, Linux systems administration, Scripting (Python, Bash, etc.), AWS GovCloud compliance frameworks
Radicle Health

Radicle Health is a B2B SaaS provider headquartered in the U.S. and Canada, offering a suite of human services software products designed to empower organizations in social services, mental health, and poverty relief.

Remote policy: Radicle Health supports remote work and is hiring from various locations, primarily focusing on candidates in the U.S. and Canada.

Staff Software Engineer, Site Reliability

21 days ago
Full-time
United States, Canada
CAD $180,000 to $225,000 per year
Key requirements: 8 years of experience, Terraform, AWS, Docker, Kubernetes, CI systems (CircleCI, Jenkins, GitHub Actions), Monitoring tools (Datadog, Sentry, PagerDuty), Cloud-native systems design, On-call management best practices, Troubleshooting and debugging, High-traffic consumer-facing websites experience
Babylist

Babylist is a leading Los Angeles-based B2C e-commerce platform offering a universal baby registry service for growing families, allowing users to add products from any retailer and providing a comprehensive ecosystem for new parents.

Remote policy: Babylist is a remote-first company with team members located across the U.S. and Canada, hiring from various regions to support collaboration and innovation.

Staff Software Engineer, Site Reliability

21 days ago
Full-time
United States, Canada
$199,200 to $239,040 per year
Key requirements: 8 years of experience, Terraform, AWS, Docker, Kubernetes, High-traffic website support, Cloud-native systems design, CI systems (CircleCI, Jenkins, GitHub Actions), Monitoring and alerting (Datadog, Sentry), On-call management best practices, Strong troubleshooting skills
Babylist

Babylist is a leading Los Angeles-based B2C e-commerce platform offering a universal baby registry service for growing families, allowing users to add products from any retailer and providing a comprehensive ecosystem for new parents.

Remote policy: Babylist is a remote-first company with team members located across the U.S. and Canada, hiring from various regions to support collaboration and innovation.

Senior Infrastructure Engineer

21 days ago
Full-time
United States
$160,000 to $205,000 per year
Key requirements: 5 years of experience, Golang, AWS, Kubernetes, Control-loop style operators, Infrastructure as code, CI/CD pipelines, Custom Kubernetes controllers, Developer tooling
Angi

Angi Inc. is a Denver-based B2C internet services company specializing in connecting homeowners with local home service professionals through a digital marketplace for home improvement and maintenance.

Senior Site Reliability Engineer

22 days ago
Full-time
Worldwide
$158,440 to $188,147.50 per year
Key requirements: 7 years of experience, SRE principles, AWS (EKS, ECS, VPC, S3, ELB), Kubernetes, Terraform, GitHub Actions, Prometheus, Decentralized architecture, Incident management, Infrastructure as Code (IaC)
Ava Labs

Ava Labs is a New York City-based blockchain technology company specializing in the Avalanche Layer-1 platform for Web3 applications and decentralized finance, primarily serving businesses and developers.

Remote policy: Ava Labs supports a global remote work environment, hiring from various regions including Latin America, while also maintaining offices in New York City and Miami. Team members collaborate across time zones, ensuring flexibility in work arrangements.

SRE

22 days ago
Full-time
North America, Europe
Key requirements: 5 years of experience, AWS services, Infrastructure as Code, Terraform, Python, High volume production systems
Motive

Motive is a USA-based B2B fleet management platform that empowers businesses with tools to enhance safety, productivity, and profitability across various industries, including transportation and logistics.

Remote policy: Motive supports remote work and has a global presence, with team members located in various regions, including North America and Europe. We welcome applications from candidates across multiple time zones.

Senior Security and IT Operations Engineer

22 days ago
Full-time
Croatia
Key requirements: 8 years of experience, Cloud infrastructure management, Identity access management, Mobile device management, VPN administration, Antivirus system management, Google Workspace, Microsoft Office, MacOS support, Jira
Glooko

Glooko is a global healthtech company providing a B2B and B2C diabetes management platform that integrates data from over 200 devices to enhance patient care and improve health outcomes.

Site Reliability Engineer Expression of Interest Form

22 days ago
Full-time
Worldwide
Key requirements: AI integration, DevSecOps familiarity, Remote work adaptability, High-performance culture alignment
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

DevOps/Site Reliability Engineer (SRE)

22 days ago
Full-time
United States
Key requirements: 5 years of experience, SaaS platform experience, DevOps best practices, Cloud platforms (AWS, GCP), IaC tools, Configuration management systems, Monitoring and alerting platforms, Containerized micro-services, Linux proficiency, Startup environment adaptability
Elligint Health

Elligint Health is a New Jersey-based B2B healthcare technology company that provides a data-driven precision healthcare platform for payers and value-based care organizations.

Remote policy: Elligint Health supports flexible remote work arrangements and is hiring from various locations, with a requirement for candidates to reside in the U.S. for all positions.

Infrastructure Engineer/SRE - Canada

22 days ago
Full-time
Canada
Key requirements: 5 years of experience, Kubernetes, Terraform, Golang, Python, GitOps, CI/CD, Container security, AWS, PostgreSQL, Infrastructure-as-code
Cresta

Cresta is a B2B SaaS platform headquartered in an unspecified location, specializing in AI-driven customer experience solutions for contact centers, serving a global market.

Site Reliability Engineer III

22 days ago
Full-time
Spain
€0 to €52,000 per year
Key requirements: Kubernetes (AWS EKS), Time-series monitoring (Cortex), Distributed tracing (OTLP + Tempo), Logging platform (Loki), Self-service infrastructure components, SLIs, SLOs, SLAs definition, Unix knowledge, Networking stack, Containers and schedulers, Observability implementations, Automation mindset, Effective asynchronous communication, Simplicity over complexity (KISS principle)
Cabify

Cabify is a Madrid-based transportation and mobility app offering rideshare, delivery, and corporate services, operating primarily in the B2C sector across Spain and Latin America.

Remote policy: Cabify supports flexible work arrangements, allowing for full remote or partially onsite roles, primarily hiring in various countries across Latin America and Spain.

Senior Site Reliability Engineer

22 days ago
Full-time
Spain
€60,000 to €75,000 per year
Key requirements: Kubernetes, AWS EKS, Cortex, Distributed tracing, Loki, GitLab, Unix, Networking stack, Observability, SLIs, SLOs, SLA, Automation, Effective communication, Diversity and inclusion, KISS principle
Cabify

Cabify is a Madrid-based transportation and mobility app offering rideshare, delivery, and corporate services, operating primarily in the B2C sector across Spain and Latin America.

Remote policy: Cabify supports flexible work arrangements, allowing for full remote or partially onsite roles, primarily hiring in various countries across Latin America and Spain.

Staff Software Engineer - SRE, Backend (Reliability Engineering)

22 days ago
Full-time
United States
$200,000 to $275,000 per year
Key requirements: 7 years of experience, AWS, Kubernetes, Python, Kotlin, Distributed systems, Site Reliability Engineering, Capacity management, Automation, Observability, Change Management, Incident Management
Affirm

Affirm is a U.S.-based fintech company offering a buy now, pay later (BNPL) service that allows consumers to make purchases in installments, primarily targeting the retail sector with both B2C and B2B business models.

Remote policy: Affirm is a remote-first company, allowing employees to work from various locations within their country of employment, including Poland, where the majority of roles can be remote.

Senior Software Engineer - SRE, Backend (Reliability Engineering)

22 days ago
Full-time
Canada
$150,000 to $200,000 per year
Key requirements: 4 years of experience, Python, Kotlin, AWS, MySQL, Kubernetes, Site Reliability Engineering, Distributed systems, Capacity management, Automation, Observability, Configuration management, Technical planning, Code quality, Communication skills
Affirm

Affirm is a U.S.-based fintech company offering a buy now, pay later (BNPL) service that allows consumers to make purchases in installments, primarily targeting the retail sector with both B2C and B2B business models.

Remote policy: Affirm is a remote-first company, allowing employees to work from various locations within their country of employment, including Poland, where the majority of roles can be remote.

Staff Site Reliability Engineer

22 days ago
Full-time
United States, United Kingdom, United Arab Emirates, India
$144,000 to $225,000 per year
Key requirements: 7 years of experience, Kubernetes, Terraform, AWS, Advanced scripting (Python, Bash), Linux internals (Ubuntu), CI/CD pipelines (Jenkins, ArgoCD), Infrastructure as Code (IaC), Monitoring tools (Prometheus, Grafana), Containerization (k8s), Financial services exposure
Addepar

Addepar is a Mountain View, CA-based B2B wealth management technology platform that provides cloud-based investment portfolio management solutions for high-net-worth clients and investment professionals globally.

Remote policy: Addepar embraces a flexible workforce model and hires from various locations, with team members in cities such as New York, Salt Lake City, Chicago, London, Edinburgh, Pune, and Dubai. This allows for collaboration across different time zones.

Site Reliability Engineer, Contract

22 days ago
Contract
Worldwide
Key requirements: 3 years of experience, Google Cloud, Terraform, Python, Kubernetes, Incident response, Error budgets negotiation, Service reliability metrics, Agile methodologies
66degrees

66degrees is a global B2B consulting firm specializing in AI, data, and cloud solutions, helping businesses transform challenges into opportunities.

Remote policy: 66degrees embraces a flexible remote work policy, hiring from various regions globally to support its diverse delivery team.

Platform Engineer

22 days ago
Full-time
United States
Key requirements: 5 years of experience, AWS, Terraform, Docker, Kubernetes, CI/CD Pipelines, Infrastructure as Code, DevSecOps, Monitoring tools, Networking (Nginx, Vault), Scripting (Bash, Python), Federal government experience
PingWind

PingWind is an Annandale, Virginia-based SDVOSB specializing in B2B IT services, focusing on cybersecurity, cloud computing, and digital transformation for federal government clients.

Remote policy: PingWind supports remote work and hires employees across the continental United States (CONUS), with team members located in various states. While some positions are on-site or hybrid, there are fully remote opportunities available for U.S.-based candidates.

Site Reliability Engineer for Jira Data Center

22 days ago
Contract
United States
Key requirements: Atlassian Jira and Confluence experience, Datadog, Infrastructure setup, Clustering and load balancing, API integrations, Linux/Windows server proficiency, Database management (PostgreSQL, MySQL, Oracle), Web application performance tuning, Large-scale enterprise environment experience, Excellent communication skills
ServiceRocket

ServiceRocket is a Palo Alto-based global tech-enabled services company specializing in Atlassian products and ITSM/ESM solutions for the B2B market across various industries.

Remote policy: ServiceRocket supports remote work with fully remote roles primarily for candidates based in the United States, while also offering hybrid positions in various regions such as Malaysia, the Philippines, and Australia.

Senior Site Reliability Engineer (Node.js & Javascript), Trading Technologies

26 days ago
Full-time
South America, Europe, Asia
Key requirements: 5 years of experience, Node.js, JavaScript, Monitoring large-scale production systems, Cloud platforms (AWS or GCP), Distributed systems, Linux environment, Excellent communication skills in English
Binance

Binance is a global fintech B2C platform headquartered in Malta, operating the largest cryptocurrency exchange by trading volume, offering a wide range of digital asset products and services to users in over 180 countries.

Remote policy: Binance supports flexible remote work arrangements, with team members located in various regions globally, though specific policies may vary by team and role.

Senior Cloud Performance Engineer

30 days ago
Full-time
United States
$120,000 to $180,000 per year
Key requirements: 6 years of experience, Distributed systems performance engineering, Database benchmarking, Cloud infrastructure services, Kubernetes, Go, C/C++, Java, Concurrency and multithreading, Performance analysis, Capacity management, Chaos Engineering techniques, Production debugging
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Cloud Performance Engineer

30 days ago
Full-time
United States
$120,000 to $180,000 per year
Key requirements: 6 years of experience, Distributed systems performance engineering, Database benchmarking, Test automation, Capacity management, Go, C/C++, Java, Kubernetes, AWS, GCP, Azure, Chaos Engineering, Production debugging
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Database Reliability Engineer - Core Team

30 days ago
Full-time
United Kingdom, Germany, Netherlands
Key requirements: 5 years of experience, ClickHouse, SQL databases, Distributed database internals, Shell scripting, Python, Cloud platforms (AWS, Azure, GCP), Production debugging, Incident response processes
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Database Reliability Engineer - Core Team

30 days ago
Full-time
United Kingdom, Germany, Netherlands
Key requirements: 5 years of experience, ClickHouse, SQL databases, Distributed database internals, Shell scripting, Python, C++ reading, AWS, Azure, Google Cloud Platform, Incident response processes, Post-mortem analysis
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.