Remote Site Reliability Engineer Jobs

Explore 69 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (69) - Page 2

Cloud & AI Operations Administrator

15 days ago
Full-time
Canada
$67,000 to $82,000 per year
Key requirements: 5 years of experience, Azure, PowerShell, Azure CLI, Azure Active Directory, AI services management, CCNA, Microsoft Teams, Office365, Cisco Meraki Solutions
Heart & Stroke

Heart & Stroke is a Canadian non-profit organization headquartered in Ontario, dedicated to preventing heart disease and stroke through research, advocacy, and public education, targeting diverse communities across Canada.

Remote policy: Heart & Stroke supports remote work for candidates residing in Canada, with the requirement to travel to the Toronto office when requested.

General Application for Engineering

15 days ago
Contract
Brazil
Key requirements: Python, Go, Typescript, ETL, AWS, Kubernetes, Terraform, Agile, Learning Agility, Collaborative Spirit, Ownership
Loadsmart

Loadsmart is a Chicago-based logistics technology company specializing in innovative freight management solutions for the B2B market.

Remote policy: Loadsmart supports remote work and has a globally distributed team, currently hiring for remote positions in Brazil.

Senior Systems Software Engineer, AI Infrastructure

16 days ago
Full-time
United States
$184,000 to $287,500 per year
Key requirements: 5 years of experience, Python, C/C++ or Go or Perl or Ruby, Linux or Windows systems engineering, AWS or Azure or GCP or OCI, SRE principles, Terraform CDK, observability platforms, CI/CD systems, AI training and inferencing, deep learning frameworks, distributed systems, cloud or hardware health monitoring
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Software Engineer - Infrastructure

17 days ago
Full-time
United States
Key requirements: 8 years of experience, AWS infrastructure at scale, Infrastructure as Code (Terraform), Designing and operating distributed systems, CI/CD systems and automation pipelines, Containerized environments (Docker), Observability and performance tuning, Experience architecting infrastructure for multiple payment rails, Understanding of payment rail mechanics, Experience scaling high-volume financial workloads, Familiarity with security and risk standards for money movement
Modern Treasury

Modern Treasury is a San Francisco-based fintech B2B company specializing in payment operations tools, offering APIs and dashboards for automating money movement for enterprises.

Remote policy: Modern Treasury supports remote work for certain roles and primarily hires from the United States, with team members located in cities such as San Francisco and New York.

Intermediate Site Reliability Engineer, Environment Automation

17 days ago
Full-time
Worldwide
Key requirements: Golang, Kubernetes, Terraform, Ansible, Infrastructure as Code, SaaS experience, Multi-tenant environments, Observability stack, Automation mindset, Git-based workflows
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

AI Platform Engineer

18 days ago
Full-time
United States
$168,000 to $322,000 per year
Key requirements: 10 years of experience, Python, Kubernetes, AI/ML platforms, Distributed systems, Observability design, AI-assisted development tools, Automation-first approach, AI-native infrastructure roadmaps
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer

18 days ago
Full-time
United States, Canada
Key requirements: 10 years of experience, Cloud-native infrastructure, Kubernetes deployment, Terraform, Ansible, IaC / GitOps tooling, On-prem compute knowledge, Platform optimization, Distributed systems troubleshooting, Python, Bash, CUDA-based GPU programs, Security in sensitive environments
Planet

Planet Labs is a San Francisco-based B2B company specializing in satellite imagery and earth data analytics for applications in agriculture, environmental monitoring, and disaster response.

Remote policy: Planet hires remotely from the United States, supporting team members to work from anywhere within the country.

Site Reliability Engineer

18 days ago
Full-time
United States, Canada
$160,600 to $200,800 per year
Key requirements: 10 years of experience, Cloud-native infrastructure, Kubernetes deployment, Terraform, Ansible, IaC / GitOps tooling, On-prem compute knowledge, Platform optimization, Distributed systems troubleshooting, Python, Bash, CUDA-based GPU programs, Security in sensitive environments
Planet

Planet Labs is a San Francisco-based B2B company specializing in satellite imagery and earth data analytics for applications in agriculture, environmental monitoring, and disaster response.

Remote policy: Planet hires remotely from the United States, supporting team members to work from anywhere within the country.

Senior Infrastructure Engineer

18 days ago
Full-time
United States, Canada
$200,700 to $250,900 per year
Key requirements: 5 years of experience, AWS, Terraform, Prometheus, Grafana, OpenTelemetry, Technical writing, Infrastructure project leadership, Regulated environments experience, Large-scale Terraform implementations, Coding experience
Mercury

Mercury Bank is a B2B financial institution offering simplified business banking services, headquartered in an unspecified location, targeting businesses primarily in the North American market.

Remote policy: Mercury Marine offers remote work opportunities for various positions, with current roles available for candidates in Canada and the United States. For specific hiring regions and remote work policies, please refer to the company's careers page.

Senior Site Reliability Engineer, EU or UK

19 days ago
Full-time
Europe
Key requirements: Linux systems management, AWS or Azure or Google Cloud, Docker, CI/CD pipeline, Prometheus or OpenTelemetry or eBPF, Cloud security and IAM policies, Python, Automation and API coding
Auros

Auros is a Hong Kong-based B2B cryptocurrency market making firm specializing in high-frequency trading and liquidity provision services for the global digital asset market.

Remote policy: Auros Global embraces a hybrid work model, allowing remote and flexible work arrangements while hiring from various regions globally, including the UK and EU.

Senior Platform Engineer

19 days ago
Full-time
Worldwide
$100,000 to $150,000 per year
Key requirements: AWS, Kubernetes, Terraform, CI/CD pipelines, Go, SRE principles, AI-assisted engineering tools, Cloud security, Production observability technologies
Trust Wallet

Trust Wallet is a leading B2C multi-chain, non-custodial cryptocurrency wallet enabling users to manage over 10 million digital assets, headquartered remotely with a global user base.

Remote policy: Trust Wallet operates as a fully remote company, hiring globally with team members working from various countries. Candidates must have the right to work in their respective locations.

Senior Site Reliability Engineer

19 days ago
Full-time
Worldwide
$113,082 to $175,725 per year
Key requirements: 6 years of experience, Puppet, Kubernetes, Python, Linux troubleshooting, Distributed caching systems, TCP/IP, HTTP, TLS, DNS, Incident response, Automation of tasks, Monitoring tools (Prometheus, Grafana)
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.

Escalation Engineer

19 days ago
Full-time
India
Key requirements: 4 years of experience, Strong networking fundamentals, Expert troubleshooting skills, Linux, Troubleshooting tools (IXIA, tcpdump, Wireshark, iPerf3), Python, Terraform or other IaC tools, AWS/Azure/GCP Cloud networking, Cloud-native architecture, Strong leadership ability, Ability to learn new technologies
Alkira, Inc.

Alkira, Inc. is a California-based B2B cloud networking company specializing in Network Infrastructure as a Service (NaaS) for enterprises, offering hybrid and multi-cloud connectivity solutions globally.

Senior Software Engineer, Infrastructure Automation and Distributed Systems

20 days ago
Full-time
North America
$224,000 to $431,250 per year
Key requirements: 12 years of experience, Infrastructure automation, Distributed systems design, Python, Go, Perl, Ruby, Linux, Networking, Storage, Containers, Multi-cloud infrastructure, Kubernetes, OpenStack, Docker, Slurm, NVIDIA Collective Communication Library (NCCL)
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Software Engineer - SRE (Remote)

20 days ago
Full-time
United States
Key requirements: 8 years of experience, Site Reliability Engineering, DevOps, Kubernetes, AWS, On-call experience, Monitoring and alerting, Collaboration across teams
Rula

Rula Health is a remote-first B2C telehealth SaaS platform based in the U.S., specializing in online therapy and psychiatry services for individuals aged 5 and older, addressing over 90 mental health conditions.

Remote policy: Rula Health is a 100% remote-first company, hiring primarily in the United States, with the exception of Hawaii.

Senior Site Reliability Engineer

20 days ago
Full-time
India
Key requirements: 5 years of experience, AWS, Kubernetes, Terraform, Docker, CI/CD, Bash, Python, Authentication technologies, Monitoring tools, Data pipelines with Databricks, Infrastructure cost management
Teikametrics

Teikametrics is a US-based B2B SaaS platform specializing in AI-driven marketplace optimization for e-commerce brands, helping them maximize profitability on platforms like Amazon and Walmart.

Remote policy: Teikametrics embraces a remote-first culture, hiring talented individuals across 25 states in the USA, as well as in China and India, allowing flexibility for employees to work when they are most productive.

Senior Site Reliability Engineer

20 days ago
Full-time
India
Key requirements: 3 years of experience, AWS, Kubernetes, Terraform, Docker, CI/CD, Bash, Python, Authentication technologies, Monitoring tools, DevOps best practices, On-call support
Teikametrics

Teikametrics is a US-based B2B SaaS platform specializing in AI-driven marketplace optimization for e-commerce brands, helping them maximize profitability on platforms like Amazon and Walmart.

Remote policy: Teikametrics embraces a remote-first culture, hiring talented individuals across 25 states in the USA, as well as in China and India, allowing flexibility for employees to work when they are most productive.

FBS AIOps Engineer

20 days ago
Full-time
Worldwide
Key requirements: 2 years of experience, Dynatrace, Copilot Studio, AIOps platform design, Event correlation, Runbook automation, ITSM integration, Anomaly detection, Consultative partnership
Capgemini

Capgemini is a Paris-based B2B IT services and consulting company specializing in digital transformation, technology services, and consulting, serving diverse industries globally.

Remote policy: Capgemini supports flexible work arrangements, including remote and office-based options, and operates globally across more than 40 countries, welcoming applicants from various regions.

Senior Production Engineer

21 days ago
Full-time
United States
$165,000 to $195,000 per year
Key requirements: 5 years of experience, AWS, Kubernetes (EKS), Terraform, Go, Python, CI/CD systems, Observability tools, SLIs/SLOs implementation, GenAI tools
Legion

Legion is a remote B2B SaaS provider specializing in intelligent automation workforce management solutions for labor-intensive industries, headquartered in the United States.

Sr Cloud Engineer (Contract-to-Hire)

21 days ago
Contract
United States
$140,000 to $165,000 per year
Key requirements: 5 years of experience, Cloud-native solutions, HITRUST & SOC2 compliance, Infrastructure automation, Containerization (Docker, Kubernetes), DevSecOps principles, CI/CD pipelines (Azure DevOps), Observability platforms (Datadog), Multi-cloud deployment, Stateful database infrastructure, Microservices architecture
Lirio

Lirio is a U.S.-based healthtech B2B SaaS company specializing in behavioral health interventions and personalized care navigation through its AI-driven platform.

Remote policy: Lirio supports remote work with opportunities for hybrid arrangements for candidates located in Tennessee. Currently, hiring is focused on candidates authorized to work in the US.

Application Support Engineer

21 days ago
Full-time
United States
$100,000 to $120,000 per year
Key requirements: 2 years of experience, Microservices architectures, Full-stack troubleshooting, Log analysis tools, Incident management methodologies, FinTech experience, Jira, Confluence, Scripting skills, Customer satisfaction focus, High-pressure problem-solving, Excellent communication skills
Lumin Digital

Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.

Remote policy: Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.

Site Reliability Engineer - India

22 days ago
Full-time
India
Key requirements: Kubernetes, Docker, Java, Python, Continuous Delivery tools, Unix, Infrastructure components, DataDog monitoring, Automation of operational work, Self-healing patterns, Resiliency patterns
Zimperium

Zimperium, Inc. is a Dallas-based B2B cybersecurity company specializing in mobile security solutions for enterprises, offering real-time protection against mobile threats on iOS and Android devices.

Remote policy: Zimperium supports remote work and is currently hiring for various roles, including remote positions in regions such as India. For specific hiring locations and remote work details, please refer to the company's official careers page.

Software Reliability Engineer - LPU Hardware DataFlow

23 days ago
Full-time
Europe
Key requirements: 8 years of experience, Reliability engineering, Hardware testing, Driver testing, Functional programming (Haskell, Nix), System programming (C++, Rust, Java), Linux scripting (Python, Shell), Automated test pipelines, CI/CD experience, GPU reliability testing, Hardware durability testing, Driver development, Kernel debugging, Reliability standards knowledge
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior DevOps / SRE Engineer

23 days ago
Full-time
United States
$120,000 to $150,000 per year
Key requirements: Kubernetes (EKS), Blockchain reliability, Zero-downtime operations, CI/CD pipeline development, Observability tooling, Real-time systems, Infrastructure as Code (IaC), Incident leadership, Security focus
MLabs

MLabs is a remote fintech company specializing in DeFi solutions, providing a unified API for financial institutions to access on-chain liquidity across major blockchains.

Remote policy: MLabs supports remote work for positions located within the EMEA region, allowing for flexible hours and a remote-first environment.

Senior DevOps/SRE Engineer

24 days ago
Full-time
Worldwide
$100,000 to $150,000 per year
Key requirements: 6 years of experience, AWS services, Kubernetes, Terraform, GitLab CI, VictoriaMetrics, Prometheus, Grafana, Apache Kafka, Bash, Python, Go
capital.com

Capital.com is a Cyprus-based fintech B2C online trading platform specializing in CFDs and spread betting across over 3,000 global financial markets.

Remote policy: Capital.com offers remote work opportunities, including the flexibility to work from various locations, with team members enjoying benefits such as 30 extra days to work remotely from anywhere in the world.

Senior Site Reliability Engineer- Remote

25 days ago
Full-time
United States
Key requirements: 8 years of experience, Go, Python, AWS, Azure, Google Cloud, Kubernetes, Ansible, Terraform, Distributed databases, ClickHouse, Incident management, Post-mortem analysis, Problem solving, Accountability
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Site Reliability Engineer- Remote

25 days ago
Full-time
Worldwide
$141,000 to $208,000 per year
Key requirements: 8 years of experience, Go, Python, AWS, Azure, Google Cloud Platform, Kubernetes, Docker Swarm, Ansible, Terraform, Puppet, Distributed databases, SQL, ClickHouse
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Cloud Network Engineer (US Remote)

26 days ago
Full-time
United States
$110,000 to $140,000 per year
Key requirements: 7 years of experience, AWS network security, Palo Alto NGFW, Multi-cloud network management, CloudFormation, Terraform, IP overlap and static routing, Public Cloud architecture (Azure/AWS), Network performance testing methodologies, Centralized cloud connectivity design, Automation methodologies
First Advantage

First Advantage is an Atlanta-based HR Tech B2B SaaS provider specializing in global background screening and compliance solutions for various industries.

Remote policy: First Advantage offers flexibility with the possibility to work remotely, supporting a global workforce across various regions. Team members are located in 17 countries, allowing for collaboration across time zones.

Site Reliability Engineer

26 days ago
Full-time
Worldwide
Key requirements: 3 years of experience, SLIs/SLOs definition, Multi-tenant SaaS platforms, Datadog, Grafana, Elastic Stack, Kubernetes, High-availability architectures, Incident response leadership, Automation and process improvement, Cloud experience (Azure preferred), Capacity planning and load testing
HostPapa

HostPapa is a Canadian-based web hosting company offering B2B and B2C solutions, including shared, reseller, and VPS hosting services, with a focus on small businesses and a global presence.

Remote policy: HostPapa offers remote work opportunities and hires from various locations, with team members and customers in 39 countries around the globe.

Senior HPC Site Reliability Engineer

27 days ago
Full-time
Asia, Israel
Key requirements: 8 years of experience, HPC infrastructure design, Large scale compute architecture, Job schedulers (LSF, SGE, SLURM), Cluster configuration management (Ansible, Puppet), Public cloud services (AWS, Azure, Google Cloud), Script-writing (Python, Bash, Perl), PaaS microservices (Docker, Kubernetes), Distributed storage solutions, Linux performance optimization, Kubernetes deployment management
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.