Remote Site Reliability Engineer Jobs

Explore 64 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (64)

Site Reliability Engineer

about 8 hours ago
Full-time
United States
$110,000 to $175,000 per year
Key requirements: 8 years of experience, Linux administration, Python, Cloud platforms (OCI, AWS, GCP), Configuration management (Ansible, Puppet), Database administration (MySQL, MongoDB, PostgreSQL), Production support for large-scale environments, Advanced scripting (Perl, Bash), DevOps tools (Docker, K8s, Gitlab CICD, Jenkins, Terraform), Monitoring best practices (ELK stack, Prometheus, Nagios, Grafana), Technical project leadership
Ooma, INC

Ooma, Inc. is a Sunnyvale-based telecommunications company offering cloud-based VoIP and unified communications services as a SaaS provider, targeting both B2B and B2C markets across the US and Canada.

Software Reliability Engineer - LPU Hardware DataFlow

about 10 hours ago
Full-time
Europe
Key requirements: 8 years of experience, Reliability engineering, Hardware testing, Driver testing, Functional programming (Haskell, Nix), System programming (C++, Rust, Java), Linux scripting (Python, Shell), Automated test pipelines, CI/CD experience, GPU reliability testing, Hardware durability testing, Driver development, Kernel debugging, Reliability standards knowledge
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Network Site Reliability Engineer

about 10 hours ago
Full-time
Asia, Israel
Key requirements: 8 years of experience, Network automation, Prometheus, Grafana, Python, Go, TCP/UDP, BGP, VPN, L2 switching, Firewalls, Load Balancers, SNMP, Syslog, Streaming Telemetry, Mellanox/Cumulus Linux, Palo Alto firewalls, Netscalers, F5 load balancers
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Cloud Infrastructure & IT Compliance Engineer

2 days ago
Full-time
South America, Europe, Africa, Middle East
$100,000 to $130,000 per year
Key requirements: 7 years of experience, Azure DevOps, PaaS cloud infrastructure, Cloud security principles, Identity and access management (IAM), Bicep, CI/CD pipelines, Monitoring and observability platforms
Skedda

Skedda is a global B2B SaaS platform headquartered in an unspecified location, specializing in workplace management solutions for diverse sectors including IT, education, and finance.

Remote policy: Skedda hires remotely from various locations, with some roles, such as the QA Automation Engineer, specifically available within Europe (CET).

Senior Site Reliability Engineer- Remote

3 days ago
Full-time
United States
Key requirements: 8 years of experience, Go, Python, AWS, Azure, Google Cloud, Kubernetes, Ansible, Terraform, Distributed databases, ClickHouse, Incident management, Post-mortem analysis, Problem solving, Accountability
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Site Reliability Engineer- Remote

3 days ago
Full-time
Worldwide
$141,000 to $208,000 per year
Key requirements: 8 years of experience, Go, Python, AWS, Azure, Google Cloud Platform, Kubernetes, Docker Swarm, Ansible, Terraform, Puppet, Distributed databases, SQL, ClickHouse
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Cloud Network Engineer (US Remote)

4 days ago
Full-time
United States
$110,000 to $140,000 per year
Key requirements: 7 years of experience, AWS network security, Palo Alto NGFW, Multi-cloud network management, CloudFormation, Terraform, IP overlap and static routing, Public Cloud architecture (Azure/AWS), Network performance testing methodologies, Centralized cloud connectivity design, Automation methodologies
First Advantage

First Advantage is an Atlanta-based HR Tech B2B SaaS provider specializing in global background screening and compliance solutions for various industries.

Remote policy: First Advantage offers flexibility with the possibility to work remotely, supporting a global workforce across various regions. Team members are located in 17 countries, allowing for collaboration across time zones.

Site Reliability Engineer

4 days ago
Full-time
Worldwide
Key requirements: 3 years of experience, SLIs/SLOs definition, Multi-tenant SaaS platforms, Datadog, Grafana, Elastic Stack, Kubernetes, High-availability architectures, Incident response leadership, Automation and process improvement, Cloud experience (Azure preferred), Capacity planning and load testing
HostPapa

HostPapa is a Canadian-based web hosting company offering B2B and B2C solutions, including shared, reseller, and VPS hosting services, with a focus on small businesses and a global presence.

Remote policy: HostPapa offers remote work opportunities and hires from various locations, with team members and customers in 39 countries around the globe.

Senior HPC Site Reliability Engineer

4 days ago
Full-time
Asia, Israel
Key requirements: 8 years of experience, HPC infrastructure design, Large scale compute architecture, Job schedulers (LSF, SGE, SLURM), Cluster configuration management (Ansible, Puppet), Public cloud services (AWS, Azure, Google Cloud), Script-writing (Python, Bash, Perl), PaaS microservices (Docker, Kubernetes), Distributed storage solutions, Linux performance optimization, Kubernetes deployment management
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Systems Engineer

4 days ago
Full-time
United States
$78,900 to $116,760 per year
Key requirements: 5 years of experience, Windows server administration, Linux server administration, SRE methodologies, DevOps methodology, VMware/Nutanix administration, Incident management, Automation focus, Flexible working hours willingness
rockstargames

Rockstar Games is a New York City-based video game publisher specializing in action-adventure and racing games, operating primarily as a B2C company with a global reach.

Senior Infrastructure Automation Engineer - SCM and HPC AI

5 days ago
Full-time
India
Key requirements: 4 years of experience, Baremetal provisioning automation, Distributed systems architecture, CI/CD systems configuration, Go, Python, Ansible, Linux system administration
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Site Reliability Engineer

5 days ago
Full-time
Worldwide
$120,000 to $180,000 per year
Key requirements: 8 years of experience, Kubernetes, Multi-cloud experience, Terraform, CI/CD processes, Stateful software on Kubernetes, Kubernetes best practices, Debugging Kubernetes clusters, Scripting (bash, Python, Go)
Diagrid

Diagrid is a technology company providing a B2B SaaS platform for workflow orchestration and AI agent development, headquartered in an unspecified location, serving various industries including financial services and healthcare.

Remote policy: Diagrid operates with a fully remote and flexible work environment, supporting collaboration across various regions, including the United States and Europe.

HPC Operations Engineer

5 days ago
Full-time
United States
$124,000 to $241,500 per year
Key requirements: 2 years of experience, Linux systems administration, Workload schedulers (LSF, Slurm), HPC support, Scripting (Bash, Python), Network computing (NFS, LDAP), Technical support experience
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior HPC and LSF Operations Engineer

5 days ago
Full-time
United States
$152,000 to $287,500 per year
Key requirements: 5 years of experience, HPC scheduling systems, LSF, Slurm, Linux systems administration, Reliability engineering practices, Observability systems, Container technologies
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Cloud Operations Engineer

5 days ago
Full-time
United States
$110,000 to $125,000 per year
Key requirements: 3 years of experience, Cloudwatch, AWS, CI/CD pipelines, Incident triage, SSL/TLS management, Automation/orchestration tools, Cross-functional collaboration, Exceptional communication skills
Lumin Digital

Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.

Remote policy: Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.

Site Reliability Engineer - AI & ML Infrastructure (Kubernetes, AWS & Terraform)

6 days ago
Full-time
United States
Key requirements: 5 years of experience, Kubernetes, Terraform, Slurm, High-performance computing, Bare metal infrastructure management, Python, AWS
Deepgram

Deepgram is a San Francisco-based B2B SaaS provider specializing in Voice AI solutions, offering speech-to-text and text-to-speech APIs for developers and enterprises in various sectors.

Site Reliability Engineer - AI & ML Infrastructure (Kubernetes, AWS & Terraform)

6 days ago
Full-time
United States
Key requirements: 5 years of experience, Kubernetes, Terraform, AWS, Slurm, Bare metal infrastructure, High-performance computing, Scripting (Python, Go, Bash)
deepgram

Deepgram is a San Francisco-based B2B AI company specializing in speech-to-text (STT) and text-to-speech (TTS) technologies, providing real-time APIs for developers in the Voice AI industry.

Sr. Site Reliability Engineer

6 days ago
Full-time
United States
$140,000 to $180,000 per year
Key requirements: 10 years of experience, Infrastructure assessments, Hybrid hosting environments, Infrastructure stabilization, Performance engineering, System resilience strategies, Citrix dependency analysis, Operational continuity strategies, Technical documentation skills, Stakeholder communication
Element Solutions

Element Solutions Inc is a US-based specialty chemicals company specializing in manufacturing chemical products for electronics and industrial applications, operating primarily in a B2B model with a global presence.

Remote policy: Element Solutions Inc is a remote-first company, primarily hiring candidates who reside in the Continental US, with team members collaborating across various time zones.

IT Operations Engineer I, Remote

6 days ago
Full-time
United States
Key requirements: 6 years of experience, PowerShell, Python, Azure, ITGC, SOX, SOC II Type II, Identity migrations, VDI management, Incident management, Security monitoring
Aledade

Aledade is an Ashburn, VA-based B2B healthcare company specializing in helping independent primary care practices and health centers build and manage Accountable Care Organizations (ACOs) to enhance value-based care.

Remote policy: Aledade supports flexible work schedules and remote work for many roles, operating across various states in the United States, with team members collaborating nationwide.

Senior Site Reliability Engineer

6 days ago
Full-time
United States
$172,614 per year
Key requirements: 2 years of experience, Kubernetes, Infrastructure as Code, Python, Go, JavaScript, Shell script, Observability platform, SLI/SLO/SLAs, Security risk assessments
Loadsmart

Loadsmart is a Chicago-based logistics technology company specializing in innovative freight management solutions for the B2B market.

Remote policy: Loadsmart supports remote work and has a globally distributed team, currently hiring for remote positions in Brazil.

Senior Infrastructure & Security Engineer

8 days ago
Full-time
United States
$160,000 to $170,000 per year
Key requirements: 6 years of experience, AWS CDK, HIPAA compliance, SOC 2 compliance, AWS Lambda, AWS ECS, AWS S3, AWS CloudWatch, AWS IAM, AWS VPC, AWS WAF, Monitoring and incident management, TypeScript, CI/CD pipelines, Multi-region AWS architecture, Healthcare industry experience
Koda Health

Koda Health is a healthcare technology company offering a cloud-based SaaS platform for advance care planning, headquartered in the US, targeting both B2B partnerships with health systems and direct B2C access for patients.

Remote policy: Koda Health offers fully remote positions for U.S.-based candidates, supporting a flexible work environment across various time zones.

Infrastructure Engineer

9 days ago
Full-time
United States
$163,000 to $263,670 per year
Key requirements: 4 years of experience, AWS, Terraform, Automate infrastructure provisioning, Build internal tools, On-call rotation participation, Code to solve problems, Curiosity for learning new tools, High standards for production systems, Value code quality and testing
LaunchDarkly

LaunchDarkly is an Oakland-based B2B SaaS platform specializing in feature flagging and experimentation for software development teams, enabling safe and controlled software releases for enterprises globally.

Site Reliability Engineer II, Data Platforms

10 days ago
Full-time
India
Key requirements: 4 years of experience, PostgreSQL, MongoDB, Puppet, Linux, Automation, Observability, Metrics, Alerting, Git, CI/CD, Scripting (Python, Go, Shell)
OpenTable

OpenTable is a San Francisco-based SaaS platform providing an online restaurant-reservation service for diners and restaurants, operating in the hospitality industry with a global reach.

Senior Infrastructure Engineer

10 days ago
Full-time
South America, United States, Canada, Australia
$172,000 to $212,000 per year
Key requirements: 5 years of experience, AWS, GCP, Terraform, SRE principles, CI/CD, Kubernetes, Microservices, Monitoring tools, Java, Python, TypeScript
Flex

Flex is a global manufacturing partner specializing in electrical engineering and critical power solutions, headquartered in an unspecified location, operating in the B2B sector.

Remote policy: Flex does not have a defined remote work policy, and many roles may require on-site presence, particularly in engineering and manufacturing. Hiring is flexible, with opportunities available in various regions, including Europe.

Senior Site Reliability Engineer, Environment Automation

10 days ago
Full-time
United States
$124,300 to $266,400 per year
Key requirements: Terraform, Kubernetes, Production-scale experience, Infrastructure-as-code, Observability tools, Incident response leadership, Multi-tenant infrastructure management, GitLab proficiency
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Operations Engineer- Philippines (Contract)

11 days ago
Contract
Philippines
Key requirements: 3 years of experience, Linux server administration, SIP troubleshooting, FreeSWITCH, Kamailio, Zabbix, OpsGenie, VoIP support, SLA-driven support model
Ooma, INC

Ooma, Inc. is a Sunnyvale-based telecommunications company offering cloud-based VoIP and unified communications services as a SaaS provider, targeting both B2B and B2C markets across the US and Canada.

Customer SIEM Engineer

11 days ago
Full-time
United States
$120,000 to $210,000 per year
Key requirements: 5 years of experience, Linux Power User, SIEM Expertise, Detection Mindset, Scripting & Automation, Log Mastery, Architect Data Pipelines, Project Leadership
Gravwell

Gravwell is a remote-first cybersecurity B2B platform specializing in full-stack security and observability solutions for enterprises, with a focus on data analytics.

Remote policy: Gravwell offers a flexible remote work setup, allowing employees to work from home anywhere within the United States, fostering a collaborative and supportive work environment.

Site Reliability Engineer

12 days ago
Contract
United States
Key requirements: 2 years of experience, Linux management, Load balancers, High-availability technologies, Configuration management tools, Troubleshooting skills, Go, Python, Rust, CI/CD pipelines, Open-source monitoring tools, Blockchain infrastructure
Asymmetric Research

Asymmetric Research is a remote B2B security firm specializing in blockchain security for Layer 1 and Layer 2 protocols and DeFi projects.

Remote policy: Asymmetric Research is a fully remote organization, hiring from various locations globally to support a diverse team.

Site Reliability Engineer

12 days ago
Full-time
United States
$125,000 to $155,000 per year
Key requirements: 2 years of experience, Kubernetes, Infrastructure as Code, AWS, GCP, Python, Go, CI/CD systems, GitOps, Observability tools, Cloud-native ecosystem, Security interest
Virtru

Virtru is a Washington, D.C.-based cybersecurity B2B provider specializing in data encryption and digital privacy solutions for email and file sharing, targeting compliance-focused industries globally.

Staff Site Reliability Engineer

12 days ago
Full-time
United States
$136,301 to $170,000 per year
Key requirements: Go, Kubernetes, Docker, CI/CD pipelines, scalable distributed systems, cloud platforms, networking, Identity and Access Management (IAM)
Ping Identity

Ping Identity is a Denver-based B2B software company specializing in identity and access management (IAM) solutions for enterprise customers across technology, finance, healthcare, and government sectors.