Remote Site Reliability Engineer Jobs

Explore 51 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (51)

Senior Cloud Support Engineer

1 day ago
Full-time
United States, United Kingdom, Singapore, India
$100,000 to $120,000 per year
Key requirements: 3 years of experience, AWS, Kubernetes, GraphQL, CLI tools, Python, JavaScript, Multi-agent systems, Prometheus, Grafana, Infrastructure as Code, Search Technologies
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Infrastructure Engineer

1 day ago
Full-time
United States
$200,000 to $250,000 per year
Key requirements: AWS CDK, ECS, RDS Aurora, Datadog, Infrastructure as Code, Cloud security, Cost optimization, Observability, Golang
Future

Future is a San Francisco-based digital personal training platform that provides personalized coaching and fitness solutions, targeting the US market.

Remote policy: Future is a remote-first company, hiring employees located anywhere in the continental US, with no travel required.

Staff Infrastructure Security Engineer (APAC, EMEA)

2 days ago
Full-time
Europe, Asia
$120,000 to $180,000 per year
Key requirements: Cloud infrastructure security (AWS/GCP/Azure), Kubernetes, Security tooling (Go, Python, Ruby), Infrastructure-as-Code security (Terraform, Ansible, CloudFormation), AI in security workflows, Leading multi-team technical initiatives, Security certifications (FedRAMP, ISO 27001, SOC 2, PCI-DSS)
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Engineer, Observability (Remote)

3 days ago
Full-time
United States
$87,000 to $107,000 per year
Key requirements: 4 years of experience, Session replay tools, Front-end web development, HTML, CSS, JavaScript, Customer struggle analysis, Technical liaison, Data literacy, Website KPIs, Collaboration with engineering teams, Proactive communication
Abercrombie and Fitch Co.

Abercrombie & Fitch Co. is a New Albany, Ohio-based global omnichannel specialty retailer of lifestyle apparel and accessories, primarily targeting preteens to millennials across North America, Europe, Asia, and the Middle East.

Senior Site Reliability Engineer

3 days ago
Full-time
United States, Canada
$145,000 to $185,000 per year
Key requirements: 5 years of experience, Terraform, AWS (VPC, IAM, EKS, S3, CloudWatch), Kubernetes, CI/CD (GitHub Actions, ArgoCD), Networking (CIDR, DNS, load balancing), Observability (Prometheus, Grafana), Python, Bash, Windows workloads on Kubernetes, GPU scheduling
Parallel Domain

Parallel Domain is a San Francisco-based B2B SaaS provider specializing in synthetic data generation and simulation tools for autonomous systems, including self-driving vehicles and robotics.

Senior Infrastructure Software Systems Engineer

4 days ago
Full-time
India
Key requirements: 9 years of experience, Distributed systems design, Building scalable infrastructure, API and data model definition, Job orchestration, Performance optimization, Python, C++, Go, Linux systems, Developer productivity improvement, Chip-design understanding
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Service Reliability Engineer - EDA Infrastructure

4 days ago
Full-time
India
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Linux administration, Ansible, Python, Kubernetes, SLURM, large-scale cluster management, observability tools, incident management tools, AWS, Azure, GCP
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Intermediate Site Reliability Engineer, Cloud Cost Utilization

5 days ago
Full-time
Worldwide
$90,000 to $130,000 per year
Key requirements: Cloud cost management, GCP, AWS, FinOps FOCUS, Resource tagging strategies, Infrastructure as code, Terraform, Ansible, Observability tooling, Grafana, Cross-functional collaboration
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Junior NOC Engineer

6 days ago
Full-time
United States
Key requirements: Network monitoring, Cisco, Juniper, SolarWinds, CCNA, CCNP, NOC experience, DoD Secret clearance
True Zero Technologies

True Zero Technologies is a veteran-owned cybersecurity consulting firm headquartered in Fairfax, VA, specializing in B2B services for federal agencies and the public sector.

Remote policy: True Zero Technologies offers remote positions, including roles like the BigID Engineer, and is open to hiring from various locations, primarily focusing on the U.S. market.

Senior Cloud Infrastructure Engineer - GeForce Now

7 days ago
Full-time
United States
$168,000 to $322,000 per year
Key requirements: 8 years of experience, AWS, Kubernetes, Ansible, HashiCorp Vault, AI-assisted coding tools, Cassandra or DynamoDB, CI/CD pipelines, Zero Trust networks, Python, Bash, Go
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

NVIDIA GPU Cloud Infrastructure - Platform Engineer

7 days ago
Full-time
Germany
Key requirements: 5 years of experience, Linux, Windows, Server hardware diagnostics, Network protocols, Basic switch setup, Cloud infrastructure experience, Automation and provisioning knowledge, Relationship building with IT teams
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Sr. Platform Engineer I

7 days ago
Full-time
United States
$115,000 to $165,000 per year
Key requirements: 4 years of experience, AWS (EC2, EKS, VPC, IAM, S3, RDS, Lambda, CloudWatch), Kubernetes administration, Terraform, Istio or similar service mesh, GitOps tools (ArgoCD, Flux), CI/CD tools (GitHub Actions, Jenkins, GitLab CI), HashiCorp Vault, Python, Bash, or Go scripting proficiency, Oncall experience, Incident response capabilities
Smarsh

Smarsh Inc. is a Portland-based B2B SaaS provider specializing in digital communications governance and compliance solutions for regulated industries.

Remote policy: Smarsh supports remote work for various roles, including positions available for candidates in the United States. The company values a diverse workforce and encourages applications from individuals across different regions.

Senior System Reliability Engineer

8 days ago
Full-time
Taiwan
Key requirements: 5 years of experience, Reliability testing methodologies, FMEA, DoE, Thermal Cycling, Mechanical Shock and Vibration, ALT/HALT/HASS, Burn-in, Ongoing Reliability Testing (ORT), Statistical analysis, Reliability modeling, Life data analysis, Project management, Fluency in Chinese and English
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Infrastructure Engineer

8 days ago
Full-time
Worldwide
$133,500 to $212,000 per year
Key requirements: 5 years of experience, Kubernetes, AWS, Infrastructure-as-Code (Terraform), Java, Python, Mentoring, Cross-team initiatives
Iterable

Iterable is a San Francisco-based B2B SaaS platform specializing in AI-driven cross-channel customer engagement solutions for brands worldwide.

Remote policy: Iterable embraces a flexible remote work culture, welcoming candidates from various regions worldwide, including those with remote employees across multiple countries. Team members collaborate across time zones to foster a diverse and inclusive workplace.

Senior Platform and EngOps Engineer - Cluster Operations

9 days ago
Full-time
India
Key requirements: 5 years of experience, Ansible, Python, Shell Scripting, NVLink, InfiniBand, Slurm, GPU-focused hardware, Compute Clusters, Metrics collection, Large scale networking
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Intermediate Site Reliability Engineer, Environment Automation

9 days ago
Full-time
Worldwide
$103,600 to $222,000 per year
Key requirements: Terraform, Ansible, Kubernetes, Golang, Infrastructure as Code, SaaS experience, Observability stack, Multi-tenant environments, Automation mindset
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Infrastructure Engineer

9 days ago
Full-time
North America, United Kingdom
$163,000 to $204,000 per year
Key requirements: CI/CD, Infrastructure as code, Observability, Distributed systems, AWS, Kubernetes, Go
Tailscale

Tailscale Inc. is a Toronto-based B2B software company specializing in a zero-config mesh VPN solution using the WireGuard protocol, targeting enterprise and developer markets.

Remote policy: Tailscale offers an all-remote work environment, allowing team members to work from anywhere, with a global hiring approach that includes various regions such as Canada, the US, and the UK.

Site Reliability Engineer

9 days ago
Full-time
United States
$150,000 to $200,000 per year
Key requirements: 5 years of experience, Linux systems expertise, SLIs/SLOs management, Containerized production systems, Distributed systems understanding, Incident response leadership, Scripting skills, Monitoring and alerting systems, GPU infrastructure experience
RunPod

RunPod is a Mt. Laurel, New Jersey-based B2B cloud computing platform specializing in GPU infrastructure for AI and machine learning applications, serving a global market of developers and enterprises.

Remote policy: RunPod operates as a remote-first organization, welcoming candidates from various locations, primarily focusing on those eligible to work in the United States.

Senior Site Reliability Engineer

9 days ago
Full-time
Canada
$125,200 to $132,500 per year
Key requirements: 7 years of experience, AWS (EKS, Lambda, CloudWatch), AI workload reliability, Terraform, GitOps, Datadog, LLM observability, CI/CD pipeline design, Agent loop observability, Blast radius management, Service catalog ownership, Incident response leadership
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

Senior Site Reliability Engineer (Remote Canada)

9 days ago
Full-time
Canada
$125,200 to $132,500 per year
Key requirements: 7 years of experience, AWS (EKS, Lambda, CloudWatch), AI workload reliability, Terraform, GitOps, Datadog, CI/CD pipeline design (Bitbucket Pipelines, GitHub Actions), Agent loop observability, Blast radius management, LLM infrastructure reliability, Experience in semiconductor or data-intensive platforms
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

Sr. Platform Engineer II

9 days ago
Full-time
United States
$134,000 to $180,000 per year
Key requirements: 8 years of experience, AWS cloud networking, Hybrid connectivity, FedRAMP compliance, Terraform, Infrastructure as Code, Network architecture design, Multi-tenant isolation, Network security controls, DataDog, Prometheus, Kubernetes, Zero-trust networking
Smarsh

Smarsh Inc. is a Portland-based B2B SaaS provider specializing in digital communications governance and compliance solutions for regulated industries.

Remote policy: Smarsh supports remote work for various roles, including positions available for candidates in the United States. The company values a diverse workforce and encourages applications from individuals across different regions.

Senior Service Reliability Engineer

10 days ago
Full-time
United States
$172,100 to $258,100 per year
Key requirements: 7 years of experience, Linux Production Systems Engineer, Python, Kubernetes, AWS, Distributed data storage (Hadoop, Ceph), NoSQL (MongoDB, Redis, Cassandra), Monitoring & Alerting (Prometheus, Grafana), Incident Management toolsets, Software Distribution, Configuration Management (ansible, saltstack, puppet, chef)
sonyinteractiveentertainmentglobal

Sony Interactive Entertainment is a San Mateo-based global video game and digital entertainment company, primarily B2C, known for the PlayStation brand and its innovative gaming hardware, software, and network services.

Remote policy: Sony Interactive Entertainment supports flexible remote work arrangements, hiring from various regions globally, including locations such as the USA, UK, and Japan.

Engineer, Observability (Remote)

10 days ago
Full-time
United States
$87,000 to $107,000 per year
Key requirements: 4 years of experience, Session replay tools, Front-end web development, HTML, CSS, JavaScript, Customer struggle analysis, Technical liaison, Data literacy, Collaboration with engineering teams, Proactive communication, Mentoring others
Abercrombie and Fitch Co.

Abercrombie & Fitch Co. is a New Albany, Ohio-based global omnichannel specialty retailer of lifestyle apparel and accessories, primarily targeting preteens to millennials across North America, Europe, Asia, and the Middle East.

Senior Site Reliability Engineer

10 days ago
Full-time
United States
$237,630 to $287,952 per year
Key requirements: GCP, Kubernetes, Golang, Python, Infrastructure as code, Cloud networking, Datadog, Distributed systems, High-availability infrastructure, Observability best practices
Calendly

Calendly is a leading SaaS platform headquartered remotely, specializing in scheduling automation for B2B and B2C markets, with a strong focus on enhancing productivity and communication across various industries.

Remote policy: Calendly operates as a fully remote company without an official headquarters, hiring from various locations. However, candidates must be authorized to work in the United States, and certain states are excluded from eligibility.

Site Reliability Engineer

10 days ago
Full-time
Malaysia
Key requirements: 3 years of experience, SLIs/SLOs definition, Multi-tenant SaaS platforms, Datadog, Grafana, Kubernetes, Python, Incident response, High-availability architectures, Observability stack design, Automation and process improvement, Capacity planning
HostPapa

HostPapa is a Canadian-based web hosting company offering B2B and B2C solutions, including shared, reseller, and VPS hosting services, with a focus on small businesses and a global presence.

Remote policy: HostPapa offers remote work opportunities and hires from various locations, with team members and customers in 39 countries around the globe.

Site Reliability Engineer

11 days ago
Full-time
Germany
Key requirements: 5 years of experience, Linux Production Systems Engineer, Python, Kubernetes, AWS, Distributed data storage, NoSQL, Monitoring & Alerting, Configuration Management, Incident Management
sonyinteractiveentertainmentglobal

Sony Interactive Entertainment is a San Mateo-based global video game and digital entertainment company, primarily B2C, known for the PlayStation brand and its innovative gaming hardware, software, and network services.

Remote policy: Sony Interactive Entertainment supports flexible remote work arrangements, hiring from various regions globally, including locations such as the USA, UK, and Japan.

Platform Engineer - Product Reliability (Mid Level)

11 days ago
Full-time
Japan
Key requirements: AWS, Terraform, Kubernetes, Observability tooling, Python, SaaS experience, Site Reliability Engineering experience, Large scale service management, Incident response experience, Proactive learning mindset
Kraken

Kraken is a global fintech B2B and B2C cryptocurrency exchange platform, dedicated to accelerating crypto adoption and headquartered remotely with a presence in over 70 countries.

Remote policy: Kraken is a fully remote company with team members in over 70 countries, hiring globally to support a diverse workforce across various regions.

Senior Site Reliability Engineer

12 days ago
Full-time
United States
$100,000 to $120,000 per year
Key requirements: 6 years of experience, Cloud infrastructure (AWS, GCP, Azure), Observability systems design, Incident management, Automation and tooling, Ownership of reliability outcomes, Mentorship of engineers
ujet.cx

UJET is a B2B SaaS provider headquartered in an unspecified location, specializing in AI-powered Cloud Contact Center solutions to enhance customer experience for enterprises globally.

Infrastructure Engineer

12 days ago
Full-time
United States
Key requirements: Terraform, AWS, Kubernetes, CI/CD, Go, TypeScript, Rust, Observability, Database performance optimization, Security hardening, Financial technology experience
Bastion

Bastion is a fintech B2B platform headquartered in New York City, specializing in Stablecoin-as-a-Service for financial institutions and enterprises.

Member of Technical Staff (Infrastructure): World Models

12 days ago
Full-time
United States, Canada, United Kingdom
Key requirements: Linux-native, GPU infrastructure, Kubernetes, Slurm, Distributed systems, Production discipline, ML familiarity, Resource-constrained thinking
Moonvalley AI

Moonvalley AI is a Los Angeles-based generative AI company specializing in text-to-video tools for the entertainment industry, operating in both B2B and B2C markets.

Remote policy: Moonvalley supports remote work and hires globally, welcoming candidates from various regions, including the UK and Europe, with a fully remote culture that accommodates collaboration across time zones.