Remote Site Reliability Engineer Jobs

Explore 48 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (48)

Infrastructure Engineer

about 10 hours ago
Full-time
Worldwide
Key requirements: 5 years of experience, Kubernetes, Cloud security fundamentals, Production infrastructure in DevOps/DevSecOps, Full-stack engineering, Major cloud providers, Packaging workloads for air-gapped environments, Ability to obtain SECRET clearance
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Infrastructure Engineer (India)

4 days ago
Full-time
India
Key requirements: 7 years of experience, AWS, Azure, GCP, Terraform, CloudFormation, Ansible, Kubernetes, Docker Swarm, Prometheus, Grafana, ELK Stack, Python, Node.js, Bash, Go, Ruby, CI/CD pipelines, Mentoring, Security best practices, Performance optimization
Articul8 AI

Articul8 AI is a generative AI software company specializing in enterprise-scale AI solutions for regulated industries, headquartered in an unspecified location, operating on a B2B model targeting sectors like aerospace, oil and gas, and manufacturing.

Infrastructure Engineer (Brazil)

4 days ago
Full-time
Brazil
Key requirements: 7 years of experience, AWS, Azure, GCP, Terraform, Ansible, Kubernetes, Docker Swarm, Prometheus, Grafana, Python, Node.js, Bash, Go, Ruby
Articul8 AI

Articul8 AI is a generative AI software company specializing in enterprise-scale AI solutions for regulated industries, headquartered in an unspecified location, operating on a B2B model targeting sectors like aerospace, oil and gas, and manufacturing.

Senior Site Reliability Engineer - Observability and Telemetry Platform

5 days ago
Full-time
United States
$176,000 to $333,500 per year
Key requirements: 8 years of experience, Infrastructure automation, Distributed systems design, Python, Kubernetes, OpenStack, Grafana, Observability platforms, Systematic problem-solving, Strong communication skills
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Solutions Architect, OEM AI Factory Infrastructure

5 days ago
Full-time
United States
$152,000 to $241,500 per year
Key requirements: 5 years of experience, NVIDIA GPUs, Python, Cluster Administration, HPC/AI workloads, DevOps, Slurm, Data Science, Computer Architecture
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Cloud Security Engineer

5 days ago
Full-time
United States
$158,900 to $238,300 per year
Key requirements: Service mesh expertise (Istio, Kong), API gateway management (Kong, Amazon API Gateway), mTLS deployment in Kubernetes, PKI and certificate lifecycle management, Zero-trust architecture experience, Kubernetes security hardening, Scripting in Python and Go, Multi-cloud experience (AWS, GCP), Security governance frameworks, Automation of security processes, Monitoring tools (Grafana, Datadog)
sonyinteractiveentertainmentglobal

Sony Interactive Entertainment is a San Mateo-based global video game and digital entertainment company, primarily B2C, known for the PlayStation brand and its innovative gaming hardware, software, and network services.

Remote policy: Sony Interactive Entertainment supports flexible remote work arrangements, hiring from various regions globally, including locations such as the USA, UK, and Japan.

Senior Site Reliability Engineer (Remote Poland)

5 days ago
Full-time
Poland
18,800 to 20,000 PLN per month
Key requirements: 7 years of experience, AWS (EKS, Lambda, CloudWatch), AI workload reliability, Terraform, GitOps, Datadog, LLM observability, CI/CD pipeline design, Agent loop observability, Blast radius management, Service catalog ownership, Incident response leadership, Mentorship of junior engineers, Internal Developer Platform (IDP)
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

Principal Platform Infrastructure Engineer (Containers)

6 days ago
Full-time
Canada
$141,000 to $249,000 per year
Key requirements: Kubernetes expertise, Terraform, Google Cloud Platform, GitOps methodologies, Python, Bash, Go, Network protocols, Observability solutions
Menlo Security

Menlo Security is a Mountain View, CA-based B2B cybersecurity company specializing in secure enterprise browser solutions that protect against phishing and malware for government agencies and global enterprises.

Remote policy: Menlo Security supports remote work and hires from various regions globally, with team members located in places such as India and the United States, allowing for collaboration across time zones.

Senior Software Engineer - SRE

6 days ago
Full-time
Worldwide
Key requirements: AWS expertise, Kubernetes fundamentals, Terraform at scale, Production-quality code in Go/Python, CI/CD pipeline experience, GitHub Actions, ArgoCD, Observability principles, Datadog or similar tool, SLIs/SLOs for reliability
Socure

Socure is a U.S.-based B2B SaaS provider specializing in AI-driven identity verification and fraud prevention solutions for enterprises across financial services, e-commerce, and government sectors.

Remote policy: Socure is a fully remote organization, supporting team members across various locations, with some roles requiring in-person engagement in specific regions such as Washington, D.C.

Senior Site Reliability Engineer (Remote UK)

6 days ago
Full-time
United Kingdom
£77,600 to £82,200 per year
Key requirements: 7 years of experience, AWS (EKS, Lambda, CloudWatch), AI workload reliability, Terraform, GitOps, Datadog, CI/CD pipeline design, Agent loop observability, Blast radius management, LLM infrastructure reliability, Service catalog ownership, Incident response leadership, Observability for AI workloads, Mentorship of junior engineers
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

NCX Engineer, AI Accelerator

7 days ago
Full-time
United States
$184,000 to $287,500 per year
Key requirements: 8 years of experience, Linux systems, Distributed computing, Kubernetes, GPU scheduling, AI/ML experience, Python, Go, PyTorch, TensorFlow, MLOps, Infrastructure as code, NVIDIA ecosystem, Collaboration with NVIDIA Cloud Partners, Integration with enterprise systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Site Reliability Engineer

7 days ago
Full-time
United States
$168,000 to $270,250 per year
Key requirements: 8 years of experience, Incident Commander experience, Distributed systems understanding, Automation for incident management, AI/ML application in operations, Reducing MTTD and MTTR, Building incident management platforms, Real-time incident leadership
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Site Reliability Engineer

7 days ago
Full-time
Worldwide
$100,000 to $150,000 per year
Key requirements: Kubernetes, Helm, Google Cloud Platform, Microsoft Azure, Linux, Python, MySQL, MongoDB, RabbitMQ, Expert-level troubleshooting
Appspace

Appspace is a B2B SaaS company specializing in workplace experience software that integrates communication, space management, and digital signage to enhance hybrid work environments for enterprises globally.

Remote policy: Appspace supports a hybrid work environment, allowing employees to work from home, the office, or other locations. The company is hiring from various regions, promoting a flexible work culture.

Distinguished Site Reliability Engineer - Cloud

8 days ago
Full-time
North America
$320,000 to $488,750 per year
Key requirements: 16 years of experience, Kubernetes, OpenStack, Python, Go, Perl, Ruby, Infrastructure automation, Distributed systems design, Linux, Networking, Containers
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Site Reliability Engineer

10 days ago
Full-time
United States
Key requirements: 10 years of experience, Distributed systems, AWS, SLO-driven engineering, Observability ecosystems, Infrastructure as Code (Terraform), Incident management, Technical leadership, Cross-functional influence, Performance engineering, Resilience testing
Fieldguide

Fieldguide is a San Francisco-based B2B SaaS platform that automates workflows for audit and advisory firms, enhancing productivity and client relationships in the assurance industry.

Remote policy: Fieldguide is a remote-first company that primarily hires candidates based in the United States for remote roles, with team members working across various time zones (UTC-8 to UTC-5).

Senior Site Reliability Engineer

10 days ago
Full-time
United States
Key requirements: 5 years of experience, AWS, Distributed systems, Observability platforms, SLOs/SLIs, Terraform, System performance, Incident response, Automation scripting, Collaboration skills
Fieldguide

Fieldguide is a San Francisco-based B2B SaaS platform that automates workflows for audit and advisory firms, enhancing productivity and client relationships in the assurance industry.

Remote policy: Fieldguide is a remote-first company that primarily hires candidates based in the United States for remote roles, with team members working across various time zones (UTC-8 to UTC-5).

Sr Cloud Engineer | Europe remote

10 days ago
Full-time
Europe
Key requirements: Azure, Infrastructure design, Terraform, Kubernetes, Security-first mindset, Scale experience, Architectural judgement, Growth mindset
n8n

n8n is a Germany-based B2B SaaS workflow automation platform that integrates over 400 tools and services, targeting industries like IT and fintech with a focus on low-code and AI functionalities.

Remote policy: n8n is a remote-first company, allowing team members to work from anywhere within Europe, with regular off-sites to foster team bonding. We welcome applications from diverse locations across Europe.

Senior Cloud Support Engineer

12 days ago
Full-time
United States, United Kingdom, Singapore, India
$100,000 to $120,000 per year
Key requirements: 3 years of experience, AWS, Kubernetes, GraphQL, CLI tools, Python, JavaScript, Multi-agent systems, Prometheus, Grafana, Infrastructure as Code, Search Technologies
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Staff Infrastructure Security Engineer (APAC, EMEA)

13 days ago
Full-time
Europe, Asia
$120,000 to $180,000 per year
Key requirements: Cloud infrastructure security (AWS/GCP/Azure), Kubernetes, Security tooling (Go, Python, Ruby), Infrastructure-as-Code security (Terraform, Ansible, CloudFormation), AI in security workflows, Leading multi-team technical initiatives, Security certifications (FedRAMP, ISO 27001, SOC 2, PCI-DSS)
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Engineer, Observability (Remote)

14 days ago
Full-time
United States
$87,000 to $107,000 per year
Key requirements: 4 years of experience, Session replay tools, Front-end web development, HTML, CSS, JavaScript, Customer struggle analysis, Technical liaison, Data literacy, Website KPIs, Collaboration with engineering teams, Proactive communication
Abercrombie and Fitch Co.

Abercrombie & Fitch Co. is a New Albany, Ohio-based global omnichannel specialty retailer of lifestyle apparel and accessories, primarily targeting preteens to millennials across North America, Europe, Asia, and the Middle East.

Senior Site Reliability Engineer

14 days ago
Full-time
United States, Canada
$145,000 to $185,000 per year
Key requirements: 5 years of experience, Terraform, AWS (VPC, IAM, EKS, S3, CloudWatch), Kubernetes, CI/CD (GitHub Actions, ArgoCD), Networking (CIDR, DNS, load balancing), Observability (Prometheus, Grafana), Python, Bash, Windows workloads on Kubernetes, GPU scheduling
Parallel Domain

Parallel Domain is a San Francisco-based B2B SaaS provider specializing in synthetic data generation and simulation tools for autonomous systems, including self-driving vehicles and robotics.

Senior Infrastructure Software Systems Engineer

15 days ago
Full-time
India
Key requirements: 9 years of experience, Distributed systems design, Building scalable infrastructure, API and data model definition, Job orchestration, Performance optimization, Python, C++, Go, Linux systems, Developer productivity improvement, Chip-design understanding
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Service Reliability Engineer - EDA Infrastructure

15 days ago
Full-time
India
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Linux administration, Ansible, Python, Kubernetes, SLURM, large-scale cluster management, observability tools, incident management tools, AWS, Azure, GCP
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Intermediate Site Reliability Engineer, Cloud Cost Utilization

16 days ago
Full-time
Worldwide
$90,000 to $130,000 per year
Key requirements: Cloud cost management, GCP, AWS, FinOps FOCUS, Resource tagging strategies, Infrastructure as code, Terraform, Ansible, Observability tooling, Grafana, Cross-functional collaboration
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Senior Cloud Infrastructure Engineer - GeForce Now

18 days ago
Full-time
United States
$168,000 to $322,000 per year
Key requirements: 8 years of experience, AWS, Kubernetes, Ansible, HashiCorp Vault, AI-assisted coding tools, Cassandra or DynamoDB, CI/CD pipelines, Zero Trust networks, Python, Bash, Go
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

NVIDIA GPU Cloud Infrastructure - Platform Engineer

18 days ago
Full-time
Germany
Key requirements: 5 years of experience, Linux, Windows, Server hardware diagnostics, Network protocols, Basic switch setup, Cloud infrastructure experience, Automation and provisioning knowledge, Relationship building with IT teams
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Sr. Platform Engineer I

18 days ago
Full-time
United States
$115,000 to $165,000 per year
Key requirements: 4 years of experience, AWS (EC2, EKS, VPC, IAM, S3, RDS, Lambda, CloudWatch), Kubernetes administration, Terraform, Istio or similar service mesh, GitOps tools (ArgoCD, Flux), CI/CD tools (GitHub Actions, Jenkins, GitLab CI), HashiCorp Vault, Python, Bash, or Go scripting proficiency, Oncall experience, Incident response capabilities
Smarsh

Smarsh Inc. is a Portland-based B2B SaaS provider specializing in digital communications governance and compliance solutions for regulated industries.

Remote policy: Smarsh supports remote work for various roles, including positions available for candidates in the United States. The company values a diverse workforce and encourages applications from individuals across different regions.

Senior System Reliability Engineer

19 days ago
Full-time
Taiwan
Key requirements: 5 years of experience, Reliability testing methodologies, FMEA, DoE, Thermal Cycling, Mechanical Shock and Vibration, ALT/HALT/HASS, Burn-in, Ongoing Reliability Testing (ORT), Statistical analysis, Reliability modeling, Life data analysis, Project management, Fluency in Chinese and English
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Infrastructure Engineer

19 days ago
Full-time
Worldwide
$133,500 to $212,000 per year
Key requirements: 5 years of experience, Kubernetes, AWS, Infrastructure-as-Code (Terraform), Java, Python, Mentoring, Cross-team initiatives
Iterable

Iterable is a San Francisco-based B2B SaaS platform specializing in AI-driven cross-channel customer engagement solutions for brands worldwide.

Remote policy: Iterable embraces a flexible remote work culture, welcoming candidates from various regions worldwide, including those with remote employees across multiple countries. Team members collaborate across time zones to foster a diverse and inclusive workplace.

Senior Platform and EngOps Engineer - Cluster Operations

20 days ago
Full-time
India
Key requirements: 5 years of experience, Ansible, Python, Shell Scripting, NVLink, InfiniBand, Slurm, GPU-focused hardware, Compute Clusters, Metrics collection, Large scale networking
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.