Remote Site Reliability Engineer Jobs

Explore 62 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (62) - Page 2

Cloud Reliability & Recovery Engineer

16 days ago
Full-time
United States, Canada, United Kingdom, Singapore, India, Ireland, Finland
$100,000 to $150,000 per year
Key requirements: 5 years of experience, AWS expertise, Disaster Recovery architecture, Multi-region failover, Terraform, Kubernetes, CI/CD pipelines, Python scripting, AWS Backup administration, Chaos engineering, Business Continuity Planning
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Staff Database Reliability Engineer

17 days ago
Full-time
United States
$200,000 to $250,000 per year
Key requirements: PostgreSQL, Django ORM, AWS DMS, pganalyze, CloudWatch, Honeycomb, AI coding tools, OpenSearch, Redis, SQS, RabbitMQ, Python, Terraform, Cross-team leadership, Automation
Scribe

Scribe is a San Francisco-based B2B SaaS platform specializing in workflow documentation and optimization, serving over 5 million users across 600,000 businesses globally.

Senior Infrastructure Engineer, Government Systems

17 days ago
Full-time
North America, Middle East
Key requirements: Kubernetes, Terraform, AWS, CI/CD, GitOps, Linux administration, Operational mindset, Security compliance
Chainalysis

Chainalysis is a New York City-based B2B blockchain analysis firm specializing in compliance and investigation software for the cryptocurrency and financial sectors, serving clients globally.

Remote policy: Chainalysis supports remote work and is open to hiring from various regions, including North America and the Middle East, with team members located across multiple countries.

Senior Software Engineer, AV Mapping Infrastructure

18 days ago
Full-time
United States
$152,000 to $287,500 per year
Key requirements: 5 years of experience, AWS, Kubernetes, Cloud services management, Application containers, Monitoring systems (Prometheus, Datadog), Middleware systems (Redis, MongoDB, Kafka, HBase, Postgres, ElasticSearch), CI/CD deployment strategies, Networking fundamentals, Linux proficiency
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer II

18 days ago
Full-time
Mexico
Key requirements: 3 years of experience, Python, Go, Distributed Systems Expertise, Reliability Engineering Mindset, Observability & Incident Response, Cross-functional Communication, Operational Tooling & AI Fluency, Leadership & Mentorship
EarnIn

EarnIn is a fintech company headquartered in the US, specializing in earned wage access (EWA) through a mobile app that provides financial tools for hourly workers.

Senior Site Reliability Engineer

18 days ago
Full-time
Mexico
$100,000 to $150,000 per year
Key requirements: 4 years of experience, Python, Go, Distributed Systems Expertise, Reliability Engineering Mindset, Observability & Incident Response, Cross-functional Communication, Operational Tooling & AI Fluency, Leadership & Mentorship
EarnIn

EarnIn is a fintech company headquartered in the US, specializing in earned wage access (EWA) through a mobile app that provides financial tools for hourly workers.

Senior Site Reliability Engineer, Infrastructure Foundations

18 days ago
Full-time
Worldwide
$113,082 to $175,725 per year
Key requirements: 6 years of experience, Puppet, Kubernetes, Python, Linux system-level troubleshooting, Infrastructure security management, Incident response leadership, Automation of tasks and processes, Monitoring and logging infrastructure (Prometheus, Grafana), Open source software contribution, Security incident technical response
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.

Infrastructure Engineer

18 days ago
Full-time
United States
Key requirements: AWS, Terraform, Kubernetes, CI/CD, Go, TypeScript, Rust, Postgres, Redis, Kafka, Datadog, Grafana, Sentry, CloudWatch, AWS Nitro Enclaves
Bastion

Bastion is a fintech B2B platform headquartered in New York City, specializing in Stablecoin-as-a-Service for financial institutions and enterprises.

Platform Engineer (Database Reliability) - Remote Canada

18 days ago
Full-time
Canada
Key requirements: 5 years of experience, MySQL management, Cloud infrastructure (GCP), Kubernetes, Terraform, Linux systems administration, Monitoring and observability, Scripting (Bash, Go, Python, JavaScript), Incident response, Operational best practices
Bold Commerce

Bold Commerce is a Winnipeg-based B2B SaaS provider specializing in e-commerce applications for Shopify merchants, focusing on tools that enhance online store performance and sales.

Remote policy: Bold Commerce supports remote work from anywhere in Canada and the United States, allowing for flexible work arrangements across these regions.

Platform Engineer (Database Reliability) - Remote Canada

18 days ago
Full-time
Canada
Key requirements: 5 years of experience, MySQL management, Cloud infrastructure (GCP), Kubernetes, Terraform, Linux systems administration, Monitoring and observability, Scripting (Bash, Go, Python, JavaScript), Incident response, Configuration management (Ansible)
Bold Commerce

Bold Commerce is a Winnipeg-based B2B SaaS provider specializing in e-commerce applications for Shopify merchants, focusing on tools that enhance online store performance and sales.

Remote policy: Bold Commerce supports remote work from anywhere in Canada and the United States, allowing for flexible work arrangements across these regions.

Senior Site Reliability Engineer, Observability

19 days ago
Full-time
Argentina
Key requirements: 5 years of experience, OpenTelemetry, Datadog, SLOs/SLIs, AWS, GCP, Docker, Kubernetes, Terraform, TypeScript, Node, Go, AI-powered automation, Distributed systems, Observability practices
Webflow

Webflow is a San Francisco-based SaaS company providing a Website Experience Platform (WXP) that empowers B2B marketing teams to design, manage, and optimize custom websites for a global audience.

Remote policy: Webflow operates on a remote-first model, primarily hiring from the United States and Canada, including British Columbia and Ontario, with team members collaborating across various time zones.

Senior System Software Engineer - DevOps and Infrastructure Automation

20 days ago
Full-time
United States
$224,000 to $356,500 per year
Key requirements: 7 years of experience, Kubernetes expertise, CI/CD (GitLab CI, GitHub Actions), IaC (Terraform, Ansible, Helm, Crossplane), Observability stacks (Prometheus, Grafana, Loki), MLOps experience, GPU software stacks (CUDA, cuDNN, TensorRT), Debugging complex issues (kernel modules, container runtimes)
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Cloud Operations Engineer

20 days ago
Full-time
United States
$110,000 to $127,000 per year
Key requirements: 7 years of experience, Microsoft Azure, AWS, Linux, Office 365, Intune, AWS GovCloud, Cloud security tools, System Administration, Networking (TCP/IP, SSH, VPN), Automation tools, Analytical skills, Interpersonal communication, Project management, Government cloud experience
CyberSheath

CyberSheath is a managed security services provider specializing in cybersecurity compliance for the U.S. Defense Industrial Base (DIB), operating as a B2B entity focused on DoD contractors.

Infrastructure Engineer

21 days ago
Full-time
Worldwide
Key requirements: 5 years of experience, Kubernetes, Cloud security fundamentals, Production infrastructure in DevOps/DevSecOps, Full-stack engineering, Major cloud providers, Packaging workloads for air-gapped environments, Ability to obtain SECRET clearance
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Site Reliability Engineer

21 days ago
Contract
United States, Canada
Key requirements: 3 years of experience, Terraform, Prometheus, Grafana, CI/CD, Incident Response, NIST SP 800-53, Python, Docker, Kubernetes, AWS, Azure, GCP
Arctiq

Arctiq is a Toronto-based B2B DevOps and cloud solution integrator specializing in professional IT services and managed services for enterprise organizations across North America.

Senior Site Reliability Engineer

22 days ago
Full-time
United States
Key requirements: Python, Java, C++, Go, Linux systems, AWS, GCP, Azure, Docker, Kubernetes, Terraform, CI/CD pipelines, Observability tools, SLIs, SLOs, AI-augmented development tools
PlayOn

PlayOn is a B2B platform headquartered in an unspecified location, specializing in high school sports streaming, digital ticketing, and community engagement solutions for schools across the US.

Senior Site Reliability Engineer

22 days ago
Contract
United States
Key requirements: 7 years of experience, SRE or DevOps, Go, Python, Java, Linux internals, Kubernetes, Cloud architectures, SLO definition, Government RMF processes, Security-as-code integration, Mentorship
Arctiq

Arctiq is a Toronto-based B2B DevOps and cloud solution integrator specializing in professional IT services and managed services for enterprise organizations across North America.

Infrastructure Engineer (India)

25 days ago
Full-time
India
Key requirements: 7 years of experience, AWS, Azure, GCP, Terraform, CloudFormation, Ansible, Kubernetes, Docker Swarm, Prometheus, Grafana, ELK Stack, Python, Node.js, Bash, Go, Ruby, CI/CD pipelines, Mentoring, Security best practices, Performance optimization
Articul8 AI

Articul8 AI is a generative AI software company specializing in enterprise-scale AI solutions for regulated industries, headquartered in an unspecified location, operating on a B2B model targeting sectors like aerospace, oil and gas, and manufacturing.

Infrastructure Engineer (Brazil)

25 days ago
Full-time
Brazil
Key requirements: 7 years of experience, AWS, Azure, GCP, Terraform, Ansible, Kubernetes, Docker Swarm, Prometheus, Grafana, Python, Node.js, Bash, Go, Ruby
Articul8 AI

Articul8 AI is a generative AI software company specializing in enterprise-scale AI solutions for regulated industries, headquartered in an unspecified location, operating on a B2B model targeting sectors like aerospace, oil and gas, and manufacturing.

Senior DevOps / Platform Reliability Engineer

25 days ago
Full-time
Worldwide
$120,000 to $160,000 per year
Key requirements: 5 years of experience, GitHub Actions, Terraform, Kubernetes (EKS), Cloudflare, Prometheus, Grafana, OpenTelemetry, AWS networking, CI/CD pipelines, AI-native DevOps, Lambda, Kafka/MSK, Security best practices, Auto-remediation agents, Model Context Protocol (MCP)
Zingtree

Zingtree is an AI-powered B2B SaaS platform headquartered in an unspecified location, specializing in interactive decision trees for customer experience management and workflow automation in the customer service industry.

Remote policy: Zingtree supports flexible remote work, allowing employees to work from anywhere, although specific hiring locations are not detailed.

Senior Site Reliability Engineer (Remote USA)

25 days ago
Full-time
United States
$149,100 to $157,800 per year
Key requirements: 7 years of experience, AWS (EKS, Lambda, CloudWatch), AI workload reliability, SLOs, SLIs, error budgets, Terraform, GitOps, Datadog, CI/CD pipeline design, Agent loop observability, Blast radius management, LLM infrastructure reliability, Observability for AI workloads, Internal Developer Platform (IDP), Mentorship of junior engineers
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

Senior Site Reliability Engineer - Observability and Telemetry Platform

25 days ago
Full-time
United States
$176,000 to $333,500 per year
Key requirements: 8 years of experience, Infrastructure automation, Distributed systems design, Python, Kubernetes, OpenStack, Grafana, Observability platforms, Systematic problem-solving, Strong communication skills
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Solutions Architect, OEM AI Factory Infrastructure

25 days ago
Full-time
United States
$152,000 to $241,500 per year
Key requirements: 5 years of experience, NVIDIA GPUs, Python, Cluster Administration, HPC/AI workloads, DevOps, Slurm, Data Science, Computer Architecture
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Cloud Security Engineer

26 days ago
Full-time
United States
$158,900 to $238,300 per year
Key requirements: Service mesh expertise (Istio, Kong), API gateway management (Kong, Amazon API Gateway), mTLS deployment in Kubernetes, PKI and certificate lifecycle management, Zero-trust architecture experience, Kubernetes security hardening, Scripting in Python and Go, Multi-cloud experience (AWS, GCP), Security governance frameworks, Automation of security processes, Monitoring tools (Grafana, Datadog)
sonyinteractiveentertainmentglobal

Sony Interactive Entertainment is a San Mateo-based global video game and digital entertainment company, primarily B2C, known for the PlayStation brand and its innovative gaming hardware, software, and network services.

Remote policy: Sony Interactive Entertainment supports flexible remote work arrangements, hiring from various regions globally, including locations such as the USA, UK, and Japan.

Senior Site Reliability Engineer (Remote Poland)

26 days ago
Full-time
Poland
18,800 to 20,000 PLN per month
Key requirements: 7 years of experience, AWS (EKS, Lambda, CloudWatch), AI workload reliability, Terraform, GitOps, Datadog, LLM observability, CI/CD pipeline design, Agent loop observability, Blast radius management, Service catalog ownership, Incident response leadership, Mentorship of junior engineers, Internal Developer Platform (IDP)
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

Principal Platform Infrastructure Engineer (Containers)

27 days ago
Full-time
Canada
$141,000 to $249,000 per year
Key requirements: Kubernetes expertise, Terraform, Google Cloud Platform, GitOps methodologies, Python, Bash, Go, Network protocols, Observability solutions
Menlo Security

Menlo Security is a Mountain View, CA-based B2B cybersecurity company specializing in secure enterprise browser solutions that protect against phishing and malware for government agencies and global enterprises.

Remote policy: Menlo Security supports remote work and hires from various regions globally, with team members located in places such as India and the United States, allowing for collaboration across time zones.

Senior Software Engineer - SRE

27 days ago
Full-time
Worldwide
Key requirements: AWS expertise, Kubernetes fundamentals, Terraform at scale, Production-quality code in Go/Python, CI/CD pipeline experience, GitHub Actions, ArgoCD, Observability principles, Datadog or similar tool, SLIs/SLOs for reliability
Socure

Socure is a U.S.-based B2B SaaS provider specializing in AI-driven identity verification and fraud prevention solutions for enterprises across financial services, e-commerce, and government sectors.

Remote policy: Socure is a fully remote organization, supporting team members across various locations, with some roles requiring in-person engagement in specific regions such as Washington, D.C.

Senior Site Reliability Engineer (Remote UK)

27 days ago
Full-time
United Kingdom
£77,600 to £82,200 per year
Key requirements: 7 years of experience, AWS (EKS, Lambda, CloudWatch), AI workload reliability, Terraform, GitOps, Datadog, CI/CD pipeline design, Agent loop observability, Blast radius management, LLM infrastructure reliability, Service catalog ownership, Incident response leadership, Observability for AI workloads, Mentorship of junior engineers
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

NCX Engineer, AI Accelerator

27 days ago
Full-time
United States
$184,000 to $287,500 per year
Key requirements: 8 years of experience, Linux systems, Distributed computing, Kubernetes, GPU scheduling, AI/ML experience, Python, Go, PyTorch, TensorFlow, MLOps, Infrastructure as code, NVIDIA ecosystem, Collaboration with NVIDIA Cloud Partners, Integration with enterprise systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Site Reliability Engineer

28 days ago
Full-time
United States
$168,000 to $270,250 per year
Key requirements: 8 years of experience, Incident Commander experience, Distributed systems understanding, Automation for incident management, AI/ML application in operations, Reducing MTTD and MTTR, Building incident management platforms, Real-time incident leadership
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.