Remote Site Reliability Engineer Jobs

Explore 69 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (69)

AI Infrastructure Operations Engineer

1 day ago
Full-time
United States
$120,000 to $140,000 per year
Key requirements: 5 years of experience, Kubernetes operations, Azure AKS, Observability engineering, Incident response, Operational security, AI platform reliability, Cloud-native infrastructure, SRE mindset, Monitoring and logging practices, HIPAA compliance
Private Health Management

Private Health Management is a remote-first B2C healthcare navigation service specializing in personalized patient advocacy and support for complex medical conditions, headquartered in the United States.

Remote policy: Private Health Management (PHM) supports fully remote work and is hiring from various locations, including the United States, allowing team members to work from wherever they call home.

Staff Infrastructure Engineer

2 days ago
Full-time
Worldwide
Key requirements: 5 years of experience, Kubernetes, Cloud security fundamentals, Production infrastructure in DevOps/DevSecOps, Full-stack engineering, Major cloud providers, Packaging workloads for air-gapped environments, Ability to obtain SECRET clearance
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Principal Infrastructure Engineer

2 days ago
Full-time
Worldwide
Key requirements: 8 years of experience, Kubernetes, Cloud security fundamentals, Production infrastructure in DevOps/DevSecOps, Full-stack engineering, Major cloud providers, Packaging workloads for air-gapped environments, Leadership/mentorship, Ability to obtain SECRET clearance
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Staff Cloud Operations Engineer

2 days ago
Full-time
United States
$185,000 to $200,000 per year
Key requirements: 5 years of experience, GCP, Cloud infrastructure engineering, Observability tools, Python, Docker, Kubernetes, AI tools usage, 24/7 on-call availability
Branch

Branch is a remote-first fintech company specializing in workforce payment solutions, headquartered in the mid-Atlantic region, targeting working Americans with B2B/B2C financial services.

Remote policy: Branch is a remote-first company with employees located throughout the USA, emphasizing collaboration and transparency across teams.

Cloud Operations Engineer

2 days ago
Full-time
United States
$135,000 to $150,000 per year
Key requirements: 3 years of experience, GCP, Cloud infrastructure engineering, Observability tools, Python, Docker, Kubernetes, Network security practices, AI tools for automation, 24/7 on-call rotation
Branch

Branch is a remote-first fintech company specializing in workforce payment solutions, headquartered in the mid-Atlantic region, targeting working Americans with B2B/B2C financial services.

Remote policy: Branch is a remote-first company with employees located throughout the USA, emphasizing collaboration and transparency across teams.

Senior AI Infrastructure Engineer - DGX Cloud

3 days ago
Full-time
United States
$152,000 to $287,500 per year
Key requirements: 5 years of experience, Kubernetes, OpenStack, Infrastructure automation, Distributed systems architecture, Python, Go, C/C++, Java, Linux, Networking, Storage, Containers, Infrastructure as Code (IaC), Terraform, Large-scale cloud systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Security Engineer, Platform

3 days ago
Full-time
Europe
$120,000 to $140,000 per year
Key requirements: Securing production infrastructure, Cloud environments, APIs, Multi-tenant SaaS, Incident response, AWS, Terraform, Kubernetes, Email infrastructure, SOC 2 preparation, Security observability, Developer experience
Resend

Resend is a B2B SaaS platform headquartered remotely, specializing in modern email API services for developers, enabling scalable transactional email delivery globally.

Remote policy: Resend is a fully remote company with team members across various countries, primarily hiring from regions aligned with Brazilian and European Time Zones (UTC-3 to UTC+2).

Security Engineer, Platform

3 days ago
Full-time
Europe
$120,000 to $140,000 per year
Key requirements: Production infrastructure security, Cloud environment security, API security, Multi-tenant SaaS security, Code writing and reading, Authentication and authorization, Secrets management, Incident response, Security observability, Email infrastructure security, SOC 2 preparation, High-scale developer product security
Resend

Resend is a B2B SaaS platform headquartered remotely, specializing in modern email API services for developers, enabling scalable transactional email delivery globally.

Remote policy: Resend is a fully remote company with team members across various countries, primarily hiring from regions aligned with Brazilian and European Time Zones (UTC-3 to UTC+2).

Senior Incident Manager

3 days ago
Full-time
United States
Key requirements: 8 years of experience, Critical incident management, High-availability infrastructure, Large-scale GPU clusters, Cloud infrastructure platforms, Incident Command leadership, Incident management frameworks (ITIL, SRE), Incident tracking tools (PagerDuty, ServiceNow), Post-incident reviews (PIRs), Root cause analysis, Crisis communication
Lambda

Lambda is a Seattle-based AI cloud infrastructure provider specializing in scalable GPU resources for AI researchers and enterprises, operating in a B2B model.

Site Reliability Engineer

3 days ago
Full-time
United Kingdom
£90,000 to £110,000 per year
Key requirements: Ansible, AWS, GCP, Financial technology experience, Automated infrastructure provisioning, Continuous Delivery pipeline management, Systems Administration, Network Administration, Vendor relationship management, Collaborative problem-solving
MLabs

MLabs is a remote fintech company specializing in DeFi solutions, providing a unified API for financial institutions to access on-chain liquidity across major blockchains.

Remote policy: MLabs supports remote work for positions located within the EMEA region, allowing for flexible hours and a remote-first environment.

Observability Engineer (Prometheus / Grafana / Datadog)

3 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, SRE principles, High-cardinality metrics, Distributed tracing, CI/CD integration, Linux internals, Container platforms
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

OpenShift Engineer

3 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, OpenShift, Kubernetes internals, Linux administration, Infrastructure-as-code (Ansible, Terraform, Helm), CI/CD pipelines (Tekton, Jenkins, Argo CD), Scripting (Bash, Python, Go), Monitoring and logging tools, Container image security, Multi-tenant OpenShift platforms, Disaster recovery strategies
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Service Mesh Engineer (Istio / Linkerd)

3 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Istio, Linkerd, Envoy, Kubernetes, mTLS, Traffic management policies, Distributed tracing, Go, Python, Multi-cluster deployments
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Senior Infrastructure Engineer

4 days ago
Full-time
North America
$190,000 to $225,000 per year
Key requirements: 5 years of experience, AWS (ECS/Fargate, RDS/Aurora, MSK, DynamoDB), Terraform, CI/CD platform ownership, Observability fundamentals, Developer experience improvement
Rally UXR

Rally UXR is a New York-based B2B SaaS platform that automates user research operations for product, design, and UX research teams, enabling continuous user insights for informed product decisions.

Remote policy: Rally UXR is a remote-first company, hiring primarily from the United States and Canada, with core collaboration hours that overlap across U.S. time zones.

Senior Platform Engineer (SRE) | Hybrid & Remote | USD Salary

4 days ago
Full-time
Pakistan
Key requirements: 7 years of experience, Kubernetes, AWS, Golang, Infrastructure as Code, CI/CD optimization, OpenTelemetry, DevSecOps, Custom Resource Definitions, Production Observability, Developer Experience
HR Ways

HR Ways is a global technical recruitment firm headquartered in Pakistan, specializing in connecting IT talent with software houses and IT product companies across multiple countries.

Remote policy: HR Ways embraces a fully remote work model, hiring globally from various regions including Pakistan, Canada, the US, the UK, and more, allowing team members to collaborate across time zones.

Senior System Reliability Engineer

4 days ago
Full-time
United States
$168,000 to $264,500 per year
Key requirements: 8 years of experience, Reliability testing, FMEA, DoE, Statistical analysis, Hardware reliability, Cross-functional collaboration, Project management
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Kafka Platform Engineer

4 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Apache Kafka, Confluent Platform, Kafka internals, Kafka security, Kafka Connect, Schema Registry, Kafka Streams, HA/DR strategies, Python, Bash, Go, infrastructure-as-code, Terraform, Ansible, observability tooling
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Observability Engineer (Prometheus / Grafana / Datadog)

4 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, SRE principles, High-cardinality metrics, CI/CD integration, Linux internals, Distributed tracing, Observability cost optimization
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Site Reliability Engineer (SRE)

4 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Python, Go, Kubernetes, Prometheus, Grafana, CI/CD pipelines, Distributed systems, Incident response, Chaos engineering, Cloud platforms
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Senior Site Reliability Engineer

5 days ago
Full-time
Worldwide
$113,082 to $175,725 per year
Key requirements: 6 years of experience, Puppet, Kubernetes, Python, Linux system-level troubleshooting, Distributed caching systems, Incident response, Automation of tasks, Open source contribution, Linux kernel tuning, Monitoring infrastructure (Prometheus, Grafana)
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.

Senior Site Reliability Engineer, Wikimedia Enterprise

5 days ago
Full-time
Worldwide
$116,633 to $181,243 per year
Key requirements: SRE best practices, Kubernetes, CI/CD pipelines, GitOps workflows, Infrastructure as Code, Cloud platforms (AWS, GCP), Observability (Prometheus, OpenTelemetry), Incident response, Capacity planning, Automation tools (Terraform, Ansible)
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.

Security Platform Engineer

5 days ago
Full-time
United States
$160,000 to $180,000 per year
Key requirements: 5 years of experience, Claude Code, GitOps patterns, Python, AWS, Kubernetes, Security telemetry, Reliability engineering, Agentic coding tools, Infrastructure as code, Cloud-native systems
Lumin Digital

Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.

Remote policy: Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.

Senior Site Reliability Engineer, GeForce NOW

7 days ago
Full-time
United States
$168,000 to $270,250 per year
Key requirements: 8 years of experience, Kubernetes, Automation, Multi-region cloud deployments, Datadog, Prometheus, Deployment pipelines, Go, Python, Bash scripting, Anomaly detection tools, AI usage in SRE
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Observability Engineer (Prometheus / Grafana / Datadog)

7 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, SRE principles, High-cardinality metrics, Distributed tracing, CI/CD integration, Linux internals, Container platforms
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

DevOps / Infrastructure Engineer

7 days ago
Full-time
North America
$100,000 to $130,000 per year
Key requirements: Tailscale, AWS, Container orchestration, Infrastructure-as-Code, CI/CD, Blockchain familiarity, Financial SRE background
MLabs

MLabs is a remote fintech company specializing in DeFi solutions, providing a unified API for financial institutions to access on-chain liquidity across major blockchains.

Remote policy: MLabs supports remote work for positions located within the EMEA region, allowing for flexible hours and a remote-first environment.

Nutanix Engineer

7 days ago
Full-time
United States
Key requirements: 3 years of experience, Nutanix AOS, Nutanix AHV, Prism management tools, HCI troubleshooting, Client-facing experience, Infrastructure project support, Backup and DR configurations, Managed services experience
MetroSys

MetroSys is a San Diego-based B2B technology solutions and staffing company specializing in IT consulting, backup and recovery solutions, and cloud services for enterprise clients.

Remote policy: MetroSys offers remote work opportunities primarily for candidates located in the United States, with a focus on hiring across the Americas.

Senior Release Engineer

8 days ago
Full-time
Worldwide
Key requirements: 4 years of experience, Kubernetes, Helm, CI/CD pipelines, Infrastructure as Code, GitOps tooling, Security tooling integration, Linux systems, Networking fundamentals, Distributed systems
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Agora - Senior Infrastructure Engineer

9 days ago
Full-time
North America, South America
Key requirements: 5 years of experience, TypeScript, Kubernetes, AWS architecture, Infrastructure-as-code, Distributed systems fundamentals, Reusable systems and abstractions, SLIs/SLOs and incident tooling, Observability practices, GitOps patterns, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Agora - Senior Infrastructure Engineer

9 days ago
Full-time
North America, South America
Key requirements: 5 years of experience, TypeScript, AWS architecture, Kubernetes, Infrastructure-as-code, Distributed systems, Reusable systems, SLIs/SLOs, Observability practices, GitOps, Developer Productivity, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Site Reliability Engineer

10 days ago
Full-time
Worldwide
Key requirements: Linux/Unix, Cloud providers (AWS, Google Cloud, Azure), Infrastructure provisioning (Terraform, CloudFormation, Ansible), Containerization (Docker), Orchestration (Kubernetes), Monitoring tools (Prometheus, Grafana, Datadog), CI/CD pipelines (Jenkins, GitLab CI, CircleCI), Incident management practices, Scripting (Bash, Perl)
OXIO

OXIO is a North America-based B2B Telecom-as-a-Service (TaaS) platform enabling businesses to build and manage customizable mobile networks through a cloud-based solution.

Remote policy: OXIO supports flexible work arrangements and hires from various regions, with team members located in cities such as New York, Mexico City, and Montreal. Candidates are encouraged to apply regardless of their location.