Remote Site Reliability Engineer Jobs

Explore 66 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (66)

Incident Manager

about 9 hours ago
Full-time
United States
Key requirements: 5 years of experience, Incident management, SaaS experience, Healthcare experience, ITIL frameworks, PagerDuty, Jira, Datadog, Root Cause Analysis, Cross-functional leadership, Strategic documentation, Process improvement
SmithRx

SmithRx is a health-tech B2B Pharmacy Benefit Manager (PBM) focused on providing transparent pharmacy benefits and cost-saving solutions to employers and health plans across the U.S.

Platform Engineer

about 21 hours ago
Full-time
United Kingdom
Key requirements: Kubernetes (EKS), AWS core services, GitOps (ArgoCD), CI/CD tools, Airflow, DevOps/SRE practices, IaC Tooling (Terraform), Linux
sonyinteractiveentertainmentglobal

Sony Interactive Entertainment is a San Mateo-based global video game and digital entertainment company, primarily B2C, known for the PlayStation brand and its innovative gaming hardware, software, and network services.

Remote policy: Sony Interactive Entertainment supports flexible remote work arrangements, hiring from various regions globally, including locations such as the USA, UK, and Japan.

Senior Software Engineer, SRE

1 day ago
Full-time
Worldwide
Key requirements: AWS expertise, Terraform, Kubernetes fundamentals, Amazon EKS, Go, Python, CI/CD pipelines, GitHub Actions, ArgoCD, Datadog, SLIs/SLOs
Socure

Socure is a U.S.-based B2B SaaS provider specializing in AI-driven identity verification and fraud prevention solutions for enterprises across financial services, e-commerce, and government sectors.

Remote policy: Socure is a fully remote organization, supporting team members across various locations, with some roles requiring in-person engagement in specific regions such as Washington, D.C.

Support Engineer (Java)

3 days ago
Full-time
United States
$140,000 to $155,000 per year
Key requirements: 3 years of experience, federal enclave experience, AWS production environment, strong troubleshooting skills, scripting ability (Python, Bash, PowerShell), experience with monitoring platforms (Kibana, Grafana, Datadog, Splunk), CI/CD pipeline experience
Smarsh

Smarsh Inc. is a Portland-based B2B SaaS provider specializing in digital communications governance and compliance solutions for regulated industries.

Remote policy: Smarsh supports remote work for various roles, including positions available for candidates in the United States. The company values a diverse workforce and encourages applications from individuals across different regions.

Cloud & AI Operations Administrator

3 days ago
Full-time
Canada
$67,000 to $82,000 per year
Key requirements: 5 years of experience, Azure, PowerShell, Azure CLI, Azure Active Directory, AI services management, CCNA, Microsoft Teams, Office365, Cisco Meraki Solutions
Heart & Stroke

Heart & Stroke is a Canadian non-profit organization headquartered in Ontario, dedicated to preventing heart disease and stroke through research, advocacy, and public education, targeting diverse communities across Canada.

Remote policy: Heart & Stroke supports remote work for candidates residing in Canada, with the requirement to travel to the Toronto office when requested.

General Application for Engineering

3 days ago
Contract
Brazil
Key requirements: Python, Go, Typescript, ETL, AWS, Kubernetes, Terraform, Agile, Learning Agility, Collaborative Spirit, Ownership
Loadsmart

Loadsmart is a Chicago-based logistics technology company specializing in innovative freight management solutions for the B2B market.

Remote policy: Loadsmart supports remote work and has a globally distributed team, currently hiring for remote positions in Brazil.

Senior Systems Software Engineer, AI Infrastructure

4 days ago
Full-time
United States
$184,000 to $287,500 per year
Key requirements: 5 years of experience, Python, C/C++ or Go or Perl or Ruby, Linux or Windows systems engineering, AWS or Azure or GCP or OCI, SRE principles, Terraform CDK, observability platforms, CI/CD systems, AI training and inferencing, deep learning frameworks, distributed systems, cloud or hardware health monitoring
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Software Engineer - Infrastructure

5 days ago
Full-time
United States
Key requirements: 8 years of experience, AWS infrastructure at scale, Infrastructure as Code (Terraform), Designing and operating distributed systems, CI/CD systems and automation pipelines, Containerized environments (Docker), Observability and performance tuning, Experience architecting infrastructure for multiple payment rails, Understanding of payment rail mechanics, Experience scaling high-volume financial workloads, Familiarity with security and risk standards for money movement
Modern Treasury

Modern Treasury is a San Francisco-based fintech B2B company specializing in payment operations tools, offering APIs and dashboards for automating money movement for enterprises.

Remote policy: Modern Treasury supports remote work for certain roles and primarily hires from the United States, with team members located in cities such as San Francisco and New York.

Intermediate Site Reliability Engineer, Environment Automation

5 days ago
Full-time
Worldwide
Key requirements: Golang, Kubernetes, Terraform, Ansible, Infrastructure as Code, SaaS experience, Multi-tenant environments, Observability stack, Automation mindset, Git-based workflows
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

AI Platform Engineer

6 days ago
Full-time
United States
$168,000 to $322,000 per year
Key requirements: 10 years of experience, Python, Kubernetes, AI/ML platforms, Distributed systems, Observability design, AI-assisted development tools, Automation-first approach, AI-native infrastructure roadmaps
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer

6 days ago
Full-time
United States, Canada
Key requirements: 10 years of experience, Cloud-native infrastructure, Kubernetes deployment, Terraform, Ansible, IaC / GitOps tooling, On-prem compute knowledge, Platform optimization, Distributed systems troubleshooting, Python, Bash, CUDA-based GPU programs, Security in sensitive environments
Planet

Planet Labs is a San Francisco-based B2B company specializing in satellite imagery and earth data analytics for applications in agriculture, environmental monitoring, and disaster response.

Remote policy: Planet hires remotely from the United States, supporting team members to work from anywhere within the country.

Site Reliability Engineer

6 days ago
Full-time
United States, Canada
$160,600 to $200,800 per year
Key requirements: 10 years of experience, Cloud-native infrastructure, Kubernetes deployment, Terraform, Ansible, IaC / GitOps tooling, On-prem compute knowledge, Platform optimization, Distributed systems troubleshooting, Python, Bash, CUDA-based GPU programs, Security in sensitive environments
Planet

Planet Labs is a San Francisco-based B2B company specializing in satellite imagery and earth data analytics for applications in agriculture, environmental monitoring, and disaster response.

Remote policy: Planet hires remotely from the United States, supporting team members to work from anywhere within the country.

Senior Infrastructure Engineer

6 days ago
Full-time
United States, Canada
$200,700 to $250,900 per year
Key requirements: 5 years of experience, AWS, Terraform, Prometheus, Grafana, OpenTelemetry, Technical writing, Infrastructure project leadership, Regulated environments experience, Large-scale Terraform implementations, Coding experience
Mercury

Mercury Bank is a B2B financial institution offering simplified business banking services, headquartered in an unspecified location, targeting businesses primarily in the North American market.

Remote policy: Mercury Marine offers remote work opportunities for various positions, with current roles available for candidates in Canada and the United States. For specific hiring regions and remote work policies, please refer to the company's careers page.

Senior Site Reliability Engineer, EU or UK

7 days ago
Full-time
Europe
Key requirements: Linux systems management, AWS or Azure or Google Cloud, Docker, CI/CD pipeline, Prometheus or OpenTelemetry or eBPF, Cloud security and IAM policies, Python, Automation and API coding
Auros

Auros is a Hong Kong-based B2B cryptocurrency market making firm specializing in high-frequency trading and liquidity provision services for the global digital asset market.

Remote policy: Auros Global embraces a hybrid work model, allowing remote and flexible work arrangements while hiring from various regions globally, including the UK and EU.

Senior Platform Engineer

7 days ago
Full-time
Worldwide
$100,000 to $150,000 per year
Key requirements: AWS, Kubernetes, Terraform, CI/CD pipelines, Go, SRE principles, AI-assisted engineering tools, Cloud security, Production observability technologies
Trust Wallet

Trust Wallet is a leading B2C multi-chain, non-custodial cryptocurrency wallet enabling users to manage over 10 million digital assets, headquartered remotely with a global user base.

Remote policy: Trust Wallet operates as a fully remote company, hiring globally with team members working from various countries. Candidates must have the right to work in their respective locations.

Senior Site Reliability Engineer

7 days ago
Full-time
Worldwide
$113,082 to $175,725 per year
Key requirements: 6 years of experience, Puppet, Kubernetes, Python, Linux troubleshooting, Distributed caching systems, TCP/IP, HTTP, TLS, DNS, Incident response, Automation of tasks, Monitoring tools (Prometheus, Grafana)
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.

Escalation Engineer

7 days ago
Full-time
India
Key requirements: 4 years of experience, Strong networking fundamentals, Expert troubleshooting skills, Linux, Troubleshooting tools (IXIA, tcpdump, Wireshark, iPerf3), Python, Terraform or other IaC tools, AWS/Azure/GCP Cloud networking, Cloud-native architecture, Strong leadership ability, Ability to learn new technologies
Alkira, Inc.

Alkira, Inc. is a California-based B2B cloud networking company specializing in Network Infrastructure as a Service (NaaS) for enterprises, offering hybrid and multi-cloud connectivity solutions globally.

Senior Software Engineer, Infrastructure Automation and Distributed Systems

8 days ago
Full-time
North America
$224,000 to $431,250 per year
Key requirements: 12 years of experience, Infrastructure automation, Distributed systems design, Python, Go, Perl, Ruby, Linux, Networking, Storage, Containers, Multi-cloud infrastructure, Kubernetes, OpenStack, Docker, Slurm, NVIDIA Collective Communication Library (NCCL)
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Software Engineer - SRE (Remote)

8 days ago
Full-time
United States
Key requirements: 8 years of experience, Site Reliability Engineering, DevOps, Kubernetes, AWS, On-call experience, Monitoring and alerting, Collaboration across teams
Rula

Rula Health is a remote-first B2C telehealth SaaS platform based in the U.S., specializing in online therapy and psychiatry services for individuals aged 5 and older, addressing over 90 mental health conditions.

Remote policy: Rula Health is a 100% remote-first company, hiring primarily in the United States, with the exception of Hawaii.

Senior Site Reliability Engineer

8 days ago
Full-time
India
Key requirements: 5 years of experience, AWS, Kubernetes, Terraform, Docker, CI/CD, Bash, Python, Authentication technologies, Monitoring tools, Data pipelines with Databricks, Infrastructure cost management
Teikametrics

Teikametrics is a US-based B2B SaaS platform specializing in AI-driven marketplace optimization for e-commerce brands, helping them maximize profitability on platforms like Amazon and Walmart.

Remote policy: Teikametrics embraces a remote-first culture, hiring talented individuals across 25 states in the USA, as well as in China and India, allowing flexibility for employees to work when they are most productive.

Senior Site Reliability Engineer

8 days ago
Full-time
India
Key requirements: 3 years of experience, AWS, Kubernetes, Terraform, Docker, CI/CD, Bash, Python, Authentication technologies, Monitoring tools, DevOps best practices, On-call support
Teikametrics

Teikametrics is a US-based B2B SaaS platform specializing in AI-driven marketplace optimization for e-commerce brands, helping them maximize profitability on platforms like Amazon and Walmart.

Remote policy: Teikametrics embraces a remote-first culture, hiring talented individuals across 25 states in the USA, as well as in China and India, allowing flexibility for employees to work when they are most productive.

FBS AIOps Engineer

8 days ago
Full-time
Worldwide
Key requirements: 2 years of experience, Dynatrace, Copilot Studio, AIOps platform design, Event correlation, Runbook automation, ITSM integration, Anomaly detection, Consultative partnership
Capgemini

Capgemini is a Paris-based B2B IT services and consulting company specializing in digital transformation, technology services, and consulting, serving diverse industries globally.

Remote policy: Capgemini supports flexible work arrangements, including remote and office-based options, and operates globally across more than 40 countries, welcoming applicants from various regions.

Senior Production Engineer

9 days ago
Full-time
United States
$165,000 to $195,000 per year
Key requirements: 5 years of experience, AWS, Kubernetes (EKS), Terraform, Go, Python, CI/CD systems, Observability tools, SLIs/SLOs implementation, GenAI tools
Legion

Legion is a remote B2B SaaS provider specializing in intelligent automation workforce management solutions for labor-intensive industries, headquartered in the United States.

Sr Cloud Engineer (Contract-to-Hire)

9 days ago
Contract
United States
$140,000 to $165,000 per year
Key requirements: 5 years of experience, Cloud-native solutions, HITRUST & SOC2 compliance, Infrastructure automation, Containerization (Docker, Kubernetes), DevSecOps principles, CI/CD pipelines (Azure DevOps), Observability platforms (Datadog), Multi-cloud deployment, Stateful database infrastructure, Microservices architecture
Lirio

Lirio is a U.S.-based healthtech B2B SaaS company specializing in behavioral health interventions and personalized care navigation through its AI-driven platform.

Remote policy: Lirio supports remote work with opportunities for hybrid arrangements for candidates located in Tennessee. Currently, hiring is focused on candidates authorized to work in the US.

Application Support Engineer

9 days ago
Full-time
United States
$100,000 to $120,000 per year
Key requirements: 2 years of experience, Microservices architectures, Full-stack troubleshooting, Log analysis tools, Incident management methodologies, FinTech experience, Jira, Confluence, Scripting skills, Customer satisfaction focus, High-pressure problem-solving, Excellent communication skills
Lumin Digital

Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.

Remote policy: Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.

Site Reliability Engineer

10 days ago
Full-time
United States
$110,000 to $175,000 per year
Key requirements: 8 years of experience, Linux administration, Python, Cloud platforms (OCI, AWS, GCP), Configuration management (Ansible, Puppet), Database administration (MySQL, MongoDB, PostgreSQL), Production support for large-scale environments, Advanced scripting (Perl, Bash), DevOps tools (Docker, K8s, Gitlab CICD, Jenkins, Terraform), Monitoring best practices (ELK stack, Prometheus, Nagios, Grafana), Technical project leadership
Ooma, INC

Ooma, Inc. is a Sunnyvale-based telecommunications company offering cloud-based VoIP and unified communications services as a SaaS provider, targeting both B2B and B2C markets across the US and Canada.

Site Reliability Engineer - India

10 days ago
Full-time
India
Key requirements: Kubernetes, Docker, Java, Python, Continuous Delivery tools, Unix, Infrastructure components, DataDog monitoring, Automation of operational work, Self-healing patterns, Resiliency patterns
Zimperium

Zimperium, Inc. is a Dallas-based B2B cybersecurity company specializing in mobile security solutions for enterprises, offering real-time protection against mobile threats on iOS and Android devices.

Remote policy: Zimperium supports remote work and is currently hiring for various roles, including remote positions in regions such as India. For specific hiring locations and remote work details, please refer to the company's official careers page.

Software Reliability Engineer - LPU Hardware DataFlow

11 days ago
Full-time
Europe
Key requirements: 8 years of experience, Reliability engineering, Hardware testing, Driver testing, Functional programming (Haskell, Nix), System programming (C++, Rust, Java), Linux scripting (Python, Shell), Automated test pipelines, CI/CD experience, GPU reliability testing, Hardware durability testing, Driver development, Kernel debugging, Reliability standards knowledge
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Network Site Reliability Engineer

11 days ago
Full-time
Asia, Israel
Key requirements: 8 years of experience, Network automation, Prometheus, Grafana, Python, Go, TCP/UDP, BGP, VPN, L2 switching, Firewalls, Load Balancers, SNMP, Syslog, Streaming Telemetry, Mellanox/Cumulus Linux, Palo Alto firewalls, Netscalers, F5 load balancers
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior DevOps / SRE Engineer

11 days ago
Full-time
United States
$120,000 to $150,000 per year
Key requirements: Kubernetes (EKS), Blockchain reliability, Zero-downtime operations, CI/CD pipeline development, Observability tooling, Real-time systems, Infrastructure as Code (IaC), Incident leadership, Security focus
MLabs

MLabs is a remote fintech company specializing in DeFi solutions, providing a unified API for financial institutions to access on-chain liquidity across major blockchains.

Remote policy: MLabs supports remote work for positions located within the EMEA region, allowing for flexible hours and a remote-first environment.