Remote Site Reliability Engineer Jobs

Explore 31 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (31)

Senior Infrastructure Security Engineer

about 9 hours ago
Full-time
Worldwide
Key requirements: Cloud security (AWS/GCP/Azure), Container security, Kubernetes hardening, Infrastructure-as-Code security (Terraform, Ansible, CloudFormation), Programming (Go, Python, Ruby), Technical initiative leadership, Security risk identification, AI process automation, High-reliability domain experience (finance, healthcare, government, telecom), Regulatory compliance familiarity (PCI-DSS, FedRAMP, ISO27001, SOC II)
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Site Reliability Engineer

3 days ago
Full-time
United States
$115,000 to $135,000 per year
Key requirements: 2 years of experience, Python, Linux, Relational Databases, SQL, gRPC microservices, Postgres, Pandas, Golang, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes, Production systems support, Code maintainability best practices, Analytical problem-solving, Technical communication, Attention to detail, Growth mindset
The Voleon Group

The Voleon Group is a Berkeley-based quantitative investment management firm specializing in AI-driven trading strategies for institutional investors in the finance industry.

Remote policy: The Voleon Group offers a hybrid work model, allowing for both in-office and remote work, and hires from various regions including the United States, Canada, and the United Kingdom.

*Monitoring and Observability Analyst (Sat, Sun, Holidays)

4 days ago
Full-time
United States
Key requirements: 3 years of experience, Prometheus/Grafana, Cloud environments (AWS/Azure/GCP), Monitoring automation (AIOps), Distributed Tracing (Jaeger, Zipkin, OpenTelemetry), Scripting (Python, Bash), Linux operating systems, Proactivity Orientation, Analysis Skill
Coderio

Coderio is a remote-first B2B tech company specializing in IT staff augmentation and scalable digital solutions for global businesses.

Remote policy: Coderio is a remote-first company that values talent regardless of location, hiring from various regions to support a collaborative international team.

Monitoring and Observability Analyst (M-F)

4 days ago
Full-time
United States
Key requirements: 3 years of experience, Prometheus/Grafana, ELK Stack, New Relic, Datadog, Cloud environments (AWS/Azure/GCP), Containers (Docker, Kubernetes), Logs (fluentd, Logstash, Loki), Distributed Tracing (Jaeger, Zipkin, OpenTelemetry), Scripting (Python, Bash), Linux operating systems, Root Cause Analysis, Proactivity Orientation
Coderio

Coderio is a remote-first B2B tech company specializing in IT staff augmentation and scalable digital solutions for global businesses.

Remote policy: Coderio is a remote-first company that values talent regardless of location, hiring from various regions to support a collaborative international team.

Site Reliability Engineer

4 days ago
Full-time
United States
$115,000 to $135,000 per year
Key requirements: 2 years of experience, Python, Linux, Relational Databases, SQL, gRPC microservices, Postgres, Pandas, Golang, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes, Production systems support, Code maintainability, Technical communication, Analytical problem-solving
The Voleon Group

The Voleon Group is a Berkeley-based quantitative investment management firm specializing in AI-driven trading strategies for institutional investors in the finance industry.

Remote policy: The Voleon Group offers a hybrid work model, allowing for both in-office and remote work, and hires from various regions including the United States, Canada, and the United Kingdom.

IT Staff Systems Engineer

9 days ago
Part-time
United States
$154,000 to $201,300 per year
Key requirements: 10 years of experience, Okta, Terraform, Python, Bash, PowerShell, Palo Alto, Meraki, JAMF Pro, Google Workspace, O365, SOC2, HIPAA, HITRUST, SOX, Incident management, Automation, Infrastructure-as-Code, Cross-functional communication, Technical change management
Omada Health

Omada Health is a San Francisco-based B2B healthcare technology company providing virtual integrated care solutions for chronic disease management through digital tools and personalized coaching.

Remote policy: Omada Health supports fully remote work for various roles, primarily hiring from the United States, with team members located across different regions.

Job Description : Cloud Administrator

11 days ago
Full-time
Worldwide
Key requirements: AWS, Azure, IaaS services, Compute, Networking, Cloud application design, AWS/Azure certification, Strong communication skills, Problem solving, Process improvement
Rackspace

Rackspace Technology is a San Antonio-based B2B cloud computing company specializing in multicloud solutions, data management, and security services for global enterprises.

Remote policy: Rackspace Technology supports remote work and hires globally, with team members located in various regions including North America, Europe, and Asia. Specific roles may have location requirements, such as being based in certain areas of Mexico.

Job Opprtunity : Cloud (GCP) Engineer III - IN

11 days ago
Full-time
India
Key requirements: 5 years of experience, GCP support & administration, Active GCP certification, Terraform, 24×7 support experience, ITIL processes, Cloud-native services troubleshooting, CI/CD tools
Rackspace

Rackspace Technology is a San Antonio-based B2B cloud computing company specializing in multicloud solutions, data management, and security services for global enterprises.

Remote policy: Rackspace Technology supports remote work and hires globally, with team members located in various regions including North America, Europe, and Asia. Specific roles may have location requirements, such as being based in certain areas of Mexico.

Junior Site Reliability Engineer

11 days ago
Full-time
United Kingdom
Key requirements: Terraform, AWS, Azure, GCP, CI/CD, Python, Bash, Linux, Git, Prometheus, Grafana, Coralogix, Kubernetes
Accesso Technology Company

Accesso Technology Group PLC is a Berkshire-based B2B SaaS provider specializing in integrated software solutions for the leisure and entertainment sectors, serving over 1,100 venues globally.

Remote policy: Accesso Technology Group PLC offers remote work flexibility for certain roles, such as the Jr. Site Reliability Engineer position, which allows for 100% remote work within the United Kingdom. The company operates globally, with a presence in various regions, including the UK and the US, but does not have a formal remote work policy applicable to all positions.

Senior Site Reliability Engineer

14 days ago
Full-time
United States
$200,000 to $220,000 per year
Key requirements: 5 years of experience, Kubernetes, Container orchestration, Air-gapped environments, Customer-managed infrastructure, Troubleshooting deployment issues, Observability solutions, DevOps tooling, Helm, PostgreSQL, Collaboration with Product and Design teams
Tines

Tines is a Dublin-based B2B automation platform specializing in no-code workflow automation for IT and security teams, serving a global market.

Remote policy: Tines offers remote positions with some roles requiring candidates to be based in specific locations, such as Washington D.C., MD, or VA, for in-person meetings. The company is open to hiring from various regions, supporting a flexible work environment.

Blockchain Site Reliability Engineer

16 days ago
Full-time
United States
Key requirements: Linux administration, Golang, Python, Blockchain node deployment, Monitoring tools, Incident response, Technical documentation, Blockchain protocols
InfStones

InfStones is a global B2B blockchain infrastructure provider headquartered in an undisclosed location, offering a comprehensive PaaS platform for managing blockchain nodes and APIs, serving developers and institutional clients across the Web3 industry.

Remote policy: InfStones supports remote work for certain roles and is open to hiring from various regions, including locations such as Texas, USA. The company operates globally, with team members potentially located across multiple countries.

Senior Infrastructure Engineer - Postgres

16 days ago
Full-time
United States
$140,000 to $208,000 per year
Key requirements: 7 years of experience, Postgres operations, AWS, Terraform, Go, Multi-cloud expertise, SLOs, Incident management, Observability tools
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Software Engineer, Infrastructure

17 days ago
Full-time
Worldwide
$160,000 to $190,000 per year
Key requirements: 2 years of experience, Rust, AWS, Python, SRE experience, Infrastructure as code, CI/CD, Linux systems, Observability tooling, Relational Databases (Postgres)
Lithic

Lithic is a fintech B2B company providing flexible card issuing solutions and payment infrastructure for technology companies, headquartered in an unspecified location.

Remote policy: Lithic supports a hybrid work model, requiring employees in the NYC area to work from the office three days a week, while offering flexibility for remote work from various locations around the world.

Senior Site Reliability Engineer

19 days ago
Full-time
Oceania
$145,000 to $175,000 per year
Key requirements: SRE expertise, High reliability and availability, Automation across builds and deployments, Postmortem leadership, Safety culture application, Azure services, Terraform, OpenTelemetry, C# application SDLC, CI/CD with TeamCity and GitHub Actions
Octopus Deploy

Octopus Deploy is an Australia-based B2B SaaS provider specializing in deployment automation solutions for DevOps, serving enterprise and medium-sized companies globally.

Remote policy: Octopus Deploy supports remote work and has a global hiring practice, with team members working remotely from regions such as Australia and New Zealand.

Staff Site Reliability Engineer, Streaming

20 days ago
Full-time
United States, Canada, Brazil, United Kingdom, Nigeria, Japan
$120,000 to $160,000 per year
Key requirements: 5 years of experience, RabbitMQ, Redpanda, Kubernetes, Go, Prometheus, Linux, SLIs, SLOs, SLA design, High-availability systems, Troubleshooting message broker performance
Alpaca

Alpaca is a US-based fintech company providing self-clearing brokerage infrastructure and APIs for stocks, ETFs, options, and crypto, serving financial institutions globally.

Remote policy: Alpaca embraces a remote-first culture, hiring globally from various regions including the USA, Canada, Japan, Hungary, Nigeria, Brazil, and the UK, allowing team members to work from their preferred locations.

Site Reliability Engineer (SRE)

23 days ago
Full-time
United States
Key requirements: 5 years of experience, Terraform, Datadog, Kubernetes, AWS, Azure, Python, CI/CD pipelines, SLIs, SLOs, SLAs, Infrastructure as Code, Observability stacks, Root-cause analysis
Neovera

Neovera is a Reston, Virginia-based B2B provider of managed IT services specializing in cybersecurity and enterprise cloud solutions for highly regulated industries such as financial services, healthcare, and government.

Remote policy: Neovera primarily hires US-based employees, focusing on operational needs within US time zones. Specific remote work policies are not detailed.

Senior Site Reliability Engineer

23 days ago
Full-time
United States
$120,000 to $160,000 per year
Key requirements: 10 years of experience, Kubernetes management, Cloud automation tools, SaaS security standards, Building tools from scratch, Automation mindset, Cloud native technologies, Incident response, Designing secure systems, Effective communication skills
ScienceLogic

ScienceLogic is a global B2B SaaS provider of AI-driven IT operations management solutions, headquartered in an unspecified location, focusing on enterprise IT departments and managed service providers in the cloud computing and IT service management industries.

DevOps/Site Reliability Engineer (SRE)

23 days ago
Full-time
United States
Key requirements: 5 years of experience, SaaS platform experience, IaC proficiency, Cloud platform management, CI/CD pipeline management, Containerization, Linux expertise, DevOps best practices, Security mindset, Startup environment adaptability
Elligint Health

Elligint Health is a New Jersey-based B2B healthcare technology company that provides a data-driven precision healthcare platform for payers and value-based care organizations.

Remote policy: Elligint Health supports flexible remote work arrangements and is hiring from various locations, with a requirement for candidates to reside in the U.S. for all positions.

Cloud Site Reliability Engineer

24 days ago
Full-time
United States
$100,000 to $120,000 per year
Key requirements: Azure expertise, Kubernetes, Openshift, Kafka, Elastic stack, Chaos engineering, Performance troubleshooting, Performance testing methodologies, Security and Compliance (SOC2, HIPAA, ISO27001), Terraform, Ansible, Infrastructure as code, Linux production systems, High-performance distributed systems, Cloud service management
Smile Digital Health

Smile Digital Health is a healthtech B2B company providing a FHIR-based clinical data repository as a service, headquartered in Canada and the US, focusing on healthcare interoperability and data management solutions for global healthcare organizations.

Remote policy: Smile Digital Health supports remote work and hires from various locations, including Canada and the United States, with team members collaborating across time zones.

Staff SRE - Solana

24 days ago
Contract
Worldwide
Key requirements: Solana validator experience, Performance tuning, Infrastructure-as-Code (Terraform), Docker, Ansible, Multi-cloud environments (GCP preferred), Cryptography principles, Programming in Go, Rust, or Python
P2P

P2P.org is a leading fintech company specializing in cryptocurrency staking solutions, headquartered remotely, serving a global market with a focus on decentralized finance.

Remote policy: P2P.org is a fully remote company, hiring talented individuals from various regions around the world to foster a diverse and inclusive team.

SRE Observability Engineer

25 days ago
Full-time
Poland
Key requirements: 7 years of experience, Prometheus, Grafana, VictoriaMetrics, Opensearch, ELK stack, fluentbit, fluentd, rsyslog, journald, Ansible, Nginx, Docker, kvm, qemu, GIT, Gitlab CI/CD, network configurations, incident reporting, root cause analysis
capital.com

Capital.com is a Cyprus-based fintech B2C online trading platform specializing in CFDs and spread betting across over 3,000 global financial markets.

Remote policy: Capital.com offers remote work opportunities, including the flexibility to work from various locations, with team members enjoying benefits such as 30 extra days to work remotely from anywhere in the world.

Site Reliability Engineer (India 3rd Shift)

25 days ago
Full-time
India
Key requirements: 5 years of experience, Terraform, AWS, EC2, S3, RDS, IAM, Load balancing, NGINX, HAProxy, Redis, CDN, Splunk, Datadog, Java, Go, Python, Linux Shell, CI/CD pipelines
Rackspace

Rackspace Technology is a San Antonio-based B2B cloud computing company specializing in multicloud solutions, data management, and security services for global enterprises.

Remote policy: Rackspace Technology supports remote work and hires globally, with team members located in various regions including North America, Europe, and Asia. Specific roles may have location requirements, such as being based in certain areas of Mexico.

Site Reliability Engineer (India Based- Bangalore & Hyderabad)

25 days ago
Full-time
India
Key requirements: Kubernetes, Docker, Java, Python, Continuous Delivery tools, Unix, Infrastructure components, Automation of operational work, Self-healing patterns, DataDog monitoring
Zimperium

Zimperium, Inc. is a Dallas-based B2B cybersecurity company specializing in mobile security solutions for enterprises, offering real-time protection against mobile threats on iOS and Android devices.

Remote policy: Zimperium supports remote work and is currently hiring for various roles, including remote positions in regions such as India. For specific hiring locations and remote work details, please refer to the company's official careers page.

Site Reliability Engineer

25 days ago
Full-time
Australia
Key requirements: 5 years of experience, Linux Production Systems Engineer, Python, Kubernetes, AWS, Distributed data storage, NoSQL, Monitoring & Alerting, Configuration Management, S/W Performance analysis
sonyinteractiveentertainmentglobal

Sony Interactive Entertainment is a San Mateo-based global video game and digital entertainment company, primarily B2C, known for the PlayStation brand and its innovative gaming hardware, software, and network services.

Remote policy: Sony Interactive Entertainment supports flexible remote work arrangements, hiring from various regions globally, including locations such as the USA, UK, and Japan.

Site Reliability Engineer (India Based- Bangalore & Hyderabad)

26 days ago
Full-time
India
Key requirements: Kubernetes, Docker, Java, Python, Continuous Delivery tools, Unix, Infrastructure components, DevOps experience, DataDog monitoring, Self-healing patterns, Resiliency patterns
Zimperium

Zimperium, Inc. is a Dallas-based B2B cybersecurity company specializing in mobile security solutions for enterprises, offering real-time protection against mobile threats on iOS and Android devices.

Remote policy: Zimperium supports remote work and is currently hiring for various roles, including remote positions in regions such as India. For specific hiring locations and remote work details, please refer to the company's official careers page.

Virtual Platform Administrator (Future Opportunity)

27 days ago
Full-time
United States
$100,000 to $115,000 per year
Key requirements: 7 years of experience, GCP, AWS, Azure, DoD cloud solutions, Multicloud solutions, Network architecture, Virtualization, Automation scripting, IaaS, PaaS, SaaS, Secret Clearance, DoD 8570 IAT Level II
AGE Solutions

AGE Solutions is a technology and professional services company headquartered in Alexandria, VA, specializing in consulting and advanced technology solutions for the U.S. government, defense, and intelligence sectors in a B2B context.

Remote policy: AGE Solutions offers remote positions within the United States, focusing on roles that support U.S. government and defense sectors.

Senior Production Engineer (REMOTE)

27 days ago
Full-time
Worldwide
Key requirements: 5 years of experience, Kubernetes, Go, Infrastructure-as-Code, Observability, Honeycomb, OpenTelemetry, Distributed systems, SaaS management, Incident response, Crossplane
Upbound

Upbound is a Seattle-based B2B technology company specializing in cloud-native infrastructure solutions through its Platform Cloud™, targeting enterprises in the cloud computing industry.

Remote policy: Upbound is a Remote-First company that hires globally, with team members located in various regions such as EMEA and the western United States, allowing for flexibility in remote work.

Staff Site Reliability Engineer, Database

29 days ago
Full-time
United States, Canada, Brazil, United Kingdom, Nigeria, Japan
$120,000 to $180,000 per year
Key requirements: 5 years of experience, PostgreSQL, Go, Prometheus, Linux, Fintech knowledge, Incident Management, SLIs/SLOs/SLAs design, Performance troubleshooting, Schema design, Distributed tracing, Scaling PostgreSQL clusters
Alpaca

Alpaca is a US-based fintech company providing self-clearing brokerage infrastructure and APIs for stocks, ETFs, options, and crypto, serving financial institutions globally.

Remote policy: Alpaca embraces a remote-first culture, hiring globally from various regions including the USA, Canada, Japan, Hungary, Nigeria, Brazil, and the UK, allowing team members to work from their preferred locations.

Senior Site Reliability Engineer

29 days ago
Contract
Poland
285,000 zł to 385,000 zł per year
Key requirements: Kubernetes, AWS, Distributed systems, Automation of deployments, Troubleshooting cloud systems, Technical strategy execution, Cross-functional collaboration, On-call operations management
Affirm

Affirm is a U.S.-based fintech company offering a buy now, pay later (BNPL) service that allows consumers to make purchases in installments, primarily targeting the retail sector with both B2C and B2B business models.

Remote policy: Affirm is a remote-first company, allowing employees to work from various locations within their country of employment, including Poland, where the majority of roles can be remote.

Senior Site Reliability Engineer

29 days ago
Full-time
Spain
€80,000 to €110,000 per year
Key requirements: Kubernetes, AWS, Distributed systems, Automation of deployments, Troubleshooting cloud systems, Technical strategy execution, Cross-functional collaboration, On-call operations management
Affirm

Affirm is a U.S.-based fintech company offering a buy now, pay later (BNPL) service that allows consumers to make purchases in installments, primarily targeting the retail sector with both B2C and B2B business models.

Remote policy: Affirm is a remote-first company, allowing employees to work from various locations within their country of employment, including Poland, where the majority of roles can be remote.

Staff Site Reliability Engineer-Federal, Security Clearance

29 days ago
Full-time
United States
$119,000 to $170,000 per year
Key requirements: 5 years of experience, Active Secret Security Clearance, Site Reliability Engineering in classified environments, Monitoring activities (vulnerability scanning, patch management), Linux administration, Automation tools (Ansible, Terraform), Python coding, Container-based architectures (AWS ECS, Kubernetes), Experience with air-gapped environments, High/Moderate FedRAMP authorization levels
Zscaler

Zscaler is a San Jose-based B2B cloud security company specializing in SaaS solutions for zero trust architecture and secure access service edge (SASE), serving enterprise customers globally.

Remote policy: Zscaler supports remote work and hires globally, with team members located in various regions, allowing for collaboration across time zones.