Remote Site Reliability Engineer Jobs

Explore 46 fresh remote Site Reliability Engineer jobs. Whether you're working from home or from anywhere in the world, our curated listings deliver clear insights for your next move.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Latest Site Reliability Engineer Jobs (46)

Senior Site Reliability Engineer

1 day ago
Full-time
United States
$100,000 to $120,000 per year
Key requirements: 6 years of experience, Cloud infrastructure (AWS, GCP, Azure), Observability systems design, Incident management, Automation and tooling, Ownership of reliability outcomes, Mentorship of engineers
ujet.cx

UJET is a B2B SaaS provider headquartered in an unspecified location, specializing in AI-powered Cloud Contact Center solutions to enhance customer experience for enterprises globally.

Infrastructure Engineer

1 day ago
Full-time
United States
Key requirements: Terraform, AWS, Kubernetes, CI/CD, Go, TypeScript, Rust, Observability, Database performance optimization, Security hardening, Financial technology experience
Bastion

Bastion is a fintech B2B platform headquartered in New York City, specializing in Stablecoin-as-a-Service for financial institutions and enterprises.

Member of Technical Staff (Infrastructure): World Models

1 day ago
Full-time
United States, Canada, United Kingdom
Key requirements: Linux-native, GPU infrastructure, Kubernetes, Slurm, Distributed systems, Production discipline, ML familiarity, Resource-constrained thinking
Moonvalley AI

Moonvalley AI is a Los Angeles-based generative AI company specializing in text-to-video tools for the entertainment industry, operating in both B2B and B2C markets.

Remote policy: Moonvalley supports remote work and hires globally, welcoming candidates from various regions, including the UK and Europe, with a fully remote culture that accommodates collaboration across time zones.

Senior SRE (blockchain networks)

1 day ago
Contract
Worldwide
Key requirements: 5 years of experience, Kubernetes, Terraform, GCP, Observability tooling, Blockchain infrastructure, Distributed systems, High-availability environments, Linux systems, Scripting (Go, Python)
P2P

P2P.org is a leading fintech company specializing in cryptocurrency staking solutions, headquartered remotely, serving a global market with a focus on decentralized finance.

Remote policy: P2P.org is a fully remote company, hiring talented individuals from various regions around the world to foster a diverse and inclusive team.

Senior Software Engineer - Stability

1 day ago
Full-time
United States, Canada
$166,600 to $250,900 per year
Key requirements: PostgreSQL expertise, Temporal workflows, Tracing and OpenTelemetry, SRE or DevOps experience, Haskell or functional programming, Experience with event streaming and OLAP
Mercury

Mercury Bank is a B2B financial institution offering simplified business banking services, headquartered in an unspecified location, targeting businesses primarily in the North American market.

Remote policy: Mercury Marine offers remote work opportunities for various positions, with current roles available for candidates in Canada and the United States. For specific hiring regions and remote work policies, please refer to the company's careers page.

SRE / Platform Infrastructure Engineer Intern

1 day ago
Full-time
Worldwide
$700 per month
Key requirements: Kubernetes, AWS, Golang, Python, Prometheus, Grafana, Docker, CI/CD, Distributed systems, Automation tools
Sezzle

Sezzle is a Minneapolis-based fintech B2C company specializing in Buy Now, Pay Later (BNPL) solutions, aiming to enhance financial inclusion for consumers across the U.S. market.

Remote policy: Sezzle embraces a flexible remote work culture and hires from various locations, with a current focus on U.S.-based candidates. Team members are encouraged to collaborate across time zones.

Senior Site Reliability Engineer

2 days ago
Full-time
United States
$145,000 to $203,000 per year
Key requirements: 5 years of experience, Kubernetes, Google Cloud Platform, Helm, Crossplane, Container security, CloudSQL, Pub/Sub, Prometheus, Grafana, OpenTelemetry, Terraform, GitHub Actions, ArgoCD, gRPC microservices, Linux, Application Security tooling, Continuous Deployment
Censys

Censys is a cybersecurity B2B platform providing real-time Internet intelligence and actionable threat insights, headquartered in the U.S. and serving global governments and over 50% of Fortune 500 companies.

Remote policy: Censys supports remote work and is hiring remotely within the continental United States, with team members located across various regions.

Senior Site Reliability Engineer, Tenant Services: Geo

2 days ago
Full-time
Worldwide
Key requirements: SaaS experience, Kubernetes, Terraform, Ansible, Go, Data replication, Disaster recovery technologies, Customer-facing communication, Automation and tooling, Observability systems
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Site Reliability Engineer II

2 days ago
Full-time
India
$3,000 to $6,000 per month
Key requirements: 3 years of experience, Golang, Kubernetes, AWS, RDS (MySQL/Postgres), Observability tools, Distributed systems design, AI enablement, CI/CD pipelines, Microservices architecture
Sezzle

Sezzle is a Minneapolis-based fintech B2C company specializing in Buy Now, Pay Later (BNPL) solutions, aiming to enhance financial inclusion for consumers across the U.S. market.

Remote policy: Sezzle embraces a flexible remote work culture and hires from various locations, with a current focus on U.S.-based candidates. Team members are encouraged to collaborate across time zones.

Site Reliability Engineer/Developer

5 days ago
Full-time
Canada
$109,600 to $116,100 per year
Key requirements: 6 years of experience, AWS expertise, Infrastructure-as-code (Terraform), CI/CD pipeline development, Containerization (Docker, Kubernetes), Observability tools (Prometheus, Grafana), Python, Go, or Java proficiency, Incident response leadership, Disaster recovery planning, Semiconductor industry experience
TechInsights

TechInsights is a global B2B SaaS provider specializing in semiconductor reverse engineering and market intelligence, offering AI-powered analytics tools for the semiconductor industry.

Remote policy: TechInsights Inc. offers remote work opportunities for certain roles, including positions like the Senior Consulting Associate, while primarily operating from its headquarters in Ottawa, Canada. The company supports a diverse team across various regions.

Site Reliability Engineer

10 days ago
Contract
South America, Mexico
Key requirements: Prometheus, Grafana, Datadog, OpenTelemetry, Azure, Terraform, Python, Bash, Docker, Kubernetes, PostgreSQL monitoring
Capital Markets Gateway

Capital Markets Gateway (CMG) is a Chicago-based B2B fintech firm specializing in modernizing equity capital markets through a digital platform that connects institutional investors and financial institutions.

Remote policy: Capital Markets Gateway (CMG) hires remotely from various locations, including the United States and select countries in Latin America such as Brazil, Argentina, and Mexico, with roles requiring collaboration across time zones.

Senior Technical Support Engineer

10 days ago
Full-time
United States
Key requirements: 3 years of experience, AWS, GCP, Azure, Kubernetes, Docker, Python, VPC networking, Debugging production systems, Troubleshooting, Customer communication, High ownership
Unstructured Technologies

Unstructured Technologies is a Loomis, California-based B2B AI platform company specializing in transforming unstructured data into structured formats for government and enterprise clients.

Remote policy: Unstructured Technologies supports remote work and is hiring for various roles, with team members located across the United States. While specific positions may require occasional travel, the company embraces a flexible work environment.

Senior Platform Engineer - Observability

10 days ago
Full-time
Japan
Key requirements: AWS, Terraform, Kubernetes, Datadog, Grafana, Prometheus, Rootly, Service Level Objectives (SLOs), Service Level Indicators (SLIs), Python, Observability tooling, Cost management of observability tools, Experience in data-focused teams, SaaS platform experience, Building observability tooling for large-scale services
Kraken

Kraken is a global fintech B2B and B2C cryptocurrency exchange platform, dedicated to accelerating crypto adoption and headquartered remotely with a presence in over 70 countries.

Remote policy: Kraken is a fully remote company with team members in over 70 countries, hiring globally to support a diverse workforce across various regions.

Sr. Platform Engineer (AKS/EKS)

10 days ago
Full-time
United States
Key requirements: 6 years of experience, Kubernetes (EKS + AKS), Terraform, AWS cloud technologies, Cloud-native architecture, Payment industry standards, Serverless architectures, Containers and orchestration, Security best practices for containers
ASCENDING

ASCENDING is a B2B technology services company specializing in AI-driven solutions and cloud transformation strategies, headquartered in an unspecified location, serving various industries including finance, healthcare, and education.

Remote policy: This company offers fully remote positions, hiring exclusively from the United States for remote roles.

Site Reliability / DevOps Engineer - 100% Remote (m/f/d)

12 days ago
Contract
Worldwide
Key requirements: 3 years of experience, Kubernetes, CI/CD, Cloud Services, Automation, Problem-Solving, Communication Mastery, Collaboration Wizardry, Self-organization
Digistore24

Digistore24 is a German-based e-commerce platform specializing in digital product sales and affiliate marketing, operating primarily in the B2B and B2C sectors.

Remote policy: Digistore24 embraces a flexible remote work culture, allowing team members to work from home or coworking spaces, as long as they can ensure reliable internet access. The company operates globally, welcoming talent from various regions.

Sr. Reliability Operations Engineer (Mexico)

12 days ago
Full-time
Mexico
Key requirements: 5 years of experience, Incident response leadership, Grafana/Prometheus, GCP Monitoring, Automation scripting, Runbook development, Distributed systems support, IoT device operations, Operational documentation improvement, Incident management tools, Networking fundamentals
Serve Robotics

Serve Robotics is a B2B hardware robotics company based in the U.S. specializing in self-driving delivery robots for last-mile food delivery, targeting urban markets.

Remote policy: Serve Robotics is open to hiring qualified talent working remotely, with a preference for candidates located in the United States. Team members may be based in various locations, allowing for flexible collaboration.

Reliability Operations Engineer (Mexico)

12 days ago
Full-time
Mexico
Key requirements: 2 years of experience, Grafana, Prometheus, GCP Monitoring, OpenTelemetry, Linux, Incident response, Cloud platforms, Runbooks, Distributed systems, IoT systems, Scripting, Networking fundamentals, Jira
Serve Robotics

Serve Robotics is a B2B hardware robotics company based in the U.S. specializing in self-driving delivery robots for last-mile food delivery, targeting urban markets.

Remote policy: Serve Robotics is open to hiring qualified talent working remotely, with a preference for candidates located in the United States. Team members may be based in various locations, allowing for flexible collaboration.

Senior HPC Cluster Administrator - Deep Learning Frameworks Infrastructure

13 days ago
Full-time
Poland
221,250 PLN to 383,500 PLN per year
Key requirements: 5 years of experience, HPC cluster administration, Deep learning frameworks, Linux systems administration, Slurm, Ansible, High-speed networking, Distributed filesystems, MLOps tooling, NVIDIA GPU infrastructure tools
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Systems Software Engineer, AI Infrastructure

14 days ago
Full-time
India
Key requirements: 5 years of experience, Python, C/C++, SRE principles, AI training, deep learning frameworks, cloud platforms, observability platforms, distributed systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Linux System Administrator

16 days ago
Full-time
Asia, Israel
Key requirements: 5 years of experience, NVIDIA hardware experience, Bash or Python coding, Zabbix, Prometheus, or Nagios, Infoblox, Jenkins and Git familiarity, Strong system-level understanding
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Cloud Infrastructure Engineer

16 days ago
Full-time
United States
Key requirements: 7 years of experience, AWS, GCP, Azure, Docker, Kubernetes, Terraform, Pulumi, Distributed systems, High availability, Fault tolerance, Cost efficiency, Technical leadership, Infrastructure strategy, Regulated industries
Valon Tech

Valon is a New York City-based fintech company specializing in AI-native mortgage servicing solutions, operating a proprietary platform for both B2B and B2C markets across the US.

Remote policy: Valon supports remote work and hires primarily within the United States, with team members located across various regions. Candidates are encouraged to apply regardless of their specific location within the US.

Senior Cloud Infrastructure Engineer

16 days ago
Full-time
United States
Key requirements: 3 years of experience, AWS, GCP, Azure, Vitess, Clickhouse, Redis, Docker, Kubernetes, Terraform, Pulumi, Distributed systems, Infrastructure-as-code, Product mindset, High-growth environments, Regulated industries
Valon Tech

Valon is a New York City-based fintech company specializing in AI-native mortgage servicing solutions, operating a proprietary platform for both B2B and B2C markets across the US.

Remote policy: Valon supports remote work and hires primarily within the United States, with team members located across various regions. Candidates are encouraged to apply regardless of their specific location within the US.

Senior Site Reliability Engineer - Government Cloud

17 days ago
Full-time
United States
$105,000 to $130,000 per year
Key requirements: GCP, FedRAMP compliance, Kubernetes, Go, CI/CD pipelines, Infrastructure as Code, NIST 800-53 controls, Distributed systems architecture
Ping Identity

Ping Identity is a Denver-based B2B software company specializing in identity and access management (IAM) solutions for enterprise customers across technology, finance, healthcare, and government sectors.

Database Reliability Engineer - Core Team

17 days ago
Full-time
United Kingdom, Germany, Netherlands
Key requirements: 5 years of experience, ClickHouse, SQL databases, Distributed database internals, Shell scripting, Python, C++ reading, AWS, Azure, Google Cloud Platform, Production debugging, Incident response processes
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Site Reliability Engineer

17 days ago
Full-time
North America, South America
$130,000 to $140,000 per year
Key requirements: 3 years of experience, AWS, Kubernetes, Database optimization, Incident response management, Observability tooling, North or South America based
Circle.so

Circle.so is a global SaaS platform for community building and online education, enabling creators and businesses to engage audiences through discussions, courses, and events.

Remote policy: Circle.so is a fully remote company with team members from over 30 countries, hiring globally and supporting candidates in various regions, including preferences for European and North/South American time zones.

Senior Systems Engineer

18 days ago
Full-time
Germany
Key requirements: 5 years of experience, Ansible, Terraform, CI/CD pipelines, Virtualization technologies, Containerized workloads, Linux systems, Windows systems, Automation expertise
Riot Games

Riot Games is a Los Angeles-based video game developer and publisher specializing in competitive multiplayer esports titles, operating globally with a B2C business model.

NOC Engineer – Tier III

18 days ago
Full-time
United States
Key requirements: 8 years of experience, IP networking (BGP, OSPF, MPLS, VLANs, VPNs, QoS), Fiber transport systems (DWDM, CWDM, Ethernet), Juniper (Junos), Cisco, MikroTik, Calix XGS-PON, Fixed wireless and LTE platforms, Advanced diagnostic and analytical skills, Network monitoring and telemetry tools, Lead technical response during major incidents
Vero Networks

Vero Fiber Networks is a Boulder, Colorado-based telecommunications company specializing in fiber-to-the-premise (FTTP) broadband services for underserved communities across the US, operating in both B2B and B2C markets.

Remote policy: Vero Networks offers remote work opportunities for certain roles, including positions like the Director of Fiber Engineering. However, specific hiring locations and broader remote work policies are not clearly defined.

Senior Systems Performance Engineer 

18 days ago
Full-time
United States
$168,000 to $258,750 per year
Key requirements: 5 years of experience, Dynamo, TensorRT, Slurm, BCM, vLLM, SG Lang, Cuda, Cublas, Cutlass, Python, x86/Arm server architectures, GPU computing
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Distinguished Engineer, Cloud Site Reliability Engineering

19 days ago
Full-time
United States
$320,000 to $488,750 per year
Key requirements: 18 years of experience, Cloud infrastructure maintenance, AI development, JAVA, Python, Shell scripting, Distributed systems, REST APIs, SQL/NoSQL databases, Docker, Kubernetes, OpenStack, Machine Learning, Deep Learning, High-performance software design, Scalable software systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Forward Deploy Engineer

19 days ago
Full-time
Sweden, Norway, Finland, Denmark
Key requirements: 8 years of experience, Linux, Kubernetes, Networking, GPU-accelerated compute, Edge compute, Modular data centers, AI workloads, Nordic market experience, Technical account management, Customer relationship management
Armada

Armada is a global edge computing startup specializing in IoT and AI solutions for remote areas, headquartered in an unspecified location and primarily serving B2B clients across various industries.