Fresh remote Site Reliability Engineer jobs in North America

Explore latest remote Site Reliability Engineer opportunities from leading companies hiring in North America. 48 jobs posted last 30 days.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Fresh remote Site Reliability Engineer jobs in North America (48)

Senior Site Reliability Engineer, GeForce NOW

1 day ago
Full-time
United States
$168,000 to $270,250 per year
Key requirements: 8 years of experience, Kubernetes, Automation, Multi-region cloud deployments, Datadog, Prometheus, Deployment pipelines, Go, Python, Bash scripting, Anomaly detection tools, AI usage in SRE
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Release Engineer

2 days ago
Full-time
North America, Worldwide
Key requirements: 4 years of experience, Kubernetes, Helm, CI/CD pipelines, Infrastructure as Code, GitOps tooling, Security tooling integration, Linux systems, Networking fundamentals, Distributed systems
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Senior Platform Engineer (Kubernetes, Middleware, DevOps) - (W2PE)

3 days ago
Contract
United States
$80 to $110 per hour
Key requirements: Ansible, Kubernetes, Rancher, Oracle middleware stack, Unix/Linux, Shell scripting, Neo4j, Cognos, Tableau, Graylog, Grafana, AWS
Trace3

Trace3 is an Irvine, California-based B2B IT consulting firm specializing in technology solutions, digital transformation, and cybersecurity for enterprise and government clients across the United States.

Remote policy: Trace3 hires primarily within the United States and offers remote roles, with team members located across various states including California, Texas, and Colorado.

Agora - Senior Infrastructure Engineer

3 days ago
Full-time
North America, South America
Key requirements: 5 years of experience, TypeScript, Kubernetes, AWS architecture, Infrastructure-as-code, Distributed systems fundamentals, Reusable systems and abstractions, SLIs/SLOs and incident tooling, Observability practices, GitOps patterns, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Agora - Senior Infrastructure Engineer

3 days ago
Full-time
North America, South America
Key requirements: 5 years of experience, TypeScript, AWS architecture, Kubernetes, Infrastructure-as-code, Distributed systems, Reusable systems, SLIs/SLOs, Observability practices, GitOps, Developer Productivity, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Site Reliability Engineer

4 days ago
Full-time
North America, Worldwide
Key requirements: Linux/Unix, Cloud providers (AWS, Google Cloud, Azure), Infrastructure provisioning (Terraform, CloudFormation, Ansible), Containerization (Docker), Orchestration (Kubernetes), Monitoring tools (Prometheus, Grafana, Datadog), CI/CD pipelines (Jenkins, GitLab CI, CircleCI), Incident management practices, Scripting (Bash, Perl)
OXIO

OXIO is a North America-based B2B Telecom-as-a-Service (TaaS) platform enabling businesses to build and manage customizable mobile networks through a cloud-based solution.

Remote policy: OXIO supports flexible work arrangements and hires from various regions, with team members located in cities such as New York, Mexico City, and Montreal. Candidates are encouraged to apply regardless of their location.

Kafka Platform Engineer

4 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Kafka internals, Kafka security, Kafka Connect, Schema Registry, Kafka Streams, HA/DR strategies, Python, Terraform, Observability tooling, Confluent platform
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Observability Engineer (Prometheus / Grafana / Datadog)

4 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, SRE principles, High-cardinality metrics, Distributed tracing, CI/CD integration, Linux internals, Container platforms
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Site Reliability Engineer (SRE)

4 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Python, Go, Kubernetes, Prometheus, Grafana, CI/CD pipelines, Distributed systems, Incident response, Chaos engineering, Cloud platforms
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Senior AI Tools Engineer, SRE Operations - GeForce NOW

6 days ago
Full-time
North America
$144,000 to $230,000 per year
Key requirements: 5 years of experience, Python, AI tools development, LLM-based systems, Kubernetes, AWS, Data pipeline management, Automation, SRE principles, Monitoring tools (Grafana)
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer

9 days ago
Full-time
United States
$150,000 to $200,000 per year
Key requirements: 5 years of experience, SRE, Linux, Container management, Distributed systems, SLIs/SLOs, Incident response leadership, Scripting, Monitoring systems, GPU infrastructure, High-growth reliability improvement
RunPod

RunPod is a Mt. Laurel, New Jersey-based B2B cloud computing platform specializing in GPU infrastructure for AI and machine learning applications, serving a global market of developers and enterprises.

Remote policy: RunPod operates as a remote-first organization, welcoming candidates from various locations, primarily focusing on those eligible to work in the United States.

Staff Site Reliability Engineer (AI Enablement)

10 days ago
Full-time
United States
$150,000 to $230,000 per year
Key requirements: 8 years of experience, AI-assisted development tools, Building AI/LLM-powered developer tools, Driving org-wide tooling adoption, Prompt engineering techniques, Go or Python proficiency, Operating production environments in AWS, Strong experience with Terraform, Container orchestration (ECS/Kubernetes), Observability practices, AI Enablement Strategy
Coalition, Inc.

Staff Site Reliability Engineer (AI Enablement)

10 days ago
Full-time
Canada
$153,400 to $230,400 per year
Key requirements: 8 years of experience, AI-assisted development tools, Building AI/LLM-powered developer tools, AI Enablement Strategy, Proficiency in prompt engineering, Go or Python proficiency, AWS production environment experience, Terraform expertise, Container orchestration (ECS/Kubernetes), Observability practices, Driving org-wide tooling adoption
Coalition, Inc.

Site Reliability Engineer

10 days ago
Full-time
United States, Canada, United Kingdom, Brazil, Japan, Nigeria
$100,000 to $150,000 per year
Key requirements: 4 years of experience, PostgreSQL, Kubernetes, GitOps, Cloud networking, Incident response, Go, Python, Observability stack
Alpaca

Alpaca is a US-based fintech company providing self-clearing brokerage infrastructure and APIs for stocks, ETFs, options, and crypto, serving financial institutions globally.

Remote policy: Alpaca embraces a remote-first culture, hiring globally from various regions including the USA, Canada, Japan, Hungary, Nigeria, Brazil, and the UK, allowing team members to work from their preferred locations.

Senior Infrastructure Engineer

10 days ago
Full-time
United States
$105,000 to $135,000 per year
Key requirements: 4 years of experience, PowerShell, Python, Cloud management tools, ITIL 4 Foundation, AI platforms, SharePoint Online, Vulnerability remediation, Disaster Recovery (DR), Business Continuity Plans (BCP), Automated deployment frameworks, Global infrastructure scaling
Omnidian

Omnidian is a Seattle-based B2B tech-enabled service company specializing in solar energy monitoring and maintenance, serving residential and commercial markets globally.

Remote policy: Omnidian supports remote work for most roles, allowing flexibility for employees to work from various locations, including regions such as Seattle, WA, and Australia.

Managed Services Engineer I (Raleigh/Durham/Chapel Hill, NC area)

11 days ago
Full-time
United States
$40,000 to $60,000 per year
Key requirements: 1 years of experience, Microsoft Exchange, SQL, Windows Server, ConnectWise, LAN/WAN troubleshooting, Microsoft Active Directory, Azure AD, Imaging Solutions, Basic PowerShell, Strong organization skills, Strong interpersonal skills
Logically

Logically is a Brighouse, England-based B2B Managed Security Services Provider (MSSP) specializing in cybersecurity solutions and IT services for organizations across various industries.

Remote policy: Logically supports remote work and hires from various regions, including the Raleigh–Durham–Chapel Hill area in North Carolina, while encouraging a collaborative team environment.

Senior Systems Administrator (temp to hire) - Marcus Hook, PA - HYBRID

11 days ago
Contract
United States
Key requirements: 12 years of experience, Microsoft Active Directory, VMWare VSphere 7.x, Citrix XenApp 2507+, Cisco routers and switches, IT SOX Controls, endpoint security tools, server management best practices, scripting, capacity planning, cybersecurity tools
Arctiq

Arctiq is a Toronto-based B2B DevOps and cloud solution integrator specializing in professional IT services and managed services for enterprise organizations across North America.

Technology Operations Manager

12 days ago
Full-time
North America, Worldwide
$200,000 to $225,000 per year
Key requirements: 5 years of experience, AWS, Hybrid cloud infrastructure, Site Reliability Engineering, Incident investigation, Observability practices, Service reliability, Root cause management, Data center technologies, Virtualization, Infrastructure as Code (IaC), Operational KPIs
Business Wire

Business Wire is a San Francisco-based B2B service provider specializing in global news release distribution and regulatory disclosure for various industries, including finance and healthcare.

Remote policy: Business Wire supports remote work and hires from various locations, with team members collaborating across different time zones.

Senior Debug System Engineer, Datacenter

13 days ago
Full-time
United States
$200,000 to $322,000 per year
Key requirements: 12 years of experience, Failure analysis on datacenter products, Debugging GPU baseboards and servers, Enabling DFx requirements, Hardware, Software, Component, Process, Test, Validation expertise, Familiarity with oscilloscopes and analyzers, Strong negotiation and organization skills, Problem solving mentality, Ability to travel to factory sites
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Production Engineer - DGX Cloud

14 days ago
Full-time
North America
$168,000 to $333,500 per year
Key requirements: 8 years of experience, Production Engineering, DevOps, SRE, Kubernetes, Slurm, Go, Python, Large-scale distributed systems, Incident management, Monitoring and alerting, Automated deployments
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Site Reliability Engineer

16 days ago
Full-time
United States, Canada, United Kingdom, Singapore, India, Ireland, Finland
$120,000 to $180,000 per year
Key requirements: 8 years of experience, Production SaaS systems, Python, AWS, Kubernetes, Networking fundamentals, Monitoring & alerting, Advanced observability, Incident management, Troubleshooting skills, AIOps strategy
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Cloud Reliability & Recovery Engineer

16 days ago
Full-time
United States, Canada, United Kingdom, Singapore, India, Ireland, Finland
$100,000 to $150,000 per year
Key requirements: 5 years of experience, AWS expertise, Disaster Recovery architecture, Multi-region failover, Terraform, Kubernetes, CI/CD pipelines, Python scripting, AWS Backup administration, Chaos engineering, Business Continuity Planning
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Staff Database Reliability Engineer

17 days ago
Full-time
United States
$200,000 to $250,000 per year
Key requirements: PostgreSQL, Django ORM, AWS DMS, pganalyze, CloudWatch, Honeycomb, AI coding tools, OpenSearch, Redis, SQS, RabbitMQ, Python, Terraform, Cross-team leadership, Automation
Scribe

Scribe is a San Francisco-based B2B SaaS platform specializing in workflow documentation and optimization, serving over 5 million users across 600,000 businesses globally.

Senior Infrastructure Engineer, Government Systems

17 days ago
Full-time
North America, Middle East
Key requirements: Kubernetes, Terraform, AWS, CI/CD, GitOps, Linux administration, Operational mindset, Security compliance
Chainalysis

Chainalysis is a New York City-based B2B blockchain analysis firm specializing in compliance and investigation software for the cryptocurrency and financial sectors, serving clients globally.

Remote policy: Chainalysis supports remote work and is open to hiring from various regions, including North America and the Middle East, with team members located across multiple countries.

Senior Software Engineer, AV Mapping Infrastructure

18 days ago
Full-time
United States
$152,000 to $287,500 per year
Key requirements: 5 years of experience, AWS, Kubernetes, Cloud services management, Application containers, Monitoring systems (Prometheus, Datadog), Middleware systems (Redis, MongoDB, Kafka, HBase, Postgres, ElasticSearch), CI/CD deployment strategies, Networking fundamentals, Linux proficiency
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer II

18 days ago
Full-time
Mexico
Key requirements: 3 years of experience, Python, Go, Distributed Systems Expertise, Reliability Engineering Mindset, Observability & Incident Response, Cross-functional Communication, Operational Tooling & AI Fluency, Leadership & Mentorship
EarnIn

EarnIn is a fintech company headquartered in the US, specializing in earned wage access (EWA) through a mobile app that provides financial tools for hourly workers.

Senior Site Reliability Engineer

18 days ago
Full-time
Mexico
$100,000 to $150,000 per year
Key requirements: 4 years of experience, Python, Go, Distributed Systems Expertise, Reliability Engineering Mindset, Observability & Incident Response, Cross-functional Communication, Operational Tooling & AI Fluency, Leadership & Mentorship
EarnIn

EarnIn is a fintech company headquartered in the US, specializing in earned wage access (EWA) through a mobile app that provides financial tools for hourly workers.

Senior Site Reliability Engineer, Infrastructure Foundations

18 days ago
Full-time
North America, Worldwide
$113,082 to $175,725 per year
Key requirements: 6 years of experience, Puppet, Kubernetes, Python, Linux system-level troubleshooting, Infrastructure security management, Incident response leadership, Automation of tasks and processes, Monitoring and logging infrastructure (Prometheus, Grafana), Open source software contribution, Security incident technical response
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.

Infrastructure Engineer

18 days ago
Full-time
United States
Key requirements: AWS, Terraform, Kubernetes, CI/CD, Go, TypeScript, Rust, Postgres, Redis, Kafka, Datadog, Grafana, Sentry, CloudWatch, AWS Nitro Enclaves
Bastion

Bastion is a fintech B2B platform headquartered in New York City, specializing in Stablecoin-as-a-Service for financial institutions and enterprises.

Platform Engineer (Database Reliability) - Remote Canada

18 days ago
Full-time
Canada
Key requirements: 5 years of experience, MySQL management, Cloud infrastructure (GCP), Kubernetes, Terraform, Linux systems administration, Monitoring and observability, Scripting (Bash, Go, Python, JavaScript), Incident response, Operational best practices
Bold Commerce

Bold Commerce is a Winnipeg-based B2B SaaS provider specializing in e-commerce applications for Shopify merchants, focusing on tools that enhance online store performance and sales.

Remote policy: Bold Commerce supports remote work from anywhere in Canada and the United States, allowing for flexible work arrangements across these regions.