Fresh remote Site Reliability Engineer jobs in North America

Explore latest remote Site Reliability Engineer opportunities from leading companies hiring in North America. 50 jobs posted last 30 days.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Fresh remote Site Reliability Engineer jobs in North America (50)

Sr. Reliability Operations Engineer (Mexico)

about 9 hours ago
Full-time
Mexico
Key requirements: 5 years of experience, Incident response leadership, Grafana/Prometheus, GCP Monitoring, Automation scripting, Runbook development, Distributed systems support, IoT device operations, Operational documentation improvement, Incident management tools, Networking fundamentals
Serve Robotics

Serve Robotics is a B2B hardware robotics company based in the U.S. specializing in self-driving delivery robots for last-mile food delivery, targeting urban markets.

Remote policy: Serve Robotics is open to hiring qualified talent working remotely, with a preference for candidates located in the United States. Team members may be based in various locations, allowing for flexible collaboration.

Reliability Operations Engineer (Mexico)

about 9 hours ago
Full-time
Mexico
Key requirements: 2 years of experience, Grafana, Prometheus, GCP Monitoring, OpenTelemetry, Linux, Incident response, Cloud platforms, Runbooks, Distributed systems, IoT systems, Scripting, Networking fundamentals, Jira
Serve Robotics

Serve Robotics is a B2B hardware robotics company based in the U.S. specializing in self-driving delivery robots for last-mile food delivery, targeting urban markets.

Remote policy: Serve Robotics is open to hiring qualified talent working remotely, with a preference for candidates located in the United States. Team members may be based in various locations, allowing for flexible collaboration.

Kubernetes Engineer (REMOTE)

1 day ago
Full-time
United States
Key requirements: 5 years of experience, Kubernetes, AWS (EKS), CI/CD pipelines, Terraform, Ansible, Scripting (Bash, Python, PowerShell), Containerization, Hybrid environments
Xcellent Technology Solutions (XTS)

Xcellent Technology Solutions (XTS) is a Warrenton, Virginia-based B2G company specializing in geospatial technology and GEOINT services for the defense and federal government sectors.

Staff Cloud Infrastructure Engineer

4 days ago
Full-time
United States
Key requirements: 7 years of experience, AWS, GCP, Azure, Docker, Kubernetes, Terraform, Pulumi, Distributed systems, High availability, Fault tolerance, Cost efficiency, Technical leadership, Infrastructure strategy, Regulated industries
Valon Tech

Valon is a New York City-based fintech company specializing in AI-native mortgage servicing solutions, operating a proprietary platform for both B2B and B2C markets across the US.

Remote policy: Valon supports remote work and hires primarily within the United States, with team members located across various regions. Candidates are encouraged to apply regardless of their specific location within the US.

Senior Cloud Infrastructure Engineer

4 days ago
Full-time
United States
Key requirements: 3 years of experience, AWS, GCP, Azure, Vitess, Clickhouse, Redis, Docker, Kubernetes, Terraform, Pulumi, Distributed systems, Infrastructure-as-code, Product mindset, High-growth environments, Regulated industries
Valon Tech

Valon is a New York City-based fintech company specializing in AI-native mortgage servicing solutions, operating a proprietary platform for both B2B and B2C markets across the US.

Remote policy: Valon supports remote work and hires primarily within the United States, with team members located across various regions. Candidates are encouraged to apply regardless of their specific location within the US.

Staff Site Reliability Engineer

5 days ago
Full-time
North America, Worldwide
$220,000 to $250,000 per year
Key requirements: 15 years of experience, Kubernetes, Terraform, PostgreSQL, Incident Response, Distributed systems, Cloud infrastructure, Automation scripting, Linux operations, CI/CD systems
YugabyteDB

Yugabyte is a Sunnyvale, CA-based B2B SaaS provider of YugabyteDB, a PostgreSQL-compatible distributed SQL database designed for cloud-native applications across industries like cybersecurity and financial services.

Remote policy: YugabyteDB supports remote work and is likely to hire from various regions, with team members collaborating globally across time zones (UTC).

Staff Software Engineer, System Reliability

5 days ago
Full-time
United States, India
$212,000 to $286,200 per year
Key requirements: Distributed systems, Reliability improvements, Gamedays, Chaos testing, Load testing, Reliability scorecards, Observability standards, Post-incident learning, Mentoring engineers
Temporal Technologies

Temporal Technologies is a Bellevue, WA-based B2B software company specializing in an open-source durable execution system, serving diverse industries including fintech and e-commerce.

Remote policy: Temporal Technologies supports remote work for specific roles, including opportunities in the United States and India, with team members collaborating across various time zones.

Site Reliability Engineer II - Government Cloud

5 days ago
Full-time
United States
$96,000 to $120,000 per year
Key requirements: Google Cloud Platform (GCP), FedRAMP compliance, Kubernetes, CI/CD pipelines, NIST 800-53, Infrastructure as Code, Go (Golang)
Ping Identity

Ping Identity is a Denver-based B2B software company specializing in identity and access management (IAM) solutions for enterprise customers across technology, finance, healthcare, and government sectors.

Senior Site Reliability Engineer - Government Cloud

5 days ago
Full-time
United States
$105,000 to $130,000 per year
Key requirements: GCP, FedRAMP compliance, Kubernetes, Go, CI/CD pipelines, Infrastructure as Code, NIST 800-53 controls, Distributed systems architecture
Ping Identity

Ping Identity is a Denver-based B2B software company specializing in identity and access management (IAM) solutions for enterprise customers across technology, finance, healthcare, and government sectors.

Sr. Site Reliability Engineer (Database focused)

5 days ago
Full-time
United States
$136,100 to $174,210 per year
Key requirements: 7 years of experience, MySQL, Snowflake, Amazon Redshift, Database migration, Database performance tuning, High availability, AWS (S3, Lambda, IAM), Data pipeline tools (Apache Airflow, AWS Glue), Monitoring tools (CloudWatch, Prometheus), Scripting (Python, Bash)
iSpot.tv

iSpot.tv is a Bellevue, WA-based B2B company specializing in cross-platform TV and video ad measurement solutions, serving the advertising industry with real-time analytics and insights.

Remote policy: iSpot.tv supports a hybrid and flexible workplace, allowing employees to work remotely or in the office based on their location and role. Team members are located in various regions, including Bellevue, WA; El Segundo, CA; and New York, NY, with remote work options available for those outside these areas.

Senior Site Reliability Engineer

5 days ago
Full-time
North America, South America
$130,000 to $140,000 per year
Key requirements: 3 years of experience, AWS, Kubernetes, Database optimization, Incident response management, Observability tooling, North or South America based
Circle.so

Circle.so is a global SaaS platform for community building and online education, enabling creators and businesses to engage audiences through discussions, courses, and events.

Remote policy: Circle.so is a fully remote company with team members from over 30 countries, hiring globally and supporting candidates in various regions, including preferences for European and North/South American time zones.

NOC Engineer – Tier III

6 days ago
Full-time
United States
Key requirements: 8 years of experience, IP networking (BGP, OSPF, MPLS, VLANs, VPNs, QoS), Fiber transport systems (DWDM, CWDM, Ethernet), Juniper (Junos), Cisco, MikroTik, Calix XGS-PON, Fixed wireless and LTE platforms, Advanced diagnostic and analytical skills, Network monitoring and telemetry tools, Lead technical response during major incidents
Vero Networks

Vero Fiber Networks is a Boulder, Colorado-based telecommunications company specializing in fiber-to-the-premise (FTTP) broadband services for underserved communities across the US, operating in both B2B and B2C markets.

Remote policy: Vero Networks offers remote work opportunities for certain roles, including positions like the Director of Fiber Engineering. However, specific hiring locations and broader remote work policies are not clearly defined.

Senior Systems Performance Engineer 

7 days ago
Full-time
United States
$168,000 to $258,750 per year
Key requirements: 5 years of experience, Dynamo, TensorRT, Slurm, BCM, vLLM, SG Lang, Cuda, Cublas, Cutlass, Python, x86/Arm server architectures, GPU computing
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Distinguished Engineer, Cloud Site Reliability Engineering

7 days ago
Full-time
United States
$320,000 to $488,750 per year
Key requirements: 18 years of experience, Cloud infrastructure maintenance, AI development, JAVA, Python, Shell scripting, Distributed systems, REST APIs, SQL/NoSQL databases, Docker, Kubernetes, OpenStack, Machine Learning, Deep Learning, High-performance software design, Scalable software systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Software Engineer - Site Reliability Engineering

7 days ago
Full-time
United States
$130,000 to $165,000 per year
Key requirements: 5 years of experience, Ruby on Rails, Typescript, AWS, CI/CD frameworks, Infrastructure automation, System monitoring, Observability platform, Performance optimization
Snapsheet Inc

Snapsheet is a Chicago-based B2B SaaS provider specializing in claims management technology for the property and casualty insurance industry, serving clients across the US, Canada, and Europe.

Remote policy: Snapsheet is a remote-first company, hiring from various regions including the United States, Canada, and Europe, with no office attendance required.

Helpdesk and Cloud Operations Engineer

9 days ago
Full-time
United States
$70,000 to $100,000 per year
Key requirements: 5 years of experience, Microsoft 365 administration, Cloud infrastructure management (Azure, AWS), Powershell automation, Incident and problem management, Cybersecurity practices (NIST, CIS), CI/CD pipelines, Remote work independence
Gorilla Commerce

Gorilla Commerce is a Westport, CT-based B2C e-commerce company specializing in high-quality, affordable home and pet products, leveraging a data-driven approach to product development.

Remote policy: Gorilla Commerce adopts a remote-first approach, allowing flexible work arrangements for team members. While specific hiring locations are not detailed, the company supports collaboration across various regions.

SRE - Infra

11 days ago
Full-time
North America, Worldwide
Key requirements: Kubernetes (EKS), AWS multi-account management, Terraform/Terragrunt automation, Linux systems, Stateful systems support, Performance debugging, End-to-end system ownership
PostHog

PostHog is a San Francisco-based B2B SaaS platform offering an integrated suite of tools for product engineers to build, test, and analyze software products, with a focus on the global market.

Remote policy: PostHog is a fully remote company with a globally distributed team, currently hiring in time zones between GMT-8 and GMT+2.

Incident Manager

12 days ago
Full-time
United States
Key requirements: 5 years of experience, Incident management, SaaS experience, Healthcare experience, ITIL frameworks, PagerDuty, Jira, Datadog, Root Cause Analysis, Cross-functional leadership, Strategic documentation, Process improvement
SmithRx

SmithRx is a health-tech B2B Pharmacy Benefit Manager (PBM) focused on providing transparent pharmacy benefits and cost-saving solutions to employers and health plans across the U.S.

Senior Software Engineer, SRE

13 days ago
Full-time
North America, Worldwide
Key requirements: AWS expertise, Terraform, Kubernetes fundamentals, Amazon EKS, Go, Python, CI/CD pipelines, GitHub Actions, ArgoCD, Datadog, SLIs/SLOs
Socure

Socure is a U.S.-based B2B SaaS provider specializing in AI-driven identity verification and fraud prevention solutions for enterprises across financial services, e-commerce, and government sectors.

Remote policy: Socure is a fully remote organization, supporting team members across various locations, with some roles requiring in-person engagement in specific regions such as Washington, D.C.

Site Reliability Engineer - Canada Wide - Remote

13 days ago
Full-time
Canada
Key requirements: AWS, chaos engineering, incident management, SLIs/SLOs/SLA, observability, Linux Shell, Python, JavaScript, Java, self-starter
Newton

Newton is a Canadian fintech company specializing in cryptocurrency trading, providing tools for financial freedom in the crypto market.

Remote policy: Newton operates with a remote team across Canada, welcoming applicants from this region to join their innovative and collaborative environment.

Cloud & AI Operations Administrator

15 days ago
Full-time
Canada
$67,000 to $82,000 per year
Key requirements: 5 years of experience, Azure, PowerShell, Azure CLI, Azure Active Directory, AI services management, CCNA, Microsoft Teams, Office365, Cisco Meraki Solutions
Heart & Stroke

Heart & Stroke is a Canadian non-profit organization headquartered in Ontario, dedicated to preventing heart disease and stroke through research, advocacy, and public education, targeting diverse communities across Canada.

Remote policy: Heart & Stroke supports remote work for candidates residing in Canada, with the requirement to travel to the Toronto office when requested.

Senior Systems Software Engineer, AI Infrastructure

16 days ago
Full-time
United States
$184,000 to $287,500 per year
Key requirements: 5 years of experience, Python, C/C++ or Go or Perl or Ruby, Linux or Windows systems engineering, AWS or Azure or GCP or OCI, SRE principles, Terraform CDK, observability platforms, CI/CD systems, AI training and inferencing, deep learning frameworks, distributed systems, cloud or hardware health monitoring
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Software Engineer - Infrastructure

17 days ago
Full-time
United States
Key requirements: 8 years of experience, AWS infrastructure at scale, Infrastructure as Code (Terraform), Designing and operating distributed systems, CI/CD systems and automation pipelines, Containerized environments (Docker), Observability and performance tuning, Experience architecting infrastructure for multiple payment rails, Understanding of payment rail mechanics, Experience scaling high-volume financial workloads, Familiarity with security and risk standards for money movement
Modern Treasury

Modern Treasury is a San Francisco-based fintech B2B company specializing in payment operations tools, offering APIs and dashboards for automating money movement for enterprises.

Remote policy: Modern Treasury supports remote work for certain roles and primarily hires from the United States, with team members located in cities such as San Francisco and New York.

Intermediate Site Reliability Engineer, Environment Automation

17 days ago
Full-time
North America, Worldwide
Key requirements: Golang, Kubernetes, Terraform, Ansible, Infrastructure as Code, SaaS experience, Multi-tenant environments, Observability stack, Automation mindset, Git-based workflows
GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

AI Platform Engineer

18 days ago
Full-time
United States
$168,000 to $322,000 per year
Key requirements: 10 years of experience, Python, Kubernetes, AI/ML platforms, Distributed systems, Observability design, AI-assisted development tools, Automation-first approach, AI-native infrastructure roadmaps
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer

18 days ago
Full-time
United States, Canada
Key requirements: 10 years of experience, Cloud-native infrastructure, Kubernetes deployment, Terraform, Ansible, IaC / GitOps tooling, On-prem compute knowledge, Platform optimization, Distributed systems troubleshooting, Python, Bash, CUDA-based GPU programs, Security in sensitive environments
Planet

Planet Labs is a San Francisco-based B2B company specializing in satellite imagery and earth data analytics for applications in agriculture, environmental monitoring, and disaster response.

Remote policy: Planet hires remotely from the United States, supporting team members to work from anywhere within the country.

Site Reliability Engineer

18 days ago
Full-time
United States, Canada
$160,600 to $200,800 per year
Key requirements: 10 years of experience, Cloud-native infrastructure, Kubernetes deployment, Terraform, Ansible, IaC / GitOps tooling, On-prem compute knowledge, Platform optimization, Distributed systems troubleshooting, Python, Bash, CUDA-based GPU programs, Security in sensitive environments
Planet

Planet Labs is a San Francisco-based B2B company specializing in satellite imagery and earth data analytics for applications in agriculture, environmental monitoring, and disaster response.

Remote policy: Planet hires remotely from the United States, supporting team members to work from anywhere within the country.

Senior Infrastructure Engineer

18 days ago
Full-time
United States, Canada
$200,700 to $250,900 per year
Key requirements: 5 years of experience, AWS, Terraform, Prometheus, Grafana, OpenTelemetry, Technical writing, Infrastructure project leadership, Regulated environments experience, Large-scale Terraform implementations, Coding experience
Mercury

Mercury Bank is a B2B financial institution offering simplified business banking services, headquartered in an unspecified location, targeting businesses primarily in the North American market.

Remote policy: Mercury Marine offers remote work opportunities for various positions, with current roles available for candidates in Canada and the United States. For specific hiring regions and remote work policies, please refer to the company's careers page.

Senior Platform Engineer

19 days ago
Full-time
North America, Worldwide
$100,000 to $150,000 per year
Key requirements: AWS, Kubernetes, Terraform, CI/CD pipelines, Go, SRE principles, AI-assisted engineering tools, Cloud security, Production observability technologies
Trust Wallet

Trust Wallet is a leading B2C multi-chain, non-custodial cryptocurrency wallet enabling users to manage over 10 million digital assets, headquartered remotely with a global user base.

Remote policy: Trust Wallet operates as a fully remote company, hiring globally with team members working from various countries. Candidates must have the right to work in their respective locations.

Senior Site Reliability Engineer

19 days ago
Full-time
North America, Worldwide
$113,082 to $175,725 per year
Key requirements: 6 years of experience, Puppet, Kubernetes, Python, Linux troubleshooting, Distributed caching systems, TCP/IP, HTTP, TLS, DNS, Incident response, Automation of tasks, Monitoring tools (Prometheus, Grafana)
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.