Fresh remote Site Reliability Engineer jobs in North America

Explore latest remote Site Reliability Engineer opportunities from leading companies hiring in North America. 50 jobs posted last 30 days.

Filter by Location

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Fresh remote Site Reliability Engineer jobs in North America (50) - Page 2

Senior Software Engineer, Infrastructure Automation and Distributed Systems

20 days ago
Full-time
North America
$224,000 to $431,250 per year
Key requirements: 12 years of experience, Infrastructure automation, Distributed systems design, Python, Go, Perl, Ruby, Linux, Networking, Storage, Containers, Multi-cloud infrastructure, Kubernetes, OpenStack, Docker, Slurm, NVIDIA Collective Communication Library (NCCL)
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Software Engineer - SRE (Remote)

20 days ago
Full-time
United States
Key requirements: 8 years of experience, Site Reliability Engineering, DevOps, Kubernetes, AWS, On-call experience, Monitoring and alerting, Collaboration across teams
Rula

Rula Health is a remote-first B2C telehealth SaaS platform based in the U.S., specializing in online therapy and psychiatry services for individuals aged 5 and older, addressing over 90 mental health conditions.

Remote policy: Rula Health is a 100% remote-first company, hiring primarily in the United States, with the exception of Hawaii.

FBS AIOps Engineer

20 days ago
Full-time
North America, Worldwide
Key requirements: 2 years of experience, Dynatrace, Copilot Studio, AIOps platform design, Event correlation, Runbook automation, ITSM integration, Anomaly detection, Consultative partnership
Capgemini

Capgemini is a Paris-based B2B IT services and consulting company specializing in digital transformation, technology services, and consulting, serving diverse industries globally.

Remote policy: Capgemini supports flexible work arrangements, including remote and office-based options, and operates globally across more than 40 countries, welcoming applicants from various regions.

Senior Production Engineer

21 days ago
Full-time
United States
$165,000 to $195,000 per year
Key requirements: 5 years of experience, AWS, Kubernetes (EKS), Terraform, Go, Python, CI/CD systems, Observability tools, SLIs/SLOs implementation, GenAI tools
Legion

Legion is a remote B2B SaaS provider specializing in intelligent automation workforce management solutions for labor-intensive industries, headquartered in the United States.

Sr Cloud Engineer (Contract-to-Hire)

21 days ago
Contract
United States
$140,000 to $165,000 per year
Key requirements: 5 years of experience, Cloud-native solutions, HITRUST & SOC2 compliance, Infrastructure automation, Containerization (Docker, Kubernetes), DevSecOps principles, CI/CD pipelines (Azure DevOps), Observability platforms (Datadog), Multi-cloud deployment, Stateful database infrastructure, Microservices architecture
Lirio

Lirio is a U.S.-based healthtech B2B SaaS company specializing in behavioral health interventions and personalized care navigation through its AI-driven platform.

Remote policy: Lirio supports remote work with opportunities for hybrid arrangements for candidates located in Tennessee. Currently, hiring is focused on candidates authorized to work in the US.

Application Support Engineer

21 days ago
Full-time
United States
$100,000 to $120,000 per year
Key requirements: 2 years of experience, Microservices architectures, Full-stack troubleshooting, Log analysis tools, Incident management methodologies, FinTech experience, Jira, Confluence, Scripting skills, Customer satisfaction focus, High-pressure problem-solving, Excellent communication skills
Lumin Digital

Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.

Remote policy: Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.

Senior DevOps / SRE Engineer

23 days ago
Full-time
United States
$120,000 to $150,000 per year
Key requirements: Kubernetes (EKS), Blockchain reliability, Zero-downtime operations, CI/CD pipeline development, Observability tooling, Real-time systems, Infrastructure as Code (IaC), Incident leadership, Security focus
MLabs

MLabs is a remote fintech company specializing in DeFi solutions, providing a unified API for financial institutions to access on-chain liquidity across major blockchains.

Remote policy: MLabs supports remote work for positions located within the EMEA region, allowing for flexible hours and a remote-first environment.

Senior DevOps/SRE Engineer

24 days ago
Full-time
North America, Worldwide
$100,000 to $150,000 per year
Key requirements: 6 years of experience, AWS services, Kubernetes, Terraform, GitLab CI, VictoriaMetrics, Prometheus, Grafana, Apache Kafka, Bash, Python, Go
capital.com

Capital.com is a Cyprus-based fintech B2C online trading platform specializing in CFDs and spread betting across over 3,000 global financial markets.

Remote policy: Capital.com offers remote work opportunities, including the flexibility to work from various locations, with team members enjoying benefits such as 30 extra days to work remotely from anywhere in the world.

Senior Site Reliability Engineer- Remote

25 days ago
Full-time
United States
Key requirements: 8 years of experience, Go, Python, AWS, Azure, Google Cloud, Kubernetes, Ansible, Terraform, Distributed databases, ClickHouse, Incident management, Post-mortem analysis, Problem solving, Accountability
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Site Reliability Engineer- Remote

25 days ago
Full-time
North America, Worldwide
$141,000 to $208,000 per year
Key requirements: 8 years of experience, Go, Python, AWS, Azure, Google Cloud Platform, Kubernetes, Docker Swarm, Ansible, Terraform, Puppet, Distributed databases, SQL, ClickHouse
ClickHouse

ClickHouse is a San Francisco-based B2B open-source column-oriented database system specializing in real-time analytics and SQL querying for enterprises globally.

Remote policy: ClickHouse is a globally distributed and remote-friendly company, operating in 20 countries, allowing for flexible work arrangements across various regions.

Senior Cloud Network Engineer (US Remote)

26 days ago
Full-time
United States
$110,000 to $140,000 per year
Key requirements: 7 years of experience, AWS network security, Palo Alto NGFW, Multi-cloud network management, CloudFormation, Terraform, IP overlap and static routing, Public Cloud architecture (Azure/AWS), Network performance testing methodologies, Centralized cloud connectivity design, Automation methodologies
First Advantage

First Advantage is an Atlanta-based HR Tech B2B SaaS provider specializing in global background screening and compliance solutions for various industries.

Remote policy: First Advantage offers flexibility with the possibility to work remotely, supporting a global workforce across various regions. Team members are located in 17 countries, allowing for collaboration across time zones.

Site Reliability Engineer

26 days ago
Full-time
North America, Worldwide
Key requirements: 3 years of experience, SLIs/SLOs definition, Multi-tenant SaaS platforms, Datadog, Grafana, Elastic Stack, Kubernetes, High-availability architectures, Incident response leadership, Automation and process improvement, Cloud experience (Azure preferred), Capacity planning and load testing
HostPapa

HostPapa is a Canadian-based web hosting company offering B2B and B2C solutions, including shared, reseller, and VPS hosting services, with a focus on small businesses and a global presence.

Remote policy: HostPapa offers remote work opportunities and hires from various locations, with team members and customers in 39 countries around the globe.

Systems Engineer

27 days ago
Full-time
United States
$78,900 to $116,760 per year
Key requirements: 5 years of experience, Windows server administration, Linux server administration, SRE methodologies, DevOps methodology, VMware/Nutanix administration, Incident management, Automation focus, Flexible working hours willingness
rockstargames

Rockstar Games is a New York City-based video game publisher specializing in action-adventure and racing games, operating primarily as a B2C company with a global reach.

Senior Site Reliability Engineer

27 days ago
Full-time
North America, Worldwide
$120,000 to $180,000 per year
Key requirements: 8 years of experience, Kubernetes, Multi-cloud experience, Terraform, CI/CD processes, Stateful software on Kubernetes, Kubernetes best practices, Debugging Kubernetes clusters, Scripting (bash, Python, Go)
Diagrid

Diagrid is a technology company providing a B2B SaaS platform for workflow orchestration and AI agent development, headquartered in an unspecified location, serving various industries including financial services and healthcare.

Remote policy: Diagrid operates with a fully remote and flexible work environment, supporting collaboration across various regions, including the United States and Europe.

Senior HPC and LSF Operations Engineer

27 days ago
Full-time
United States
$152,000 to $287,500 per year
Key requirements: 5 years of experience, HPC scheduling systems, LSF, Slurm, Linux systems administration, Reliability engineering practices, Observability systems, Container technologies
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Cloud Operations Engineer

27 days ago
Full-time
United States
$110,000 to $125,000 per year
Key requirements: 3 years of experience, Cloudwatch, AWS, CI/CD pipelines, Incident triage, SSL/TLS management, Automation/orchestration tools, Cross-functional collaboration, Exceptional communication skills
Lumin Digital

Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.

Remote policy: Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.

Site Reliability Engineer - AI & ML Infrastructure (Kubernetes, AWS & Terraform)

28 days ago
Full-time
United States
Key requirements: 5 years of experience, Kubernetes, Terraform, Slurm, High-performance computing, Bare metal infrastructure management, Python, AWS
Deepgram

Deepgram is a San Francisco-based B2B SaaS provider specializing in Voice AI solutions, offering speech-to-text and text-to-speech APIs for developers and enterprises in various sectors.

Site Reliability Engineer - AI & ML Infrastructure (Kubernetes, AWS & Terraform)

28 days ago
Full-time
United States
Key requirements: 5 years of experience, Kubernetes, Terraform, AWS, Slurm, Bare metal infrastructure, High-performance computing, Scripting (Python, Go, Bash)
deepgram

Deepgram is a San Francisco-based B2B AI company specializing in speech-to-text (STT) and text-to-speech (TTS) technologies, providing real-time APIs for developers in the Voice AI industry.

Senior Site Reliability Engineer

28 days ago
Full-time
United States
$172,614 per year
Key requirements: 2 years of experience, Kubernetes, Infrastructure as Code, Python, Go, JavaScript, Shell script, Observability platform, SLI/SLO/SLAs, Security risk assessments
Loadsmart

Loadsmart is a Chicago-based logistics technology company specializing in innovative freight management solutions for the B2B market.

Remote policy: Loadsmart supports remote work and has a globally distributed team, currently hiring for remote positions in Brazil.

Senior Infrastructure & Security Engineer

about 1 month ago
Full-time
United States
$160,000 to $170,000 per year
Key requirements: 6 years of experience, AWS CDK, HIPAA compliance, SOC 2 compliance, AWS Lambda, AWS ECS, AWS S3, AWS CloudWatch, AWS IAM, AWS VPC, AWS WAF, Monitoring and incident management, TypeScript, CI/CD pipelines, Multi-region AWS architecture, Healthcare industry experience
Koda Health

Koda Health is a healthcare technology company offering a cloud-based SaaS platform for advance care planning, headquartered in the US, targeting both B2B partnerships with health systems and direct B2C access for patients.

Remote policy: Koda Health offers fully remote positions for U.S.-based candidates, supporting a flexible work environment across various time zones.