Fresh remote Site Reliability Engineer jobs in North America

$120,000 to $165,000 per year

Key requirements: 5 years of experience, SLI/SLO frameworks, Incident response leadership, Datadog, Infrastructure as Code (Terraform), Kubernetes management, CI/CD pipeline ownership, SAST/DAST/SCA integration, Cloud security fundamentals, Policy-as-code frameworks, B2C/mobile backend experience

MyFitnessPal

MyFitnessPal is a health and fitness tracking mobile application (SaaS) based in the health and wellness technology industry, serving individual users globally with tools for nutrition and fitness management.

Site Reliability Engineer (US - Central/Eastern time)

about 9 hours ago

Key requirements: Kubernetes (EKS), AWS multi-account management, Terraform/Terragrunt automation, Linux systems, Stateful systems support, Performance debugging, End-to-end system ownership

PostHog

PostHog is a San Francisco-based B2B SaaS platform offering an integrated suite of tools for product engineers to build, test, and analyze software products, with a focus on the global market.

Remote policy: PostHog is a fully remote company with a globally distributed team, currently hiring in time zones between GMT-8 and GMT+2.

Staff Platform Engineer

about 9 hours ago

$190,000 to $230,000 per year

Key requirements: 5 years of experience, Rust, Python, Kubernetes, AWS, Terraform, GitOps, Incident response, Data systems fluency, Agent fluency, Guardrail engineering

Postscript

Postscript is a remote-based SaaS company specializing in SMS marketing solutions for eCommerce brands, targeting the B2B market with a focus on enhancing customer engagement and driving sales.

Remote policy: Postscript is a fully remote organization, hiring from various locations globally, allowing team members to work from anywhere.

Monitoring Engineer

1 day ago

Key requirements: 6 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, Distributed tracing, High-cardinality metrics, SLOs, CI/CD integration, Linux internals, Container platforms

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

OpenShift Administrator

1 day ago

Key requirements: 6 years of experience, OpenShift, Kubernetes internals, Linux administration, Infrastructure-as-code (Ansible, Terraform, Helm), CI/CD pipelines (Tekton, Jenkins, Argo CD), Scripting (Bash, Python, Go), Monitoring and logging tools, Container image security, Disaster recovery strategies, Cloud environments (AWS, Azure, GCP)

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Cloud Infrastructure Network Engineer

1 day ago

Key requirements: 6 years of experience, Cloud networking architecture, VPC/VNet design, AWS Transit Gateway, Hybrid connectivity, Infrastructure-as-code with Terraform, Network security controls, Kubernetes networking, Multi-cloud networking, SD-WAN familiarity, eBPF-based networking tools

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Observability Engineer

$100,000 to $160,000 per year

Key requirements: 12 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, Distributed tracing, High-cardinality metrics, SLOs, CI/CD integration, Linux internals, eBPF-based tooling

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Platform Networking Engineer

Key requirements: 6 years of experience, Istio, Linkerd, mTLS, Envoy, Kubernetes, Go, Python, Distributed tracing, Traffic management policies, Zero-trust patterns

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Service Mesh Engineer

Key requirements: 6 years of experience, Istio, Linkerd, Envoy, mTLS, Kubernetes, Go, Python, Distributed tracing, Traffic management policies, Zero-trust patterns

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Systems Observability Specialist

Key requirements: 6 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, Distributed tracing, High-cardinality metrics, SLOs, CI/CD integration, Linux internals, Container platforms

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Virtual Platform Engineer

Key requirements: 6 years of experience, vSphere, vSAN, NSX-T, Tanzu Kubernetes Grid, PowerCLI, Terraform, Ansible, vRealize Automation, disaster recovery, VMware Cloud Foundation, VMware Cloud on AWS

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Platform Engineer II

$118,000 to $151,000 per year

Key requirements: AWS, Docker, Kubernetes, Python, CI/CD, Terraform, Observability, Security best practices, Production systems operation

Plüm énergie

Plüm énergie is a French B2C startup supplying green electricity with a focus on energy reduction and customer savings, operating in the renewable energy sector.

Senior Cloud Operations Engineer (Hybrid) Bellevue, WA; Newtown Square, PA; New York, New York, or San Francisco, CA

Key requirements: 4 years of experience, Azure administration, Azure DevOps, Identity management, Zero Trust architecture, Scripting/automation, Monitoring tools, Cost optimization, Cloud-native application design, CI/CD pipeline management, Disaster recovery strategies

Arctiq

Arctiq is a Toronto-based B2B DevOps and cloud solution integrator specializing in professional IT services and managed services for enterprise organizations across North America.

Staff Infrastructure Engineer – Kubernetes Platform

$120,000 to $160,000 per year

Key requirements: 7 years of experience, Kubernetes at scale, Multi-tenant cluster models, Control plane architecture design, CNI plugins (Cilium preferred), Multi-region scaling, Deep troubleshooting across Kubernetes, Networking stack expertise, Observability platforms (Prometheus, Grafana), Virtual cluster technologies (vcluster, Kamaji)

TensorWave

TensorWave is a Las Vegas-based B2B cloud computing provider specializing in AI and high-performance computing infrastructure, utilizing AMD Instinct GPUs to deliver scalable solutions for enterprises and AI researchers.

Cloud Infrastructure Engineer – AWS

Key requirements: 6 years of experience, AWS core services, Infrastructure-as-code (Terraform, CloudFormation), Amazon EKS or ECS, CI/CD pipelines, Cloud security, Observability and monitoring, Python scripting, Multi-account AWS Organizations, Cost optimization frameworks, Regulated workloads (HIPAA, PCI-DSS)

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

DevOps & SRE Engineer

Key requirements: 6 years of experience, Kubernetes, Prometheus, Grafana, Python, Go, CI/CD pipelines, Chaos engineering, SLOs and error budgets, Linux at scale, Observability tooling

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Infrastructure Engineer – Automation

Key requirements: 6 years of experience, Terraform, Infrastructure-as-code, CI/CD pipelines, Policy-as-code, Cloud networking, Multi-cloud provisioning, Python, Git-based workflows, Terraform state management

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Kafka Platform Engineer

Key requirements: 6 years of experience, Kafka internals, Kafka security (SASL, mTLS, ACLs, RBAC), Kafka Connect, Schema Registry, Kafka Streams or ksqlDB, HA/DR strategies for Kafka, Infrastructure-as-code (Terraform, Ansible), Observability tooling for Kafka, Confluent Certified Administrator or Developer, Kafka on Kubernetes (Strimzi, Confluent Operator)

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Platform Infrastructure Engineer

Key requirements: 5 years of experience, OpenShift, Kubernetes, Linux administration, Infrastructure-as-code (Ansible, Terraform), CI/CD pipelines (Tekton, Jenkins), Scripting (Bash, Python, Go), Monitoring tools (Prometheus, Grafana), Container image security, Multi-tenant platform design, Disaster recovery strategies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Reliability Monitoring Engineer

Key requirements: 5 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, SLOs, distributed tracing, high-cardinality metrics, CI/CD integration, Linux internals, container platforms

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Telemetry Engineer

Key requirements: 6 years of experience, Prometheus, Grafana, Datadog, OpenTelemetry, Distributed tracing, High-cardinality metrics, SLOs, CI/CD integration, Linux internals, Container platforms

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Senior Systems Software Engineer, Observability and Telemetry Platform

4 days ago

$184,000 to $356,500 per year

Key requirements: 8 years of experience, Infrastructure automation, Distributed systems design, Python, Kubernetes, OpenStack, Grafana, Observability tools, Systematic problem-solving, Strong communication skills

NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer, Cloud

4 days ago

$116,000 to $189,750 per year

Key requirements: 3 years of experience, AWS Cloud Platform, Kubernetes, Python, ML model serving frameworks, MLOps frameworks, Akamai CDN, SRE on-call experience, Automation of operational steps, Incident management process, Generative AI/LLM applications

NVIDIA

Site Reliability Engineer (SRE)

4 days ago

Key requirements: 5 years of experience, Kubernetes, Python, Go, Prometheus, Grafana, CI/CD pipelines, Chaos engineering, SLOs and error budgets, Distributed systems design, Incident response leadership

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Site Reliability Engineer, Intermediate to Senior Staff — Infrastructure Platforms

6 days ago

$126,400 to $314,400 per year

Key requirements: Kubernetes, Terraform, Go, AWS, GCP, Infrastructure as Code, Observability practices, Automation, Incident response, Strong written communication

GitLab

GitLab is a San Francisco-based DevOps platform offering B2B and B2C solutions for software development, security, and collaboration, with a global presence.

Remote policy: GitLab is a fully remote company that hires globally, with team members located in over 65 countries. We embrace flexibility in scheduling to accommodate various time zones.

Senior Infrastructure Engineer

7 days ago

Key requirements: 5 years of experience, Network architecture, Observability strategy, Terraform, Datadog, SNMPv2 and SNMPv3, GitHub for CI/CD, Cloud platforms (AWS, Azure, GCP), Scripting/automation (Python, PowerShell)

EverOps

EverOps is a San Francisco-based B2B consulting firm specializing in DevOps and IT services, focusing on cloud operations for innovative companies across various industries.

Remote policy: EverOps is a fully remote company, having operated remotely since its inception, and hires from various locations globally.

Observability Implementation Consultant

7 days ago

Contract

Key requirements: Datadog, Legacy monitoring migration, Observability concepts, Cloud platforms (AWS, Azure, GCP), Infrastructure as Code (Terraform), AI-driven alerting (Watchdog), Serverless observability, Strong troubleshooting, Excellent communication

Arctiq

Arctiq is a Toronto-based B2B DevOps and cloud solution integrator specializing in professional IT services and managed services for enterprise organizations across North America.

Site Reliability Specialist

8 days ago

Key requirements: Terraform, Kubernetes, AWS, GitLab CI/CD, Python, Bash, Go, Prometheus, Grafana, Ansible, Chef, AI-assisted engineering tools

Canada

Ubisoft

Ubisoft is a French video game developer and publisher headquartered in Saint-Mandé, specializing in AAA titles for global consumers in the B2C gaming industry.

Customer Reliability Engineer

8 days ago

Key requirements: 7 years of experience, AWS Cloud (EC2, RDS, S3, VPC, IAM), Network and Security troubleshooting, Scripting (Ruby, Python, Bash, Powershell), Infrastructure as Code (Terraform, CDK, CloudFormation), Production On-call experience, Windows/Linux server administration, Monitoring platforms (CloudWatch, Grafana, Datadog), Experience with AI-driven development environments

OpsGuru, a Carbon60 Company

OpsGuru is a Vancouver-based B2B cloud consulting firm specializing in AWS solutions, data modernization, and generative AI, serving SMBs and enterprises globally.

Remote policy: OpsGuru embraces a remote-first work environment, offering flexibility in work hours and location. While specific hiring regions are not detailed, the company supports a global team, welcoming applicants from various locations.

Senior Cloud Infrastructure Engineer

9 days ago