Fresh remote Site Reliability Engineer jobs in Canada

Explore latest remote Site Reliability Engineer opportunities from leading companies hiring in Canada. 21 jobs posted last 30 days.

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Fresh remote Site Reliability Engineer jobs in Canada (21)

Senior Release Engineer

2 days ago
Full-time
Canada, Worldwide
Key requirements: 4 years of experience, Kubernetes, Helm, CI/CD pipelines, Infrastructure as Code, GitOps tooling, Security tooling integration, Linux systems, Networking fundamentals, Distributed systems
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Agora - Senior Infrastructure Engineer

3 days ago
Full-time
Canada, North America, South America
Key requirements: 5 years of experience, TypeScript, Kubernetes, AWS architecture, Infrastructure-as-code, Distributed systems fundamentals, Reusable systems and abstractions, SLIs/SLOs and incident tooling, Observability practices, GitOps patterns, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Agora - Senior Infrastructure Engineer

3 days ago
Full-time
Canada, North America, South America
Key requirements: 5 years of experience, TypeScript, AWS architecture, Kubernetes, Infrastructure-as-code, Distributed systems, Reusable systems, SLIs/SLOs, Observability practices, GitOps, Developer Productivity, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Site Reliability Engineer

4 days ago
Full-time
Canada, Worldwide
Key requirements: Linux/Unix, Cloud providers (AWS, Google Cloud, Azure), Infrastructure provisioning (Terraform, CloudFormation, Ansible), Containerization (Docker), Orchestration (Kubernetes), Monitoring tools (Prometheus, Grafana, Datadog), CI/CD pipelines (Jenkins, GitLab CI, CircleCI), Incident management practices, Scripting (Bash, Perl)
OXIO

OXIO is a North America-based B2B Telecom-as-a-Service (TaaS) platform enabling businesses to build and manage customizable mobile networks through a cloud-based solution.

Remote policy: OXIO supports flexible work arrangements and hires from various regions, with team members located in cities such as New York, Mexico City, and Montreal. Candidates are encouraged to apply regardless of their location.

Senior AI Tools Engineer, SRE Operations - GeForce NOW

6 days ago
Full-time
Canada, North America
$144,000 to $230,000 per year
Key requirements: 5 years of experience, Python, AI tools development, LLM-based systems, Kubernetes, AWS, Data pipeline management, Automation, SRE principles, Monitoring tools (Grafana)
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Site Reliability Engineer (AI Enablement)

10 days ago
Full-time
Canada
$153,400 to $230,400 per year
Key requirements: 8 years of experience, AI-assisted development tools, Building AI/LLM-powered developer tools, AI Enablement Strategy, Proficiency in prompt engineering, Go or Python proficiency, AWS production environment experience, Terraform expertise, Container orchestration (ECS/Kubernetes), Observability practices, Driving org-wide tooling adoption
Coalition, Inc.

Site Reliability Engineer

10 days ago
Full-time
Canada, United States, United Kingdom, Brazil, Japan, Nigeria
$100,000 to $150,000 per year
Key requirements: 4 years of experience, PostgreSQL, Kubernetes, GitOps, Cloud networking, Incident response, Go, Python, Observability stack
Alpaca

Alpaca is a US-based fintech company providing self-clearing brokerage infrastructure and APIs for stocks, ETFs, options, and crypto, serving financial institutions globally.

Remote policy: Alpaca embraces a remote-first culture, hiring globally from various regions including the USA, Canada, Japan, Hungary, Nigeria, Brazil, and the UK, allowing team members to work from their preferred locations.

Technology Operations Manager

12 days ago
Full-time
Canada, Worldwide
$200,000 to $225,000 per year
Key requirements: 5 years of experience, AWS, Hybrid cloud infrastructure, Site Reliability Engineering, Incident investigation, Observability practices, Service reliability, Root cause management, Data center technologies, Virtualization, Infrastructure as Code (IaC), Operational KPIs
Business Wire

Business Wire is a San Francisco-based B2B service provider specializing in global news release distribution and regulatory disclosure for various industries, including finance and healthcare.

Remote policy: Business Wire supports remote work and hires from various locations, with team members collaborating across different time zones.

Senior Production Engineer - DGX Cloud

14 days ago
Full-time
Canada, North America
$168,000 to $333,500 per year
Key requirements: 8 years of experience, Production Engineering, DevOps, SRE, Kubernetes, Slurm, Go, Python, Large-scale distributed systems, Incident management, Monitoring and alerting, Automated deployments
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Staff Site Reliability Engineer

16 days ago
Full-time
Canada, United States, United Kingdom, Singapore, India, Ireland, Finland
$120,000 to $180,000 per year
Key requirements: 8 years of experience, Production SaaS systems, Python, AWS, Kubernetes, Networking fundamentals, Monitoring & alerting, Advanced observability, Incident management, Troubleshooting skills, AIOps strategy
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Cloud Reliability & Recovery Engineer

16 days ago
Full-time
Canada, United States, United Kingdom, Singapore, India, Ireland, Finland
$100,000 to $150,000 per year
Key requirements: 5 years of experience, AWS expertise, Disaster Recovery architecture, Multi-region failover, Terraform, Kubernetes, CI/CD pipelines, Python scripting, AWS Backup administration, Chaos engineering, Business Continuity Planning
AlphaSense

AlphaSense is a New York City-based B2B fintech platform specializing in AI-driven market intelligence and search solutions for financial institutions and top companies globally.

Remote policy: AlphaSense supports remote work and hires from various regions, with team members located in countries such as the United States, U.K., Finland, India, Singapore, Canada, and Ireland.

Senior Infrastructure Engineer, Government Systems

17 days ago
Full-time
Canada, North America, Middle East
Key requirements: Kubernetes, Terraform, AWS, CI/CD, GitOps, Linux administration, Operational mindset, Security compliance
Chainalysis

Chainalysis is a New York City-based B2B blockchain analysis firm specializing in compliance and investigation software for the cryptocurrency and financial sectors, serving clients globally.

Remote policy: Chainalysis supports remote work and is open to hiring from various regions, including North America and the Middle East, with team members located across multiple countries.

Senior Site Reliability Engineer, Infrastructure Foundations

18 days ago
Full-time
Canada, Worldwide
$113,082 to $175,725 per year
Key requirements: 6 years of experience, Puppet, Kubernetes, Python, Linux system-level troubleshooting, Infrastructure security management, Incident response leadership, Automation of tasks and processes, Monitoring and logging infrastructure (Prometheus, Grafana), Open source software contribution, Security incident technical response
Wikimedia Foundation

The Wikimedia Foundation is a San Francisco-based nonprofit organization providing free, multilingual educational content through its wiki-based projects, including Wikipedia, targeting a global audience.

Remote policy: The Wikimedia Foundation is a remote-first organization, hiring globally from various countries including the United States, Canada, and many others across different continents. Team members collaborate across time zones, supporting a diverse and inclusive workforce.

Platform Engineer (Database Reliability) - Remote Canada

18 days ago
Full-time
Canada
Key requirements: 5 years of experience, MySQL management, Cloud infrastructure (GCP), Kubernetes, Terraform, Linux systems administration, Monitoring and observability, Scripting (Bash, Go, Python, JavaScript), Incident response, Operational best practices
Bold Commerce

Bold Commerce is a Winnipeg-based B2B SaaS provider specializing in e-commerce applications for Shopify merchants, focusing on tools that enhance online store performance and sales.

Remote policy: Bold Commerce supports remote work from anywhere in Canada and the United States, allowing for flexible work arrangements across these regions.

Platform Engineer (Database Reliability) - Remote Canada

18 days ago
Full-time
Canada
Key requirements: 5 years of experience, MySQL management, Cloud infrastructure (GCP), Kubernetes, Terraform, Linux systems administration, Monitoring and observability, Scripting (Bash, Go, Python, JavaScript), Incident response, Configuration management (Ansible)
Bold Commerce

Bold Commerce is a Winnipeg-based B2B SaaS provider specializing in e-commerce applications for Shopify merchants, focusing on tools that enhance online store performance and sales.

Remote policy: Bold Commerce supports remote work from anywhere in Canada and the United States, allowing for flexible work arrangements across these regions.

Infrastructure Engineer

21 days ago
Full-time
Canada, Worldwide
Key requirements: 5 years of experience, Kubernetes, Cloud security fundamentals, Production infrastructure in DevOps/DevSecOps, Full-stack engineering, Major cloud providers, Packaging workloads for air-gapped environments, Ability to obtain SECRET clearance
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Site Reliability Engineer

21 days ago
Contract
Canada, United States
Key requirements: 3 years of experience, Terraform, Prometheus, Grafana, CI/CD, Incident Response, NIST SP 800-53, Python, Docker, Kubernetes, AWS, Azure, GCP
Arctiq

Arctiq is a Toronto-based B2B DevOps and cloud solution integrator specializing in professional IT services and managed services for enterprise organizations across North America.

Senior DevOps / Platform Reliability Engineer

25 days ago
Full-time
Canada, Worldwide
$120,000 to $160,000 per year
Key requirements: 5 years of experience, GitHub Actions, Terraform, Kubernetes (EKS), Cloudflare, Prometheus, Grafana, OpenTelemetry, AWS networking, CI/CD pipelines, AI-native DevOps, Lambda, Kafka/MSK, Security best practices, Auto-remediation agents, Model Context Protocol (MCP)
Zingtree

Zingtree is an AI-powered B2B SaaS platform headquartered in an unspecified location, specializing in interactive decision trees for customer experience management and workflow automation in the customer service industry.

Remote policy: Zingtree supports flexible remote work, allowing employees to work from anywhere, although specific hiring locations are not detailed.

Principal Platform Infrastructure Engineer (Containers)

26 days ago
Full-time
Canada
$141,000 to $249,000 per year
Key requirements: Kubernetes expertise, Terraform, Google Cloud Platform, GitOps methodologies, Python, Bash, Go, Network protocols, Observability solutions
Menlo Security

Menlo Security is a Mountain View, CA-based B2B cybersecurity company specializing in secure enterprise browser solutions that protect against phishing and malware for government agencies and global enterprises.

Remote policy: Menlo Security supports remote work and hires from various regions globally, with team members located in places such as India and the United States, allowing for collaboration across time zones.

Senior Software Engineer - SRE

27 days ago
Full-time
Canada, Worldwide
Key requirements: AWS expertise, Kubernetes fundamentals, Terraform at scale, Production-quality code in Go/Python, CI/CD pipeline experience, GitHub Actions, ArgoCD, Observability principles, Datadog or similar tool, SLIs/SLOs for reliability
Socure

Socure is a U.S.-based B2B SaaS provider specializing in AI-driven identity verification and fraud prevention solutions for enterprises across financial services, e-commerce, and government sectors.

Remote policy: Socure is a fully remote organization, supporting team members across various locations, with some roles requiring in-person engagement in specific regions such as Washington, D.C.

Distinguished Site Reliability Engineer - Cloud

29 days ago
Full-time
Canada, North America
$320,000 to $488,750 per year
Key requirements: 16 years of experience, Kubernetes, OpenStack, Python, Go, Perl, Ruby, Infrastructure automation, Distributed systems design, Linux, Networking, Containers
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.