Fresh remote Site Reliability Engineer jobs in United States

Explore latest remote Site Reliability Engineer opportunities from leading companies hiring in United States. 36 jobs posted last 30 days.

Subscribe to our Telegram bot to receive instant notifications about new remote jobs

TelegramSubscribe Now

Fresh remote Site Reliability Engineer jobs in United States (36)

VibeOps Engineer (remote)

3 days ago
Full-time
United States
$55 to $70 per hour
Key requirements: Infrastructure operations, SRE practices, AI-powered operational assistants, Highly available production systems, Linux/Unix or Windows Server administration, Cloud infrastructure operations, Incident management processes, Root cause analysis, Banking operations knowledge, IBM Z technologies
AIDA Recruitment

AIDA Recruitment is a Lithuania-based B2B recruitment firm specializing in AI-driven candidate screening for the tech sector, with a global reach.

Remote policy: AIDA Recruitment operates remotely with team members based in Lithuania and hires globally from various regions, including the EU, Ukraine, and India, supporting flexible work arrangements.

Senior Deployment Engineer

3 days ago
Full-time
United States
$137,040 to $171,300 per year
Key requirements: 8 years of experience, Electrical systems, Mechanical/HVAC systems, Controls/BAS systems, Low-voltage systems, BMS, EPMS, and DCIM familiarity, Independent judgment in deployments, Root cause analysis in operational environments, Technical leadership in field execution
Armada

Armada is a global edge computing startup specializing in IoT and AI solutions for remote areas, headquartered in an unspecified location and primarily serving B2B clients across various industries.

Deployment Engineer

3 days ago
Full-time
United States
$113,760 to $142,200 per year
Key requirements: 4 years of experience, Electrical systems, Mechanical/HVAC systems, Controls/BAS systems, Low-voltage systems, BMS, EPMS, DCIM familiarity, Independent troubleshooting, Field diagnostic equipment proficiency, Strong analytical skills, Excellent communication skills, U.S. citizenship, Eligibility for U.S. security clearance
Armada

Armada is a global edge computing startup specializing in IoT and AI solutions for remote areas, headquartered in an unspecified location and primarily serving B2B clients across various industries.

Senior Site Reliability Engineer

7 days ago
Full-time
United States
$176,000 to $276,000 per year
Key requirements: 5 years of experience, Kubernetes, BMC interfaces (Redfish), KVM, IPMI tools, Openstack, SQL/MySQL, Prometheus, Grafana, Kibana, Jenkins, Ansible, Docker, Virtualization, Security methodologies, NVIDIA hardware management
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Site Reliability Engineer

7 days ago
Full-time
United States, Canada
$108,000 to $125,000 per year
Key requirements: 7 years of experience, Kubernetes, AWS, Terraform, Pulumi, Security awareness, CI/CD systems, Observability stack, Async communication, Proactive issue identification, Open-source community experience
Mozilla

Mozilla is a Mountain View, California-based non-profit organization that develops open-source software, including the privacy-focused Firefox browser and Thunderbird email client, targeting global consumers.

Remote policy: Mozilla embraces a flexible remote work environment and hires globally, with team members located in various regions, including the United States. We welcome applications from diverse locations to support our mission of building an open and accessible internet.

Senior Site Reliability Engineer

7 days ago
Full-time
United States, North America
$123,000 to $144,000 per year
Key requirements: 7 years of experience, Kubernetes, Terraform, Pulumi, AWS, Security awareness, CI/CD systems, Observability stack, Async communication, Proactive issue identification, Collaboration with software engineers
Mozilla

Mozilla is a Mountain View, California-based non-profit organization that develops open-source software, including the privacy-focused Firefox browser and Thunderbird email client, targeting global consumers.

Remote policy: Mozilla embraces a flexible remote work environment and hires globally, with team members located in various regions, including the United States. We welcome applications from diverse locations to support our mission of building an open and accessible internet.

Service Mesh Engineer (Istio / Linkerd)

8 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Istio, Linkerd, Envoy, Kubernetes, mTLS, Traffic management policies, Distributed tracing, Go, Python, Multi-cluster deployments, Zero-trust networking
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Site Reliability Engineer (SRE)

8 days ago
Full-time
United States
$100,000 to $150,000 per year
Key requirements: 5 years of experience, Python, Go, Kubernetes, Prometheus, Grafana, CI/CD pipelines, Chaos engineering, Distributed systems design, Incident response, SLOs and error budgets
Bright Vision Technologies

Bright Vision Technologies is a New Jersey-based IT staffing firm specializing in placing technical professionals in software development roles across the U.S. government and enterprise sectors.

Senior Systems Engineer, Storage - DGX Cloud

9 days ago
Full-time
United States, North America
$208,000 to $414,000 per year
Key requirements: 12 years of experience, Kubernetes, telemetry and observability, infrastructure-as-code, large-scale storage systems, Python, Go, Java, analytical troubleshooting, CI/CD, Linux-based systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Managed Services Engineer I (Mentor, OH area highly preferred)

9 days ago
Full-time
United States
Key requirements: 1 years of experience, Microsoft Exchange, SQL, Windows Server, ConnectWise, Remote Desktop Services, LAN/WAN troubleshooting, Microsoft Active Directory, Azure AD, Imaging Solutions, Basic PowerShell, Strong organization skills, Strong interpersonal skills
Logically

Logically is a Brighouse, England-based B2B Managed Security Services Provider (MSSP) specializing in cybersecurity solutions and IT services for organizations across various industries.

Remote policy: Logically supports remote work and hires from various regions, including the Raleigh–Durham–Chapel Hill area in North Carolina, while encouraging a collaborative team environment.

AI Infrastructure Operations Engineer

10 days ago
Full-time
United States
$120,000 to $140,000 per year
Key requirements: 5 years of experience, Kubernetes operations, Azure AKS, Observability engineering, Incident response, Operational security, AI platform reliability, Cloud-native infrastructure, SRE mindset, Monitoring and logging practices, HIPAA compliance
Private Health Management

Private Health Management is a remote-first B2C healthcare navigation service specializing in personalized patient advocacy and support for complex medical conditions, headquartered in the United States.

Remote policy: Private Health Management (PHM) supports fully remote work and is hiring from various locations, including the United States, allowing team members to work from wherever they call home.

Staff Infrastructure Engineer

11 days ago
Full-time
United States, Worldwide
Key requirements: 5 years of experience, Kubernetes, Cloud security fundamentals, Production infrastructure in DevOps/DevSecOps, Full-stack engineering, Major cloud providers, Packaging workloads for air-gapped environments, Ability to obtain SECRET clearance
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Principal Infrastructure Engineer

11 days ago
Full-time
United States, Worldwide
Key requirements: 8 years of experience, Kubernetes, Cloud security fundamentals, Production infrastructure in DevOps/DevSecOps, Full-stack engineering, Major cloud providers, Packaging workloads for air-gapped environments, Leadership/mentorship, Ability to obtain SECRET clearance
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Staff Cloud Operations Engineer

11 days ago
Full-time
United States
$185,000 to $200,000 per year
Key requirements: 5 years of experience, GCP, Cloud infrastructure engineering, Observability tools, Python, Docker, Kubernetes, AI tools usage, 24/7 on-call availability
Branch

Branch is a remote-first fintech company specializing in workforce payment solutions, headquartered in the mid-Atlantic region, targeting working Americans with B2B/B2C financial services.

Remote policy: Branch is a remote-first company with employees located throughout the USA, emphasizing collaboration and transparency across teams.

Cloud Operations Engineer

11 days ago
Full-time
United States
$135,000 to $150,000 per year
Key requirements: 3 years of experience, GCP, Cloud infrastructure engineering, Observability tools, Python, Docker, Kubernetes, Network security practices, AI tools for automation, 24/7 on-call rotation
Branch

Branch is a remote-first fintech company specializing in workforce payment solutions, headquartered in the mid-Atlantic region, targeting working Americans with B2B/B2C financial services.

Remote policy: Branch is a remote-first company with employees located throughout the USA, emphasizing collaboration and transparency across teams.

Senior AI Infrastructure Engineer - DGX Cloud

12 days ago
Full-time
United States
$152,000 to $287,500 per year
Key requirements: 5 years of experience, Kubernetes, OpenStack, Infrastructure automation, Distributed systems architecture, Python, Go, C/C++, Java, Linux, Networking, Storage, Containers, Infrastructure as Code (IaC), Terraform, Large-scale cloud systems
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Senior Incident Manager

12 days ago
Full-time
United States
Key requirements: 8 years of experience, Critical incident management, High-availability infrastructure, Large-scale GPU clusters, Cloud infrastructure platforms, Incident Command leadership, Incident management frameworks (ITIL, SRE), Incident tracking tools (PagerDuty, ServiceNow), Post-incident reviews (PIRs), Root cause analysis, Crisis communication
Lambda

Lambda is a Seattle-based AI cloud infrastructure provider specializing in scalable GPU resources for AI researchers and enterprises, operating in a B2B model.

Senior Infrastructure Engineer

13 days ago
Full-time
United States, North America
$190,000 to $225,000 per year
Key requirements: 5 years of experience, AWS (ECS/Fargate, RDS/Aurora, MSK, DynamoDB), Terraform, CI/CD platform ownership, Observability fundamentals, Developer experience improvement
Rally UXR

Rally UXR is a New York-based B2B SaaS platform that automates user research operations for product, design, and UX research teams, enabling continuous user insights for informed product decisions.

Remote policy: Rally UXR is a remote-first company, hiring primarily from the United States and Canada, with core collaboration hours that overlap across U.S. time zones.

Senior System Reliability Engineer

13 days ago
Full-time
United States
$168,000 to $264,500 per year
Key requirements: 8 years of experience, Reliability testing, FMEA, DoE, Statistical analysis, Hardware reliability, Cross-functional collaboration, Project management
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Security Platform Engineer

14 days ago
Full-time
United States
$160,000 to $180,000 per year
Key requirements: 5 years of experience, Claude Code, GitOps patterns, Python, AWS, Kubernetes, Security telemetry, Reliability engineering, Agentic coding tools, Infrastructure as code, Cloud-native systems
Lumin Digital

Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.

Remote policy: Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.

Senior Site Reliability Engineer, GeForce NOW

16 days ago
Full-time
United States
$168,000 to $270,250 per year
Key requirements: 8 years of experience, Kubernetes, Automation, Multi-region cloud deployments, Datadog, Prometheus, Deployment pipelines, Go, Python, Bash scripting, Anomaly detection tools, AI usage in SRE
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

DevOps / Infrastructure Engineer

16 days ago
Full-time
United States, North America
$100,000 to $130,000 per year
Key requirements: Tailscale, AWS, Container orchestration, Infrastructure-as-Code, CI/CD, Blockchain familiarity, Financial SRE background
MLabs

MLabs is a remote fintech company specializing in DeFi solutions, providing a unified API for financial institutions to access on-chain liquidity across major blockchains.

Remote policy: MLabs supports remote work for positions located within the EMEA region, allowing for flexible hours and a remote-first environment.

Nutanix Engineer

16 days ago
Full-time
United States
Key requirements: 3 years of experience, Nutanix AOS, Nutanix AHV, Prism management tools, HCI troubleshooting, Client-facing experience, Infrastructure project support, Backup and DR configurations, Managed services experience
MetroSys

MetroSys is a San Diego-based B2B technology solutions and staffing company specializing in IT consulting, backup and recovery solutions, and cloud services for enterprise clients.

Remote policy: MetroSys offers remote work opportunities primarily for candidates located in the United States, with a focus on hiring across the Americas.

Senior Release Engineer

17 days ago
Full-time
United States, Worldwide
Key requirements: 4 years of experience, Kubernetes, Helm, CI/CD pipelines, Infrastructure as Code, GitOps tooling, Security tooling integration, Linux systems, Networking fundamentals, Distributed systems
Onebrief

Onebrief is a Honolulu-based B2G SaaS platform specializing in AI-powered workflow software for military planning and command operations, targeting defense organizations globally.

Remote policy: Onebrief operates as an all-remote company, hiring from various regions, with team members collaborating globally, including at military commands.

Agora - Senior Infrastructure Engineer

18 days ago
Full-time
United States, North America, South America
Key requirements: 5 years of experience, TypeScript, Kubernetes, AWS architecture, Infrastructure-as-code, Distributed systems fundamentals, Reusable systems and abstractions, SLIs/SLOs and incident tooling, Observability practices, GitOps patterns, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Agora - Senior Infrastructure Engineer

18 days ago
Full-time
United States, North America, South America
Key requirements: 5 years of experience, TypeScript, AWS architecture, Kubernetes, Infrastructure-as-code, Distributed systems, Reusable systems, SLIs/SLOs, Observability practices, GitOps, Developer Productivity, Internal Development Platforms
Silver.dev

Silver.dev is a Buenos Aires-based B2B talent recruitment platform specializing in connecting venture-backed US startups with vetted software engineers from Latin America.

Remote policy: Silver.dev supports remote work for most engineering roles, primarily hiring from Latin America, with a focus on Argentina and Uruguay. Team members collaborate across Americas time zones.

Site Reliability Engineer

19 days ago
Full-time
United States, Worldwide
Key requirements: Linux/Unix, Cloud providers (AWS, Google Cloud, Azure), Infrastructure provisioning (Terraform, CloudFormation, Ansible), Containerization (Docker), Orchestration (Kubernetes), Monitoring tools (Prometheus, Grafana, Datadog), CI/CD pipelines (Jenkins, GitLab CI, CircleCI), Incident management practices, Scripting (Bash, Perl)
OXIO

OXIO is a North America-based B2B Telecom-as-a-Service (TaaS) platform enabling businesses to build and manage customizable mobile networks through a cloud-based solution.

Remote policy: OXIO supports flexible work arrangements and hires from various regions, with team members located in cities such as New York, Mexico City, and Montreal. Candidates are encouraged to apply regardless of their location.

Senior AI Tools Engineer, SRE Operations - GeForce NOW

21 days ago
Full-time
United States, North America
$144,000 to $230,000 per year
Key requirements: 5 years of experience, Python, AI tools development, LLM-based systems, Kubernetes, AWS, Data pipeline management, Automation, SRE principles, Monitoring tools (Grafana)
NVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote policy: NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Site Reliability Engineer

24 days ago
Full-time
United States
$150,000 to $200,000 per year
Key requirements: 5 years of experience, SRE, Linux, Container management, Distributed systems, SLIs/SLOs, Incident response leadership, Scripting, Monitoring systems, GPU infrastructure, High-growth reliability improvement
RunPod

RunPod is a Mt. Laurel, New Jersey-based B2B cloud computing platform specializing in GPU infrastructure for AI and machine learning applications, serving a global market of developers and enterprises.

Remote policy: RunPod operates as a remote-first organization, welcoming candidates from various locations, primarily focusing on those eligible to work in the United States.

Staff Site Reliability Engineer (AI Enablement)

25 days ago
Full-time
United States
$150,000 to $230,000 per year
Key requirements: 8 years of experience, AI-assisted development tools, Building AI/LLM-powered developer tools, Driving org-wide tooling adoption, Prompt engineering techniques, Go or Python proficiency, Operating production environments in AWS, Strong experience with Terraform, Container orchestration (ECS/Kubernetes), Observability practices, AI Enablement Strategy
Coalition, Inc.