Posted at: 6 June

DevOps Engineer

Company

CompanyNVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote Hiring Policy:

NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Job Type

Full-time

Allowed Applicant Locations

Asia, Israel

Job Description

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.NVIDIA's Manufacturing Information Systems team builds the data and automation backbone that keeps global manufacturing operations running — from CI/CD and container platforms to the on-prem environments that production workflows depend on. Our DevOps team leads that infrastructure end-to-end: Azure cloud, Kubernetes at scale, delivery coordinated via GitLab, and multiple business-critical on-prem sites . With major initiatives on the roadmap — GPU-enabled Kubernetes, per-site cluster rollouts, AKS upgrades, and a broader Vault rollout — the work ahead offers a rare mix of greenfield infrastructure and hands-on stewardship of systems NVIDIA relies on daily. The team is small, senior, and deeply accountable, with a strong mentorship culture under an experienced tech lead. We are adding a third DevOps engineer to increase delivery capacity, reduce single-person risk on critical systems, and grow our database infrastructure capability from within. If building resilient cloud and on-prem systems at scale sounds like the right challenge, we'd like to hear from you.What you'll be doing:Design, build, and operate Kubernetes infrastructure across Azure AKS and on-prem clusters, including ingress, autoscaling with Keda, TLS management, and GPU-enabled workloadsExtend and harden CI/CD pipelines in GitLab, manage runners across multiple environments, and evolve GitOps-based deployments through ArgoCDMaintain and improve the critical on-prem infrastructure — Linux servers, NGINX, container platforms, and networking — that several production workflows depend onPartner with development, data, and architecture teams to streamline delivery, improve observability across Datadog, and shorten time-to-recovery during incidentsContribute to flagship initiatives on the roadmap: per-site Kubernetes cluster rollouts, AKS upgrades and node pool reorganization, GPU cluster enablement, and secret management with Azure Key Vault, and Sealed SecretsAutomate provisioning and configuration across Azure resources and on-prem systems using infrastructure-as-code and scriptingTroubleshoot across the full stack — from networking and certificates to container runtime and pipeline internals — turning incidents into durable improvementsWhat we need to see:Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)3+ years in a DevOps, SRE, or infrastructure engineering roleHands-on proficiency with Kubernetes and container tooling (Docker for example) in production environmentsTrack record of building and maintaining CI/CD pipelines, ideally in GitLab, including runner management and pipeline-as-codeFluency using AI-assisted development tools (such as Cursor, Codex or Claude) as a regular part of daily engineering workSolid Linux administration skills and fluency in BashPractical background with a major cloud platform, Azure preferred (or AWS o/GCP)Working knowledge of GitOps workflows and tooling such as ArgoCD or FluxCollaboration and ownership mentality, with the accountability needed to operate business-critical systemsWays to stand out from the crowd:Hands-on experience with on-prem Kubernetes at scale, including cluster bootstrap, MetalLB, and ingress configurationFamiliarity with secret management via HashiCorp Vault, Azure Key Vault, or Sealed SecretsOperational background with SQL (PostgreSQL, MySQL) and/or MongoDB, including backups, replication, or performance tuningContributions to observability improvements with DatadogWidely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/