Posted at: 1 May
Senior Staff Software Engineer - AI Agent Platform
Company
NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.
Remote Hiring Policy:
NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.
Job Type
Full-time
Allowed Applicant Locations
United States
Salary
$168,000 to $270,250 per year
Job Description
We are looking for a Sr. Engineer to design, build, and scale the infrastructure powering NVIDIA’s AI agent ecosystem. You will work at the intersection of distributed systems, developer platforms, and agentic AI — building the foundational services that enable teams across the company to develop, deploy, orchestrate, and operate autonomous AI agents at production scale.What you will be doing:Build and develop platform services that own the full agent lifecycle from registration through deployment, execution, and teardownArchitect Kubernetes-based execution environments with pod lifecycle management, namespace isolation, persistent storage, and identity propagationDevelop and maintain automated CI/CD pipelines using GitLab CI and ArgoCD, including reusable pipeline templates and deployment blueprints that standardize how agents are built across teamsBuild framework-agnostic infrastructure supporting multiple agent SDKs (Claude Code, OpenAI Codex, LangGraph), with hands-on experience using harnesses, lifecycle hooks, skills configurability, observability (OTEL), and memory servicesBuild and operate Kafka-based message pipelines and real-time event streaming using Redis PubSub and SSEDevelop data ingestion pipelines, access interfaces, and storage layers that power AI agent knowledge and contextImplement session management for state persistence, conversation history, and agent recovery across sessionsDevelop multi-layer auth using OAuth 2.0, JWT validation, token exchange, and gateway integration, and manage secrets lifecycle with Vault (provisioning, rotation, container injection)Partner with security teams on compliance, access controls, and approval workflows for agent operationsWhat we need to see:Bachelor's or Master's degree in Computer Science, Engineering, or related field (or equivalent experience), with 8+ years in software engineering — ideally in platform engineering, infrastructure, or developer toolsExperience building and scaling AI agents in production using frameworks like Claude Code, Codex, or LangGraphDeep Kubernetes expertise including pod orchestration, persistent storage, RBAC, and multi-cluster managementStrong Python skills with production API experience using FastAPI, Flask, or similar async frameworksProven track record designing distributed systems with Kafka, Redis, and MongoDB or PostgreSQLExpertise building and managing robust CI/CD pipelines using GitLab CI and ArgoCD for continuous delivery to KubernetesExperience designing AI data platform components (ingestion pipelines, vector stores, retrieval APIs, data preprocessing workflows) and building developer-facing platform APIs consumed by multiple engineering teamsSolid grasp of auth and identity: OAuth 2.0, JWT, token exchange, and secrets management with VaultHistory of leading sophisticated technical projects such as migrations or greenfield platform builds, with strong interpersonal skills to drive alignment across teams and write clear design documentsWays to stand out from the crowd:Experience building or operating AI agent platforms or agentic workflow systems, with hands-on expertise in agent protocols and frameworks like MCP, A2A, LangChain, or LangGraphHands-on experience with RAG architectures, embedding pipelines, and vector databases (Milvus, Pinecone, or Weaviate)Full-stack skills with React or Vue for building developer portals and dashboardsContributions to open-source infrastructure or platform toolingYour base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 270,250 USD for Level 4, and 200,000 USD - 322,000 USD for Level 5.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until May 4, 2026.This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.