Posted at: 29 May

Senior Software Engineer, Agentic Systems

Company

CompanyNVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote Hiring Policy:

NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Job Type

Full-time

Allowed Applicant Locations

United States

Salary

$184,000 to $356,500 per year

Job Description

We are looking for a Senior Software Engineer to help build NeMo Platform, NVIDIA’s product for developing, evaluating, deploying, and operating AI systems at scale. This role will focus on NeMo Evaluator, which helps teams understand whether changes to AI agents are making those agents better. As AI systems become more autonomous and more deeply integrated into real workflows, teams need practical infrastructure for observing behavior, measuring progress, catching regressions, and iterating with confidence.Our roadmap is increasingly focused on agentic development and automated agent improvement: giving teams the infrastructure they need to compare versions, understand behavior, and make empirically grounded improvements over time.What you'll be doing:Design and implement Python-first APIs, SDK workflows, and plugin interfaces for building, measuring, and improving agents across multiple runtimes and product surfacesBuild reusable systems for observing behavior, measuring progress, detecting regressions, and turning runtime evidence into product decisionsBuild systems for ingesting, normalizing, validating, and analyzing agent execution data and evaluation datasetsPartner with research, product, platform, and infrastructure teams to integrate agentic capabilities broadly across NVIDIA agent runtimes and developer workflowsHelp turn emerging agent development and improvement techniques into reliable, reusable product capabilitiesImprove reliability, observability, debuggability, and performance across NeMoStack services, SDKs, plugins, jobs, and developer workflowsBuild strong test coverage across unit, integration, E2E, Docker, and Kubernetes workflowsDrive “speed of light” engineering: fast iteration, high ownership, pragmatic decisions, and performance-minded implementation under production constraintsProvide senior technical leadership through design reviews, code reviews, mentoring, and ownership of ambiguous cross-component problemsWhat we need to see:BS, MS, or equivalent experience in Computer Science, Computer Engineering, or a related technical field5+ years of professional software engineering experience building production systemsExcellent Python engineering skills, including API design, typing, testing, debugging, performance analysis, and maintainable software designExperience designing SDKs, libraries, plugins, CLIs, or other developer-facing interfacesExperience with distributed systems, cloud-native services, containers, Kubernetes, or job orchestrationStrong understanding of reliability, scalability, security, and performance tradeoffs in production infrastructureExperience with structured data modeling and validation systems such as Pydantic, typed schemas, event/trace models, or SDK-generated typesAbility to work independently, define technical scope, break down ambiguous problems, and drive work across team boundariesClear communication skills and a track record of collaborating with engineering, product, research, or customer-facing teamsWays to stand out from the crowd:Experience building, deploying, and iterating on production agentic AI systems where evaluation was used to measure and improve real product outcomesExperience designing evaluation workflows for heterogeneous agents, including tool-using agents, RAG agents, workflow agents, coding agents, or long-running autonomous systemsExperience integrating evaluation capabilities across multiple products, runtimes, or internal platforms, especially through Python SDKs, plugins, or shared developer toolingStrong ability to connect technical evaluation work to business outcomes, product quality, user experience, reliability, or operational efficiencyExperience with enterprise AI systems where measurement, regression testing, observability, governance, and continuous improvement are required for production deploymentNVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you’re passionate about leading breakthrough AI research and building exceptional teams that shape the future of computing, we want to hear from you.Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until June 1, 2026.This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.