Posted at: 21 April
Software Platform Support Engineer - GPU Cloud
Company
NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.
Remote Hiring Policy:
NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.
Job Type
Full-time
Allowed Applicant Locations
United States
Salary
$76,000 to $172,500 per year
Job Description
The NVIDIA DGX Cloud organization is looking for passionate software support engineers to partner closely with our internal customers to support them on our internal platforms. This partnership requires you to gain a deep understanding of the customer needs, how their application(s) work, assist them in troubleshooting issues, and create documentation to make it easier for users to troubleshoot issues themselves in an ambiguous / fast-moving environment. The support you provide will help our users have a better experience and help shape our platform. We expect you to have knowledge of supporting cloud-based deployments across compute, storage and networking environments. What will you be doing:Partner with multiple internal teams to provide Tier 1 support for complex cloud platformsDefine and improve operational workflows (runbooks, escalation paths, support processes)Triage/investigate root cause of customer issues and escalate as needed File bugs and report issues while working closely with the Site Reliability teamBuild tooling to improve customer support process and visibilityDeeply understand user workloads and use cases Partner with multiple internal teams to give feedback to engineering teams and develop solutions to aid in their successBe part of an on call rotation to support production systems What we need to see:BS/MS degree in Computer science or related areas (or equivalent experience)2+ yrs of experience with supporting distributed software systems, supporting end-user software platforms, and experience with LinuxExperience with Kubernetes, AWS, Azure, OCI, and GCP Background of Infrastructure, Networking, Storage, and DevOps scripting/toolingUnderstanding of data storage technologies (databases, file, block, blob)Customer Service/Support ExperienceWillingness to work up and down the stack as well as across multiple teams Strong skills in troubleshooting and Communication Ways to stand out from the crowd:Experience with MLOps workflows or ML infrastructure Familiarity with GPU workloads or distributed training systemsSLURM or HPC previous experienceStrong drive to work with internal customers and make them successfulA drive to improve process with strong organizational skills NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 76,000 USD - 126,500 USD for Level 2, and 108,000 USD - 172,500 USD for Level 3.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until April 24, 2026.This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.