Posted at: 7 April

Deep Learning Solution Architect

Company

CompanyNVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote Hiring Policy:

NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Job Type

Full-time

Allowed Applicant Locations

China

Job Description

NVIDIA is leading company of AI computing. At NVIDIA, our employees are passionate about AI, HPC , VISUAL, GAMING. Our SA team is more focusing to bring NVIDIA new technology into difference industries. We help to design the architecture of AI computing platform, analysis the AI and HPC applications to deliver our value to customers.  You will work closely with industry sales, developer relationship managers and product teams in the hiring position. What you’ll be doing:Drive research, development, and optimization of Reinforcement Learning algorithms and infrastructure for Large Language Models and multimodal models.Collaborate with internal research and engineering teams to adapt and validate state-of-the-art RL methods on NVIDIA GPU platforms at scale.Improve Reinforcement Learning initiatives and engagements with customers, providing technical guidance on integrating NVIDIA RL technologies into their AI workflows.Develop and maintain reusable toolchains, experiment management workflows, and technical documentation to accelerate both internal and customer-facing projects. What we need to see:MS or PhD in Computer Science, Artificial Intelligence, Mathematics, or related fields, with solid foundations in algorithms and programming.5+ years of experience (including research) in Reinforcement Learning, Large Language Model training, or multimodal learning.Proficient in PyTorch and familiar with RL training frameworks and workflows.Strong engineering skills with experience in distributed training, task orchestration, or evaluation pipelines.Ability to work independently with minimal day-to-day direction, and willingness to conduct exploratory experiments on frontier problems.Desire to be involved in multiple diverse and innovative projects.Outstanding verbal and written communication skills.Ways to stand out from the crowd:Experience with RLHF, GRPO, DPO, or other alignment and post-training methods for LLMs.Experience with scale-out HPC or cloud architectures for large-scale model training.CUDA optimization or GPU performance tuning experience.Experience with agentic AI systems, code generation models, or multimodal RL.Publications in top-tier venues in RL, NLP, or multimodal learning. With competitive salaries and a generous benefits package, we are widely considered to be one of the world’s most desirable employers! We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous person with a real passion for technology, we want to hear from you.