Posted at: 28 May

Deep Learning Compiler Engineer - CUDA

Company

CompanyNVIDIA

NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.

Remote Hiring Policy:

NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.

Job Type

Full-time

Allowed Applicant Locations

China

Job Description

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.We are now looking for cuTile Core Compiler Architect in our group! The NVIDIA Architecture group is looking for world class architects and engineers to join and lead our various architecture efforts. A key part of NVIDIA's strength is to innovate in the graphics and parallel computing fields delivering the highest performance in the world for parallel processing algorithms. We are constantly looking for ways to improve our GPU architecture and maintain our leadership by developing new parallel programming models, new architectures and new infrastructure that is required to make this successful.What you'll be doing:Design and implement the DSL and the core compiler of tile-aware GPU programming model for emerging GPU architecturesContinuously innovate and iterate on the core architecture of the compiler to consistently optimize performanceInvestigation of next-generation GPU architectures and provide solutions in the DSL and compiler stackPerformance analysis on emerging AI/LLM workloads and integrate with AI/ML frameworksWhat we need to see:Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI) 2+ years of relevant work experienceExcellent C/C++ programming and software engineering skills, ACM background is a plusGood fundamental knowledges on computer architectureStrong ability in abstracting problems and the methodology in resolving problemsStrong compiler backgrounds including MLIR/TVM/Triton/LLVM is desiredGood knowledge of GPU architecture and fast kernel programming skills is a plusKnowledge of LLM algorithms or a certain HPC domain is a plusKnowledge of multi-GPU distributed communication is a plusExcellent oral communication in English is a plusWidely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/