Posted at: 21 May
Principal Machine Learning Engineer, Accelerated Apache Spark
Company
NVIDIA Corporation is a Santa Clara-based technology company specializing in designing GPUs and AI solutions for gaming, professional visualization, and cloud services, operating in both B2B and B2C markets globally.
Remote Hiring Policy:
NVIDIA supports flexible remote work arrangements and hires from various regions globally, including the Americas, Europe, Asia, and the Middle East, with roles that may require collaboration across time zones.
Job Type
Full-time
Allowed Applicant Locations
United States
Salary
$272,000 to $431,250 per year
Job Description
NVIDIA is looking for a Machine Learning (ML) Engineer to join the GPU accelerated Apache Spark team. Apache Spark is the most popular data processing engine in data centers for running large scale workloads for ETL, SQL, and ML/DL model training and inference pipelines, spanning many domains and use cases. NVIDIA GPUs offer a promising avenue for significantly speeding up and/or lowering the cost of running Apache Spark applications at massive scales. You will work with the open source community to accelerate Apache Spark with GPUs. You will apply the latest ML/AI methods to empower enterprises to migrate Spark workloads onto GPUs at scale.What you’ll be doing:Design and implement machine learning solutions for performance prediction and optimization of GPU accelerated enterprise Apache Spark workloads.Develop advanced algorithms and adaptive systems to continuously improve the performance of Apache Spark workloads on GPUs.Develop AI-based agents and tools to assist with fixing system issues and application optimization.Collaborate with key partners and customers on the deployment of complex machine learning solutions in various environments.Maintain deep domain expertise by knowing the latest published advances in ML systems and algorithms.Provide technical mentorship and leadership in data science and machine learning to a team of engineers.What we need to see:BS, MS, or PhD or equivalent experience in Machine Learning, Data Science, Computer Science or a closely related field.12+ years of professional experience in designing, implementing, and productionizing high-quality ML/DL solutions.5+ experience as technical lead in ML model development.Proven hands-on experience (2+ years) with large-scale data processing platforms, such as Apache Spark.Proven ability to employ modern tooling and sound techniques for all aspects of crafting, deploying, and maintaining machine learning models.Excellent programming skills in Python and Python data science related libraries like numpy, pandas, scikit-learn, scipy, pytorch, and tensorflow.Deep experience with sophisticated ML methodologies, including LLM/GenAI, reinforcement learning, and adaptive, on-line ML systems.Strong expertise in feature engineering, feature importance assessment, and developing boosted tree model solutions (e.g., XGBoost).Ways to stand out from the crowd:Understanding of the internal workings and architecture related to Apache Spark.Familiarity with NVIDIA GPUs and CUDA.Experience coding in Scala, Java, and/or C++.NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and dedicated people in the world working for us. If you are passionate about what you do, creative and autonomous, we want to hear from you!Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until May 25, 2026.This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.