Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
NVIDIA

Manager, Deep Learning Inference - ONNX

Santa Clara, CA

At NVIDIA, we are building the world's leading AI computing platform. The mission of the TensorRT team is to deliver software solutions for achieving state-of-the-art performance and efficiency in Machine Learning inference with NVIDIA GPUs.

We are looking for a hands-on, highly technically experienced and motivated engineering manager to help lead critical work in the Deep Learning Inference Software team, and drive the development of ONNX and TensorRT software.

What you'll be doing:

In this role, you will help shape the strategy for inference deployment workflows and lead the ONNX development efforts at NVIDIA.

  • Help define and drive ONNX inference workflows and development objectives
  • Collaborate closely with industry partners in advancing ONNX
  • Coordinate planning and execution of inference strategy in concert with various internal teams at NVIDIA
  • Grow and develop a team of world-class engineers

Want more jobs like this?

Get Software Engineering jobs in Santa Clara, CA delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

What we need to see:

  • Masters (or equivalent experience) or PhD and at last 12 overall years of relevant industry experience in Computer Science, Artificial Intelligence, Applied Math, or related field
  • 5+ years of demonstrated experience in leading and mentoring multiple software engineering teams
  • Strong experience with C++11/C++14
  • Working knowledge or experience with TensorRT, PyTorch, TensorFlow, JAX, ONNX Runtime or other ML frameworks.
  • Excellent understanding of software development practices including architecting, development, testing, continuous integration, and documentation
  • Excellent communication skills, strong analytical, and organization skills

Ways to Stand Out From the Crowd:

  • Significant contributions to Deep Learning optimizations for inference.
  • Strong Python programming experience
  • Familiarity with CUDA kernel programming
  • Experience working directly with AI hardware and software development teams.
  • Exceptional project management skills, with a demonstrated ability to lead complex projects to completion.
  • A charismatic leader who inspires innovation and drives the team towards achieving NVIDIA's vision.

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's an outstanding legacy of innovation that's fueled by phenomenal technology-and amazing people.

Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join our team and see how you can make a lasting impact on the world.

The base salary range is 220,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Client-provided location(s): Santa Clara, CA, USA
Job ID: NVIDIA-JR1982697
Employment Type: Full Time