Santa Clara or United States On-site USD 184,000 - 287,500 Yearly

NVIDIA is hiring a Senior Software Engineer

NVIDIA is looking for a Senior Software Engineer to develop groundbreaking technologies in the inference systems software stack. You will build innovative AI systems software to accelerate AI inference and develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture.

What You'll Do

  • Innovate and develop new AI systems technologies for efficient inference
  • Design, implement, and optimize kernels for high impact AI workloads
  • Design and implement extensible abstractions for LLM serving engines
  • Build efficient just-in-time domain specific compilers and runtimes
  • Collaborate closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
  • Contribute to open source communities like FlashInfer, vLLM, and SGLang

What We're Looking For

  • Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience)
  • 6+ years (academic/industry) experience with ML/DL systems development preferable
  • Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC
  • Strong Python and C/C++ programming skills
  • Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar)

Nice to Have

  • PhD are preferred
  • Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
  • Expertise in inference engines like vLLM and SGLang
  • Expertise in machine learning compilers (e.g. Apache TVM, MLIR)
  • Open source project ownership or contributions

Technical Stack

  • PyTorch, JAX, TensorFlow, ONNX, vLLM, SGLang, MLC, Python, C/C++, CUDA C/C++, cuTile, Triton, FlashInfer, Flash Attention, Apache TVM, MLIR

Team & Environment

Collaborate closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams.

Benefits & Compensation

  • Compensation: $184,000 USD - $287,500 USD + equity: Eligible
  • Equity
  • Benefits

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required Skills
PyTorchJAXTensorFlowONNXvLLMSGLangMLCPythonC/C++CUDA C/C++GPU kernel developmentperformance optimizationscuTileTritonML/DL systems PyTorchJAXTensorFlowONNXvLLMSGLangMLCPythonC/C++CUDA C/C++GPU kernel developmentperformance optimizationscuTileTritonML/DL systems
Invoicing holding you back?

Focus on work, not paperwork

Stop worrying about invoicing, taxes, and compliance. Glopay handles the business setup, you handle the client work. Get paid faster and look professional.

Auto-generated compliant invoices
Built-in expense management
Income reports for tax season
95% of earnings stay with you
Try Glopay free
No credit card needed
About company
NVIDIA
NVIDIA builds accelerated computing platforms and AI technologies that power advancements in areas such as generative AI, data centers, robotics, and digital twins.
All jobs at NVIDIA Visit website
Job Details
Department Software Development
Category data
Posted 2 months ago