Job Information
Nvidia Deep Learning Performance Architect in Shanghai, China
Are you passionate about exploring computer architectures for deep learning? Do you like to work at the intersection of hardware and software? NVIDIA is seeking world-class programmers and performance architects who love to squeeze out every cycle of performance from deep learning codes. In this role, you will write code that ships in our deep learning libraries, as well as guide the direction of our future GPU architectures. This position offers the opportunity to have real impact in a fast-moving, technology-focused company.
What you'll be doing:
Developing production-quality software that ships as part of NVIDIA's AI software stack.
Analyzing the performance of important workloads, tuning our current software, and proposing improvements for future software.
Working with cross-collaborative teams of deep learning software engineers and GPU architects to develop innovative solutions
Adapting to the constantly evolving AI industry by being agile and excited to contribute across the codebase, including API design, software architecture, testing, and GPU kernel development.
What we need to see:
MS or PhD in relevant discipline (CS, EE, Math) or equivalent experience
Strong background in computer architecture
Strong C/C++ programming and software design skill
GPU programming experience (CUDA) desired
LLVM/MLIR development experience a plus
Good communication and organizational skills
#deeplearning
NVIDIA is a Learning Machine
NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.
Learn more about NVIDIA .