🇬🇧 SearchUKJobs.co.uk

Britain's Premier Job Portal

← Back to Jobs

Solutions Architect - Top AI Labs

NVIDIA | Beijing, China | Posted June 03, 2026

Position Overview

NVIDIA is leading company of AI computing. At NVIDIA, our employees are passionate about AI, HPC, GAMING. Our SA team is more focusing to bring NVIDIA new technology into difference industries. We help to design the architecture of AI computing platform, analysis the AI and HPC applications to deliver our value to customers, focusing on defining and solving computational challenges in LLM inference and training acceleration, as well as network communication and data transfer optimization.


What You'll Be Doing:
+ Contribute to the development of open-source inference frameworks such as SGLang and vLLM, including feature and operator development, performance optimization, and model support, in collaboration with the community.
+ Develop and optimize KV cache offloading frameworks for LLM workloads, supporting multi-level cache offloading and reuse across CPU, SSD, and remote storage to improve inference efficiency. (Team project: FlexKV)
+ Drive R&D on compute per...

Ready to Apply?

Apply for this Position