Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Nvidia Senior AI System Software Engineer 
Taiwan, Taiwan Province, Hsinchu 
854640668

31.07.2024

This challenging role requires someone who deeply understands Large Language and Multi-modal (LLM/LMM) Foundation models to advance the application of artificial intelligence and machine learning in the Manufacturing AI market. The role involves optimizing inference performance, crafting efficient inference pipelines, and building scalable AI systems using NVIDIA technologies like TensorRT, Triton, and CUDA.

What You’ll Be Doing:

  • Collaborate with diverse software, research, and hardware teams across geographies to analyze the interplay of hardware and software architectures, solve critical problems, and shape future applications.

  • Support engagements with customers and their third-party software providers. Collaborate with Product Management, Marketing, and Developer Technology teams.

  • Develop algorithms and pipelines for multi-modal large models (text, image, video, audio, etc.), optimize and scale AI models for efficient and reliable performance.

  • Work on microservices architectures and inference APIs for AI model serving, ensuring modularity, scalability, and resilience.

  • Drive the design and implementation of complex AI projects, providing technical guidance and support, and mentoring junior engineers.

What We Need To See:

  • 3 years or more of working experience and an MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field with a focus on Deep Learning, Machine Learning, and Computer Vision.

  • Familiarity with AI model datasetpreparation/curation,model training, and inference flow/pipeline.

  • Proficiency in working with deep learning frameworks such as TensorFlow and PyTorch. Strong programming skills in Python and/or C++, and experience developing integrated AI solutions.

  • Experience with microservices and inference API architectures for AI model serving.

  • Knowledge of software development best practices, including version control, code review, and documentation.

  • Proven ability to lead projects, manage timelines, and deliver results.

  • Strong communication skills and ability to work in a collaborative environment.

Ways To Stand Out From The Crowd:

  • Familiarity withcloud-based machine learning systems and CI/CD skills, including Kubernetes, containers, and Helm.

  • Experience with deploying and managing inference solutions with NVIDIA TensorRT, CUDA, and other acceleration technologies.

  • Skilled inlarge-scale data processing and distributed computing systems.