Bachelor’s degree or equivalent practical experience.
2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
1 year of experience with one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field.
1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
Experience with GPU programming.
Experience with LLMs (large language models).
Preferred qualifications:
Master's or PhD in Computer Science or Computer Engineering or equivalent practical experience.
Experience with NVIDIA GPU architecture, performance and profiling.
Experience writing or optimizing NVIDIA GPU CUDA kernels.
Experience optimizing inference performance for LLM models like Gemini or other open source models.
Experience in performance and resource optimization.