Minimum qualifications are required to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates. Minimum Qualifications: �MS or PhD graduate at Computer Science, Computer Engineering, or related field. (Electrical Engineering with Software background, BS with relevant experience will be considered too)-4+ years of Industry experience and developing high quality code in C/C++, Python, C# or similar programming language -2+ years of experience in deep learning models, with understanding of common model architectures, numeric, and data types, and model optimization techniques (fusions, sparsity, quantization, compression, etc.)-2+ years of experience in parallel computing and performance engineering-Intermediate to Advanced English lev � Preferred Qualifications: �Background in modern CPU, GPU architectures, runtimes, compiler code generation, multi-threading. Intel Xeon architecture is a plus.-Kernel development experience with HLSL, CUDA, OpenCL, etc. HLSL a plus-Software optimization, benchmarking, compiler, runtimes and AL/ML background-Knowledge of Deep Learning Runtimes and frameworks like TensorFlow, PyTorch, ONNX Runtime-Experience with framework development, performance (latency/throughput) analysis and optimizations a big plus �Requirements listed would be obtained through a combination of industry relevant job experience, internship experiences and or schoolwork/classes/research.We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing BenefitsThis role will require an on-site presence.