Develop and validate microarchitecture simulations of a massively parallel machine for AI training, including system architecture, core architecture, memory hierarchy, and interconnects
Performance correlation, evaluation and debug of existing and future hardware implementations
Collaborate with architects and engineers to understand the requirements of the simulation and ensure that it accurately models the behavior of the system
Develop and maintain software frameworks and tools to support testing and deployment
Participate in code reviews, testing, and debugging to ensure high-quality software
Stay up-to-date with the latest developments in AI workloads, computer architecture, and simulation techniques
What You’ll Bring
Degree in Engineering, Computer Science, or equivalent in experience and evidence of exceptional ability
3+ years of experience in simulation development, computer architecture, and AI workloads
Strong understanding of CPU and/or GPU microarchitecture, including pipelining, caching, and memory hierarchy
Experience with analytical and cycle-based simulation techniques, including modeling of complex systems and validation of simulation results
Strong programming skills in languages such as C++ and Python. Parallel and distributed programming skills are a strong plus.
Excellent problem-solving skills, with the ability to analyze complex problems and develop creative solutions
Strong communication and collaboration skills, with the ability to work effectively with Architects, Engineers, and Researchers