

Share
What you will be doing:
Study and develop cutting-edge techniques in deep learning, graphs, machine learning, and data analytics, and perform in-depth analysis and optimization to ensure the best possible performance on current- and next-generation GPU architectures.
Work directly with key customers to understand the current and future problems they are solving and provide the best AI solutions using GPUs.
Collaborate closely with the architecture, research, libraries, tools, and system software teams at NVIDIA to influence the design of next-generation architectures, software platforms, and programming models.
What we need to see:
A Masters degree or PhD in an engineering or computer science related discipline or equivalent experience and 2+ years of relevant work or research experience.
Strong knowledge of C/C++, software design, programming techniques, and AI algorithms.
Firsthand work experience with parallel programming, ideally CUDA C/C++.
Strong communication and organization skills, with a logical approach to problem solving, good time management, and task prioritization skills.
Some travel is required for conferences and for on-site visits with developers.
These jobs might be a good fit

Share
What you'll be doing:
In this position, you will expected to lead all block/chip level PD activities.
PD activities includes floor plans, abstract view generation, RC extraction, PNR, STA, EM,IR DROP, DRCs & schematic to layout verification. Work in collaboration with design team for addressing design challenges.
Help team members in debugging tool/design related issues.
Constantly look for improvement in RTL2GDS flow to improve PPA. Troubleshoots a wide variety up to and including difficult design issues and applied proactive intervention.
Responsible for all aspects of physical design and implementation of GPU and other ASICs targeted at the desktop, laptop, workstation, and mobile markets.
What we need to see:
BE/BTECH/MTECH, or equivalent experience.
4+ years of experience in Physical Design.
Strong understanding in the RTL2GDSII flow or design implementation in leading process technologies. Good understanding of the RTL2GDSII concepts related to synthesis, place & route, CTS, timing convergence, layout closure.
Expertise on high frequency design methodologies. Good knowledge and experience in Block-level and Full-chip Floor-planning and Physical verification. Working experience with tools like ICC2/Innovus, Primetime/Tempus etc used in the RTL2GDSII implementation.
Strong knowledge and experience in standard place and route flows ICC2/Synopsys and Innovus/Cadence flows preferred. Well versed with timing constraints, STA and timing closure.
Good automation skills in PERL, TCL, tool specific scripting on one of the industry leading Place & Route tools.
Ability to multi-task and flexibility to work in global environment.
Good communication skills and strong motivation, Strong analytical & Problem solving skills. Proficiency using Perl, Tcl, Make scripting is preferred.

Share
What you’ll be doing:
Own ASIC verification of IP/Cluster for complicated designs in RTL.
Work with HW architects and designers to make the right implementation choices.
Interact with the Performance verification teams to augment verification through dynamic simulations and/or Formal verification techniques.
You will work with the specifications and ensure functional and code coverage of all the RTL which you will verify.
Partner with and enable FPGA and S/W teams to ensure that S/W is tested.
Be involved with post-silicon verification and debug.
What we need to see:
BS / MS or equivalent experience.
2+ years of design experience.
Experience in ASIC verification of complex design units for at least one or two projects.
Background with design and verification tools (VCS or equivalent simulation tools, debug tools like Debussy, GDB).
Exposure to System Verilog and UVM based methodology for ASIC verification is highly desired.
Ways to stand out from the crowd:
Knowledge of Memory controllers or prior experience with verification of IP/clusters involving access to Memory.
Good debugging and problem solving skills.
Scripting knowledge (Python/Perl/shell).
Good interpersonal skills and ability & desire to work as a part of a team.

Share
What will you be doing:
Responsibilities will include development of test plans and strategies, develop simulation environments, system bring-up, validation, and automation to deliver best-in-class CPUs.
Develop and maintain CPU simulator infrastructure, hardware CPU test and performance infrastructure.
Analyze and validate CPU and fabric performance, helping to understand current, and guide the development of future CPU products.
Definition and development of tool chain and workflows that enables the full system performance alignment.
Silicon based competitive analysis of NVIDIA CPUs.
What we need to see:
Master's or Bachelor's degree in EE/CS or equivalent experience
5+ years of experience preferably in the areas of CPU / SOC Performance Verification and Analysis
Strong understanding of computer system architecture and operating system fundamentals.
Hands-on experience with HDLs such as Verilog / System Verilog.
Knowledge of verification methodologies and tools for IP and SoC level verification.
Experience with System Verilog, C/C++, Python languages and relevant frameworks.
Background with debug on Silicon.
Ways to stand out from the crowd:
Detailed knowledge of the ARM and/or x-86 architecture.
Prior experience with performance analysis of CPUs.
Experience with analysis and characterization of CPU workloads.

Share
What you will be doing:
Lead design and development of NVIDIA’s Assembler and Disassembler for GPU compute.
Work on binary analysis & instrumentation features like call graphs generation, program register usage and patching of GPU binaries
Work with GPU architecture and debugger/profiler development teams to understand their requirements and deliver new features & product improvements.
Collaborate closely with teams developing other related components to ensure compatibility, reliability, and high-quality code generation
Working with customers/partners to collect feedback and drive innovative ideas and features to incorporate into the product
What we need to see:
BS or MS degree in Computer Science, Computer Engineering, or related fields with 5+ years of experience in low-level system SW development and a minimum of 3 years related to assemblers, binary analysis tools, debuggers
Good analytical and C/C++ programming skills
Experience in any one area of compiler development including feature support, code generation and compiler infrastructure
Understanding of Assembly Language / Processor ISA (GPU ISA not required but a plus)
Knowledge of object file formats such as ELF and debugging formats (DWARF).
Ways to stand out from the crowd:
Understanding of debugger / profiler tools / bintools / Linker internals, experience in binary analysis / instrumentation tools like BOLT etc.
Usage of AI tools in everyday work like Cursor, Windsurf etc.
Knowledge of GPU development and compute APIs such as CUDA and OpenCL

Share
What you'll be doing:
Enable NVIDIA Cumulus Linux on next generation ASICs.
Define, design and develop features for NVIDIA Cumulus Linux.
Sustain the existing deployments of NVIDIA Cumulus Linux.
Working closely with customers to understand the pain points, new use cases, deployment strategies and come up with innovative solutions.
Translating requirements to the SDK and ASIC Engineers for enabling end-to-end solutions.
What we need to see:
Strong knowledge of forwarding path for L2 and L3 including concepts like ECMP etc.
Strong and proven experience in C and Python programming.
Worked with VxLAN and EVPN routing protocols.
Strong knowledge in areas of QoS, ACLs and VxLAN. And working knowledge of hardware resource management (tables, TCAMs, etc).
Battle scars from troubleshooting production network deployments.
BS or MS degree in Computer Engineering, Computer Science, or related degree, or equivalent experience.
5+ years of hands on experience.
Ways to stand out from the crowd:
Experience with Merchant Silicon for Switching/Routing.
Contributions to SONiC, SwitchDev or Switch Abstraction Interface (SAI) projects.

Share
What You’ll Be Doing:
Develop and maintain robust equivalence checking flows (FEC/FEV) for different stages of the VLSI design cycle, including RTL-to-RTL, RTL-to-Gate, and Gate-to-Gates equivalence checking.
Develop and enhance the RTL Physical-Synthesis work flows, both at full-chip and block-level.
Partner with implementation teams, on both synthesis and P&R side, to understand their requirements and pain-points if any, and enhance the methodology to cover for the same.
Develop and deploy various methodologies based on technology updates, design requirements, and tool-feature/version updates and bug-fixes.
Finetune the recipes for designs with aggressive PPA targets while ensuring logical-equivalence is maintained.
Optimize flows and methodologies for performance, capacity, and debug capabilities, ensuring efficient and effective verification of sophisticated VLSI designs.
Investigate and resolve equivalence checking issues, including setup and constraint-related problems, debug failures, and performance bottlenecks.
Develop custom scripts using tcl/perl/python to automate the post-processing of the reports/logs to present the results in a user-friendly format.
What We Need To See:
B.Tech/BS inVLSI/Electronics/ElectricalEngineering or Computer Science, with 5+ years of relevant ASIC design experience and/or CAD experience, with a focus on Logic Equivalence Checking and RTL-Synthesis.
Be familiar with Verilog and ASIC design flow along with experience in commercial EDA tools
Strong scripting skills in languages such as Perl, Python, and TCL.
Excellent problem-solving, debugging, and analytical skills.
Ability to work in a team environment and collaborate efficiently with multi-functional teams.
Strong communication and documentation skills

What you will be doing:
Study and develop cutting-edge techniques in deep learning, graphs, machine learning, and data analytics, and perform in-depth analysis and optimization to ensure the best possible performance on current- and next-generation GPU architectures.
Work directly with key customers to understand the current and future problems they are solving and provide the best AI solutions using GPUs.
Collaborate closely with the architecture, research, libraries, tools, and system software teams at NVIDIA to influence the design of next-generation architectures, software platforms, and programming models.
What we need to see:
A Masters degree or PhD in an engineering or computer science related discipline or equivalent experience and 2+ years of relevant work or research experience.
Strong knowledge of C/C++, software design, programming techniques, and AI algorithms.
Firsthand work experience with parallel programming, ideally CUDA C/C++.
Strong communication and organization skills, with a logical approach to problem solving, good time management, and task prioritization skills.
Some travel is required for conferences and for on-site visits with developers.
These jobs might be a good fit