

Share
What you'll be doing:
Be a technical specialist on networking products, directly supporting sales account managers to secure design wins.
Actively establish and nurture technical relationships with engineers, management, and architects at key customer accounts.
Identify customer architectures and key product requirements in the CSP/OEM AI market to successfully implement NVIDIA's solutions.
Provide onsite support to solve hardware and software problems, with a focus on deep learning inference.
Lead the product through its entire lifecycle, from the design-in phase to end-of-life, ensuring flawless execution and customer happiness.
Develop technical solutions including hardware & software demos and example system designs.
Offer technical and sales training to direct sales teams and channel partners.
Establish strong communication channels and collaborative relationships with internal teams to ensure a positive customer experience.
What we need to see:
BS or MS in Engineering, Electrical Engineering, Physics, or Computer Science (or equivalent experience).
3+ years of work-related experience in the high-tech electronics industry, particularly in networking product Hardware design or technical customer support roles.
Capable of excelling in a dynamic, constantly evolving environment.
Remarkable talent for effectively managing multiple initiatives and priorities.
Expert analytical and problem-solving abilities.
Strong time-management and organizational skills for coordinating complex projects.
Excellent written and oral communication skills in English, with the ability to collaborate effectively with both management and engineering teams.
Ways to stand out from the crowd:
Experience with Networking product Hardware/System architecture development, and the areas of high-speed board design/signalintegrity/qualificationtesting.
In-depth understanding of networking protocols InfiniBand/Ethernet, RDMA/RoCE, BlueField DPU product, DOCA SW stack.
Proficiency in C/C++ and Python programming.
Knowledge of Embedded Linux Systems, APIs, and similar embedded OS.
Experience working with ODMs/EMs in industrial, military, and ruggedized computing spaces.
These jobs might be a good fit

Share
As a Data Center Infrastructure Specialist, you will be interacting with customers, partners, and internal teams, to analyze, define and implement large scale Datacenter projects. The scope of these efforts includes a combination of Datacenter Infrastructure design and cluster deployment planning.
What you will be doing:
Design and implement flawless data center infrastructure solutions to meet the needs of our customers.
Collaborate with cross-functional teams to ensure the successful deployment of data center infrastructure.
Datacenter Planning: Floor Plan, Rack Elevation, Simulation
Cable deployment planning including - ensuring requirements are accurate such as number and type of connections, port assignments, and timing of activity while following standard methodologies, Point-to-Point Design.
Deploy and Support NVIDIA products.
Validating and updating all related work instructions for Datacenter activities.
Responsible for providing input for Data Center Standards updates as required; balancing multiple activities and priorities; participate in projects calls.
Documenting processes and keeping event logs
What we need to see:
5+ years of proven experience as a data center infrastructure engineer, field service engineer or similar with background in designing and implementing data center infrastructures.
Bachelor's degree or equivalent experience in a relevant field.
In-depth knowledge of data center environments, servers, and network equipment.
Extensive experience in installing, monitoring, and maintaining data center equipment.
Solid understanding of networking, storage, and virtualization technologies.
Excellent problem-solving skills and attention to detail.
Ability to work as part of a team, in a fast and highly dynamic environment.
Exceptional communication and interpersonal skills.
Proficiency in documenting processes.
Willingness to travel (25%).
Ways to stand out from the crowd:
Network certification such as: Cisco Certified Network Associate (CCNA), Juniper Networks Certified Associate - Junos (JNCIA-Junos)
Knowledge with InfiniBand Technology
Collaboration with R&D and network engineering teams.
Outstanding interpersonal skill.
These jobs might be a good fit

Share
What you'll be doing:
Developing a high-performance, highly available real-time analytics system for both internal teams and external partners.
Designing and building various services and agentic applications utilizing diverse AI models and machine learning techniques to address challenges in GPU server manufacturing.
Continuously improving AI/ML model performance through ongoing finetuning or training.
Participating in planning and design meetings with stakeholders, offering professional suggestions, and consistently tracking follow-up actions.
What we need to see:
BS or MS in Computer Science, Artificial Intelligence, or equivalent experience
6+ years of demonstrable experience in AI / ML
Expertise in the full lifecycle of AI model development, from training to real-world production deployment, with a deep understanding of machine learning algorithms.
Proficient in developing sophisticated agentic applications, especially those using RAG architecture or MCP protocol.
Comprehensive knowledge and hands-on experience with various ML strategies (Regression, Classification, Clusterization), understanding their principles and optimal use.
Proven track record in designing, implementing, and maintaining robust, scalable distributed systems for large data and computational demands.
Strong ability to clearly articulate complex technical concepts to diverse audiences and collaborate effectively.
Highly proficient in spoken and written English for international professional communication.
Self-motivated, takes initiative, owns tasks, delivers high-quality results, and meets deadlines consistently.
Ways to stand out from the crowd:
Proficient in programming languages such as Python, R, or C/C++.
Experienced with the Linux development environment.
Familiar with data science libraries and tools, including scikit-learn, SciPy, NumPy, pandas, Matplotlib, PyTorch, natural language processing (NLP), and searching/indexing techniques.
Skilled in SQL and other data query languages.
Knowledgeable in various database systems (SQL, NoSQL, embedding databases, etc.).
These jobs might be a good fit

Share
Deploy and support NVIDIA products
Guide and supervise 3rd party contractors in field or remote settings
Ensure accurate cable installations, including the number and type of connections, port assignments, and timing of activity while following standard methodologies
Print labels for re-labeling cables based on a Point to Point (P2P) scheme
Conduct quality testing of all cables used
Resolve issues involving cabling components
Validate and update all related work instructions for cabling activities
Provide input on Data Center Standards updates as needed, while balancing multiple tasks and priorities and actively participating in project calls
Ensure all relevant physical assets are recorded in Asset Manager and that all inventory-related fields are accurately recorded
Inspect all received hardware shipments and receiving duties
Document processes and maintain event logs
Install various network systems at customer sites
2-3 years of experience as a data center technician, field service engineer, or similar
Bachelor's degree or equivalent experience
In-depth knowledge of data center environments, servers, and network equipment
Experience with liquid-cooled systems
Extensive experience in installing, monitoring, and maintaining data center equipment
Outstanding ability to work as part of a team, provide IT support, and resolve errors
Proficiency in detailing network processes
Willingness to travel (~30-35%)
BICSI, CNCDP
FOA Data center - CFOSDC - certified fabrics optics specialist, Datacenter
CNCI – certified network cable installer
Network fix, including looking at port counters, errors, etc.
Collaboration with R&D and network engineering teams
Positive interaction capabilities
These jobs might be a good fit

Share
NVIDIA is looking for Senior CloudInfrastructure/DevOps
What you'll be doing:
Maintain large scale HPC/AI clusters with monitoring, logging and alerting Manage Linux job/workload schedulers and orchestration tools.
Develop and maintain continuous integration and delivery pipelines
Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources.
Deploy monitoring solutions for the servers, network and storage.
Perform troubleshooting bottom up from bare metal, operating system, software stack and application level.
Being a technical resource, develop, re-define and document standard methodologies to share with internal teams Support Research & Development activities and engage in POCs/POVs for future improvements.
What we need to see:
BS/MS/PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields.
At least 8 years of professional experience in networking fundamentals, TCP/IP stack, and data center architecture.
Knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software.
Extensive knowledge and hands-on experience with Kubernetes, including container orchestration for AI/ML workloads, resource scheduling, scaling, and integration with HPC environments.
Experience in managing and installing HPC clusters, including deployment, optimization, and troubleshooting.
Excellent knowledge of Linux systems (Redhat/CentOS and Ubuntu), including internals, ACLs, OS-level security protections, and common protocols like TCP, DHCP, DNS, etc.
Experience with multiple storage solutions, including Lustre, GPFS, ZFS, and XFS. Familiarity with newer and emerging storage technologies is a plus.
Proficiency in Python programming and bash scripting.
Comfortable with automation and configuration management tools, including Jenkins, Ansible, Puppet/Chef, etc.
Ways to stand out from the crowd:
Knowledge of CI/CD pipelines for software deployment and automation.
Knowledge of Kubernetes, container related microservice technologies.
Experience with GPU-focused hardware/software (DGX, CUDA.)
Background with RDMA (InfiniBand or RoCE) fabrics.
These jobs might be a good fit

Share
What you'll be doing:
Primary responsibilities will include building AI/HPC infrastructure for new and existing customers.
Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, real-time monitoring, logging, and alerting.
Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation, and refinement.
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.
What we need to see:
BS/MS/PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields.
At least 8 years of professional experience in networking fundamentals, TCP/IP stack, and data center architecture
Proficiency in configuring, testing, validating, and resolving issues in LAN and InfiniBand networks, especially in medium to large-scale HPC/AI environments.
Advanced knowledge of EVPN, BGP, OSPF, VXLAN protocols.
Hands-on experience with network switch/router platforms like Cumulus Linux, SONiC, IOS, JunosOS, and EOS.
Extensive experience delivering automated network provisioning solutions using tools like Ansible, Salt, and Python.
Ability to develop CI/CD pipelines for network operations.
Strong focus on customer needs and satisfaction.
Self-motivated with leadership skills to work collaboratively with customers and internal teams.
Strong written, verbal, and listening skills in English are essential.
Ways to stand out from the crowd:
Familiarity with cloud networks (AWS, GCP, Azure) is a plus.
Linux or Networking Certifications.
Experience with High-performance computing architectures. Understanding of how job schedulers(Slurm, PBS) work.
luster management technologies knowledge (bonus credit for BCM (Base Command Manager).)
Experience with GPU (Graphics Processing Unit) focused hardware/software.
These jobs might be a good fit

Share
NVIDIA Diag Team is now seeking an extraordinaryData Engineerto build up a data platform enabling advanced and intelligent log data analysis to assist engineers in solving GPU server manufacturing challenges. We aim at delivering a platform to streamline the server manufacturing process. As a data engineer, you will partner with Data Scientists, engineers, and key stakeholders to design and implement the data architecture turning data into insights for opportunities of optimization and efficiency in the server manufacturing process.
What you’ll be doing:
Build up robust, flexible, idempotent ELT data pipelines
Ensure the data integrity and keep queries performant per database’s indexing, partitioning, clustering, caching characteristics
Adopt best practices to secure data access internally and externally and identify potential security leaks with proposed solutions
Maintain and operate performant reporting system and interactive analytical dashboard given different business requirement
Solve data-related bugs in a timely manner
Enable the architecture and the use of off-the-shelf technologies for rolling update with minimal downtime
What we need to see:
BS or MS degree in one of the areas of Electrical Engineering, Computer Engineering, Computer Science (or equivalent) with 5+yearsof meaningful work experience
Skillful in SQL and relational databases systems
Demonstrable production experience in custom ELT design, implementation, maintenance, and performance tuning
Familiar with workflow management engines (i.e. Airflow, Luigi, Prefect, Dagster, and etc)
Extensive experience in Data Modeling for different database systems
Solid hand-on experience in tuning database systems read/write performances
Familiar with NoSQL technologies and Linux development environment
We expect to see excellent communication skills, ability to document and present the current status and final project deliveries
Intermediate/upper-intermediateoral and written technical English
Skills of working in an internationally distributed team
Self-motivated, engaged, eager for self-education
Ways to stand out from the crowd:
Experience with interactive analytical BI tools
Background with search/index systems
We appreciate additional programming skills in R, Java, or C/C++
Experience with notebook-based Data Science workflow
Background with anomaly/outlier detection
These jobs might be a good fit

Share
What you'll be doing:
Be a technical specialist on networking products, directly supporting sales account managers to secure design wins.
Actively establish and nurture technical relationships with engineers, management, and architects at key customer accounts.
Identify customer architectures and key product requirements in the CSP/OEM AI market to successfully implement NVIDIA's solutions.
Provide onsite support to solve hardware and software problems, with a focus on deep learning inference.
Lead the product through its entire lifecycle, from the design-in phase to end-of-life, ensuring flawless execution and customer happiness.
Develop technical solutions including hardware & software demos and example system designs.
Offer technical and sales training to direct sales teams and channel partners.
Establish strong communication channels and collaborative relationships with internal teams to ensure a positive customer experience.
What we need to see:
BS or MS in Engineering, Electrical Engineering, Physics, or Computer Science (or equivalent experience).
3+ years of work-related experience in the high-tech electronics industry, particularly in networking product Hardware design or technical customer support roles.
Capable of excelling in a dynamic, constantly evolving environment.
Remarkable talent for effectively managing multiple initiatives and priorities.
Expert analytical and problem-solving abilities.
Strong time-management and organizational skills for coordinating complex projects.
Excellent written and oral communication skills in English, with the ability to collaborate effectively with both management and engineering teams.
Ways to stand out from the crowd:
Experience with Networking product Hardware/System architecture development, and the areas of high-speed board design/signalintegrity/qualificationtesting.
In-depth understanding of networking protocols InfiniBand/Ethernet, RDMA/RoCE, BlueField DPU product, DOCA SW stack.
Proficiency in C/C++ and Python programming.
Knowledge of Embedded Linux Systems, APIs, and similar embedded OS.
Experience working with ODMs/EMs in industrial, military, and ruggedized computing spaces.
These jobs might be a good fit