

Share
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.
What will you be doing:
You will bring together and understand internal and external customer requirements to improve AI cluster resiliency and design AIOps-based solutions that address these needs.
Develop automated workflows for issue detection and root cause analysis and closely collaborate with operators to debug sophisticated, full-stack AI cluster problems. We will bring to bear the findings for product improvements!
Deliver compelling technical presentations and lead hands-on demos or training. You'll also handle evaluation deployments (POC/POV) and ensure smooth, reliable installations by staying engaged and encouraging throughout the customer journey.
What we need to see:
Bachelor of Science or equivalent experience
8+ years of networking experience in enterprise or service provider environments, with strong hands-on expertise in routing and switching.
Proficient in scripting and automation using Python or similar languages, with strong Linux expertise.
Proven experience working directly with customers to resolve issues and ensure success in Systems Engineer or SRE roles.
Exceptional oral, written, and presentation skills for clearly communicating complex technical topics.
Demonstrated ability to collaborate effectively across teams, partnering with operations, engineering, and product development
Ways to stand out from the crowd:
Experience with data center infrastructure and cloud architectures
Background in network performance monitoring or observability
Previous experience working at a technological start-up
These jobs might be a good fit

Share
What you’ll be doing:
Take part in support of developers teams' equipment, maintain existing physical and virtual servers and switches in different labs in multiple locations.
Rack and stack new hardware and software, make configurations according to the processes, while maintain equipment management and knowledgebase systems
Work closely and directly with engineering teams and resolve IT related issues, collaborate with other teams (IT / facilities / operations)
Collect the demands for new hardware from the customers, take part in purchasing / budgeting activities
Use and improve existing workflows
Strive for maximum equipment utilization in labs and plan the infrastructure needs for future
What we need to see:
BSc in Engineering/ Relevant Certifications/ equivalent experience
One or more IT certifications such as: CompTIA (Server+ / Linux+) or Microsoft (MCITP/ MSCE / Azure Administrator) or LPIC-1 or CCNA or equivalent experience
4+ years of experience as a support engineer / SysAdmin / datacenter operations engineer
Good understanding in computer hardware and various IT technologies
Proven hands-on experience with Linux and Windows based servers and virtualization technologies
Self-learning, knowledge sharing, organized person that has documentation skills and able to efficiently prioritize tasks, while paying attention to details.
Willingness to travel between sites on regular basis
Good English level
Ways to stand out from the crowd:
Experience with different server hardware
Experience with networking devices (switches / NICs)
Understanding of datacenter power, air and liquid cooling infrastructure, networking and cabling
These jobs might be a good fit

Share
What you'll be doing:
You own silicon bring-up schedule from power on to production release
Negotiate silicon and board demand with teams and drive a bottom-up forecast
Oversee and manage chip and board allocations across the company
Lead prototype chip delivery to internal customers
Track and coordinate engineering deliverables, key milestones andqualification/validation
Identify and mitigate risks to schedules and programs
Communicate status to cross-functional teams as well as upper management
Continuously evaluate internal tools and processes and drive fixes to improve productivity
Create new and fix existing processes between different teams
Drive implementation of SW tools for data analytics and process evaluation
What we need to see:
Bachelor's or Master's degree in Electrical Engineering, Mechanical Engineering, Materials Science, or a related technical field
At least 5 years of relevant experience
Proven experience in engineering roles, with a significant portion focused on semiconductors industry
Strong background in planning methodologies
Excellent communication, interpersonal, and leadership skills to effectively collaborate with internal teams
Ability to travel internationally to supplier sites as needed
Ways to stand out from the crowd:
Deep understanding of ASIC Technology and Productization requirements
Strong verbal and written communication skills, and the ability to coordinate with multiple technical and business teams
Self-directed and driven, highly motivated, creative, and have a consistent record of handling multiple tasks at any given time
Have the ability to work independently and follow complex procedures
These jobs might be a good fit

Share
These jobs might be a good fit

Share
NVIDIA is a leading supplier of innovative end-to-end InfiniBand and Ethernet connectivity solutions and services for servers and storage. We offer market-leading solutions that include adapter cards, switches, cables and software to support networking technologies. Our products optimize data center performance and deliver industry-leading bandwidth and scalability. In addition, we serve a wide range of sectors including high performance computing, enterprise, data centers, cloud computing, big data and Web 2.0. We are constantly reinventing ourselves to stay ahead of the market and bring groundbreaking products and services to the industry.
What you'll be doing:
What we need to see:
Ways to stand out from the crowd:
These jobs might be a good fit

Share
These jobs might be a good fit

Share
AWS Infrastructure Services (AIS)You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Key job responsibilities- Support the maintenance and monitoring of all data center systems, including incidents, events, problems, changes, and escalations.
- Troubleshoot and monitor mechanical, electrical, HVAC, voice/data, cooling, fire/life safety systems, and generators.
- Assist contractors or engineers in maintaining facility equipment, deploying new equipment (racks, cabling, etc.), and conducting site walkthroughs to verify equipment and system operations.
- In this role, you will act as a First Responder to critical events and support both new and existing data center facilities.A day in the life
- Monitoring alarms and systems that manage power, cooling, and fire suppression in the data center.
- Coordinating vendor work on-site to ensure equipment is maintained and uptime is preserved.- Handling rack installations and powering up equipment.
- Monitoring and facilitating the handover of new equipment.- Participating in event response drills.About the team
Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Why AWS
Work/Life BalanceMentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
- High school or equivalent diploma
- 1+ years of electrical or mechanical experience
- Experience leading and managing operations of critical facilities which require in-depth knowledge in electrical, mechanical, and control systems
- Knowledge of generic mechanical room infrastructure such as chillers, cooler units, and fan controls
- Knowledge of mechanical systems (Mechanical, HVAC systems, Controls)
- Knowledge of key electrical competencies and theory
These jobs might be a good fit

Share
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.
What will you be doing:
You will bring together and understand internal and external customer requirements to improve AI cluster resiliency and design AIOps-based solutions that address these needs.
Develop automated workflows for issue detection and root cause analysis and closely collaborate with operators to debug sophisticated, full-stack AI cluster problems. We will bring to bear the findings for product improvements!
Deliver compelling technical presentations and lead hands-on demos or training. You'll also handle evaluation deployments (POC/POV) and ensure smooth, reliable installations by staying engaged and encouraging throughout the customer journey.
What we need to see:
Bachelor of Science or equivalent experience
8+ years of networking experience in enterprise or service provider environments, with strong hands-on expertise in routing and switching.
Proficient in scripting and automation using Python or similar languages, with strong Linux expertise.
Proven experience working directly with customers to resolve issues and ensure success in Systems Engineer or SRE roles.
Exceptional oral, written, and presentation skills for clearly communicating complex technical topics.
Demonstrated ability to collaborate effectively across teams, partnering with operations, engineering, and product development
Ways to stand out from the crowd:
Experience with data center infrastructure and cloud architectures
Background in network performance monitoring or observability
Previous experience working at a technological start-up
These jobs might be a good fit