מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
What You’ll Be Doing:
Leverage AI-powered testing tools to improve test automation, increase coverage, and accelerate testing cycles for cloud-based infrastructure.
Collaborate with product engineering teams to deeply understand cloud service architectures and provide mentorship to SWQA teams on testing cloud-native applications at scale.
Craft and develop end-to-end test strategies for validating cloud infrastructure, including compute, storage, networking, security, and orchestration layers.
Lead NVIDIA Cloud bring-up activities from a software quality assurance perspective, ensuring scalability, reliability, and performance.
Architect and implement cloud-native test automation frameworks to validate multi-cloud (AWS, Azure, Google Cloud) and hybrid-cloud environments.
Develop scalable and resilient infrastructure automation by using Infrastructure as Code (IaC), Configuration Management, and optimization techniques.
Improve observability and monitoring through AI-powered anomaly detection, predictive analytics, and intelligent alerting.
Ensure resilience and failover testing of cloud-based microservices and distributed architectures.
Collaborate with internal teams and cloud service partners to ensure alignment with industry standard methodologies and real-world use cases.
What We Need to See:
Master’s or Ph.D. in Computer Science, Cloud Computing, or a related field, or equivalent experience.
4+ years of hands-on experience in cloud-native cluster management, including Docker, Slurm, Kubernetes, OpenShift, and Ansible.
8+ years of experience working with cloud infrastructure platforms like AWS, Azure, and Google Cloud, with deep expertise in multi-cloud and hybrid-cloud architectures.
Strong hands-on experience with Cloud Networking (VPCs, Load Balancers, Service Mesh, API Gateways) and Storage Technologies (EBS, S3, Azure Blob, GFS).
Advanced proficiency in Infrastructure as Code (IaC) and Configuration Management tools (e.g., Terraform, CloudFormation, Pulumi, Ansible).
Deep expertise in Kubernetes administration, service mesh technologies (Istio, Linkerd), and container security.
Proficiency in Python, Go, or Java for cloud automation, testing frameworks, and infrastructure scripting.
Expertise in CI/CD pipelines using GitOps models, GitLab, Jenkins, ArgoCD, and Spinnaker for automated cloud deployments.
Hands-on experience with cloud observability and monitoring tools (Prometheus, Grafana, CloudWatch, Thanos, Datadog, New Relic).
Strong cloud security knowledge, including Kubernetes security, IAM policies, encryption, and vulnerability management.
Proven track record to debug complex cloud infrastructure issues, involving DNS, HTTP, Linux, cloud networking, and containers.
Ways to Stand Out from the Crowd:
A true innovator who isn't afraid to challenge the status quo and bring fresh ideas to the table. You're always looking for ways to improve existing systems and processes. Passion and curiosity about the latest technologies and trends in cloud infrastructure and distributed systems. You're not just familiar with the tools, but you understand the underlying principles and can demonstrate this knowledge to make strategic decisions. Committed to personal and professional growth. You're crafting opportunities to learn new skills and deepen your expertise.
Deep expertise in bringing to bear cloud testing powered by AI, demonstrating machine learning for predictive failure analysis, anomaly detection, and self-healing infrastructure.
Strong knowledge of Kubernetes Operators, Helm charts, and custom controllers for automating cloud operations.
Familiarity with Confidential Computing, Zero Trust Security models, and cloud-native security frameworks.
Excitement for the latest cloud architectures, like edge computing, infrastructure driven by AI, and serverless computing.
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך