

Share
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.
These jobs might be a good fit

Share
What you’ll be doing:
What we need to see:
Ways to stand out from the crowd:

Share
What you'll be doing:
The person will be part of the NVIDIA AIR team that is building the SaaS/IaaS platform for digital twin of AI data centers.
The responsibility specifically is for DevOps, infrastructure and Site Reliability Engineering (SRE) requirements for AIR.
Focus on efficiency by automating repetitive workflows.
Working on microservices based architecture.
Deploying and troubleshooting non-disruptive cloud operations with an emphasis on secure production infrastructure.
Continuous evaluation of existing system and driving improvements.
Managing deployment/upgrade for Operating Systems, Kubernetes(k8s) clusters and/or or other orchestration tools.
Day to day support for engineering activities with CI/CD tools like git, Jenkins.
Efficiently multi-tasking on the different tracks to efficiently address evolving priorities .
What we need to see:
BSc in Engineering/ Relevant Certifications/ equivalent experience.
5+ years of experience in complex microservices basedarchitectures
Highly skilled in Kubernetes and Docker
Experience in IaaS environment - deploying, configuring, and administering Linux-based bare metal servers
Strong networking background (VLANs, routing, VPNs)
Experience with relational databases(MySQL) and SQL.
Experienced with modern deployment architecture for non-disruptive cloud operations including blue green and canary rollouts
Infrastructure as code (IaC) skills in frameworks like Ansible & Terraform
Expert in AWS
Knows best practices and discipline of managing and monitoring a highly available and secure production infrastructure
Ways to stand out from the crowd:
Strong expertise in Infrastructure as a Service (IaaS)
Skills in Linux/Unix Administration
Experience with Prometheus/Grafana.
Experience with APM tools like Dynatrace, Datadog, AppDynamics, New Relic, etc.
Implemented robust metrics collection and alerting

Share
Your expertise will transform our infrastructure and deployment. You'll design scalable cloud architectures to accelerate innovation, champion a world-class DevOps culture to empower developers, and build the foundation for our future growth.
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.

Share
Unify online/offline for features: Drive Flink adoption and patterns that keep features consistent and low-latency for experimentation and production.
Make self-serve real: Build golden paths, templates, and guardrails so product/analytics/DS engineers can move fast safely.
Run multi-tenant compute efficiently: EMR on EKS powered by Karpenter on Spot instances; right-size Trino/Spark/Druid for performance and cost.
Cross-cloud interoperability: BigQuery + BigLake/Iceberg interop where it makes sense (analytics, experimentation, partnership).
What you'll be doingThis position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.

Share
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.

Share
You’ll contribute to a high-impact codebase that scans, analyzes, and interprets complex Kubernetes configurations and behaviors at scale. This is a unique opportunity to shape the future of Orca’s KSPM product – owning core backend components, driving architectural improvements, and delivering features that address real-world customer needs.
: Python, Go, K8s APIs, SingleStore, Postgres, Redis, Kafka, AWS, GCP, Azure, ElasticSearch
On a typical day you’ll
Share
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.
These jobs might be a good fit