In this role, we look forward to your managing very large-scale, highly-available Cloud and Big Data Platforms supporting exabytes of data for Analytics. - Lead innovation by exploring, investigating, instrumenting, recommending, benchmarking and implementing data centric technology solutions for the platform. Provide hardware architectural guidance, planning, estimating cluster capacity, and creating roadmapsResponsibilities. Infrastructure Management: Design, implement, and maintain cloud infrastructure on AWS and GCP, leveraging best practices to ensure high availability, scalability, and resilience.Monitoring and Alerting: Set up, maintain, and continuously improve monitoring, alerting, and logging solutions to ensure application health, using tools like Prometheus, Grafana, CloudWatch, and Splunk.Automation and Scripting: Build and manage infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation for automated provisioning and configuration management.Database Management: Oversee the maintenance, backup, and performance tuning of Postgres databases to ensure data reliability and accessibility.Security and Compliance: Implement and maintain security best practices, including access control, network security, and data encryption, ensuring compliance with industry standards.Troubleshooting and Optimization: Provide support to resolve infrastructure and application performance issues, conducting root cause analysis and implementing long-term solutions.