About the Role:
As a
Senior Cloud Systems Integration Engineer - Observability Specialist, you will play a crucial role in developing and implementing cutting-edge Observability solutions powered by Big Data, streaming pipelines, Machine Learning, and Large Language models.
- You'll focus on integrating public and private cloud solutions into the SAP ecosystem, optimizing alerts, metrics, and logs using AI-driven Observability solutions.
- You will work as part of an implementation project, with an eye toward integration and operationalization, where you will ultimately join the core SRE teams in support of the environments.
- You will be expected to bridge the gap between Infrastructure, Platform, and Application Observability from the point of first contact.
- You will also support troubleshooting during major incidents related to our global cloud infrastructure, ensuring excellence in triage and resolution.
- You will help the team to reduce critical KPI's around MTTD/MTTR, Signal to Noise Ratio, and other relevant metrics using these advanced methods.
Key Responsibilities:
- Collaborate with cross-functional teams following Agile methodologies like SCRUM.
- Prioritize and deliver high-quality developments within tight timelines.
- Build expertise in hyperscaler provider architectures and API integration models.
- Ensure seamless operations and maximum uptime for our services.
- Participate in On-Call rotational coverage, including weekends and holidays.
- Share knowledge and drive hyperscaler adoption and integration.
- Support ongoing Observability and Monitoring enhancements/development across the SAP Cloud Ecosystem.
- Assist SRE teams in Reliability Services.
Required Skills:
- Rapid adoption of cutting-edge technologies.
- Advanced analytical and problem-solving abilities.
- Strong team player with exceptional communication skills.
- Self-driven with a sense of urgency to resolve issues efficiently.
- Proficient in spoken and written English.
Experience:
- Development: 4+ years of professional or enterprise development experience.
- Strong knowledge of Python & JavaScript programming.
- Experience in REST API implementation (Flask or FastAPI).
- Microservice-based development expertise.
DevOps:
- CI/CD pipelines using Azure, Jenkins, or similar tools.
- Hands-on experience with Docker containers & Kubernetes.
- Public cloud environments (GCP/AWS/Azure).
- Solid grasp of JSON, YAML, & Github.
- Enterprise/Service Provider Data Center Architecture.
- Familiarity with Fault Monitoring and Performance Management tools.
Hyperscalers:
- Certifications with public cloud providers (GCP, AWS, Azure, IBM, Alibaba Cloud, etc.).
- Adoption and integration methodologies between cloud solutions
- Observability data ingestion and pipelines working knowledge.
- Algorithms, data structures & patterns.
Preferred:
- Experience with Elasticsearch, Splunk, or similar platforms.
- Web development frameworks knowledge.
- Familiarity with Terraform, HelmChart, Ansible, or similar tools.
- Understanding of Kubeflow, MLFlow, Dataflow, or similar technologies.
Education:
- Bachelor's or equivalent in Software Engineering, Computer Science, or related fields.
- Industry Technical Certifications (CCNA, CKA, RHCE, AZ-900, etc.) and ITIL courseware are beneficial.