A DevOps engineer is responsible for ensuring the smooth operation of software systems, from development to deployment.
Job responsibilities
- Executes software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems.
- Manages and Implement AWS Cloud infrastructure, ensuring scalable, highly available, and fault-tolerant systems.
- Designing and implementing infrastructure and tools for software development, testing, and deployment
- Develops and maintain data pipelines, leveraging Apache Spark, AWS Glue and other data processing frameworks to support efficient data ingestion, transformation, and storage.
- Collaborating with development teams to ensure seamless integration and delivery.
- Automating processes and workflows using tools like Jenkins, Docker, and Kubernetes, etc.
- Ensuring continuous integration, delivery, and monitoring.
- Fostering a culture of continuous improvement and collaboration.
- Troubleshooting and resolving issues in production environments.
- Staying up to date with industry trends and best practices.
- Monitors and optimize system performance, using advanced observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. to ensure system reliability and performance metrics are met.
- Implements and monitors robust backup and disaster recovery solutions to ensure the app / data durability and high availability.
- Drives continuous improvement initiatives in the infrastructure, focusing on automation, cost optimization and process refinement.
- Ensures compliance with data governance and security polices, applying best practices in data management and cloud operations.
Required qualifications, capabilities, and skills:
- Formal training or certification on software engineering concepts and 5+ years applied experience.
- Hands-on practical experience in system design, application development, testing, and operational stability.
- Excellent troubleshooting skills.
- Proficient in coding in one or more languages. Emerging knowledge of continuous integration and continuous delivery tools like Terraform.
- Build solutions, Jenkins, Spinnaker, Deployment Automation etc.
- Experience in developing, debugging, and maintaining code in a large corporate environment with one or more modern programming languages and database querying languages.
- Experience maintaining a Cloud-base infrastructure. Familiar with tools such as Kafka, MSK, Splunk, Grafana, Dynatrace, Prometheus, Datadog, Splunk, Kafka.
- Solid understanding of agile methodologies such as CI/CD (Jenkins, GitHub, Terraform), Logging, Alerting & Monitoring (Grafana, Dynatrace, Prometheus, Datadog, Splunk) Python/Java, AWS, Application Resiliency, and Security
- Demonstrated knowledge of software applications and technical processes within a technical discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.)
- Overall knowledge of the Software Development Life Cycle.
Preferred qualifications, capabilities, and skills
- Familiarity with modern front-end technologies. Proficient in coding in one or more languages, Python preferable
- Prior experience of working in AWS & Kubernetes. Strong understanding of AWS services & architecture best practices. AWS Certified Solutions Architect is plus.
- Exposure to cloud technologies. Knowledge of software development methodologies like Agile, DevOps & tools, Git, Jenkins