Design, develop, test, deploy, maintain, and improve software.
Utilize multiple technologies to solve complex problems.
Possess a results-oriented mindset with a passion for producing high-quality code.
Lead the gathering of functional requirements, develop technical specifications, and oversee project and test planning.
Demonstrate interest and ability to learn additional coding languages as needed.
Have experience with Docker container-based packaging and deployments.
Experienced in developing highly scalable and available web applications.
Resolve defects/bugs during testing, pre-production, production, and post-release patches.
Create specifications for onboarding new offerings, including troubleshooting, patch processes, cross-organizational incident management processes, and security breach response plans.
Develop implementation plans for application disaster recovery, migration, roll-back plans, expansion, routine deployments, and system upgrades.
Collaborate cross-functionally with various Intuit teams including product management, QA/QE, various product lines, and business units to drive forward results.
Qualifications
8+ years of experience in designing and developing web, software, or mobile applications.
Experience working with three or more of the following: Unix/Linux environments, distributed and parallel systems, workflow automation, CI/CD platform development, cloud platforms development (e.g., AWS/GCP), and Kubernetes/container platforms-based development.
Proficiency in one or more general-purpose programming languages such as Java and Python.
Experience in Kubernetes and Docker container-based development, packaging, and deployments.
Familiarity with open-source technologies.
Experience with various modern web frameworks, cloud/SaaS, and the entire Software Development Life Cycle (SDLC).
Experience in messaging/queueing systems.
Ability to generate incident management reports, technical documentation on supported applications, operational tools, and application deployment plans.
Respond to monitoring alerts according to defined playbooks and procedures.
Participate in Root Cause Analysis (RCA) processes.
Manage application deployment and RCA processes for specific applications.
Implement improved operational processes.
Maintain real-time application dashboards showing the overall health of the system.
Adhere to code standards to build better solutions.
Provide metrics reporting on application performance, availability, reliability, etc.
Conduct design reviews of operational approaches and solutions.
Contribute to operational standards and requirements.
Perform risk analysis and root cause analysis.
Make decisions on technical feasibility and approach.
Exhibit strong communication skills and the ability to explain complex technical issues to both technical and non-technical audiences.
Experience with data-intensive applications and proficiency in using Kafka, Kinesis, Amazon S3, and SQS for scalability.
Solid understanding of basic AWS networking principles to enhance system performance and reliability.
Capable of improving scalability to efficiently manage increased data and traffic demands.
Demonstrate a strong understanding of software design/architecture processes.
Preferred Qualification:
SRE experience with hands-on skills in debugging, checking logs, and metrics.
Understanding of the Observability stack and experience working with Splunk.
Experience with fluentbit/fluentd is advantageous.