In this role, you will:
- Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area
- Contribute in increasing system efficiencies and lowering the human intervention time on related tasks
- Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability
- Work with vendors and other technical personnel for problem resolution
- Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards
- Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability
Required Qualifications:
- 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Desired Qualifications:
- Lead and participate in incident response activities, including identifying, investigating, and resolving incidents to minimize impact on service availability and performance. Conduct post-incident reviews (postmortems) to identify root causes and implement preventative measures.
- Define and monitor SLOs and SLIs for critical services to ensure they meet performance and reliability targets. Regularly review and adjust these metrics as necessary
- Continuously evaluate and improve processes, tools, and infrastructure to enhance reliability, efficiency, and scalability. Stay up-to-date with industry trends, emerging technologies, and best practices, and drive innovation within the organization.
- Monitor system health and performance using monitoring tools and alerting systems, and respond promptly to alerts and incidents.
- Drive efficiency by automating repetitive tasks and processes.
- Evaluate and implement technology options for managing our enterprise SaaS products in the cloud.
- Enhance our platform by identifying areas for improvement based on monitoring data.
- Work closely with the development team to create a development environment that fosters productivity and innovation.
- Propose and drive adoption of new solutions that enhance our platform.
Job Expectations:
- Hold a Bachelor degree in Engineering or a related field.
- Minimum 4 years of relevant experience in Platform Engineering, SRE, and/or DevOps in production environments.
- Expertise in Clous setup with 3+ years of hands-on experience.
- Proven track record of owning the uptime of distributed cloud-based systems.
- Possess at least 3 years of experience with scripting languages and related automation projects.
- Experience in building and using Observability frameworks for a microservice based distributed cloud setup with tools such as Prometheus, Grafana, AppDynamics, Splunk etc.
- Proficient in setting up and managing CI/CD pipelines and deployment tools (e.g., Jenkins, Git, GitHub etc).
- Strong Database knowledge is required :Oracle / MongoDB
- Experienced is 24x7 Support model for Cloud uptime and maintenance activities
- Strong spoken and written English communication skills.
- Self-driven, responsible, eager to learn, and proactive.
- Independent, goal-oriented, and proactive attitude.
- Disciplined and effective in remote work environments.
10 Jul 2025
Wells Fargo Recruitment and Hiring Requirements:
b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.