As a Technology Support Lead in AI/ML & Data platform team, you will play a leadership role in ensuring the operational stability, availability, and performance of our production services. Critical thinking while overseeing day-to-day maintenance of the firm’s systems will be key and set you up for success as you navigate tasks related to identifying, troubleshooting, and resolving issues to ensure a seamless user experience.
Job responsibilities
- Lead teams of technologists that provide end-to-end application or infrastructure service delivery for the successful business operations of the firm
- Execute policies and procedures that ensure operational stability and availability
- Monitor production environments for anomalies, address issues, and drive evolution of utilization of standard observability tools
- Escalate and communicate issues and solutions to the business and technology stakeholders,actively participating from incident resolution to service restoration
- Lead incident, problem, and change management in support of full stack technology systems, applications, or infrastructure
Required qualifications, capabilities, and skills
- Formal training or certification on Technology support concepts and 5+ years applied experience
- Experience in supporting applications distributed across multiple servers or hosted on cloud platforms, particularly Amazon Web Services (AWS), including managing, troubleshooting, and optimizing for high availability and performance.
- Proficiency in using Unix commands and writing shell scripts to automate tasks, manage system operations, and enhance productivity, including creating scripts for routine maintenance, monitoring, and deployment.
- Expertise in crafting complex SQL queries and a deep understanding of Oracle Database concepts such as indexing and execution plans, crucial for optimizing database performance and efficient data retrieval.
- Hands-on experience in developing automation utilities using shell scripting, Python, or other programming tools to streamline processes, reduce manual intervention, and improve operational efficiency.
- Familiarity with scheduling tools like Autosys and Control-M for automating job scheduling and workflow management, ensuring tasks are executed timely and orderly to minimize downtime and maximize resource utilization
Preferred qualifications, capabilities, and skills
- AWS Certified in Cloud Practitioner.
- Experience with monitoring and visualization tools such as Geneos, Grafana, Datadog, Splunk, Kibana, and AWS Cloudwatch.
- Familiarity with application performance monitoring tools like AppDynamics and Dynatrace.
- Familiar with ServiceNow tool and environment.
- Familiar in deploying, monitoring, and supporting cloud-based applications