המקום בו המומחים והחברות הטובות ביותר נפגשים
Senior DEs are:
Deep technical experts and thought leaders that help accelerate adoption of the very best engineering practices, while maintaining knowledge on industry innovations, trends and practices
Visionaries, collaborating on Capital One’s toughest issues, to deliver on business needs that directly impact the lives of our customers and associates
Role models and mentors, helping to coach and strengthen the technical expertise and know-how of our engineering and product community
Evangelists, both internally and externally, helping to elevate the Distinguished Engineering community and establish themselves as a go-to resource on given technologies and technology-enabled capabilities
Responsibilities:
Play a pivotal role in setting the roadmap and overseeing the day-to-day management of our AI and ML platforms including: setting strategies and overseeing container management in public cloud (AWS), cloud resource provisioning, ensuring low latency, high availability of cloud resources, cloud optimization, etc.
Maintain a deep understanding of the technical aspects of the platform, including infra, algorithms, APIs and integrations. Provide operations leadership to the engineering and production teams.
Implement robust processes and operations dashboard to monitor platform performance, user feedback, and adherence to service level agreements (SLAs), observability, resiliency, and key operational metrics in real time.
Collaborate with cyber, technology risk management, security and compliance teams to understand the company cyber, risk and compliance requirements. Work closely with product and engineering to ensure the platform adheres to industry best practices, corporate cyber and tech risk managementstandards
Implement automation and dashboards to visualize vulnerabilities, platform incidents, cloud controls compliance, cloud resource utilization etc. to enable proactive decision making, and risk mitigation.
Work closely with executive leadership and buy-ings to develop a long-term vision and roadmap for the platform operations enhancements
Build a high performing operations team, recruiting world class SREs, production engineers, data engineers, groom, and retain talent on team
Basic Qualifications:
Bachelor’s Degree
At least 9 years of experience managing Platform, infrastructure operations or Site Reliability Engineering.
At least 5 years’ experience with public cloud technologies
Preferred Qualifications:
Master's Degree in “STEM” field (Science, Technology, Engineering, or Mathematics)
5+ years of experience in managing large-scale, high-performance, distributed systems as a Site Reliability Engineer or a product engineer
5+ years years experience in setting up and scaling observability platform, providing monitoring and telemetry support and creating Operational health dashboards
3+ years experience in building systems and solutions within a regulated environment
3+ years of experience in Artificial Intelligence, Machine Learning or Cloud infrastructure
3+ years experience with managing distributed systems, multi-tenant, micro services, and container orchestration (Kubernetes)
Experience partnering with technology peers responsible for data architecture and distributed computing infrastructure or platforms
5+ years of experience with machine learning lifecycle(building, training models, serving models, setting up cloud infrastructure or data pipelines) and familiarity with major Machine Learning frameworks
. Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.
If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at 1-800-304-9102 or via email at . All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.
משרות נוספות שיכולות לעניין אותך