המקום בו המומחים והחברות הטובות ביותר נפגשים
You will be passionate about both development and operations, and have a strong desire to learn about Amazon's development lifecycle. This is a true DevOps role; you will be both allowed and expected to follow problems all the way to the end and work on implementing real solutions, not just quick fixes. You will want to dive deep and understand how these large systems work.Key job responsibilities
· Develop and deploy operational tools to automate processes and reduce operational overhead
· Hire, develop, and mentor team members.
· Work with other engineers to build and manage massively scaled services
· Work with other engineers to diagnose and resolve customer issues
· Track the health of our services, identify and fix problems on complex systems with massive scale
· Collaborate with some of the leading minds in distributed systems
· Various distros and their corresponding package management tools
· Permissions and permissions management
· System logging and auditing
· Log parsing using grep, awk, sed, cut, tr, etc
· Remote access via ssh
· Knowledge of procfs & sysfs
· Linux tools (ip, ifconfig, netstat, tcpdump, strace, netcat, ping, telnet, etc)
· System Monitoring Tools (top, *stat tools, etc)
· DNS and tools (bind, host, dig, nslookup, etc)
· Runlevel management tools (systemd, rc.d, inittab, chkconfig, etc)
· Git (or other CM tools)
· Agile development
· CI/CD or development pipelinesA day in the life
In this role, you will collaborate with Product teams to design solutions that make efficient use of resources and technologies. You will bring a detail-oriented approach and excellent problem-solving abilities, backed by a deep understanding of distributed system design and delivery. Your contributions will leverage cross-functional business and technical skills, assessing and managing risks, measuring and reporting on progress, anticipating and resolving bottlenecks, providing escalation management, making tradeoffs, and balancing business needs with technical constraints. You will drive engineering deliverables through influence rather than authority, manage repair requests in a high-volume environment, and respond to high-severity events. Regular presentations to senior leadership will require you to synthesize problems and solutions into a simple and consumable manner. Additionally, you will be responsible for data collection, analytics, and dashboard creation to help others identify the best paths for business improvement.Work/Life Balance
Mentorship & Career Growth
• Minimum 5 years as the Systems Engineering/DevOps leader in a management role.
• Experience in building and managing a team of strong technical people.
• Experience with analyzing operational processes and network events to identify and implement systems and tool improvements.
• 5+ years of prior programming experience with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby experience
• Experience with Agile engineering practices (Kanban, continuous delivery, etc.)
• Knowledge of Data Center Facilities Infrastructure (Rack level hardware) with working knowledge of enterprise ticketing system
• Project management, organization and problem solving skills with a drive for results and the ability to handle multiple tasks.
משרות נוספות שיכולות לעניין אותך