Key job responsibilities
* Design and build tools to improve overall service availability and operational stability.
* Collaborate with service teams to achieve Full Continuous Deployment of tier 1/tier 2 services.
* Improve service resiliency by managing security, availability and other risks.
* Own Infrastructure Market Rate for all services on the team, exploring efficiency opportunities across various products/platforms.
* Own the initiatives to reduce operational burden and improve the quality of resolution to customer contacts.
* Own and operate the entire production operational space including monitoring, alarming, dashboard generation, documentation and other engineering excellence areas.
* Collaborate with service teams on design reviews, to make the product platform robust and scalable.
- 5+ years of deploying and operating in a Linux/Unix environment experience
- 5+ years of development/programming/scripting language (Python/Java/Bash/Perl) experience
- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
- * 3+ years of experience with Relational, Object-Relational, and non-relational databases
- * Experience building with AWS services (S3, DynamoDB, CloudFront, EC2, Lambda)
- * Ability to dive deep to analyze complex issues, solve problems, and automate repetitive tasks
- * Proven ability to troubleshoot large distributed systems and dive code to identify root causes
- * Experience with service-oriented architecture and current web service technologies
- * Experience building and supporting distributed systems running on complex, large-scale networks
- * Excellent documentation skills
- * Excellent problem-solving skills with a strong attention to detail
- * Experience with agile software development practices
משרות נוספות שיכולות לעניין אותך