Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Amazon Software Development Engineer Open Data Analytics - Engines 
United States, California, East Palo Alto 
650747527

Yesterday
DESCRIPTION

Athena and EMR are services that our customer use to run large scale analytics, leveraging open source engines like Apache Spark and Trino, with datalake open table formats like Apache Iceberg, Hudi and Delta. The analytics engines organization makes significant modifications to these engines to run in serverless environments and with superior performance and scalability than what is available in Open Source. In the last 3 years we have improved our engines by a factor of 5x by making changes to the optimizer, query runtime and storage connectors. We have also made significant changes to the compiler to enable enterprise features like fine grain access control with these engines and table formats. Additionally, we strive to regularly contribute features, bug fixes and optimizations back to open-source, as well be current with the latest open-source versions of these frameworks. This is a “must-win” strategic area in a growing and very technical space.Key job responsibilities
• Develop and optimize core components of query engines and open table formats (Iceberg, Hudi, Delta) to enhance performance, scalability, and reliability.
• Design and implement innovative solutions and algorithms to improve feature capabilities, stability, and security in table format integrations with query engines.
• Collaborate with the open-source community, contributing to discussions, driving improvements, and integrating upstream changes.
• Ensure data consistency and durability while achieving breakthrough performance and scalability for large-scale data lake workloads.
• Improve the organizations automation and testing capabilities.
• Manage complex deliverables project and research projects with deadlines.
• Mentor and train other team members on design techniques and coding best practices.
• Be a point of contact for challenging customer issues related to data lake workloads and query engine.About the team
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance

BASIC QUALIFICATIONS

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 3+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience


PREFERRED QUALIFICATIONS

- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Experience in developing and operating distributed systems or applications at large scale
- Experience working on open table formats (Iceberg, Hudi, Delta) or query engines (Spark, Trino, Flink etc) is a huge plus
- Experience contributing to open source code bases, and collaborating with open source communities