Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Amazon Software Development Engineer Open Data Analytics - Engines 
United States, Washington, Redmond 
851694536

Today
DESCRIPTION

Athena and EMR are services that our customer use to run large scale analytics, leveraging open source engines like Apache Spark and Trino, with datalake open table formats like Apache Iceberg, Hudi and Delta. The analytics engines organization makes significant modifications to these engines to run in serverless environments and with superior performance and scalability than what is available in Open Source. In the last 3 years we have improved our engines by a factor of 5x by making changes to the optimizer, query runtime and storage connectors. We have also made significant changes to the compiler to enable enterprise features like fine grain access control with these engines and table formats. Additionally, we strive to regularly contribute features, bug fixes and optimizations back to open-source, as well be current with the latest open-source versions of these frameworks. This is a “must-win” strategic area in a growing and very technical space.Key job responsibilities
• Develop and optimize core components of query engines and open table formats (Iceberg, Hudi, Delta) to enhance performance, scalability, and reliability.
• Design and implement innovative solutions and algorithms to improve feature capabilities, stability, and security in table format integrations with query engines.
• Collaborate with the open-source community, contributing to discussions, driving improvements, and integrating upstream changes.
• Ensure data consistency and durability while achieving breakthrough performance and scalability for large-scale data lake workloads.
• Improve the organizations automation and testing capabilities.
• Manage complex deliverables project and research projects with deadlines.
• Mentor and train other team members on design techniques and coding best practices.
• Be a point of contact for challenging customer issues related to data lake workloads and query engine.
Solve challenging technical problems, often ones not solved before, at every layer of the stack.
Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security.
Build high-quality, highly available, always-on products.A day in the life
As you design and code solutions to help our team drive efficiencies in software architecture, you’ll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You’ll also:Participate in design discussions, code review, and communicate with internal and external stakeholders.Work in a startup-like development environment, where you’re always working on the most important stuff.Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSWork/Life Balance
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

BASIC QUALIFICATIONS

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 3+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience


PREFERRED QUALIFICATIONS

- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Experience in developing and operating distributed systems or applications at large scale
- Experience working on open table formats (Iceberg, Hudi, Delta) or query engines (Spark, Trino, Flink etc) is a huge plus
- Experience contributing to open source code bases, and collaborating with open source communities