The point where experts and best companies meet
Share
Key job responsibilities- Advance the state of data engineering infrastructure on the latest tools (AWS Glue, Spark, AWS EMR, EMR Serverless, Ray etc...)
- Improve and raise the bar on proactive data quality monitoring and alarming (e.g., anomaly detection) across big data pipelines- Defining the future of Prime Science data architectureA day in the life
As a data engineer on this team, you will collaborate closely with Prime business leaders, scientists (economists, research scientists, applied scientists) and engineering leaders to build data solutions. You will leverage AWS technologies (EMR, EC2, S3, GLUE, KMS, Lambda, DynamoDB, etc.) to build novel systems and tackle challenges at scale. You will manipulate and process TB-sized data, supporting real-time access and orchestration across multiple systems. Your work will enhance our scientific models and data applications. As a consequence, you will have global impact, improving customer experiences for Prime members worldwide. As a successful candidate, you will successfully interact with both technical and business stakeholders.
- 5+ years of data engineering experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience with SQL
- Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
- Experience mentoring team members on best practices
- Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
- Experience operating large data warehouses
- Knowledge of distributed systems as it pertains to data storage and computing
- Knowledge of professional software engineering & best practices for full software development life cycle, including coding standards, software architectures, code reviews, source control management, continuous deployments, testing, and operational excellence
These jobs might be a good fit