Finding the best job has never been easier
Share
Key job responsibilities
- Lead the design, development, and maintenance of data pipelines that will collect, clean, and store data from multiple diverse sources.- Optimize data processing, storage, and retrieval solutions for scalability, cost, and performance tradeoffs.- Oversee and contribute to the implementation of data quality and validation mechanisms to ensure data and model integrity.
A day in the life
As a Senior Data Engineer, you will be a technical leader in data initiatives. You will lead data engineering activities in all agile ceremonies. This includes planning, daily updates, reviews, and retrospectives. You will work closely with data scientists, applied scientists, software engineers, and other data professionals. Together, you'll gather requirements, set goals, and track progress. You will conduct data exploration, profiling, and cleaning activities. These efforts will support data analysis and model building. You will design and implement data pipelines. When issues arise, you will troubleshoot and communicate findings to stakeholders. As a senior member, you will mentor other engineers and scientists. You will share your knowledge across teams and contribute to the Data Engineering Community of Practice across Amazon Robotics.1. Medical, Dental, and Vision Coverage
2. Maternity and Parental Leave Options
3. Paid Time Off (PTO)
4. 401(k) Plan
- 5+ years of data engineering experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
- Experience mentoring team members on best practices
- Experience with both SQL and NoSQL
- Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
- Experience operating large data warehouses
- Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
- Experience with AWS technologies such as Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM
These jobs might be a good fit