5+ years’ experience developing data engineering pipeline using technologies like Python, PySpark, Spark, SQL, DataBricks, AWS, Elastic Search, Kafka
5+ year’s experience working on BigData technologies
BS/MS in computer science or equivalent work experience
Strong experience with Python and Spark and ability to understand a large existing codebase.
Experience with the entire Software Development Life Cycle (SDLC)
Good experience building and supporting large scale enterprise data engineering pipelines and featurization
You should have a passion for Engineering and Operational Excellence. You will support what your team builds in production via quarterly on-call rotations to monitor production systems and resolve production errors & incidents
Solid communication skills: Demonstrated ability to explain complex technical issues to both technical and non-technical audiences
Strong understanding of the Software design/architecture process
Experience with unit testing & Test-Driven Development (TDD)