Required Qualifications, Capabilities, and Skills
- Formal training or certification on software engineering concepts and 2+ years applied experience
- Basic knowledge of the data lifecycle and data management functions
- Advanced at SQL (e.g., joins and aggregations)
- Working understanding of NoSQL databases
- Significant experience with statistical data analysis and ability to determine appropriate tools to perform analysis
- Basic knowledge of data system components to determine controls needed
- Experience in a Big Data technology (Spark Architecture, Performance tuning ,Spark SQL, Streaming, KAFKA, Entitlements etc., )
- Experience in Python/Scala/Java
- Data driven mindset, experience in data analysis and data mining.
- Experience in developing, deploying, and monitoring in building large distributed and parallel systems with devops knowledge.
- Proven experience in implementing a production ready, highly scalable, end to end solution on a big data platform.
Preferred Qualifications, Capabilities, and Skills