Job responsibilities
- Supports review of controls to ensure sufficient protection of enterprise data
- Advises and makes custom configuration changes in one to two tools to generate a product at the business or customer request
- Updates logical or physical data models based on new use cases
- Frequently uses SQL and understands NoSQL databases and their niche in the marketplace
- Adds to team culture of diversity, equity, inclusion, and respect
Required qualifications, capabilities, and skills
- Formal training or certification on data lifecycle concepts and 2+ years applied experience
- Experience across the data lifecycle
- Experience with Batch and Real time Data processing with Spark or Flink.
- Working knowledge of AWS Glue and EMR usage for Data processing.
- Experience working with Databricks.
- Experience working with Python/Java, PySpark etc.,
- Advanced at SQL (e.g., joins and aggregations)
- Working understanding of NoSQL databases
- Significant experience with statistical data analysis and ability to determine appropriate tools and data patterns to perform analysis.
Preferred qualifications, capabilities, and skills
- Worked with building Data lake, built Data platforms, built Data frameworks, Built/Design of Data as a Service API