Job responsibilities
- Generates data models for their team using firmwide tooling, linear algebra, statistics, and geometrical algorithms
- Delivers data collection, storage, access, and analytics data platform solutions in a secure, stable, and scalable way
- Implements database back-up, recovery, and archiving strategy
- Evaluates and reports on access control processes to determine effectiveness of data asset securitywith minimal supervision
- Adds to team culture of diversity, equity, inclusion, and respect
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 5+ years of applied experience
- Working experience with both relational and NoSQL databases
- Experience across the data lifecycle
- Experience with Batch and Real time Data processing with Spark or Flink.
- Working knowledge of AWS Glue and EMR usage for Data processing.
- Experience working with Databricks.
- Experience working with Python/Java, PySpark etc.,
- Advanced at SQL (e.g., joins and aggregations)
- Significant experience with statistical data analysis and ability to determine appropriate tools and data patterns to perform analysis.
Preferred qualifications, capabilities, and skills
- Experience with database back-up, recovery, and archiving strategy
- Proficient knowledge of linear algebra, statistics, and geometrical algorithms