As a Data Engineer in our Finance team you will spend each day defining, refining and delivering set goals for our firm.
Job Responsibilities :
- Develop data pipelines using Python for querying, cleaning, aggregating, and merging data with Spark/PySpark to support Machine Learning model development and deployment
- Develop and execute production-quality pipeline using Python for deploying Machine Learning models at scale
- Conduct research and development with emerging technologies, determine their applicability to business use cases, document and communicate their recommended use in the firm
- Troubleshoot/debug legacy code in Python/Pyspark/SQL
- Become a subject matter expert on data sources and best practices corresponding to a LOB or business function.
- Collaborate with fellow team members to develop skills and transfer knowledge
- Deliver high-quality solutions under tight deadlines and communicate project status and timelines effectively
- Mentor junior team members
Required qualifications, capabilities, and skills
- BS or MS in STEM field with quantitative background or equivalent knowledge in Big Data, Software Development, or Machine Learning at Scale
- Minimum 3 years of experience in Big Data, Software Development, or Machine Learning at Scale
- Strong knowledge of data structures, algorithms and other foundational concepts of Computer Science
- Strong design, coding, debugging, and analytical skills
- Excellent command over SQL, Python and Pyspark coding languages
- Expert in Big Data technologies and their best practice application (Hadoop, Spark, Hive, HDFS, etc.)
- Knowledge of version control tools and processes (e.g. Subversion, Git)
- Technical understanding or experience in common RDBMS systems; (e.g. Teradata, Oracle)
Preferred qualifications, capabilities, and skills
- Practical experience with atleast one Cloud solutions (AWS, GCP, Azure), certification is a plus.
- Effective communication and presentation skills. Can work both independently and collaboratively.
- Good project management skills (clear goal setting, well-organized, detailed planning, and ability for tight-timeline deliverables).