Job Responsibilities
- Build pipelines in Spark and tune Spark queries.
- Possess advanced knowledge of application, data, and infrastructure disciplines.
- Design, build, and maintain ML models in production, working closely with modelers and business stakeholders.
- Apply knowledge of machine learning model frameworks, algorithms, and tools for building ML solutions.
- Create and maintain technical documentation.
- Contribute to the group’s knowledge base by finding new and valuable ways to approach problems and projects.
- Stay up-to-date with the latest advancements in GenAI and LLM technologies and incorporate them into our data engineering practices.
Required Qualifications, Capabilities, and Skills
- Formal training or certification on software engineering concepts and 3+ years applied experience.
- Background with Machine Learning Frameworks and Big Data technologies such as Hadoop.
- Strong experience in programming languages such as Java or Python.
- Experience with Python Machine Learning libraries and ecosystems (e.g., Pandas and Numpy).
- Experience with Cloud technologies such as AWS or Azure.
- Experience working with databases such as Cassandra, MongoDB, or Teradata.
- Knowledge of build tools like Maven and source control like Git/SVN.
Preferred Qualifications, Capabilities, and Skills
- Familiarity with modern front-end technologies.
- Exposure to cloud technologies.