המקום בו המומחים והחברות הטובות ביותר נפגשים
We’re looking for a
Data Transformation and Analysis:
Design, develop, and maintain performant data pipelines for collecting, processing, and transforming data from various sources into a structured format
Assemble large, complex data sets that meet functional / non-functional business requirements
Provide clean data sets to end users, modeling data in a way that empowers end users to answer their own questions
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Apply software engineering best practices to analytics code (e.g. version control, testing, continuous integration)
Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics
Performance Optimization:
Identify and address performance bottlenecks in data processing, analytics, and reporting to ensure efficient and responsive user experiences
Continuously refine and optimize data pipelines for speed, accuracy, and scalability
Data Governance and Security:
Ensure data privacy and security by designing and implementing access controls in compliance with data protection regulations
Collaborate with the security team to identify and mitigate potential risks related to data handling
Data Management:
Maintain data documentation and definitions
Interface with data stewards to ensure metadata is trustworthy and current
AI/ML Infrastructure
Build and manage data architectures that facilitate machine learning model training, testing, and deployment
Implement and optimize data storage solutions tailored for AI/ML applications
Deploy machine learning models into production environments
Bachelor or Master degree in Analytics, Computer Science, Mathematics, Management Information Systems or similar quantitative discipline
5+ years of experience with SQL, including exposure to Data Manipulation Language (DML) and Data Definition Language (DDL) statements. Advanced proficiency in writing Data Query Language (DQL) statements, with experience using common table expressions and window functions
Ability to compare different options and their trade-offs
Experience using dbt (data build tool), including designing and implementing dbt macros
Experience with one or more common data analysis languages (Python, R, Spark) and associated libraries/toolkits such as NumPy, pandas, tidyverse
Experience building ELT processes using tools like Airflow, SQL Tasks & Stored Procedures
Experience with containerization, including Docker, Kubernetes
Proficiency with Git, bash, and command line
Passion for designing efficient, modular, and maintainable systems
Ability to see how the little details impact the big picture
Experience collaborating with the business to translate goals into technical specifications
Experience and interest in problem formulation based on relatively abstract information and evaluating all possible solutions
Experience analyzing internal / external data and processes to answer specific business questions and identify opportunities for improvement
Comfort working with semi-structured datasets
Proven passion for building and learning: open source contributions, pet projects, self-education, Stack Overflow
Proficiency in Machine Learning (ML) and Natural language (NLP) libraries
Experience designing and implementing topic modeling solutions using algorithms such as Latent Dirichlet Allocation (LDA), BERTopic, or similar
Knowledge of advanced NLP techniques and their applications, such as sentiment analysis and text classification to complement topic modeling efforts
The following represents the expected range of compensation for this role:
משרות נוספות שיכולות לעניין אותך