Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

IBM Senior Data Engineer 
Greece, Attica, Athens 
265361373

11.09.2024

As a Data Scientist at IBM, you will help transform our clients’ data into tangible business value by analyzing information, communicating outcomes and collaborating on product development. Work with Best in Class open source and visual tools, along with the most flexible and scalable deployment options. Whether it’s investigating patient trends or weather patterns, you will work to solve real world problems for the industries transforming how we live.
We are looking for an experienced Data Engineer with 4+ years of hands-on experience in data engineering, with a focus on building scalable data pipelines, managing data quality, and optimizing data workflows. The ideal candidate will be proficient in big data technologies and cloud platforms, ensuring the reliable and efficient processing of large datasets.

• Utilize data management and processing capabilities in PySpark/SparkSQL to design, build, and optimize scalable data pipelines.
• Leverage the big data platforms for large-scale data processing, ensuring efficient data workflows and integration.
• Implement robust ETL processes to extract, transform, and load data from various sources into data lakes and warehouses.
• Identify, troubleshoot, and resolve data quality issues, ensuring the integrity and reliability of data across all pipelines.
• Optimize data storage and retrieval for both batch and real-time data processing.
• Work with diverse datasets, ensuring data availability and consistency for stakeholders.
• Collaborate with data scientists and analysts to enable advanced analytics and machine learning models through well-engineered data pipelines.
• Collaborate with the client and other vendor teams in complex projects and lead the technical solution for data management & migration projectsRequired Technical and Professional Expertise

• Bachelor’s degree or higher in Computer Science, Information Technology, or a related field.
• 4+ years of hands-on experience in data engineering and data pipeline development.
• Proficiency in PySpark/SparkSQL/SQL for big data processing and optimization.
Familiarity with cloud platforms, particularly Azure, for deploying and managing data infrastructure.
Solid understanding of ETL processes, data warehousing, and data lake architectures.
• Experience with Databricks or similar big data platforms.
• Experience in managing data quality
• Experience in development with MSSQL server & SSIS

Preferred Technical and Professional Expertise

• Working experience in Banking sector projects
• Azure, Databricks certification or equivalent experience is highly preferred.
• Experience with version control systems (e.g. Git) and CI/CD pipelines for data engineering.
• Cluster Tuning experience, for optimization of performance