1st shift (United States of America) Please review the following job description:
Responsible for building, optimizing and maintaining the data pipelines and aiding in building the data ecosystem for delivering enterprise data for wide consumption including developing data models, corresponding data architecture documents and API’s..
Build, manage, and implement the data and/or Big Data pipeline capabilities including data modeling, process design and overall data pipeline architecture and all phases of the ETL (extract, transform, and load) processes.
Lead efforts related to partially to completely automate repeatable data preparation and integration tasks. Partner with technology teams to understand data capture, testing needs, and to build and test end-to-end solutions.
Partner with engineers, data scientists, and the data office leadership to define and refine data architecture and technology choices.
Take a new perspective on existing solutions to solve problems of the highest complexity and exercise judgment based on the analysis (e.g. modeling, testing, etc.) of multiple sources of information and make recommendations.
Lead larger, more complex data engineering projects and initiatives with significant risks and resource requirements.
Requirements
Must have a Bachelor’s degree in Computer Science, Computer Engineering, or related technical field.
Must have 8 years of progressive experience in development or IT consulting positions performing/utilizing the following:
SQL, relational databases, ETL/ELT architecture, data integration concepts and big data concepts.
Planning and Managing Enterprise Data Lake projects
Programming and scripting languages, including Spark, Scala, Python, Hive, Hadoop, Bash, and UNIX.
Data quality tools for data profiling, cleansing and standardization.
Data acquisition, transformation, and storage design using design principles, patterns, and best practices.
Working within a team environment and interacting with data professionals and business data SME’s throughout the organization.
CI/CD pipelines, including GitHub and Jenkins.
Agile methodologies and short release cycles.
Utilizing Netezza and MS Excel
Position may be eligible to work hybrid/remotely but is based out of and reports to Truist offices in Raleigh, NC. Must be available to travel to Raleigh, NC regularly for meetings and reviews with manager and project teams within 24-hours’ notice.