Unique opportunity to contribute to the development of a Foundation Model on structured business data
Design, develop and manage tools and data pipelines that enable the processing of tabular data at scale
Extract, clean, and analyze data from various sources, while ensuring its high quality. This includes relevant data extraction from SAP S/4HANA. The data is subsequently processed, enriched, and utilized as training and benchmark data for the Tabular Foundation Model.
In close partnership with LoB domain experts from the S/4HANA development organization, understand and interpret SAP S/4HANA data, metadata, and other development artifacts.
Extract relevant information from SAP S/4HANA for further processing and enrichment of training data for the Tabular Foundation Model.
Work closely with data scientists to share SAP S/4HANA data insights and specifics, enabling the model to be accurately tailored to S/4HANA data
What you bring
PhD or Master’s degree in Computer Science, Artificial Intelligence, or other relevant disciplines.
2+ years of related professional experience
Generic understanding of SAP S/4HANA backend and data model (e.g., VDM, CDS Views, RAP, OData) as well as the Data Dictionary is required.
Proficiency in SQL and software engineering in Python
Hands-on knowledge in cloud stack (AWS, GCP, or Azure) and handling large-scale data (>1M files; >10TB volume) is a plus
Experience with data modelling, ABAP, Machine Learning and/or Knowledge Graph technologies (e.g., RDF, SPARQL) is a plus.
Strong communication, coordination and collaboration skills, with the ability to work effectively in cross-cultural teams.