Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

MSD AI/ML Platform Engineer 
Czechia 
664654033

15.12.2024

Job Description

Join the Advanced Data Analytics (ADA) Product Line as a Platform Engineer and help shape the future of AI/ML development experience for researchers and data science teams across all divisions. Product Line mission is to enable users to focus on delivering value to stakeholders and to accelerate the development process. As a member of the Platform Engineering team, you will drive continuous delivery, site reliability and cost efficiency of AI/ML platforms hosted on AWS. You will develop new platform capabilities and collaborate with our Data Science teams to deliver accelerators for our users. Our web-based accelerators range from delivering CI/CD, MLOPs to fully customizable end-to-end Retrieval-Augmented Generation pipelines. Following year we are planning to extend capabilities with AI agents and develop procedures to host and customize LLMs in our platforms in self-service way.

Your expected career path is to grow professionally and take responsibility for the delivery leadership in one of the platforms or initiatives as subject matter expert. North star of your capabilitiesis ability to drive delivery of new products, ensure compliance and fulfill stakeholder requirements.

  • Infrastructure: AWS, AWS China, On-premise

  • Product Line: Dataiku, Databricks, Domino, Posit Cloud & On-prem, SAS, AWS OpenSearch, JMP, and Alteryx,

  • Continuous Delivery: Terraform, Ansible, CloudFormation, Docker, Bash, Python, GitHub Actions,

  • Observability: ElasticSearch, CloudWatch

  • Quality Engineering: X-Ray, Robot Framework, Selenium Grid

  • Elastic Compute: Kubernetes (Karpenter), Slurm, Databricks Clusters

  • Distributed Processing: Spark, Ray

  • Product Line Insight Database: Redshift

  • Operating Systems: Alma Linux, Red Hat, Amazon Linux, Bottlerocket

Responsibilities

  • Assess and deliver assigned tasks

  • Conduct root cause analysis

  • Participate in code reviews and technical discussions

  • Improve code maintainability, security, reliability, and platform cost efficiency

  • Collaborate to enhance practices within the engineering team

  • Automate and simplify the maintenance and lifecycle of platform services

  • Keep up with current industry trends, cloud-native concepts, best practices, and technologies

  • Ensure compliance with the System Development Lifecycle (SDLC) and company policy standards

  • Maintain up-to-date Design & Configuration Specifications.

  • Assist customers and colleagues

Must-Have Qualifications

  • Self-sufficiency

  • A proactive and delivery-oriented mindset

  • Ability to effectively work in a remote environment with a global team

  • Capability to review and understand system requirements and business processes

  • Hands-on experience with:

    • Git, Docker, Ansible, Terraform, Shell scripting, Python or similar (even more modern) tooling.

    • AWS Services (VPC, CloudWatch, ALB, Route53, S3, IAM, EKS, etc.) or Azure or Google Cloud

    • Networking

    • Linux system administration

  • Bachelor’s degree or equivalent in Computer Science, Computer Engineering, Information Systems or related experience.

Nice-to-Have Qualifications

  • Data Science Experience

  • AI/ML and data processing platforms

    • MLOPs

    • LLMOps

    • RAGs

    • Agent Frameworks

    • Vector Databases

  • Statistical analysis tools

  • Karpenter, Slurm, Dask, Github Actions or Jenkins

  • Software Development

    • Design Patterns

    • UI & Visualizations

  • Data Engineering

    • Spark, Ray, Dask

What we offer:

  • Exciting work in a great team, global projects, international environment

  • Opportunity to learn and grow professionally within the company globally.

  • Hybrid working model, flexible role pattern (e.g., even 80% full-time is possible in justified cases)

  • Pension and health insurance contributions

  • Internal reward system plus referral program

  • 5weeks annual leave,5sick days,15days of certified sick leave paid above statutory requirements annually,40paid hours annually for volunteering activities,12weeks of parental contribution.

  • Cafeteria for tax free benefits according to your choice (meal vouchers, Lítačka, sport, culture, health, travel, etc.), Multisport Card

  • Vodafone, Raiffeisen Bank, Foodora, and Mall.cz discount programs.

  • Up-to-date laptop and iPhone

  • Parking in the garage, showers, refreshments, massage chairs, library, music corner

  • Competitive salary, incentive pay, and many more.



Current Contingent Workers apply



Availability Management, Capacity Management, Change Controls, Design Applications, High Performance Computing (HPC), Incident Management, Information Management, Information Technology (IT) Infrastructure, IT Service Management (ITSM), Release Management, Software Development, Software Development Life Cycle (SDLC), Solution Architecture, System Administration, System Designs


*A job posting is effective until 11:59:59PM on the dayBEFOREthe listed job posting end date. Please ensure you apply to a job posting no later than the dayBEFOREthe job posting end date.