Job Description
What will you do?
- Architect, implement, and optimize cloud storage and computing solutions within an AWS environment, ensuring robust performance for research applications.
- Take ownership of system configuration and health, especially in Linux environments, to ensure high availability and reliability.
- Work closely with research scientists and stakeholders to understand their computational needs and translate these needs into technical specifications and project requirements.
- Develop and communicate a clear product strategy and roadmap that aligns with research goals, ensuring all stakeholders are informed and engaged.
- Lead technical implementation efforts within an agile framework, conducting sprint planning, backlog refinement, and retrospectives.
- Collaborate with product managers and engineers to create a feedback loop that informs ongoing development, product improvements and realized value .
- Analyze the total cost of ownership for all solutions and make recommendations to ensure cost-effective operations without compromising quality.
- Implement best practices for cloud resource management, performance monitoring, and fault tolerance.
- Research and integrate new technologies and methodologies that enhance cloud solutions and improve user experience.
- Provide mentorship and guidance to junior engineers, fostering an environment of collaboration and innovation.
Qualifications, Skills & Experience Required
- University degree in a relevant engineering or computer science field
- At least 8 years of experience in cloud engineering, with strong hands-on skills in AWS services, cloud architecture, and HPC
- Proven track record of product management experience or being very close partner, including working directly with technical teams to deliver solutions
- Familiarity with agile methodologies and experience leading cross-functional teams in a technical setting
- Strong analytical skills to assess performance, identify issues, and implement solutions
Nice to have
- Experience with scientific applications such asGaussianfor computational chemistry andGracefor scientific data visualization and plotting, especially in cloud environments
- Proficiency in container technologiesSingularityorApptainerfor managing scientific software environments and workflows
- Hands-on experience with HPC job schedulers such asPBSandSlurmfor effective workload management in high-performance computing environments, and experience usingNextflowfor managing data-driven workflows
- Familiarity withKubernetes
- In depth understanding ofGPUdriven workloadsincluding performance observability, performance tuning and scalingparticularly for AI/ML and computational research use cases
- Experience with AI/ML platforms (such as Databricks) and their use in scientific research
- Knowledge of automation and scripting tools to improve efficiency in cloud operations and scientific workflows
What we offer
- Exciting work in a great team, global projects, international environment
- Opportunity to learn and grow professionally within the company globally
- Hybrid working model, flexible role pattern (e.g., even 80% full-time is possible in justified cases)
- Pension and health insurance contributions
- Internal reward system plus referral programme
- 5 weeks annual leave, 5 sick days, 15 days of certified sick leave paid above statutory requirements annually, 40 paid hours annually for volunteering activities, 12 weeks of parental contribution
- Cafeteria for tax free benefits according to your choice (meal vouchers, Lítačka, sport, culture, health, travel, etc.), Multisport Card
- Vodafone, Raiffeisen Bank, Foodora, and Mall.cz discount programmes
- Up-to-date laptop and iPhone
- Parking in the garage, showers, refreshments, massage chairs, library, music corner
- Competitive salary, incentive pay, and many more
Refer this job!
Current Contingent Workers apply
Agile Application Development, Agile Application Development, Agile Methodology, Amazon Web Services (AWS), Availability Management, Business, Capacity Management, Change Controls, Cloud Engineering, Cloud Services Management, Computational Chemistry, Design Applications, High Performance Computing (HPC), Incident Management, Information Management, Information Technology (IT) Infrastructure, IT Service Management (ITSM), Management Process, Product Management, Product Strategies, Release Management, Scientific Software Development, Slurm Workload Manager, Software Development, Software Development Life Cycle (SDLC) {+ 5 more}
*A job posting is effective until 11:59:59PM on the dayBEFOREthe listed job posting end date. Please ensure you apply to a job posting no later than the dayBEFOREthe job posting end date.