As a Machine Learning Engineer within our platform operations team, you will be tasked with the design, construction, and maintenance of our AIOps solution. This role demands a profound knowledge of AI/ML technologies, IT infrastructure, and platform engineering.
Job Responsibilities:
- Design, develop, and maintain software applications with integrated AI/ML capabilities with a key focus on AIOps.
- Collaborate with cross-functional teams to gather and analyze requirements, translating them into technical solutions.
- Develop and implement software architecture and design patterns to ensure scalability and performance.
- Write clean, maintainable, and efficient code in one or more general-purpose programming languages: Python, Java, C, C++, Go.
- Implement and manage data pipelines to preprocess and transform data for AI/ML models.
- Integrate AI/ML models into software applications and ensure seamless deployment into production environments.
- Optimize software applications for performance, reliability, and scalability.
- Conduct code reviews and provide technical guidance to junior developers.
- Stay up-to-date with the latest advancements in software engineering and AI/ML technologies.
- Ensure adherence to software development best practices, including agile and lean methodologies.
- Apply SRE principles to enhance system reliability, performance, and availability. Implement monitoring and alerting solutions to proactively identify and resolve issues.
Required Qualifications, Capabilities, and Skills:
- Bachelor’s degree in Computer Science or equivalent practical experience.
- Strong experience working with multiple technology teams in a matrixed organization.
- 7+ years of experience as a software developer with a focus on integrating AI/ML solutions.
- Strong programming skills in Python, with experience in developing and maintaining production-level code.
- Experience with application architecture and design patterns.
- Proficiency in working with large datasets and data preprocessing.
- Solid understanding of AI/ML algorithms and techniques, including deep learning and natural language processing.
- Familiarity with AI/ML libraries and frameworks, such as TensorFlow, PyTorch, scikit-learn, and Keras.
- Experience with cloud platforms, such as AWS or Azure, for deploying and scaling software applications.
- Strong problem-solving and analytical skills and excellent communication and collaboration skills.
Preferred Qualifications, Capabilities, and Skills:
- Familiarity with DevOps practices for software deployment and monitoring.
- Experience with SQL and NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka.
- Experience with distributed computing frameworks, such as Apache Spark.
- Experience in implementing SRE practices to improve system reliability and performance.