Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Honeywell Advanced Data Engineer 
United States 
211377923

10.07.2024
JOB DESCRIPTION

Advanced Data Engineer

You will be part of the IT Information Management & Analytics team and in this role, you will at the forefront of implementing state-of-the-art Generative AI, Machine Learning and AI Cognitive Services enhanced business use cases. This Advanced Data Engineer role will work with the business stakeholders, IT business partners, functional consultants, and infrastructure/technology teams to deliver innovative AI technology solutions that accelerate digital transformation and drive operational efficiency as well as business growth. You will focus primarily on the full cycle delivery of AI/ML use cases from requirements/design to release, driving the evolution of AI/ML infrastructure/process, and enabling Intelligent Apps & Automation, next generation of AI Copilots/Assistants, etc.

Collaborate with data scientists and AI engineers to build production-level models and pipelines, including any required preprocessing of input datasets and postprocessing after model inference.

Developing batch processing and real-time data processing solutions. Writing ETL (Extract, Transform, Load) processes, designing database systems, and implementing data pipelines for Gen AI & Class AI systems.

Optimizing data retrieval and developing dashboards for monitoring the quality of Data & AI systems. Tuning the performance of the data management systems to ensure high performance.

Work within project planning constraints, communicating any identified project risks and issues to the delivery/project manager and provide inputs to the change control process.

YOU MUST HAVE

  • Bachelors in Computer Science, Data Analytics, or Engineering fields
  • 4-6 years of experience in data engineering, IT, or software development for large corporate/organizations
  • 3+ years of experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
  • Experience with SQL, Python programming and other programming language, such as Java, Scala, NodeJS.
  • Hands-on experience in building robust, scalable data pipelines for LLM training, and vector database for RAG systems, etc

Experience with Databricks, Snowflake, Informatica

  • Experience in building IT use cases / solutions especially around AI/ML cognitive services, based on Cloud infrastructure and services such as Azure cloud platforms and Databricks.

WE VALUE

  • Master's Degree in Computer Science, Data Analytics or Engineering fields

Understanding of application development framework like LangChain, LlamaIndex, and knowledge of vector index, vector databases

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR

Experience operating large data warehouse

  • Project experience with NLP/NLG, AI Conversational Agent (Chatbot), OCR
  • Experience with DevSecOps framework, develop CICD pipelines, orchestrate workflow using Airflow, Opsera
  • Experience with Docker and Kubernetes
  • Working Experience in an Agile/Scrum/Scaled Agile and DevOps based team environment
  • Certifications/ Course work in Generative AI, AI/ML and Cloud platforms
  • Great communication skills
Additional Information
  • JOB ID: HRD225861
  • Category: Engineering
  • Location: Lot 115 (P),Nanakramguda Village,,Serilinganpally Madndal, RR District,Hyderabad,TELANGANA STATE,500019,India
  • Exempt