You will report directly to our Engineering Fellow, and you'll work out of our Raleigh, NC location on a hybrid work schedule.
You will use your experience and judgment to plan and accomplish goals. You will also generate innovative solutions in work situations, trying different and novel ways to deal with problems and opportunities. This is a hands-on position on a green field project supported by a software development and data science team with plenty of domain experience. The ideal candidate will focus on design, delivery, and execution while balancing speed with wise planning, judgement, and risk management. This is a design and development position, not an operational position.
KEY RESPONSIBILITIES
- Analysis of data use cases to inform design
- Data analysis and quality characterization
- Data preparation including imputing missing values, feature extraction, normalization
- Feature definition and extraction
- Model building, testing, validation, and deployment
- Using AutoML or related technologies to accelerate model development and experimentation
- Implementing complex rules/domain logic on data – especially time series techniques and pattern matching/recognition
- Ability to code in Python, pyspark, pandas, and SQL
YOU MUST HAVE
- 2 years of experience working with big data, IoT data, or timeseries data
- 2 years data science or data engineering experience
WE VALUE
- Big data handling expertise – data ingestion, storage, batch, streaming analytics and data processing, and understanding how model performance can be scaled
- BS. or equivalent degree in Engineering or Computer Science
- Experience with wide variety of classification, regression, clustering, and anomaly detection techniques
- Experience with time series techniques like ARIMA , dynamic time warping
- Experience evaluating model performance
- Experience with data labelling and label generation or capture techniques
- Experience with Azure, Azure Databricks, Apache Spark, Azure Data Factory, Azure Synapse Analytics, Azure Data Lake Storage, Apache Spark for Azure Synapse, Azure HDInsight
- Experience with cybersecurity concerns in cloud systems including ingress, storage, and egress
- Experience with Kubernetes, helm, docker, containerization principles and micro-service architecture foundations
- Familiarity with cloud identity management solutions
- Experience handling PII data in cloud environments
- Experience working with DevSecOps teams to manage deployments
- Experience with continuous integration or continuous delivery processes for cloud-native software
- Experience with data modelling for warehousing
- Experience with Data Warehouse and/or Lake House architectures
- Experience with ETL processes
- Knowledge of software configuration management and change management practices
- Diverse and global teaming and collaboration
- Effective communicator
- Wide degree of creativity and latitude
- Individuals who are self-motivated and able to work with little supervision, who consistently take the initiative to get things done, do things before being asked by others or forced to by events.
- Ability to consistently make timely decisions even in the face of complexity, balancing systematic analysis with decisiveness.
- Can quickly analyze, incorporate and apply new information and concepts.
Additional Information - JOB ID: HRD249875
- Category: Data & Analytics
- Location: 208 South Rogers Lane,Raleigh,North Carolina,27610,United States
- Exempt