MS or PhD in an appropriate technology field (Computer Science, Statistics, Applied Math, Operations Research, etc.).
At least 8+ years of experience with data science for MS or at least 4+ for PhD.
Experience in modern advanced analytical tools and programming languages such as R or Python with scikit-learn.
Efficient in SQL, Hive, or SparkSQL, etc.
Application building using LLM’s
Comfortable in Linux environment
Experience in data mining algorithms and statistical modeling techniques such as clustering, classification, regression, decision trees, neural nets, support vector machines, anomaly detection, recommender systems, sequential pattern discovery, and text mining.
Solid communication skills: Demonstrated ability to explain complex technical issues to both technical and non-technical audiences.
Preferred Additional Experience
Apache Spark
The Hadoop ecosystem
Java
HP Vertica
TensorFlow, reinforcement learning
Ensemble Methods, Deep Learning, and other topics in the Machine Learning community