DESCRIPTIONMajor responsibilities Leverage Bandits and Reinforcement Learning for Experimentation and Optimization Systems. Develop offline policy estimation tools and integrate with reporting systems. Establish scalable, efficient, automated processes for large scale...