Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Microsoft Senior Data Applied Scientist 
United States, Washington 
841297546

20.11.2024
Data and Applied Scientist


Maintaining a comprehensive, rich, diverse, clean, and fresh index presents significant challenges that require a strategic approach. We must balance various improvement pillars while making necessary tradeoffs to ensure scalability and efficiency for billions of images. Our commitment to innovation and imagination is tempered by the need to be mindful of cost and latency. Our challenges are diverse and complex, including:

  • Image Content Discovery: Efficiently and swiftly identifying a wide array of Image content by crawling the internet.
  • AI Selection and Ranking Models: Training and developing SLM, LLM, Deep learning, and ML models for index selection and document understanding leveraging text and pixel information.
  • Real-Time Indexing: Designing and implementing models and pipelines that support near-real-time (NRT) indexing and ensure content freshness.
  • AI Integration: Innovating methods to leverage AI within platform for optimal inference.

This is a great opportunity for someone who loves to tackle deep technical challenges and strives for industry-wide impact. In this role, you will be instrumental in bringing new AI technology into production, pushing the boundaries of state-of-the-art (SOTA) both within the company and in the broader industry to fulfill Multimedia product requirements. You will play a significant role in the research, design, development, data analysis, feature creation, and implementation of applied research projects, applying scientific principles and techniques to solve real world problems. You will create Deep learning, SLM, LLM, and multimodal models to analyze and interpret various forms of content to enhance system capabilities, manage models and data at a Pentabyte scale, and address the extreme challenges posed by new generation AI systems.

Required Qualifications:

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)
    • OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
    • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
    • OR equivalent experience.
  • 1+ years experience developing and deploying AI products or systems at multiple points in the product cycle from ideation to shipping.
  • 1+ years experience developing AI models in live production systems, as part of a product team.

Preferred Qualifications:

  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical, or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
    • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical, or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
    • OR equivalent experience.
  • 3+ years experience developing and deploying AI products or systems at multiple points in the product cycle from ideation to shipping.
  • 3+ years experience developing AI models in live production systems, as part of a product team.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:Microsoft will accept applications for the role until November 28, 2024.


Responsibilities
  • Master a broad area or research and understand any applicable research techniques. Serve as a team expert on changes in industry trends, products, and other advances, and apply this knowledge to influence product needs.
  • Review business and product requirement, incorporate research, and provide strategic direction for problem solving. You’ll also ensure scientific rigor, support the development of methods, and apply your expertise to support business impact.
  • Identify and inspire peers and new research talent to join Microsoft, build relationships, and advocate for research initiatives. Share research findings through industry outreach, collaborate with the academic community, and help develop the recruiting pipeline.
  • Document work and experimentation results and share findings to promote innovation. Provide guidance when capturing processes and contribute to ethics and privacy policies related to research processes and data collection.
  • Research and develop LLMs (Large Language Models), SLMs (Small Language Models), ML (Machine Learning), DL (Deep Learning) techniques to extract document data, parsing, clustering, selection, ranking, etc.
  • Enable everyday experimentation on petabytes of web multimedia index data using Big Data technologies. Perform data analysis and experimentation for business centric metrics and deliver SOTA (State of the Art) models for Image index platform.
  • Collaborate with other teams in Microsoft and work with other engineers, scientists and product managers.