Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Microsoft Research Software Developer II 
United States, Washington 
33850267

07.01.2025

Training world class models requires world-class data. Even the best model architectures are useless if you don’t have the data to train them with.

Research Software Developer IYou will be working in the fields of data science, data mining, machine learning, deep neural networks and natural language processing. You will directly collaborate with experienced Machine Learning, NLP and Machine Translation scientists. This position will require work in both research and engineering domains.

If you are excited about making a real difference in the quality of our system, we would love to hear from you.

Required/Minimum Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C# or Python.
    • OR equivalent experience.
  • At least one of the following:
  • Experience with web-scale data mining / data science (Hadoop MapReduce, Scope, Spark, R, etc.)

OR

  • Experience doing AI, Machine Learning, NLP, Machine Translation, preferably in a production environment

Other Requirements

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Additional or Preferred Qualification:

  • Bachelor's Degree in Computer Science
    • OR related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript,
    • OR Python
    • OR Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • Experience programming in cloud ecosystems (Azure Storage, ADLS, Cosmos or their AWS equivalents)
  • Linguistics and/or NLP coursework or experience

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:


Microsoft will accept applications for the role until January 16, 2025.

Responsibilities
  • Discover new data sources and evaluate their quality to enrich our data coverage
  • Work with researchers to create and evaluate prototypes for new algorithms
  • Explore and evaluate new data processing tools and algorithms (for example text extraction, sentence extraction, parallel data alignment, normalization, duplicate identification) to improve the existing data processing pipelines.
  • Productize research prototypes into end-to-end pipelines

Maintain the existing pipelines for:

  • Automated discovery and identification of language data from multiple domains at web scale
  • Text data processing (text extraction, sentence extraction, parallel data alignment, normalization, duplicate identification)
  • Large scale text data storage infrastructure (import, export, query)
  • Data cleaning and filtering

Embody our and