

Share
Key job responsibilities
- Design and implement highly scalable and realistic simulation environments for training agents using reinforcement learning.
- Develop and deploy generative AI approaches to automate the creation of diverse and complex simulation environments and scenarios.
- Analyze, troubleshoot, and profile complex machine learning systems within the simulation context, identifying and resolving performance bottlenecks.
- PhD, or Master's degree or 5+ years of applied research experience
- Experience programming in C++, Python or related language
- Experience with simulation and synthetic data generation
- Experience with neural deep learning methods and machine learning
- PhD in Computer Science, Machine Learning, or a related field
- 3+ years' experience building machine learning models (includes internships)
- Demonstrated experience in developing and implementing simulation environments or synthetic data generation for reinforcement learning.
- Strong programming skills in Python and experience with deep learning frameworks such as Tensor Flow or PyTorch
- Excellent problem-solving skills, with the ability to think creatively and critically about complex problems
- Strong communication and collaboration skills, with the ability to work effectively with cross-functional teamsPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit

Share
Your day will be filled with exciting challenges and opportunities to showcase your technical expertise. You’ll engage with Amazonians across various teams, owning and recognizing their unique IT needs and provide personalized high-quality support. You'll be a trusted advisor, a problem-solver, taking initiative to diagnose, troubleshoot and resolve a wide variety of specialized hardware and software issues implementing successful solutions. Our Engineers guide and empower technical and non-technical Amazonians through the ever-evolving digital landscape implementing solutions that fit their needs.Key job responsibilities
- Deliver on-site, high quality, hands-on support diagnosing, troubleshooting and resolving issues successfully, evaluating log files to determine the health of systems, software and hardware
- Assist with activities to triage and escalate system or network outages to reduce downtime
- Understand and execute change management activities in a high availability environment
- Participate with partner teams and vendors on continuous improvement projects, defining requirements and managing execution to deliver operational excellence and value
- Effectively manage and oversee IT asset inventories- Continuously expand skills, learning the latest technologies and maintaining knowledge of IT policies to provide technically accurate solutions
- Occasional travel required based on the needs of the business
- Participate in 24/7 on-call duty required for isolated high-severity incidents, serving as escalation point outside regular hours
A day in the life
Medical, Dental, and Vision Coverage
Maternity and Parental Leave Options
Paid Time Off (PTO)
401(k) PlanLearn more about our benefits here:
- High school or equivalent diploma
- 2+ years of corporate setting Windows, Mac or Linux Operating systems support experience
- 2+ years of supporting and maintaining a corporate network environment experience
- 2+ years of troubleshooting in a multi-user high availability environment experience
- 2+ years of PC repair, troubleshooting, deployment and liquidation experience
- Excellent customer facing skills
- Bachelor's degree
- CompTIA A+, CompTIA Network+, Cisco/CCNA, Linux (Redhat), Microsoft hardware (installation), AWS, or other industry relevant certifications
- Experience supporting video conference and teleconference equipment
- Experience in Active Directory and Windows Server backup solutions
- Ability to write simple scripts in an administrative language
- Strong analytical skills with demonstrated problem solving abilities
- Proven ability to develop clear, concise change management and standard operating procedure (SOP) documentation
These jobs might be a good fit

Share
Key job responsibilities
You will contribute directly to AI agent development in an applied research role, including model training, dataset design, and pre- and post-training optimization. You will be hired as a Member of Technical Staff.
- 3+ years' experience building machine learning models
- Proficiency in Python, Java, C++, or related language
- Experience with deep learning methods and tools, e.g., PyTorch, JAX
- PhD or Master's degree in computer science or related field
- Strong background in scientific research with a proven ability to generate and implement new ideas in machine learning
- Willingness to step outside typical role boundaries to get things done — every member of technical staff is expected to write code, design experiments, and interpret results
- Ability to communicate results and insights to both technical and non-technical audiences, including through presentations and written reports
- Ability to think big about the arc of development of AI over a multi-year horizon, and identify new opportunities to apply these technologies to solve real-world problems
- Capacity to mentor and guide junior scientists and engineers, and contribute to the overall growth and development of the teamPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit

Share
Key job responsibilities
* Design, build, and deploy machine learning models, frameworks, and data pipelines
* Optimize ML training, inference, and evaluation workflows for reliability and performance
* Evaluate and improve ML model performance and metrics
* Develop tools and infrastructure to enhance ML development productivity
* 3+ years experience building and deploying machine learning systems
* Experience with ML frameworks (e.g. PyTorch, TensorFlow) and ML orchestration tools
* Proven track record of improving ML systems, performance, and workflows
* Experience building and optimizing production-grade AI systems, e.g., for self-driving, simulation, or LLM applications
* Willingness to step outside typical role boundaries to get things done — every member of technical staff is expected to write code, design experiments, and interpret results
* Capacity to work autonomously and with a small team to drive research progressPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit

Share
In this role, you will work closely with research teams to design, build, and maintain systems for training and evaluating state-of-the-art agent models.Key job responsibilities* Evaluate performance of the training infrastructure, diagnose problems and address any gaps that exist.
* Develop reliable infrastructure to schedule training and model evaluation jobs across clusters.
* Work closely with researchers to create new techniques, infrastructure, and tooling around emerging research capabilities and evaluating models to meet customer needs.
* Manage project prioritization, deliverables, timelines, and stakeholder communication.
* Illuminate trade-offs, educate the team on best practices, and influence technical strategy.
* Operate in a dynamic environment to deliver high quality software.
5+ years of non-internship professional software development experience
5+ years of programming with at least one software programming language experience
5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
Experience as a mentor, tech lead or leading an engineering team
5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
Bachelor's degree in computer science or equivalent
Excellent knowledge of theory and practice of MLPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit

Share
About the RoleYou Will- Design and build scalable infrastructure to train, deploy and manage ML models- Develop and automate software for ML workflows- Optimize cost and performance of training and inference workloads- Contribute to ML infrastructure roadmap planningPerks
* Medical, Dental, Vision & Disability Insurance
* 401(k)
* Maternity & Parental Leave
* Flexible PTO
* Amazon Employee Discount
- 3+ years of professional software development experience
- Experience designing and building scalable and easy-to-use ML infrastructure systems
- Experience productionizing, scaling or extending ML models to solve real world use cases
- Experience collaborating with ML platform consumers
- Excellent coding skills in modern languages and frameworks
- Experience with AWS technologies such as ECS, Sagemaker, Redshift, Batch, DynamoDB, Lambda, SQS, and Step Functions
- Minimum of Bachelor’s degree in Computer Science or equivalent experience
- Knowledge of supervised ML algorithms
- Experience in building and managing data platforms
- Familiarity with Twitch and/or streaming on TwitchPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit

Share
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Strong software engineering background with full-stack development experience
- Deep understanding of machine learning fundamentals, particularly large-scale model training
- Expertise in distributed systems, cloud computing, and scalable data processing
- Experience with data pipeline design, ETL processes, and data management systems
- Proficiency in translating academic concepts into production systems
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Experience with dataset curation and quality assessment techniques Knowledge of computer vision and multimodal data processing
- Background in research environments or supporting ML research workflows
- Experience with data visualization and annotation tooling
- Familiarity with modern data filtering and deduplication methodologiesPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit

Share
Key job responsibilities
- Design and implement highly scalable and realistic simulation environments for training agents using reinforcement learning.
- Develop and deploy generative AI approaches to automate the creation of diverse and complex simulation environments and scenarios.
- Analyze, troubleshoot, and profile complex machine learning systems within the simulation context, identifying and resolving performance bottlenecks.
- PhD, or Master's degree or 5+ years of applied research experience
- Experience programming in C++, Python or related language
- Experience with simulation and synthetic data generation
- Experience with neural deep learning methods and machine learning
- PhD in Computer Science, Machine Learning, or a related field
- 3+ years' experience building machine learning models (includes internships)
- Demonstrated experience in developing and implementing simulation environments or synthetic data generation for reinforcement learning.
- Strong programming skills in Python and experience with deep learning frameworks such as Tensor Flow or PyTorch
- Excellent problem-solving skills, with the ability to think creatively and critically about complex problems
- Strong communication and collaboration skills, with the ability to work effectively with cross-functional teamsPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit