DevJobs

Applied Scientist, Personalization, Personalization

Overview
Skills
  • C++ C++
  • Python Python
  • PyTorch PyTorch
  • TensorFlow TensorFlow
  • Spark Spark
  • AWS AWS
  • JAX
Description

Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators — Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud.

We're Building a New Core Group Of Engineers In TLV (Tel Aviv) To Drive Innovation In ML Systems Performance And Software. As a Machine Learning Performance Engineer, You'll Help Shape The Direction Of The Team From The Ground Up And Work On

Optimizing system performance across the entire ML software stack

Analyzing high-performance ML workloads running on Annapurna hardware

Developing high-performance kernels for critical ML operations

Enhancing the Neuron SDK to improve developer experience and system capabilities

Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performance

As part of the Performance Engineering Team, you'll contribute to projects involving instruction scheduling, memory management, parallelism, kernel optimization, and compiler enhancements to maximize end-to-end performance.

This is a unique opportunity to be at the intersection of ML and systems within AWS, helping to build the future of AI infrastructure — right here in Tel Aviv.

Key job responsibilities

Our Engineers Collaborate Across Diverse Teams, Projects, And Environments To Have a Firsthand Impact On Our Global Customer Base. You Will

Solve challenging technical problems, often ones not solved before, at every layer of the stack.

Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security.

Research implementations that deliver the best possible experiences for customers.

A day in the life

As You Design And Code Solutions To Help Our Team Drive Efficiencies In Software Architecture, You’ll Create Metrics, Implement Automation And Other Improvements, And Resolve The Root Cause Of Software Defects. You’ll Also

Build high-impact so

Basic Qualifications

  • B.S. or M.S. in computer science or related field
  • Proficiency with 1 or more of the following programming languages: Python(preferred), C++
  • Experience working with TensorFlow, PyTorch, and/or JAX
  • 3+ years of non-internship professional software development experience
  • 3+ years of experience in performance optimizations in LLM, Vision or other deep-learning models

Preferred Qualifications

  • Experience in building and launching deep learning and machine learning models for business applications
  • Solid knowledge of big data and cloud technologies (e.g., Spark, AWS, etc.)
  • Experience with information retrieval, recommender systems, natural language processing, and/or personalization algorithms
  • Publications at top Web, Machine Learning, Natural Language Processing conferences such as KDD, ICML, NeurIPS, ACL, EMNLP, etc.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.


Company - Annapurna Labs Ltd.

Job ID: A10423317
Amazon Web Services (AWS)