Who we are?eSelf AI is developing the first face-to-face conversational-AI platform. By uniting advanced video generation, low-latency voice & ASR, and LLMs, we enable lifelike, real-time interactions that power AI tutors, sales agents, banking advisers, and more. Our technology already serves major clients such as Christie’s real-estate teams and supports a nationwide initiative with Israel’s Center for Educational Technology (CET) to give every student a personal AI tutor. Led by veterans of Snap’s AI group and elite Israeli tech units, we’re pushing the limits of real-time multimodal AI.
Why this role mattersDelivering fluent two-way video conversations requires combining cutting-edge research with production pragmatism. As an ML Researcher (student) you will explore and prototype across three pillars:
- Video – talking-head generation, expression transfer, gaze correction
- Voice & ASR – high-quality speech synthesis and accurate real-time transcription
- LLMs – dialogue planning, retrieval-augmented responses, tool use
What you’ll do- Stay current with the literature: track emerging papers, distill insights, and present them to the team.
- Rapid prototyping: implement promising ideas, reproduce baselines, and benchmark against internal KPIs.
- Experimental rigor: design experiments, run ablations, analyze results, and write concise technical notes.
- Model optimization: compress and fine-tune models for deployment on consumer GPUs and mobile chips.
- Cross-functional integration: work closely with backend and streaming engineers to ship your models into production.
Minimum qualifications- Current M.Sc. or Ph.D. student in Machine Learning, Computer Science, EE, or a related field
- Demonstrated ML research skills: literature review, experiment design, statistical analysis, clear technical writing
- Familiarity with PyTorch or similar deep-learning frameworks
Ready to push the boundaries of face-to-face AI?
We can’t wait to meet you!