Established in 1996 in Amsterdam, Booking.com has grown from a small Dutch start-up to one of the largest travel e-commerce companies in the world. With a mission to empower people to experience the world, Booking.com invests in digital technology that helps take the friction out of travel. We connect travellers with the world’s largest selection of incredible places to stay, including everything from apartments, vacation homes, and family-run B&Bs to 5-star luxury resorts, tree houses and even igloos. Each day, more than 1,550,000 room nights are reserved on our platform in over 40 languages and 140,354 destinations in 231 countries and territories worldwide.
Moran Beladev

Senior ML Manager
Moran is a Senior Machine Learning Manager at booking.com, researching and developing GenAI, NLP and CV models for the tourism domain.
Moran is a Ph.D candidate in information systems engineering at Ben Gurion University, researching NLP aspects in temporal graphs.
Previously worked as a Data Science Team Leader at Diagnostic Robotics, building ML solutions for the medical domain and NLP algorithms to extract clinical entities from medical visit summaries.
14:00 - 14:30
Masterclass
Thu, Nov 26
Building the Future of AI Trip Planning: LLMs, Inference Optimization, and Agentic Designs at Booking.com
In this practical talk, we share how Booking.com built its AI Trip Planner - an LLM-powered experience that personalizes travel planning at scale. We’ll walk through real-world design decisions, technical challenges, and infrastructure optimizations involved in delivering real-time hotel and destination recommendations using large language models (LLMs).
We’ll cover key challenges like moderating user input, classifying intent, structuring dialogues, and generating grounded responses. Through prompt engineering and custom model development, we tailored LLM interactions to our product needs while ensuring speed and relevance.
To address inference latency, we implemented speculative decoding and integrated Medusa-1, a novel architecture that predicts multiple tokens in parallel, achieving a 1.8x speedup with no loss in quality. We’ll detail its design and training trade-offs.
Beyond acceleration, we’ll highlight our move toward agentic AI systems - modular components that orchestrate LLMs, retrieval services, and Booking.comAPIs to solve complex travel queries. For example: A Question-Answering Agent that fuses LLMs, real-time data, and APIs for context-aware answers.
Finally, we’ll show how we evaluate quality in production using LLM-based evaluations, including Judge LLMs for automatic assessment, dialog quality and more.












.png)