공지사항
[일반] [생산서비스운영 초청 세미나] 서울대학교 데이터사이언스대학원 오민환 교수
서울대학교 경영대학 생산서비스운영 전공에서 아래와 같이 연사 초청 세미나를 개최하고자 합니다.
관심 있는 분들의 많은 참여를 부탁드립니다.
일시: 3월 19일 (수) 10시 - 11시 30분
장소: 58동 119호
발표자: 오민환 교수 (서울대학교 데이터사이언스대학원)
Title: Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
Abstract:
In this talk, we address the contextual multinomial logistic (MNL) bandit problem, where a decision-making agent sequentially selects assortments of items based on contextual information, with user choices modeled by the MNL framework. Although widely applicable in various domains such as online retail and online advertising, a significant theoretical gap has persisted between known upper and lower regret bounds for this problem.We first focus on the case of uniform revenues, establishing a minimax regret lower bound. We then introduce OFU-MNL+, a novel, computationally efficient algorithm achieving matching minimax-optimal upper bounds (up to logarithmic factors). Additionally, we extend our theoretical results by providing instance-dependent regret guarantees. Subsequently, we examine the case of non-uniform revenues, again proving matching lower and upper regret bounds. We demonstrate that OFU-MNL+ remains minimax-optimal (up to logarithmic factors) in this broader setting as well. Hence, our proposed algorithm is the first to achieve minimax optimality in either case. Empirical evaluations further validate our theoretical findings, confirming the effectiveness and practical performance of OFU-MNL+ based on cumulative regret and computational efficiency. Overall, this work provides the first minimax-optimal solution in the contextual MNL bandit literature, bridging theoretical efficiency with practical applicability.
This work is a joint work with my PhD student, Joongkyu Lee.
담당 조교: 전현주 (jeon.h@snu.ac.kr)
참석 가능 대상: 전공불문 학부생 및 대학원생 누구나 (사전 신청 불필요)
참석 가능 대상: 전공불문 학부생 및 대학원생 누구나 (사전 신청 불필요)