查看原文
其他

【直播】【QuACT系列报告】阮雨霏:Linear Bandits with Limited Adaptivity and...

KouShare 蔻享学术 2022-12-26


本系列报告由中国科学院计算技术研究所主办,于2021年6月1日10:00开始,授权蔻享学术进行网络直播。




直播二维码


Linear Bandits with Limited Adaptivity 

and Learning Distributional Optimal Design

报告人


阮雨霏 (University of Illinois at Urbana-Champaign)

时间


2021年6月1日 10:00-11:00


Motivated by practical needs such as large-scale learning, we study the impact of adaptivity constraints to linear contextual bandits, a central problem in online learning and decision making. Unlike traditional online learning problem which has full adaptivity at a per-time-step scale, our work focuses on the model in which the learning process is executed in parallelization but still wants to achieve optimal performance. In this talk, I will show that in such batch learning model, only ‍‍ batches are needed to achieve the optimal regret. Along the way in the proof, I will introduce the distributional optimal design, which is a natural extension of the optimal experiment design in statistical learning, and introduce our statistically and computationally efficient learning algorithm for the problem, which may be of independent interest.This is joint work with Jiaqi Yang and Yuan Zhou.


报告人简介


图 | 阮雨霏





Yufei Ruan is a third year Ph.D. student in Industrial & Enterprise Systems Engineering from University of Illinois at Urbana-Champaign. Her research focuses on the theoretical part of machine learning, especially on online learning and reinforcement learning. She completed her undergraduate studies in Mathematics at Tsinghua University.




QuACT系列报告】专题链接:https://www.koushare.com/frontiers/fop/intro

编辑:王茹茹




往期回顾












为满足更多科研工作者的需求,蔻享平台开通了各科研领域的微信交流群。进群请添加微信18019902656(备注您的科研方向)小编拉您入群哟!
蔻享网站www.koushare.com已开通自主上传功能,期待您的分享!

欢迎大家提供各类学术会议或学术报告信息,以便广大科研人员参与交流学习。

联系人:李盼 18005575053(微信同号)

戳这里,观看精彩直播哟!

您可能也对以下帖子感兴趣

文章有问题?点此查看未经处理的缓存