Taehyun Hwang
Taehyun Hwang
Home
Publications
Light
Dark
Automatic
1
Generalized Linear Bandits with Memory
Create your slides in Markdown - click the Slides button to check out the example. Supplementary notes can be added here, including [code, math, and images](https://wowchemy.com/docs/writing-markdown-latex/). --
Heesang Ann
,
Hyunjun Choi
,
Taehyun Hwang
,
Younghoon Shin
,
Haeju Cheong
,
Min-hwan Oh
Diversified Multinomial Logit Contextual Bandits
Existing contextual
multinomial logit
(MNL) bandits model relevance-driven choice but ignore the potential benefits of …
Heesang Ann
,
Taehyun Hwang
,
Min-hwan Oh
PDF
Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities
We study the
multinomial logit
(MNL) contextual bandit problem for sequential assortment selection. Although most existing research …
Taehyun Hwang
,
Dahngoon Kim
,
Min-hwan Oh
PDF
Lasso Bandit with Compatibility Condition on Optimal Arm
We consider a stochastic sparse linear bandit problem where only a sparse subset of context features affects the expected reward …
Harin Lee
,
Taehyun Hwang
,
Min-hwan Oh
PDF
Combinatorial Neural Bandits
We consider a contextual combinatorial bandit problem where in each round a learning agent selects a subset of arms and receives …
Taehyun Hwang
,
Kyuwook Chai
,
Min-hwan Oh
PDF
Model-Based Reinforcement Learning with Multinomial Logistic Function Approximation
We study model-based reinforcement learning (RL) for episodic Markov decision processes (MDP) whose transition probability is …
Taehyun Hwang
,
Min-hwan Oh
PDF
Cite
×