<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Research Themes on Kenshi Abe</title><link>https://bakanaouji.github.io/research/</link><description>Recent content in Research Themes on Kenshi Abe</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026 Kenshi Abe</copyright><atom:link href="https://bakanaouji.github.io/research/index.xml" rel="self" type="application/rss+xml"/><item><title>Bandits and Online Learning</title><link>https://bakanaouji.github.io/research/bandits-online-learning/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/research/bandits-online-learning/</guid><description>How can agents learn efficiently while making decisions online?</description></item><item><title>Fairness in Recommender Systems and Allocation</title><link>https://bakanaouji.github.io/research/fairness-recsys-allocation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/research/fairness-recsys-allocation/</guid><description>How can we allocate limited resources and opportunities fairly?</description></item><item><title>Language Model Alignment and Preference Optimization</title><link>https://bakanaouji.github.io/research/language-model-alignment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/research/language-model-alignment/</guid><description>How can we generate language model outputs that align with human preferences?</description></item><item><title>Learning Dynamics and Equilibrium Computation in Games</title><link>https://bakanaouji.github.io/research/learning-dynamics-equilibrium-games/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/research/learning-dynamics-equilibrium-games/</guid><description>How can learning algorithms converge quickly to Nash equilibrium?</description></item><item><title>Reinforcement Learning and Sequential Decision Making</title><link>https://bakanaouji.github.io/research/reinforcement-learning-sequential-decision/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/research/reinforcement-learning-sequential-decision/</guid><description>How can agents improve and evaluate policies over time?</description></item></channel></rss>