<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>研究テーマ on 阿部 拳之</title><link>https://bakanaouji.github.io/ja/research/</link><description>Recent content in 研究テーマ on 阿部 拳之</description><generator>Hugo -- gohugo.io</generator><language>ja</language><copyright>© 2026 阿部 拳之</copyright><atom:link href="https://bakanaouji.github.io/ja/research/index.xml" rel="self" type="application/rss+xml"/><item><title>Bandits and Online Learning</title><link>https://bakanaouji.github.io/ja/research/bandits-online-learning/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/ja/research/bandits-online-learning/</guid><description>オンライン環境で意思決定をしながら効率的に学習するには？</description></item><item><title>Fairness in Recommender Systems and Allocation</title><link>https://bakanaouji.github.io/ja/research/fairness-recsys-allocation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/ja/research/fairness-recsys-allocation/</guid><description>限られた資源や機会を公平に配分するには？</description></item><item><title>Language Model Alignment and Preference Optimization</title><link>https://bakanaouji.github.io/ja/research/language-model-alignment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/ja/research/language-model-alignment/</guid><description>言語モデルの出力を人間の選好にどう整合させるか？</description></item><item><title>Learning Dynamics and Equilibrium Computation in Games</title><link>https://bakanaouji.github.io/ja/research/learning-dynamics-equilibrium-games/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/ja/research/learning-dynamics-equilibrium-games/</guid><description>ナッシュ均衡へ高速に収束する学習アルゴリズムとは？</description></item><item><title>Reinforcement Learning and Sequential Decision Making</title><link>https://bakanaouji.github.io/ja/research/reinforcement-learning-sequential-decision/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://bakanaouji.github.io/ja/research/reinforcement-learning-sequential-decision/</guid><description>逐次的な意思決定において、方策をどう改善・評価するか？</description></item></channel></rss>