Interests
My core research interest is in machine learning for interactive systems that maximizes a utility function by taking actions, which is in contrast to prediction-oriented machine learning like supervised learning. I am working on large language models, reinforcement learning, contextual bandits, and related areas. I have applied my work to recommendation, Web search, advertising, and conversational systems.
Most of my work can be grouped into several clusters:
More information can be found in Google Scholar, DBLP, LinkedIn.
Somewhat up-to-date