A Reinforcement Learning-based Dialogue Agent for Referring Expression Generation.
References:
[1] Heidrich-Meisner, V., Lauer, M., Igel, C., & Riedmiller, M. A. (2007). Reinforcement learning in a nutshell. In ESANN (pp. 277–288).
[2] CUAYAHUITL, H., DETHLEFS, N., FROMMBERGER, L., RICHTER, K.-F., AND BATEMAN, J. 2010a. Generating adaptive route instructions using hierarchical reinforcement learning. In Proceedings of the International Conference on Spatial Cognition (Spatial Cognition VII).
[3] Mitchell, C.M., Boyer, K.E. and Lester, J.C. (2013). A Markov decision process model of tutorial intervention in task-oriented dialogue. In Proceedings of the International Conference on Artificial Intelligence in Education, pp.828–831, Memphis, Tennessee.