Preference-based reinforcement learning: A formal framework and a policy iteration algorithm
J. Fürnkranz, E. Hüllermeier, W. Cheng, S.H. Park, Machine Learning 89 (2012) 123–156.
Download
No fulltext has been uploaded.
Journal Article
| English
Author
Fürnkranz, J.;
Hüllermeier, EykeLibreCat;
Cheng, W.;
Park, S.H.
Department
Publishing Year
Journal Title
Machine Learning
Volume
89
Issue
1
Page
123-156
LibreCat-ID
Cite this
Fürnkranz J, Hüllermeier E, Cheng W, Park SH. Preference-based reinforcement learning: A formal framework and a policy iteration algorithm. Machine Learning. 2012;89(1):123-156.
Fürnkranz, J., Hüllermeier, E., Cheng, W., & Park, S. H. (2012). Preference-based reinforcement learning: A formal framework and a policy iteration algorithm. Machine Learning, 89(1), 123–156.
@article{Fürnkranz_Hüllermeier_Cheng_Park_2012, title={Preference-based reinforcement learning: A formal framework and a policy iteration algorithm}, volume={89}, number={1}, journal={Machine Learning}, author={Fürnkranz, J. and Hüllermeier, Eyke and Cheng, W. and Park, S.H.}, year={2012}, pages={123–156} }
Fürnkranz, J., Eyke Hüllermeier, W. Cheng, and S.H. Park. “Preference-Based Reinforcement Learning: A Formal Framework and a Policy Iteration Algorithm.” Machine Learning 89, no. 1 (2012): 123–56.
J. Fürnkranz, E. Hüllermeier, W. Cheng, and S. H. Park, “Preference-based reinforcement learning: A formal framework and a policy iteration algorithm,” Machine Learning, vol. 89, no. 1, pp. 123–156, 2012.
Fürnkranz, J., et al. “Preference-Based Reinforcement Learning: A Formal Framework and a Policy Iteration Algorithm.” Machine Learning, vol. 89, no. 1, 2012, pp. 123–56.