Preference-Based Reinforcement Learning: evolutionary direct policy search using a preference-based racing algorithm
R. Busa-Fekete, B. Szörényi, P. Weng, W. Cheng, E. Hüllermeier, Machine Learning 97 (2014) 327–351.
Download
No fulltext has been uploaded.
Journal Article
| English
Author
Busa-Fekete, Robert;
Szörényi, B.;
Weng, P.;
Cheng, W.;
Hüllermeier, EykeLibreCat
Department
Publishing Year
Journal Title
Machine Learning
Volume
97
Issue
3
Page
327-351
LibreCat-ID
Cite this
Busa-Fekete R, Szörényi B, Weng P, Cheng W, Hüllermeier E. Preference-Based Reinforcement Learning: evolutionary direct policy search using a preference-based racing algorithm. Machine Learning. 2014;97(3):327-351.
Busa-Fekete, R., Szörényi, B., Weng, P., Cheng, W., & Hüllermeier, E. (2014). Preference-Based Reinforcement Learning: evolutionary direct policy search using a preference-based racing algorithm. Machine Learning, 97(3), 327–351.
@article{Busa-Fekete_Szörényi_Weng_Cheng_Hüllermeier_2014, title={Preference-Based Reinforcement Learning: evolutionary direct policy search using a preference-based racing algorithm}, volume={97}, number={3}, journal={Machine Learning}, author={Busa-Fekete, Robert and Szörényi, B. and Weng, P. and Cheng, W. and Hüllermeier, Eyke}, year={2014}, pages={327–351} }
Busa-Fekete, Robert, B. Szörényi, P. Weng, W. Cheng, and Eyke Hüllermeier. “Preference-Based Reinforcement Learning: Evolutionary Direct Policy Search Using a Preference-Based Racing Algorithm.” Machine Learning 97, no. 3 (2014): 327–51.
R. Busa-Fekete, B. Szörényi, P. Weng, W. Cheng, and E. Hüllermeier, “Preference-Based Reinforcement Learning: evolutionary direct policy search using a preference-based racing algorithm,” Machine Learning, vol. 97, no. 3, pp. 327–351, 2014.
Busa-Fekete, Robert, et al. “Preference-Based Reinforcement Learning: Evolutionary Direct Policy Search Using a Preference-Based Racing Algorithm.” Machine Learning, vol. 97, no. 3, 2014, pp. 327–51.