Repository | Book | Chapter
Learning nursery rhymes using adaptive parameter neurodynamic programming
pp. 196-209
Abstract
In this study on music learning, we develop an average reward based adaptive parameterisation for reinforcement learning meta-parameters. These are tested using an approximation of user feedback based on the goal of learning the nursery rhymes Twinkle Twinkle Little Star and Mary Had a Little Lamb. We show that a large reduction in learning times can be achieved through a combination of adaptive parameters and random restarts to ensure policy convergence.
Publication details
Published in:
Randall Marcus (2015) Artificial life and computational intelligence: first Australasian conference, acalci 2015, Newcastle, nsw, India, february 5-7, 2015. proceedings. Dordrecht, Springer.
Pages: 196-209
DOI: 10.1007/978-3-319-14803-8_16
Full citation:
Walker Josiah, Chalup Stephan K. (2015) „Learning nursery rhymes using adaptive parameter neurodynamic programming“, In: M. Randall (ed.), Artificial life and computational intelligence, Dordrecht, Springer, 196–209.