Learning nursery rhymes using adaptive parameter neurodynamic programming

Walker Josiah; Chalup Stephan K.

Repository | Book | Chapter

Learning nursery rhymes using adaptive parameter neurodynamic programming

Josiah Walker Stephan K. Chalup

pp. 196-209

Abstract

In this study on music learning, we develop an average reward based adaptive parameterisation for reinforcement learning meta-parameters. These are tested using an approximation of user feedback based on the goal of learning the nursery rhymes Twinkle Twinkle Little Star and Mary Had a Little Lamb. We show that a large reduction in learning times can be achieved through a combination of adaptive parameters and random restarts to ensure policy convergence.

Publication details

Published in:

Randall Marcus (2015) Artificial life and computational intelligence: first Australasian conference, acalci 2015, Newcastle, nsw, India, february 5-7, 2015. proceedings. Dordrecht, Springer.

Pages: 196-209

DOI: 10.1007/978-3-319-14803-8_16

Full citation:

Walker Josiah, Chalup Stephan K. (2015) „Learning nursery rhymes using adaptive parameter neurodynamic programming“, In: M. Randall (ed.), Artificial life and computational intelligence, Dordrecht, Springer, 196–209.