Constraints on underspecified target trajectories

Much of the recent interest in artificial neural networks is founded on the development of supervised learning algorithms for nonlinear problems [1, 30, 39, 42, 47]. These algorithms, the most well-known being backpropagation, are able to model a large class of nonlinear transformations by assigning credit to internal “hidden” units. The remaining units—those connected directly to the environment—are generally assumed to be provided with target states. This assumption appears to be a liability; it is by no means clear that such desired outputs can always be provided. Consider, for example, a network serving as a feedforward controller for a robot. Such a network must produce torques as a function of the environmental goal and the current state of the robot. In general, however, the environment provides only the goal and not the torques that achieve the goal. Furthermore, if we assume the existence of an oracle that provides the torques as training data, then there appears to be little reason (other than perhaps speed) not to use the oracle as the controller in place of the network.

[1]  T. Flash,et al.  The coordination of arm movements: an experimentally confirmed mathematical model , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[2]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[3]  E. Bizzi,et al.  Posture control and trajectory formation during arm movement , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[4]  M. Posner Foundations of cognitive science , 1989 .

[5]  Mitsuo Kawato,et al.  Computational schemes and neural network models for formulation and control of multijoint arm trajectory , 1990 .

[6]  Bernard Widrow,et al.  Adaptive Signal Processing , 1985 .

[7]  James Lubker,et al.  Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predictive simulation , 1979 .

[8]  Elliot Saltzman,et al.  Skilled actions: a task-dynamic approach. , 1987, Psychological review.

[9]  David J. Reinkensmeyer,et al.  Using associative content-addressable memories to control robots , 1988, Proceedings of the 27th IEEE Conference on Decision and Control.

[10]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[11]  Enrico Clementi,et al.  Biological and Artificial Intelligence Systems , 1988 .

[12]  P. H. Greene,et al.  Why is it easy to control your arms ? , 1982, Journal of motor behavior.

[13]  J. Cutting,et al.  The Role of Speech in Language. , 1979 .

[14]  O. I. Fukson,et al.  Adaptability of innate motor patterns and motor control mechanisms , 1986, Behavioral and Brain Sciences.

[15]  Björn Lindblom,et al.  Economy of Speech Gestures , 1983 .

[16]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[17]  Michael A. Arbib,et al.  Perceptual Structures and Distributed Motor Control , 1981 .

[18]  Dan Koditschek,et al.  Natural motion for robot arms , 1984, The 23rd IEEE Conference on Decision and Control.

[19]  Peter F. MacNeilage,et al.  The Production of Speech , 2011, Springer New York.

[20]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[21]  A. W. Melton Categories of Human Learning , 1964 .

[22]  W. Thomas Miller,et al.  Sensor-based control of robotic manipulators using a general learning algorithm , 1987, IEEE J. Robotics Autom..

[23]  Fernando J. Pineda,et al.  Generalization of Back propagation to Recurrent and Higher Order Neural Networks , 1987, NIPS.

[24]  A. G. Feldman Superposition of motor programs—I. Rhythmic forearm movements in man , 1980, Neuroscience.

[25]  S. Öhman Coarticulation in VCV Utterances: Spectrographic Measurements , 1966 .

[26]  B. Malmberg Manual of phonetics , 1968 .

[27]  N. Hogan An organizing principle for a class of voluntary movements , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[28]  Raymond D. Kent,et al.  Coarticulation in recent speech production models , 1977 .

[29]  Paul M. Fitts,et al.  Perceptual-Motor Skill Learning1 , 1964 .

[30]  E. Bizzi,et al.  Theoretical and Experimental Perspectives on Arm Trajectory Formation: A Distributed Model of Motor Redundancy , 1988 .

[31]  Michael I. Jordan Supervised learning and systems with excess degrees of freedom , 1988 .

[32]  Michael Kuperstein,et al.  Adaptive visual-motor coordination in multijoint robots using parallel architecture , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[33]  J. Kelso,et al.  Exploring a vibratory systems analysis of human movement production. , 1980, Journal of neurophysiology.

[34]  G. Brindley,et al.  THE UNDERSTANDING OF THE BRAIN , 1973 .

[35]  E. Bizzi,et al.  Processes controlling arm movements in monkeys. , 1978, Science.

[36]  J. Salisbury,et al.  Active stiffness control of a manipulator in cartesian coordinates , 1980, 1980 19th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[37]  Michael I. Jordan Motor Learning and the Degrees of Freedom Problem , 2018, Attention and Performance XIII.

[38]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[39]  Paul J. Werbos,et al.  Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[40]  Neville Hogan,et al.  Impedance Control: An Approach to Manipulation: Part I—Theory , 1985 .

[41]  L Saltzman Elliot,et al.  A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .