Print Email Facebook Twitter Comparison of Optimal Control Techniques for Learning-based RRT Title Comparison of Optimal Control Techniques for Learning-based RRT Author Paramkusam, Deepak (TU Delft Mechanical, Maritime and Materials Engineering; TU Delft Delft Center for Systems and Control) Contributor Wisse, Martijn (mentor) Bharatheesha, Mukunda (mentor) Grammatico, Sergio (graduation committee) Wolfslag, Wouter (graduation committee) Degree granting institution Delft University of Technology Date 2018-02-27 Abstract Kinodynamic motion planning for a robot involves generating a trajectory from a given robot state to goal state while satisfying kinematic and dynamic constraints. Rapidly-exploring Random Trees (RRT) is a sampling-based algorithm that has been widely adopted for this. However, RRT is not fast enough to enable its use in industrial applications. Recently, supervised learning has been used to pre-learn time consuming steps of RRT which resulted in improvement in planning times. The supervised learning models require cost and control input of the system as training data which are generated using optimal control.The training data can be obtained either by indirect optimal control or direct optimal control techniques. In this thesis, both the techniques are each used to generate cost and control inputs for a two-link manipulator using random initial-final state pairs. Then each dataset is used to train a model and the datasets are compared based on certain training metrics. K-nearest neighbours regression and multi-layer perceptron neural network are the supervised learning models used in this thesis. It is observed that both the datasets result in similar convergence of the models, but indirect optimal control approach allows upto 24-fold faster data generation and upto 3-fold reduction in dimensionality of training data compared to the direct optimal approach.Real-world robots have torque limits based on actuator configuration. The torque limits are modeled as control constraints in both the optimal control techniques and the effect ofthis restriction on data generation and supervised learning is studied in this thesis. Directoptimal control is found to be better for data generation in this case due to the ease ofapplying control bounds as inequality constraints on the function approximations. Indirectoptimal control is very tedious as active constraints should be known a priori to determinethe switching points. An alternate method is explored instead where samples are generated similar to the unconstrained case but samples violating the constraints are removed. Poor control input learning is observed in both approaches and the models struggled to extrapolate. It is hypothesised that this is due to inability of the constrained data to fully capture the system dynamics. However, good cost prediction is achieved using neural networks. Subject RRTSupervised LearningOptimal controlMotion Planning To reference this document use: http://resolver.tudelft.nl/uuid:742ed24e-0525-4ae2-b6d4-2dc6f69e60e1 Part of collection Student theses Document type master thesis Rights © 2018 Deepak Paramkusam Files PDF mscThesis_deepak.pdf 75.13 MB Close viewer /islandora/object/uuid:742ed24e-0525-4ae2-b6d4-2dc6f69e60e1/datastream/OBJ/view