RL based Differential Drive Primitive Policy for Transfer Learning / (Record no. 609005)

000 -LEADER
fixed length control field 01764nam a22001577a 4500
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 629.8
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name Shahid, Mahrukh
245 ## - TITLE STATEMENT
Title RL based Differential Drive Primitive Policy for Transfer Learning /
Statement of responsibility, etc. Mahrukh Shahid
264 ## - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture Islamabad :
Name of producer, publisher, distributor, manufacturer SMME- NUST;
Date of production, publication, distribution, manufacture, or copyright notice 2022.
300 ## - PHYSICAL DESCRIPTION
Extent 52p.
Other physical details Soft Copy
Dimensions 30cm
500 ## - GENERAL NOTE
General note To ensure the steady navigation for robot stable controls are the basic unit and control values<br/>selection is highly environment dependent. Adding Generalization to system is the key to<br/>reusability of control parameters to ensure adaptability in robots to perform with sophistication, in<br/>the environments about which they have no prior knowledge, for this Reinforcement Leaning (RL)<br/>based control systems are promising. However, tuning appropriate parameters to train RL<br/>algorithm is a challenge. Therefore, we designed a continuous reward function to minimizing the<br/>sparsity and stabilizes the policy convergence, to attain control generalization for differential drive<br/>robot. We Implemented Twin Delayed Deep Deterministic Policy Gradient-TD3 on Open-AI Gym<br/>Race Car. System was trained to achieve smart primitive control policy, moving forward in the<br/>direction of goal by maintaining an appropriate distance from walls to avoid collisions. Resulting<br/>policy was tested on unseen environments and observed precisely performing results. Upon<br/>comparative analysis of TD3 with DDPG, TD3 policy outperformed the DDPG policy in both<br/>training and testing phase, proving TD3 to be resource efficient and stable.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element MS Robotics and Intelligent Machine Engineering
700 ## - ADDED ENTRY--PERSONAL NAME
Personal name Supervisor : Dr. Yasar Ayaz
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier <a href="http://10.250.8.41:8080/xmlui/handle/123456789/30557">http://10.250.8.41:8080/xmlui/handle/123456789/30557</a>
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type Thesis
Holdings
Withdrawn status Permanent Location Current Location Shelving location Date acquired Full call number Barcode Koha item type
  School of Mechanical & Manufacturing Engineering (SMME) School of Mechanical & Manufacturing Engineering (SMME) E-Books 04/29/2024 629.8 SMME-TH-757 Thesis
© 2023 Central Library, National University of Sciences and Technology. All Rights Reserved.