Normal view MARC view ISBD view

RL based Differential Drive Primitive Policy for Transfer Learning / (Record no. 609005)

000 -LEADER
fixed length control field	01764nam a22001577a 4500
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	629.8
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name	Shahid, Mahrukh
245 ## - TITLE STATEMENT
Title	RL based Differential Drive Primitive Policy for Transfer Learning /
Statement of responsibility, etc.	Mahrukh Shahid
264 ## - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture	Islamabad :
Name of producer, publisher, distributor, manufacturer	SMME- NUST;
Date of production, publication, distribution, manufacture, or copyright notice	2022.
300 ## - PHYSICAL DESCRIPTION
Extent	52p.
Other physical details	Soft Copy
Dimensions	30cm
500 ## - GENERAL NOTE
General note	To ensure the steady navigation for robot stable controls are the basic unit and control values<br/>selection is highly environment dependent. Adding Generalization to system is the key to<br/>reusability of control parameters to ensure adaptability in robots to perform with sophistication, in<br/>the environments about which they have no prior knowledge, for this Reinforcement Leaning (RL)<br/>based control systems are promising. However, tuning appropriate parameters to train RL<br/>algorithm is a challenge. Therefore, we designed a continuous reward function to minimizing the<br/>sparsity and stabilizes the policy convergence, to attain control generalization for differential drive<br/>robot. We Implemented Twin Delayed Deep Deterministic Policy Gradient-TD3 on Open-AI Gym<br/>Race Car. System was trained to achieve smart primitive control policy, moving forward in the<br/>direction of goal by maintaining an appropriate distance from walls to avoid collisions. Resulting<br/>policy was tested on unseen environments and observed precisely performing results. Upon<br/>comparative analysis of TD3 with DDPG, TD3 policy outperformed the DDPG policy in both<br/>training and testing phase, proving TD3 to be resource efficient and stable.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element	MS Robotics and Intelligent Machine Engineering
700 ## - ADDED ENTRY--PERSONAL NAME
Personal name	Supervisor : Dr. Yasar Ayaz
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier	<a href="http://10.250.8.41:8080/xmlui/handle/123456789/30557">http://10.250.8.41:8080/xmlui/handle/123456789/30557</a>
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type	Thesis

Holdings
Withdrawn status	Permanent Location	Current Location	Shelving location	Date acquired	Full call number	Barcode	Koha item type
	School of Mechanical & Manufacturing Engineering (SMME)	School of Mechanical & Manufacturing Engineering (SMME)	E-Books	04/29/2024	629.8	SMME-TH-757	Thesis

NUST Institutions Library Catalogue

NUST INSTITUTIONS' LIBRARY CATALOGUE

RL based Differential Drive Primitive Policy for Transfer Learning / (Record no. 609005)