Q-Learning Accelerator Chip Design for Robot Path Planning

Advani, Harsh Shekhar

Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/1116

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Maiti, Tapas Kumar	-
dc.contributor.author	Advani, Harsh Shekhar	-
dc.date.accessioned	2024-08-22T05:21:04Z	-
dc.date.available	2024-08-22T05:21:04Z	-
dc.date.issued	2022	-
dc.identifier.citation	Advani, Harsh Shekhar (2022). Q-Learning Accelerator Chip Design for Robot Path Planning. Dhirubhai Ambani Institute of Information and Communication Technology. xi, 65 p. (Acc. # T01036).	-
dc.identifier.uri	http://drsr.daiict.ac.in//handle/123456789/1116	-
dc.description.abstract	A variety of frameworks and set of tools used for creating complex and difficult behaviours are provided by reinforcement learning in robotics. Q-learning is one of the most popularly used algorithms in reinforcement learning. The implementation of this algorithm is mostly done in MATLAB and Python using various tools and libraries available there. These types of implementations are software based only, which actually requires more time to provide the output. Another way is to implement the algorithm on hardware i.e., to implement it on Field Programmable Gate Arrays (FPGA) or to design a chip based on the applications. This way of implementation reduces the processing time and provides a fastersimulation compared to the one done on software. This thesis aims to reduce the processing time i.e., latency required for the agent to perform an action in any particular state while performing in an episode. We propose an efficient hardware architecture that incorporates the testing of the algorithm and provides a reduction in processing time. We also provided a chip design based on the proposed hardware architecture. This in turn will increase the performance and accuracy of the application. Thus, we majorly focus on the two parameters i.e., latency and accuracy.	-
dc.publisher	Dhirubhai Ambani Institute of Information and Communication Technology	-
dc.subject	Faster simulation compared	-
dc.subject	Latency and accuracy	-
dc.subject	Hardware architecture	-
dc.classification.ddc	629.892 ADV	-
dc.title	Q-Learning Accelerator Chip Design for Robot Path Planning	-
dc.type	Dissertation	-
dc.degree	M. Tech	-
dc.student.id	202011049	-
dc.accession.number	T01036	-
Appears in Collections:	M Tech Dissertations

Files in This Item:

File	Size	Format
202011049.pdf	13.84 MB	Adobe PDF	View/Open

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets