Please use this identifier to cite or link to this item:
http://drsr.daiict.ac.in//handle/123456789/1116
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Maiti, Tapas Kumar | - |
dc.contributor.author | Advani, Harsh Shekhar | - |
dc.date.accessioned | 2024-08-22T05:21:04Z | - |
dc.date.available | 2024-08-22T05:21:04Z | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | Advani, Harsh Shekhar (2022). Q-Learning Accelerator Chip Design for Robot Path Planning. Dhirubhai Ambani Institute of Information and Communication Technology. xi, 65 p. (Acc. # T01036). | - |
dc.identifier.uri | http://drsr.daiict.ac.in//handle/123456789/1116 | - |
dc.description.abstract | A variety of frameworks and set of tools used for creating complex and difficult behaviours are provided by reinforcement learning in robotics. Q-learning is one of the most popularly used algorithms in reinforcement learning. The implementation of this algorithm is mostly done in MATLAB and Python using various tools and libraries available there. These types of implementations are software based only, which actually requires more time to provide the output. Another way is to implement the algorithm on hardware i.e., to implement it on Field Programmable Gate Arrays (FPGA) or to design a chip based on the applications. This way of implementation reduces the processing time and provides a fastersimulation compared to the one done on software. This thesis aims to reduce the processing time i.e., latency required for the agent to perform an action in any particular state while performing in an episode. We propose an efficient hardware architecture that incorporates the testing of the algorithm and provides a reduction in processing time. We also provided a chip design based on the proposed hardware architecture. This in turn will increase the performance and accuracy of the application. Thus, we majorly focus on the two parameters i.e., latency and accuracy. | - |
dc.publisher | Dhirubhai Ambani Institute of Information and Communication Technology | - |
dc.subject | Faster simulation compared | - |
dc.subject | Latency and accuracy | - |
dc.subject | Hardware architecture | - |
dc.classification.ddc | 629.892 ADV | - |
dc.title | Q-Learning Accelerator Chip Design for Robot Path Planning | - |
dc.type | Dissertation | - |
dc.degree | M. Tech | - |
dc.student.id | 202011049 | - |
dc.accession.number | T01036 | - |
Appears in Collections: | M Tech Dissertations |
Files in This Item:
File | Size | Format | |
---|---|---|---|
202011049.pdf | 13.84 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.