Improvement of deep reinforcement models using extreme learning machine for autonomous agents in unstructured environment /

Creating an autonomous agent, that gets real observations such as sensory data and images from the surrounding environment and learns optimal sequential actions, has been considered as one of the main goals of Artificial General Intelligence (AGI). Deep (Hierarchical) Reinforcement Learning (HRL/DRL...

Full description

Saved in:
Bibliographic Details
Main Author: Aldahoul, Nouar (Author)
Format: Thesis
Language:English
Published: Kuala Lumpur : Kulliyyah of Engineering, International Islamic University Malaysia, 2021
Subjects:
Online Access:http://studentrepo.iium.edu.my/handle/123456789/10691
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 040100000a22002890004500
008 210915s2021 my a f m 000 0 eng d
040 |a UIAM  |b eng  |e rda 
041 |a eng 
050 0 0 |a Q325.6 
100 1 |a Aldahoul, Nouar,  |e author 
245 1 0 |a Improvement of deep reinforcement models using extreme learning machine for autonomous agents in unstructured environment /  |c by Nouar Aldahoul 
264 1 |a Kuala Lumpur :  |b Kulliyyah of Engineering, International Islamic University Malaysia,  |c 2021 
300 |a xxii, 267 leaves :  |b colour illustrations ;  |c 30cm. 
336 |2 rdacontent  |a text 
347 |2 rdaft  |a text file  |b PDF 
502 |a Thesis (Ph.D)--International Islamic University Malaysia, 2021. 
504 |a Includes bibliographical references (leaves 248-262). 
520 |a Creating an autonomous agent, that gets real observations such as sensory data and images from the surrounding environment and learns optimal sequential actions, has been considered as one of the main goals of Artificial General Intelligence (AGI). Deep (Hierarchical) Reinforcement Learning (HRL/DRL) can address this objective. Traditional deep reinforcement learning methods suffer from long learning and training time resulted from the need to fine-tune the weights iteratively in the network. This research investigates the previous problem by utilizing a random weights generation approach that is based on Extreme Learning Machine. This method benefits from the randomness of input weights and least square solution in output weights calculation to reduce the training time by an order of magnitude. Hierarchical ELM (H-ELM) and Local Receptive Field ELM (LRF-ELM) are recent versions of multilayer ELM to respectively learn and extract features by hierarchical learning scheme. They have outperformed other existing deep models in terms of learning time (speed). H-ELM's architecture was found to be similar to gradient-based (GB) auto-encoder without weights fine-tuning. However, H-ELM gives higher learning speed compared to the GB autoencoder. Moreover, LRF-ELM was found as similar to Convolutional Neural Network (CNN) without weights fine-tuning. It has outperformed the traditional CNN in the term of learning time. Therefore, in this research, the proposed method, which combines RL with H-ELM or LRF-ELM, is an efficient solution to approximate the action-value function and learn an optimal policy directly from visual data (images) in a short time. In addition, this research proposed a novel method called Convolutional H-ELM (CH-ELM) which is a combination of pre-trained CNN and H-ELM. This method has outperformed either CNN or H-ELM in terms of accuracy and RMSE. The experimental results have been analyzed and evaluated in different applications such as target reaching arm, 2D maze navigation, slide puzzle game , objects sorting, and rock-paper-scissor game. The data samples have been trained and tested to investigate the robustness of the proposed systems. It was found that the proposed models can reduce the learning time by an order of magnitude in various tasks without degrading the performance. The big improvement in learning speed in the proposed method can neglect the slight drop in accuracy in few tasks compared to traditional methods. Therefore, the proposed method can balance the trade-off between learning speed and good performance. In addition, it is able to run on traditional CPUs that are available in the most of the low cost embedding systems. 
596 |a 1 
598 |a NEWGBK 
655 7 |a Theses, IIUM local 
690 |a Dissertations, Academic  |x Kulliyyah of Engineering  |z IIUM 
710 2 |a International Islamic University Malaysia.  |b Kulliyyah of Engineering 
856 4 |u http://studentrepo.iium.edu.my/handle/123456789/10691 
900 |a sz-asbh 
999 |c 439122  |d 470663 
952 |0 0  |1 0  |2 lcc  |4 0  |6 T Q 00325.00006 A00357I 02021  |7 3  |8 IIUMTHESIS  |9 762481  |a IIUM  |b IIUM  |c THESIS  |d 2022-09-09  |g 0.00  |o t Q 325.6 A357I 2021  |p 11100393421  |r 1900-01-02  |t 1  |v 0.00  |y THESIS