Adaptive online fault detection on network-on-chip based on packet logging mechanism

The shrinking size of transistors and on-chip interconnects contribute to increasing probability of on-chip faults. Fault tolerance is one of the key features in Network-on-Chip (NoC) architecture. Current NoCs use Error Detection and Correction (EDC) and acknowledgement mechanisms for fault and err...

Full description

Saved in:
Bibliographic Details
Main Author: Loo, Ling Kim
Format: Thesis
Language:English
Published: 2015
Subjects:
Online Access:http://eprints.utm.my/id/eprint/54603/1/LooLingKimMFKE2015.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.54603
record_format uketd_dc
spelling my-utm-ep.546032020-10-20T07:48:12Z Adaptive online fault detection on network-on-chip based on packet logging mechanism 2015-04 Loo, Ling Kim TK Electrical engineering. Electronics Nuclear engineering The shrinking size of transistors and on-chip interconnects contribute to increasing probability of on-chip faults. Fault tolerance is one of the key features in Network-on-Chip (NoC) architecture. Current NoCs use Error Detection and Correction (EDC) and acknowledgement mechanisms for fault and error controls. In order to maintain system functionality in presence of the faults, adapting error detection and correction based on changing error probability is required. Adapting fault detection techniques based on error probability helps NoC to achieve improved fault tolerance. End-to-end (E2E) EDC works better at low error probability whereas switch-to-switch (S2S) works better at high error probability condition. This thesis proposes an adaptive fault detection and fault diagnosis based on Negative acknowledgement (NACK) logging mechanism. In the first part, this thesis proposes a PL-Adaptive method where NoC routers are able to switch between E2E and S2S EDC depending on changing error probability. Each router tracks transmitted packets and NACK packets to continuously monitor its fault level. In the second part, this thesis proposes fault type classification of router and link faults. Based on experimental results by using constant uniform traffic pattern, our proposed PL Adaptive method gives better average latency than using only E2E or S2S. By evaluating the transmission latency with single error in a single path, our proposed PL-Adaptive method is able to achieve latency reduction in the range of [13% - 50%] compared to only S2S or E2E mechanism. Moreover, based on smaller decay rate and error probability in the range of [5x10-5-10-1], smaller threshold increases the higher probability to detect fault and error. PL-Adaptive method is able to detect faults and error up to 96%. Besides, our proposed PL-Adaptive method allows NoC routers to adapt with dynamic packet error probability and can identify router and link faults. 2015-04 Thesis http://eprints.utm.my/id/eprint/54603/ http://eprints.utm.my/id/eprint/54603/1/LooLingKimMFKE2015.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:86038 masters Universiti Teknologi Malaysia, Faculty of Electrical Engineering Faculty of Electrical Engineering
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic TK Electrical engineering
Electronics Nuclear engineering
spellingShingle TK Electrical engineering
Electronics Nuclear engineering
Loo, Ling Kim
Adaptive online fault detection on network-on-chip based on packet logging mechanism
description The shrinking size of transistors and on-chip interconnects contribute to increasing probability of on-chip faults. Fault tolerance is one of the key features in Network-on-Chip (NoC) architecture. Current NoCs use Error Detection and Correction (EDC) and acknowledgement mechanisms for fault and error controls. In order to maintain system functionality in presence of the faults, adapting error detection and correction based on changing error probability is required. Adapting fault detection techniques based on error probability helps NoC to achieve improved fault tolerance. End-to-end (E2E) EDC works better at low error probability whereas switch-to-switch (S2S) works better at high error probability condition. This thesis proposes an adaptive fault detection and fault diagnosis based on Negative acknowledgement (NACK) logging mechanism. In the first part, this thesis proposes a PL-Adaptive method where NoC routers are able to switch between E2E and S2S EDC depending on changing error probability. Each router tracks transmitted packets and NACK packets to continuously monitor its fault level. In the second part, this thesis proposes fault type classification of router and link faults. Based on experimental results by using constant uniform traffic pattern, our proposed PL Adaptive method gives better average latency than using only E2E or S2S. By evaluating the transmission latency with single error in a single path, our proposed PL-Adaptive method is able to achieve latency reduction in the range of [13% - 50%] compared to only S2S or E2E mechanism. Moreover, based on smaller decay rate and error probability in the range of [5x10-5-10-1], smaller threshold increases the higher probability to detect fault and error. PL-Adaptive method is able to detect faults and error up to 96%. Besides, our proposed PL-Adaptive method allows NoC routers to adapt with dynamic packet error probability and can identify router and link faults.
format Thesis
qualification_level Master's degree
author Loo, Ling Kim
author_facet Loo, Ling Kim
author_sort Loo, Ling Kim
title Adaptive online fault detection on network-on-chip based on packet logging mechanism
title_short Adaptive online fault detection on network-on-chip based on packet logging mechanism
title_full Adaptive online fault detection on network-on-chip based on packet logging mechanism
title_fullStr Adaptive online fault detection on network-on-chip based on packet logging mechanism
title_full_unstemmed Adaptive online fault detection on network-on-chip based on packet logging mechanism
title_sort adaptive online fault detection on network-on-chip based on packet logging mechanism
granting_institution Universiti Teknologi Malaysia, Faculty of Electrical Engineering
granting_department Faculty of Electrical Engineering
publishDate 2015
url http://eprints.utm.my/id/eprint/54603/1/LooLingKimMFKE2015.pdf
_version_ 1747817686288564224