Adaptive online fault detection on network-on-chip based on packet logging mechanism
The shrinking size of transistors and on-chip interconnects contribute to increasing probability of on-chip faults. Fault tolerance is one of the key features in Network-on-Chip (NoC) architecture. Current NoCs use Error Detection and Correction (EDC) and acknowledgement mechanisms for fault and err...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/54603/1/LooLingKimMFKE2015.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-utm-ep.54603 |
---|---|
record_format |
uketd_dc |
spelling |
my-utm-ep.546032020-10-20T07:48:12Z Adaptive online fault detection on network-on-chip based on packet logging mechanism 2015-04 Loo, Ling Kim TK Electrical engineering. Electronics Nuclear engineering The shrinking size of transistors and on-chip interconnects contribute to increasing probability of on-chip faults. Fault tolerance is one of the key features in Network-on-Chip (NoC) architecture. Current NoCs use Error Detection and Correction (EDC) and acknowledgement mechanisms for fault and error controls. In order to maintain system functionality in presence of the faults, adapting error detection and correction based on changing error probability is required. Adapting fault detection techniques based on error probability helps NoC to achieve improved fault tolerance. End-to-end (E2E) EDC works better at low error probability whereas switch-to-switch (S2S) works better at high error probability condition. This thesis proposes an adaptive fault detection and fault diagnosis based on Negative acknowledgement (NACK) logging mechanism. In the first part, this thesis proposes a PL-Adaptive method where NoC routers are able to switch between E2E and S2S EDC depending on changing error probability. Each router tracks transmitted packets and NACK packets to continuously monitor its fault level. In the second part, this thesis proposes fault type classification of router and link faults. Based on experimental results by using constant uniform traffic pattern, our proposed PL Adaptive method gives better average latency than using only E2E or S2S. By evaluating the transmission latency with single error in a single path, our proposed PL-Adaptive method is able to achieve latency reduction in the range of [13% - 50%] compared to only S2S or E2E mechanism. Moreover, based on smaller decay rate and error probability in the range of [5x10-5-10-1], smaller threshold increases the higher probability to detect fault and error. PL-Adaptive method is able to detect faults and error up to 96%. Besides, our proposed PL-Adaptive method allows NoC routers to adapt with dynamic packet error probability and can identify router and link faults. 2015-04 Thesis http://eprints.utm.my/id/eprint/54603/ http://eprints.utm.my/id/eprint/54603/1/LooLingKimMFKE2015.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:86038 masters Universiti Teknologi Malaysia, Faculty of Electrical Engineering Faculty of Electrical Engineering |
institution |
Universiti Teknologi Malaysia |
collection |
UTM Institutional Repository |
language |
English |
topic |
TK Electrical engineering Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering Electronics Nuclear engineering Loo, Ling Kim Adaptive online fault detection on network-on-chip based on packet logging mechanism |
description |
The shrinking size of transistors and on-chip interconnects contribute to increasing probability of on-chip faults. Fault tolerance is one of the key features in Network-on-Chip (NoC) architecture. Current NoCs use Error Detection and Correction (EDC) and acknowledgement mechanisms for fault and error controls. In order to maintain system functionality in presence of the faults, adapting error detection and correction based on changing error probability is required. Adapting fault detection techniques based on error probability helps NoC to achieve improved fault tolerance. End-to-end (E2E) EDC works better at low error probability whereas switch-to-switch (S2S) works better at high error probability condition. This thesis proposes an adaptive fault detection and fault diagnosis based on Negative acknowledgement (NACK) logging mechanism. In the first part, this thesis proposes a PL-Adaptive method where NoC routers are able to switch between E2E and S2S EDC depending on changing error probability. Each router tracks transmitted packets and NACK packets to continuously monitor its fault level. In the second part, this thesis proposes fault type classification of router and link faults. Based on experimental results by using constant uniform traffic pattern, our proposed PL Adaptive method gives better average latency than using only E2E or S2S. By evaluating the transmission latency with single error in a single path, our proposed PL-Adaptive method is able to achieve latency reduction in the range of [13% - 50%] compared to only S2S or E2E mechanism. Moreover, based on smaller decay rate and error probability in the range of [5x10-5-10-1], smaller threshold increases the higher probability to detect fault and error. PL-Adaptive method is able to detect faults and error up to 96%. Besides, our proposed PL-Adaptive method allows NoC routers to adapt with dynamic packet error probability and can identify router and link faults. |
format |
Thesis |
qualification_level |
Master's degree |
author |
Loo, Ling Kim |
author_facet |
Loo, Ling Kim |
author_sort |
Loo, Ling Kim |
title |
Adaptive online fault detection on network-on-chip based on packet logging mechanism |
title_short |
Adaptive online fault detection on network-on-chip based on packet logging mechanism |
title_full |
Adaptive online fault detection on network-on-chip based on packet logging mechanism |
title_fullStr |
Adaptive online fault detection on network-on-chip based on packet logging mechanism |
title_full_unstemmed |
Adaptive online fault detection on network-on-chip based on packet logging mechanism |
title_sort |
adaptive online fault detection on network-on-chip based on packet logging mechanism |
granting_institution |
Universiti Teknologi Malaysia, Faculty of Electrical Engineering |
granting_department |
Faculty of Electrical Engineering |
publishDate |
2015 |
url |
http://eprints.utm.my/id/eprint/54603/1/LooLingKimMFKE2015.pdf |
_version_ |
1747817686288564224 |