Serverless parallel video combiner with dominant speaker detection for ultra–high definition multipoint video communication systems

The unprecedented shift towards user ubiquity in the 21st century coupled with rapid advancements in computing and network infrastructures have significantly increased the mass adoption of multipoint video communication among consumers, governments and corporations globally. This rapid adoption has...

Full description

Saved in:
Bibliographic Details
Main Author: Baskaran, Vishnu Monn
Format: Thesis
Published: 2015
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The unprecedented shift towards user ubiquity in the 21st century coupled with rapid advancements in computing and network infrastructures have significantly increased the mass adoption of multipoint video communication among consumers, governments and corporations globally. This rapid adoption has brought upon new technical challenges. The first technical challenge focuses on the inefficacies of the conventional centralised based video combiner architecture in terms of its scalability, computational efficiency and image quality. The second technical challenge pertains to the latency in stitching high resolution video frames for ultra-high definition (UHD) display systems in real-time. The third technical challenge emphasises on the variability of speech characteristics amongst conference participants. This variability gives rise to transient speech patterns that result in misclassification of a dominant speaker. This thesis proposes original solutions for the aforementioned challenges to further enhance the performance of a multipoint video communication system.