- Title
- Cognitive sensors based on ridge phase-smoothing localization and multiregional histograms of oriented gradients
- Creator
- Chen, Bo-Wei; Rho, Seungmin; Imran, Muhammad; Guizani, Mohsen; Fan, Wei-Kang
- Date
- 2019
- Type
- Text; Journal article
- Identifier
- http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/183084
- Identifier
- vital:16228
- Identifier
-
https://doi.org/10.1109/TETC.2016.2585040
- Identifier
- ISBN:2168-6750 (ISSN)
- Abstract
- This study presents a smart cognitive sensor 'iRecorder' that can spontaneously locate speakers among attendees at a boardroom using ubiquitous arrays of audiovisual sensors. The proposed system 'iRecorder' consists of two major components - Sound localization and mouth tracking. For acoustic processing, this work proposes ridge phase-smoothing direction-of-arrival (DOA) estimation, which refines the distorted phase of a signal and robustly determines acoustic directions. During visual detection, this study develops novel Multiregional Histograms of Oriented Gradients (MHOGs) to model an uttering mouth. Unlike HOGs, the proposed feature is no longer limited to fixed-sized windows or blocks. It relies on facial regions. Finally, the system uses a fusion mechanism that integrates both clues from audiovisual sensors based on majority voting to target an actual speaker. The experimental result of DOA estimation showed that the directional errors were successfully improved by 6.6 degree on average. Concerning detection of talking faces, the accuracy reached as high as a rate of 85.19 percent. The fusion test results also supported the effectiveness of the system. Such findings reveal that the proposed system is superior to the other approaches and establishes its feasibility. © 2013 IEEE.
- Publisher
- IEEE Computer Society
- Relation
- IEEE Transactions on Emerging Topics in Computing Vol. 7, no. 1 (2019), p. 123-134
- Rights
- All metadata describing materials held in, or linked to, the repository is freely available under a CC0 licence
- Rights
- Copyright @ 2016 IEEE
- Subject
- 4604 Cybersecurity and Privacy; 4606 Distributed Computing and Systems Software; Active shape model refitting; EigenSNR; Multiregional histogram of oriented gradients (MHOGs); Ridge phase-smoothing direction-of-arrival (DOA) estimation; Ubiquitous audiovisual sensing
- Reviewed
- Funder
- Part of this work is supported by the Deanship of Scientific Research, King Saud University, under Research Group No. RG1435-051.
- Hits: 590
- Visitors: 441
- Downloads: 0