• Complex
  • Title
  • Author
  • Keyword
  • Abstract
  • Scholars
Search

Author:

Chen, Jinhui (Chen, Jinhui.) | Takashima, Ryoichi (Takashima, Ryoichi.) | Guo, Xingchen (Guo, Xingchen.) | Zhang, Zhihong (Zhang, Zhihong.) | Xu, Xuexin (Xu, Xuexin.) | Takiguchi, Tetsuya (Takiguchi, Tetsuya.) | Hancock, Edwin R (Hancock, Edwin R.)

Indexed by:

Abstract:

To identify the localization of indoor sound source, especially when attempted using only a single microphone, it is a challenging problem to machine learning. To address these issues, this paper presents a distinct novel solution based on fusing visual and acoustic models. Therefore, we propose two novel approaches. First, to estimate orientation of vocal object in a stable manner, we employ the visual approach as estimation model, where we develop a robust image feature representation method that adopts Fourier analysis to efficiently extract polar descriptors. Second the distance information is estimated by calculating the signal difference between transmit receive ends. To implement these, we use phoneme-level hidden Markov models (HMMs) extracted from clean speech sound, to estimate the acoustic transfer function (ATF), which can capture the speech signal as a network of phoneme HMMs. And using the separated frame sequences of the ATF, we can indicate the signal difference between two positions, which can be used to estimate the distance of sound source. Experimental results show that the proposed method can simultaneously extract the sound source parameters of direction and distance, and thus improves the verification task of sound source localization. © 2021 Elsevier Ltd

Keyword:

Acoustic generators Fourier analysis Hidden Markov models Indoor positioning systems

Author Community:

  • [ 1 ] [Chen, Jinhui]Prefectural University of Hiroshima, Hiroshima, Japan
  • [ 2 ] [Takashima, Ryoichi]Kobe University, Kobe, Japan
  • [ 3 ] [Guo, Xingchen]Xian Jiaotong University, Xian, China
  • [ 4 ] [Zhang, Zhihong]Xiamen University, Xiamen, China
  • [ 5 ] [Xu, Xuexin]Xiamen University, Xiamen, China
  • [ 6 ] [Takiguchi, Tetsuya]Kobe University, Kobe, Japan
  • [ 7 ] [Hancock, Edwin R.]University of York, York, United Kingdom

Reprint Author's Address:

  • [Zhang, Zhihong]Xiamen University, Xiamen, China;;

Show more details

Related Keywords:

Source :

Pattern Recognition

ISSN: 0031-3203

Year: 2021

Volume: 115

7 . 7 4 0

JCR@2020

ESI Discipline: ENGINEERING;

ESI HC Threshold:30

CAS Journal Grade:2

Cited Count:

WoS CC Cited Count: 1

SCOPUS Cited Count: 10

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Affiliated Colleges:

FAQ| About| Online/Total:757/213152914
Address:XI'AN JIAOTONG UNIVERSITY LIBRARY(No.28, Xianning West Road, Xi'an, Shaanxi Post Code:710049) Contact Us:029-82667865
Copyright:XI'AN JIAOTONG UNIVERSITY LIBRARY Technical Support:Beijing Aegean Software Co., Ltd.