Weproposeanew novel two-stageblindseparationanddeconvolution (BSD) algorithmfor a realconvolutivemixtureof temporally correlatedsignals,in which a new Single-InputMultiple-Output (SIMO)-model-basedICA (SIMO-ICA) and blind multichannel inversefiltering are combined. SIMO-ICA consistsof multiple ICAs anda fidelity controller, andeachICA runsin parallelunder fidelity control of the entire separationsystem. SIMO-ICA canseparatethe mixed signals,not into monauralsourcesignals but into SIMO-model-basedsignalsfrom independent sourcesas they areat themicrophones.Thus,theseparatedsignalsof SIMOICA canmaintainthespatialqualitiesof eachsoundsource.After theseparationby SIMO-ICA, a simpleblind deconvolution techniquebasedon multichannelinversefiltering for theSIMO model canbeappliedevenwhenthemixing systemis thenonminimum phasesystemandeachsourcesignalis temporallycorrelated.The experimentalresultsobtainedunderthe reverberantconditionrevealthatthesoundqualityof theseparatedsignalsin theproposed methodis superiorto thatin theconventionalICA-basedBSD. 1. INTR ODUCTION Blind separationand deconvolution (BSD) of sourcesis an approachtakento estimateoriginal sourcesignalsusingonly theinformationof mixed signalsobserved in eachinput channel.The differencebetweentheBSD andblind sourceseparation(BSS)of the convolutive mixture [1, 2] is that not only the sourceseparation but alsothe deconvolution of the transmissionchannelcharacteristicsareconsideredin the BSD framework. Therefore,the BSD techniqueis highly applicableto robust hands-freespeech recognitionsystems,wherethedistortiondueto theroomtransfer function shouldbe reduced.For the BSD basedon independent componentanalysis(ICA), variousmethodshave beenproposed to dealwith theseparationanddeconvolution for theconvolutive mixtureof independently , identicallydistributed(i.i.d.) sourcesignals[3, 4]. TheseBSD methodsrequirethespecificassumptions that thesourcesignalsaremutually independent andeachsource signalis alsotemporallyindependent. However, thelatterassumption doesnot hold, particularly in many practicalacousticmixturesof soundsignalswhich often correspondto the temporally correlatedsignals.Theapplicationof theconventionalICA-based BSDto speechoftenyieldsthenegativeresults,e.g.,theseparated speechis adverselydecorrelatedandwhitened[5]. In order to solve the problem,we proposea novel BSD approachthatcombinesinformation-geometrytheoryandmultichannelsignalprocessing. In thisapproach, theseparation-decon volution problemis resolved into two stages:the Single-InputMultipleOutput(SIMO)-model-basedseparationandthedeconvolution in the SIMO-modelframework. Here the term “SIMO” represents the specific transmissionsystemin which the input is a single sourcesignalandtheoutputsareits transmittedsignalsobservedat multiple sensors.First,we proposea new blind separationframework usinga SIMO-model-basedICA algorithm,SIMO-ICA. In the SIMO-ICA scenario,unknown multiple sourcesignalswhich aremixed throughunknown acousticaltransmissionchannelsare detectedat the microphones,andthesesignalscanbe separated, not into monauralsourcesignalsbut into SIMO-model-basedsignals from independentsourcesas they are at the microphones. Thus,theseparatedsignalsof SIMO-ICA canmaintainthespatial qualitiesof eachsoundsource.After theseparationby theSIMOICA, a simpleblind deconvolution techniquebasedon themultichannelinversefiltering for theSIMO modelcanbeapplied.In the proposedmethod,the separation/decon volution problemscanbe solved efficiently usingthe following reasonableassumptionand properties.(1) Theassumptionof themutualindependence among the acousticsoundsourcesusuallyholds,andconsequently , this canbeusedin theSIMO-ICA-basedseparation. (2) Thetemporalcorrelationpropertyof the sourcesignalsand the nonminimum phasepropertyof themixing systemcanbetaken into accountin theblind multichannelinversefiltering. Thus,theproposedalgorithm canprovide a morefeasibleperformancefor theseparation anddeconvolutionof realacousticsignals,comparedwith theconventionalICA-basedBSD.This canbeconfirmedfrom theexperimentalresultsobtainedundertherealreverberantcondition. 2. MIXING PROCESSAND CONVENTION AL BSD 2.1. Mixing process In this study, the numberof arrayelements(microphones)is andthenumberof multiplesoundsourcesis . In general,theobservedsignalsin which multiple sourcesignalsaremixedlinearly areexpressedas ! " $#% & (' (1) where is thesourcesignalvector, is theobservedsignal vector, is themixing filter matrix with the lengthof ) , and " $#* is thez-transformof ; thesearegivenas + , ('(./.0. '1-324 6587 ' (2) 9 + , : ('/./.;. ' :&<= 65 7 ' (3) > , ?A@;BC 65 @;B ' (4) D $#* + , EF@;BC $#* 65 @;B HG ?A@;BC I# *J @;B ' (5)
[1]
Kiyohiro Shikano,et al.
SIMO-Model-Based Independent Component Analysis for High-Fidelity Blind Separation of Acoustic Signals
,
2003
.
[2]
Kiyohiro Shikano,et al.
Fast-Convergence Algorithm for Blind Source Separation Based on Array Signal Processing
,
2003,
IEICE Trans. Fundam. Electron. Commun. Comput. Sci..
[3]
Shoko Araki,et al.
Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming
,
2002,
Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.
[4]
K. Matsuoka,et al.
Minimal distortion principle for blind source separation
,
2002,
Proceedings of the 41st SICE Annual Conference. SICE 2002..
[5]
Andrzej Cichocki,et al.
Adaptive blind signal and image processing
,
2002
.
[6]
Zhi Ding,et al.
Blind Equalization and Identification
,
2001
.
[7]
Shun-ichi AMARIyy,et al.
NATURAL GRADIENT LEARNING WITH A NONHOLONOMIC CONSTRAINT FOR BLIND DECONVOLUTION OF MULTIPLE CHANNELS
,
1999
.
[8]
Paris Smaragdis,et al.
Blind separation of convolved mixtures in the frequency domain
,
1998,
Neurocomputing.
[9]
K. Furuya,et al.
Two-channel blind deconvolution of nonminimum phase FIR systems
,
1997
.
[10]
S.C. Douglas,et al.
Multichannel blind deconvolution and equalization using the natural gradient
,
1997,
First IEEE Signal Processing Workshop on Signal Processing Advances in Wireless Communications.
[11]
Hui Liu,et al.
A deterministic approach to blind identification of multi-channel FIR systems
,
1994,
Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.