- 1、本文档共8页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
LOUD A 1020-NODE MICROPHONE ARRAY AND ACOUSTIC
ICSV14
Cairns ? Australia
9-12 July, 2007
LOUD: A 1020-NODE MICROPHONE ARRAY AND ACOUSTIC
BEAMFORMER
Eugene Weinstein
1 ?
, Kenneth Steele
2 ?
, Anant Agarwal
2 3 ?
, James Glass
3 §
1
Courant Institute of Mathematical Sciences
251 Mercer Street, New York, NY 10012, USA
2
Tilera Corporation
1900 West Park Drive, Suite 290, Westborough, MA 01581, USA
3
MIT Computer Science and Artificial Intelligence Laboratory
32 Vassar Street, Cambridge, MA 02139, USA
?eugenew@
?ken@
?agarwal@
§glass@
Abstract
Recording speech and other sound is difficult in environments with a large amount of noise
and/or crosstalk. In these environments, array microphones are needed in order to obtain a clean
recording of desired speech. In this work, we have designed, implemented, and tested LOUD,
a 1020-node microphone array. To the best of our knowledge and as documented by Guin-
ness World Records [6], this is currently the largest microphone array in the world. We have
implemented an acoustic beamforming algorithm for sound source amplification in a noisy en-
vironment, and have obtained preliminary results demonstrating the efficacy of the array. From
one to 1020 microphones, we have shown a 13.7dB increase in peak SNR for a representative
utterance, an 87.2% drop in word error rate (WER) with interferer present, and an 91.3% drop
in WER without an interferer.
1. INTRODUCTION
Speech recognition, and sound recording in general, in the presence of significant noise or
crosstalk is difficult. When sound is recorded in a noisy environment through a single micro-
phone, proximity of the microphone to the speaker’s mouth is essential for the high-quality
recording needed for speech recognition. This proximity can not be achieved without tethered
close-talking microphones. However, human friendly pervasive computing environments such
as CMU’s Aura [4] or MIT’s Oxygen [14] are characterized by mobile users going about their
daily business and preclude the use of tethered microph
您可能关注的文档
- JLPT_GRAMMAR_GUIDE.pdf
- Joint Frequency and Symbol Synchronization Schemes for an ofdm System.pdf
- Joint work with Kartik Krishnan. Multiscale Optimization Methods and Applications.pdf
- Journal of Food Protection Paper Doyle 5-09.pdf
- Joint Frequency Selective Channel Estimation and Turbo Decoding in.pdf
- JLL 14Q4 Beijing Office - Client Distribution EN.pdf
- Journal des Nations Unies-28 mars 2011.pdf
- JMAG-Designer.pdf
- Journals of Knowledge Management and of Knowledge Management Practice,.pdf
- Journal of Polymer Science Part A Polymer Chemistry1.pdf
文档评论(0)