Journal

[1] Michael I Mandel, Sarah E Yoho, and Eric W Healy. Measuring time-frequency importance functions of speech with bubble noise. Journal of the Acoustical Society of America, 140:2542--2553, 2016. [ bib | DOI | Code | .pdf | Abstract ]
[2] Hugo Larochelle, Michael Mandel, Razvan Pascanu, and Yoshua Bengio. Learning algorithms for the classification restricted boltzmann machine. Journal of Machine Learning Research, 13:643--669, March 2012. [ bib | .pdf | Abstract ]
[3] Ron Weiss, Michael I. Mandel, and Daniel P. W. Ellis. Combining localization cues and source model constraints for binaural source separation. Speech Communication, 53(5):606--621, May 2011. [ bib | DOI | .pdf | Abstract ]
[4] Michael I. Mandel, Razvan Pascanu, Douglas Eck, Yoshua Bengio, Luca M. Aiello, Rossano Schifanella, and Filippo Menczer. Contextual tag inference. ACM Transactions on Multimedia Computing, Communications and Applications, 7S(1):32:1--32:18, October 2011. [ bib | DOI | .pdf | Abstract ]
[5] Johanna Devaney, Michael I. Mandel, Daniel P. W. Ellis, and Ichiro Fujinaga. Automatically extracting performance data from recordings of trained singers. Psychomusicology: Music, Mind & Brain, 21(1–-2):108--136, 2012. [ bib | .pdf | Abstract ]
[6] Michael I. Mandel, Scott Bressler, Barbara Shinn-Cunningham, and Daniel P. W. Ellis. Evaluating source separation algorithms with reverberant speech. IEEE Transactions on Audio, Speech, and Language Processing, 18(7):1872--1883, 2010. [ bib | DOI | .pdf | Abstract ]
[7] Michael I. Mandel, Ron J. Weiss, and Daniel P. W. Ellis. Model-based expectation maximization source separation and localization. IEEE Transactions on Audio, Speech, and Language Processing, 18(2):382--394, February 2010. [ bib | DOI | .pdf | Abstract ]
[8] Michael I. Mandel and Daniel P. W. Ellis. A web-based game for collecting music metadata. Journal of New Music Research, 37(2):151--165, 2008. [ bib | DOI | .pdf | Abstract ]
[9] Thomas S. Huang, Charlie K. Dagli, Shyamsundar Rajaram, Edward Y. Chang, Michael I. Mandel, Graham E. Poliner, and Daniel P. W. Ellis. Active learning for interactive multimedia retrieval. Proceedings of the IEEE, 96(4):648--667, 2008. [ bib | DOI | Abstract ]
[10] Michael I. Mandel, Graham E. Poliner, and Daniel P. W. Ellis. Support vector machine active learning for music retrieval. Multimedia systems, 12(1):1--11, August 2006. [ bib | DOI | .pdf | Abstract ]

Theses, Chapters

[1] Johanna Devaney, Michael I Mandel, Douglas Turnbull, and George Tzanetakis, editors. Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR). New York, 2016. [ bib | http ]
[2] Thierry Bertin-Mahieux, Douglas Eck, and Michael I. Mandel. Automatic tagging of audio: The state-of-the-art. In Wenwu Wang, editor, Machine Audition: Principles, Algorithms and Systems, chapter 14, pages 334--352. IGI Publishing, 2010. [ bib ]
[3] Michael I. Mandel. Binaural Model-Based Source Separation and Localization. PhD thesis, Columbia University, February 2010. [ bib | .pdf | Abstract ]

Conference

[1] Soumi Maiti and Michael Mandel. Concatenative resynthesis using twin networks. In Proceedings of Interspeech, 2017. [ bib | .pdf | Abstract ]
[2] Ali Syed, Andrew Rosenberg, and Michael I Mandel. Active learning for low-resource speech recognition: Impact of selection size and language modeling data. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2017. [ bib | .pdf | Abstract ]
[3] Johanna Devaney and Michael I Mandel. An evaluation of score-informed methods for estimating fundamental frequency and power from polyphonic audio. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2017. [ bib | .pdf | Abstract ]
[4] Michael I Mandel and Jon P Barker. Multichannel spatial clustering for robust far-field automatic speech recognition in mismatched conditions. In Proceedings of Interspeech, pages 1991--1995, 2016. [ bib | DOI | Slides | .pdf | Abstract ]
[5] Michael I Mandel. Directly comparing the listening strategies of humans and machines. In Proceedings of Interspeech, pages 660--664, 2016. [ bib | DOI | Poster | .pdf | Abstract ]
[6] Hakan Erdogan, John Hershey, Shinji Watanabe, Michael Mandel, and Jonathan Le Roux. Improved MVDR beamforming using single-channel mask prediction networks. In Proceedings of Interspeech, pages 1981--1985, 2016. [ bib | DOI | .PDF | Abstract ]
[7] Xiong Xiao, Shinji Watanabe, Hakan Erdogan, Liang Lu, John Hershey, Michael L Seltzer, Guoguo Chen, Yu Zhang, Michael Mandel, and Dong Yu. Deep beamforming networks for multi-channel speech recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 5745--5749. IEEE, mar 2016. [ bib | DOI | .pdf | Abstract ]
[8] Deblin Bagchi, Michael I Mandel, Zhongqiu Wang, Yanzhang He, Andrew Plummer, and Eric Fosler-Lussier. Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pages 496--503, 2015. [ bib | DOI | .pdf | Abstract ]
[9] Sreyas Srimath Tirumala and Michael I Mandel. Exciting estimated clean spectra for speech resynthesis. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015. [ bib | Poster | .pdf | Abstract ]
[10] Michael I Mandel and Young Suk Cho. Audio super-resolution using concatenative resynthesis. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015. [ bib | Demo | Slides | .pdf | Abstract ]
[11] Michael I Mandel and Nicoleta Roman. Enforcing consistency in spectral masks using markov random fields. In Proceedings of EUSIPCO, pages 2028--2032, 2015. [ bib | .pdf | Abstract ]
[12] Michael I Mandel, Young-Suk Cho, and Yuxuan Wang. Learning a concatenative resynthesis system for noise suppression. In Proceedings of the IEEE GlobalSIP conference, 2014. [ bib | Demo | Poster | .pdf | Abstract ]
[13] Michael I Mandel, Sarah E Yoho, and Eric W Healy. Generalizing time-frequency importance functions across noises, talkers, and phonemes. In Proceedings of Interspeech, 2014. [ bib | Poster | .pdf | Abstract ]
[14] Michael I Mandel and Arun Narayanan. Analysis-by-synthesis feature estimation for robust automatic speech recognition using spectral masks. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2014. [ bib | Poster | .pdf | Abstract ]
[15] Arnab Nandi, Lilong Jiang, and Michael Mandel. Gestural query specification. In Proceedings of the International Conference on Very Large Data Bases, volume 7, 2014. [ bib | Slides | .pdf | Abstract ]
[16] Michael I. Mandel. Learning an intelligibility map of individual utterances. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013. [ bib | .pdf | Abstract ]
[17] Nicoleta Roman and Micheal Mandel. Classification based binaural dereverberation. In Proceedings of Interspeech, 2013. [ bib | Abstract ]
[18] Johanna Devaney, Michael I. Mandel, and Ichiro Fujinaga. A study of intonation in three-part singing using the automatic music performance analysis and comparison toolkit (AMPACT). In Proceedings of the International Society for Music Information Retrieval conference, 2012. [ bib | .pdf | Abstract ]
[19] Johanna Devaney, Michael I. Mandel, and Ichiro Fujinaga. Characterizing singing voice fundamental frequency trajectories. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pages 73--76, October 2011. [ bib | Poster | .pdf | Abstract ]
[20] Michael I. Mandel, Douglas Eck, and Yoshua Bengio. Learning tags that vary within a song. In Proceedings of the International Society for Music Information Retrieval conference, pages 399--404, August 2010. [ bib | Slides | .pdf | Abstract ]
[21] James Bergstra, Michael I. Mandel, and Douglas Eck. Scalable genre and tag prediction with spectral covariance. In Proceedings of the International Society for Music Information Retrieval conference, pages 507--512, August 2010. [ bib | .pdf | Abstract ]
[22] Michael I. Mandel and Daniel P. W. Ellis. The ideal interaural parameter mask: a bound on binaural separation systems. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pages 85--88, October 2009. [ bib | DOI | Poster | .pdf | Abstract ]
[23] Johanna Devaney, Michael I. Mandel, and Daniel P. W. Ellis. Improving MIDI-audio alignment with acoustic features. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pages 45--48, October 2009. [ bib | DOI | .pdf | Abstract ]
[24] Edith Law, Kris West, Michael Mandel, Mert Bay, and J. Stephen Downie. Evaluation of algorithms using games: the case of music annotation. In Proceedings of the International Society for Music Information Retrieval conference, pages 387--392, October 2009. [ bib | .pdf | Abstract ]
[25] Ron J. Weiss, Michael I. Mandel, and Daniel P. W. Ellis. Source separation based on binaural cues and source model constraints. In Proceedings of Interspeech, pages 419--422, September 2008. [ bib | Demo | .pdf | Abstract ]
[26] Michael I. Mandel and Daniel P. W. Ellis. Multiple-instance learning for music information retrieval. In Proceedings of the International Society for Music Information Retrieval conference, pages 577--582, September 2008. [ bib | Poster | .pdf | Abstract ]
[27] Daniel P. W. Ellis, Courtenay V. Cotton, and Michael I. Mandel. Cross-correlation of beat-synchronous representations for music similarity. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 57--60, April 2008. [ bib | DOI | .pdf | Abstract ]
[28] Michael I. Mandel and Daniel P. W. Ellis. EM localization and separation using interaural level and phase cues. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pages 275--278, October 2007. [ bib | DOI | Poster | .pdf | Abstract ]
[29] Michael I. Mandel and Daniel P. W. Ellis. A web-based game for collecting music metadata. In Simon Dixon, David Bainbridge, and Rainer Typke, editors, Proceedings of the International Society for Music Information Retrieval conference, pages 365--366, September 2007. [ bib | Poster | .pdf | Abstract ]
[30] Michael I. Mandel, Daniel P. W. Ellis, and Tony Jebara. An EM algorithm for localizing multiple sound sources in reverberant environments. In B. Schölkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information Processing Systems, pages 953--960. MIT Press, Cambridge, MA, 2007. [ bib | Poster | .pdf | Abstract ]
[31] Michael I. Mandel and Daniel P. W. Ellis. Song-level features and support vector machines for music classification. In Joshua D. Reiss and Geraint A. Wiggins, editors, Proceedings of the International Society for Music Information Retrieval conference, pages 594--599, September 2005. [ bib | Poster | .pdf | Abstract ]
[32] Erik B. Sudderth, Michael I. Mandel, William T. Freeman, and Alan S. Willsky. Distributed occlusion reasoning for tracking with nonparametric belief propagation. In Lawrence K. Saul, Yair Weiss, and Léon Bottou, editors, Advances in Neural Information Processing Systems, pages 1369--1376. MIT Press, Cambridge, MA, 2005. [ bib | Demo | .pdf | Abstract ]

Other

[1] Hussein Ghaly and Michael Mandel. Analyzing human and machine performance in resolving ambiguous spoken sentences. In 1st Workshop on Speech-Centric Natural Language Processing (SCNLP), pages 18--26, 2017. [ bib | .pdf ]
[2] Jiyoung Choi and Michael I Mandel. Perception of korean fricatives and affricates in 'bubble' noise by native and nonnative speakers. In International Circle of Korean Linguistics, 2017. [ bib ]
[3] Michael I Mandel and Nicoleta Roman. Integrating markov random fields and model-based expectation maximization source separation and localization. In Acoustical Society of America Spring Meeting, 2015. [ bib | Slides ]
[4] Michael I Mandel, Sarah E Yoho, and Eric W Healy. Listener consistency in identifying speech mixed with particular “bubble” noise instances. In Acoustical Society of America Spring Meeting, 2015. [ bib | Poster ]
[5] Michael I Mandel and Song Hui Chon. Using auditory bubbles to determine spectro-temporal cues of timbre. In Cognitively Based Music Informatics Research (CogMIR), 2014. [ bib | Slides | Abstract ]
[6] Arnab Nandi and Michael Mandel. The interactive join: Recognizing gestures for database queries. In CHI Works-In-Progress, 2013. [ bib | Poster | .pdf | Abstract ]
[7] Michael Mandel, Razvan Pascanu, Hugo Larochelle, and Yoshua Bengio. Autotagging music with conditional restricted boltzmann machines. March 2011. Online: http://arxiv.org/abs/1103.2832. [ bib | arXiv | http | Abstract ]
[8] Michael I. Mandel and Daniel P. W. Ellis. A probability model for interaural phase difference. In ISCA Workshop on Statistical and Perceptual Audio Processing SAPA, pages 1--6, 2006. [ bib | Demo | Slides | .pdf | Abstract ]
[9] Erik B. Sudderth, Michael I. Mandel, William T. Freeman, and Alan S. Willsky. Visual hand tracking using nonparametric belief propagation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 189--197, 2004. [ bib | DOI | Demo | .pdf | Abstract ]