Journal Papers

[13] Shiwei Yu, Hongjuan Zhang, and Zhiyao Duan, Singing voice separation by low-rank and sparse spectrogram decomposition with pre-learned dictionaries, Journal of the Audio Engineering Society, accepted.

[12] Andrea Cogliati, Zhiyao Duan, and Brendt Wohlberg, Piano transcription with convolutional sparse lateral inhibition, IEEE Signal Processing Letters, vol. 24, no. 4, pp. 392-396, 2017.

[11] David Temperley, Iris Ren, and Zhiyao Duan, Mediant mixture and ``blue notes'' in rock: An exploratory study, Music Theory Online, vol. 23, no. 1, 2017.

[10] Bochen Li and Zhiyao Duan, An approach to score following for piano performances with the sustained effect, IEEE/ACM Trans. Audio Speech Language Process., vol. 24, no. 12, pp. 2425-2438, 2016. <pdf> <project>

[9] Na Yang, Jianbo Yuan, Yun Zhou, Ilker Demirkol, Zhiyao Duan, Wendi Heinzelman, Melissa Sturge-Apple, Enhanced multiclass SVM with thresholding fusion for speech-based emotion classification, International Journal of Speech Technology, doi:10.1007/s10772-016-9364-2, 2016.

[8] Andrea Cogliati, Zhiyao Duan, and Brendt Wohlberg, Context-dependent piano music transcription with convolutional sparse coding, IEEE/ACM Trans. Audio Speech Language Process., vol. 24, no. 12, pp. 2218-2230, 2016. <pdf>

[7] Yichi Zhang and Zhiyao Duan, Supervised and unsupervised sound retrieval by vocal imitation, Journal of Audio Engineering Society, vol. 64, no. 7/8, pp. 533-543, 2016. <pdf>

[6] Francisco J. Rodriguez-Serrano, Zhiyao Duan, Pedro Vera-Candeas, Bryan Pardo, and Julio J. Carabias-Orti, Online score-informed source separation with adaptive instrument models, Journal of New Music Research, 2015. <pdf>

[5] Zafar Rafii, Zhiyao Duan, and Bryan Pardo, Combining rhythm-based and pitch-based methods for background and melody separation, IEEE/ACM Trans. Audio Speech Language Process., vol. 22, no. 12, pp. 1884-1893, 2014. <pdf>

[4] Zhiyao Duan, Jinyu Han, and Bryan Pardo, Multi-pitch streaming of harmonic sound mixtures, IEEE/ACM Trans. Audio Speech Language Process., vol. 22, no. 1, pp. 138-150, 2014. <pdf> <code>

[3] Zhiyao Duan and Bryan Pardo, Soundprism: an online system for score-informed source separation of music audio, IEEE Journal of Selected Topics in Signal Process., vol. 5, no. 6, pp. 1205-1215, 2011. <pdf> <slides> <sound files> <code>

[2] Zhiyao Duan, Bryan Pardo, and Changshui Zhang, Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions, IEEE Trans. Audio Speech Language Process., vol. 18, no. 8, pp. 2121-2133, 2010. <pdf> <code>

[1] Zhiyao Duan, Yungang Zhang, Changshui Zhang, and Zhenwei Shi, Unsupervised single-channel music source separation by average harmonic structure modeling, IEEE Trans. Audio Speech Language Process., vo. 16, no. 4, pp. 766-778, 2008. <pdf> <sound files>

Peer-reviewed Conference Papers

[31] Yichi Zhang and Zhiyao Duan, IMINET: convolutional semi-siamese networks for sound search by vocal imitation, in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2017, pp. 304-308. <pdf>

[30] Rui Lu, Zhiyao Duan, and Changshui Zhang, Metric learning based data augmentation for environmental sound classification, in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2017, pp. 1-5. <pdf>

[29] Bochen Li, Karthik Dinesh, Gaurav Sharma, and Zhiyao Duan, Video-based vibrato detection and analysis for polyphonic string music, accepted by International Society for Music Information Retrieval Conference (ISMIR), 2017.

[28] Andrea Cogliati and Zhiyao Duan, A metric for music notation transcription accuracy, accepted by International Society for Music Information Retrieval Conference (ISMIR), 2017.

[27] Bochen Li, Chenliang Xu, and Zhiyao Duan, Audio-visual source association for string ensembles through multi-modal vibrato analysis, accepted by The 14th Sound and Computing Conference (SMC), 2017.

[26] Bochen Li, Karthik Dinesh, Zhiyao Duan and Gaurav Sharma, See and listen: score-informed association of sound tracks to players in chamber music performance videos, accepted by IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017. <pdf>

[25] Karthik Dinesh*, Bochen Li*, Xinzhao Liu, Zhiyao Duan and Gaurav Sharma, Visually informed multi-pitch analysis of string ensembles, accepted by IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017. (* equal contribution) <pdf>

[24] Rui Lu, Kailun Wu, Zhiyao Duan, and Changshui Zhang, Deep ranking: triplet MatchNet for music metric learning, accepted by IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017. <pdf>

[23] Sefik Emre Eskimez, Melissa Sturge-Apple, Zhiyao Duan, and Wendi Heinzelman, WISE: web-based interactive speech emotion classification, in Proc. 4th Workshop on Sentiment Analysis where AI meets Psychology (SAAIP), 2016. <pdf> <slides>

[22] Andrea Cogliati, David Temperley, and Zhiyao Duan, Transcribing human piano performances into music notation, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2016. <pdf>

[21] Sefik Emre Eskimez, Kenneth Imade, Na Yang, Melissa Sturge-Apple, Zhiyao Duan, and Wendi Heinzelman, Emotion Classification: How Does an Automated System Compare to Naive Human Coders?, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016. <pdf> <slides>

[20] Yichi Zhang and Zhiyao Duan, IMISOUND: An Unsupervised System For Sound Query By Vocal Imitation, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016. <pdf>

[19] Andrea Cogliati, Zhiyao Duan, Brendt Wohlberg, Piano Music Transcription with Fast Convolutional Sparse Coding, in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP), 2015. <pdf>

[18] Yichi Zhang and Zhiyao Duan, Retrieving Sounds by Vocal Imitation Recognition, in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP), 2015. <pdf>

[17] Jun Zhou, Shuo Chen, and Zhiyao Duan, Rotational reset strategy for online semi-supervised NMF-based speech enhancement for long recordings, in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015. <pdf> <poster>

[16] Bochen Li and Zhiyao Duan, Score following for piano performances with sustain-pedal effects, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2015. <pdf> <poster>

[15] Andrea Cogliati and Zhiyao Duan, Piano music transcription modeling note temporal evolution, in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015. <pdf>

[14] Zhiyao Duan and David Temperley, Note-level music transcription by maximum likelihood sampling, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2014. <pdf>

[13] Zhiyao Duan, Bryan Pardo, and Laurent Daudet, A novel cepstral representation for timbre modeling of sound sources in polyphonic mixtures, in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014. <pdf> <poster> <code>

[12] Jonathan Springer, Zhiyao Duan, and Bryan Pardo, Approaches to multiple concurrent species bird song recognition, in the 2nd International Workshop on Machine Listening in Multisource Environments, ICASSP, 2013. <pdf> <poster>

[11] Zhiyao Duan, Gautham J. Mysore, and Paris Smaragdis, Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments, in Proc. Interspeech, 2012. <pdf> <slides> <sound files>

[10] Zhiyao Duan, Gautham J. Mysore, and Paris Smaragdis, Online PLCA for Real-time semi-supervised source separation, in Proc. International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA), LNCS 7191, pp. 34-41, 2012. <pdf> <slides>

[9] Zhiyao Duan and Bryan Pardo, Aligning semi-improvised music audio with its lead sheet, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2011, pp. 513-518. <pdf> <poster> <sound files>

[8] Zhiyao Duan and Bryan Pardo, A state space model for online polyphonic audio-score alignment, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, 197-200. <pdf> <poster> <sound files>

[7] Zhiyao Duan, Jinyu Han, and Bryan Pardo, Song-level multi-pitch tracking by heavily constrained clustering, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2010, pp. 57-60. <pdf> <slides>

[6] Zhiyao Duan, Jinyu Han, and Bryan Pardo, Harmonically informed multi-pitch tracking, in Proc. International Society on Music Information Retrieval conference (ISMIR), 2009, pp. 333-338. <pdf> <slides>

[5] Zhiyao Duan, Lie Lu, and Changshui Zhang, Collective annotation of music from multiple semantic categories, in Proc. International Conference on Music Information Retrieval (ISMIR), 2008, pp. 237-242. <pdf> <poster>

[4] Zhiyao Duan, Lie Lu, and Changshui Zhang, Audio tonality mode classification without tonic annotations, in Proc. International Conference on Multimedia & Expo (ICME), 2008, pp. 1361-1364. <pdf> <poster>

[3] Zhiyao Duan and Changshui Zhang, A probabilistic approach to multiple fundamental frequency estimation from the amplitude spectrum peaks, in Proc. Music, Brain and Cognition workshop in the Twenty-first Annual Conference on Neural Information Processing Systems (NIPS), 2007. <pdf> <slides> <poster>

[2] Zhiyao Duan, Dan Zhang, Changshui Zhang, and Zhenwei Shi, Multi-pitch estimation based on partial event and support transfer, in Proc. International Conference on Multimedia & Expo (ICME),2007, pp.216-219. <pdf> <poster> <sound files>

[1] Nelson Lee, Zhiyao Duan, and Julius O. Smith, Excitation signal extraction for guitar tones, in Proc. International Computer Music Conference (ICMC), 2007, pp. 450-457. <pdf>

Conference Abstracts

[3] Andrea Cogliati, Zhiyao Duan, and Brendt Wohlberg, Transcribing piano music in the time domain into music notation, The 5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan, Honolulu, Hawaii, December, 2016

[2] Bochen Li, Zhiyao Duan, and Gaurav Sharma, Associating players to sound tracks for musical performance videos, Late Breaking Demo in the International Society for Musical Information Retrieval Conference, 2016

[1] Iris Yuping Ren, David Temperley, Zhiyao Duan, Blue notes in rock: an exploratory study, The 6th workshop on Cognitively Based Music Informatics Research (CogMIR), 2016.

Book Chapters

[1] Bryan Pardo, Zafar Rafii, and Zhiyao Duan, Audio source separation in a musical context, Springer Handbook of Systematic Musicology, Springer-Verlag Berlin Heidelberg, 2017.

Patents

[2] Andrea Cogliati, Zhiyao Duan, and Brendt Wohlberg, Context-dependent piano music transcription with convolutional sparse coding, U.S. Patent filed in 2016.

[1] Gautham J. Mysore, Paris Smaragdis, and Zhiyao Duan, Online Non-negative Source Separation, U.S. Patent filed in 2011.

Theses

[5] Jonathan Downing, Joint Source Separation and Dereverberation of Single-channel Drum Kit Recordings, M.S. Thesis, Department of Electrical and Computer Engineering, University of Rochester, Decmber 2016. Advisor: Zhiyao Duan. Reading Committee: Gonzalo Mateos and David Temperley.

[4] Xinzhao Liu, Creating an Audio-visual Musical Performance Dataset for Enhanced Multi-pitch Analysis, M.S. Thesis, Department of Electrical and Computer Engineering, University of Rochester, May 2016. Advisor: Zhiyao Duan. Reading Committee: Gaurav Sharma and David Temperley.

[3] Andrew Trahan, A Two Part Event-Based Drum Kit Transcription System, M.S. Thesis, Department of Electrical and Computer Engineering, University of Rochester, May 2014. Advisor: Zhiyao Duan. Reading Committee: Jack Mottley and David Temperley.

[2] Zhiyao Duan, Computational Music Audio Scene Analysis, Ph.D. Dissertation, Department of Electrical Engineering and Computer Engineering, Northwestern University, August 2013. Advisor: Bryan Pardo. Reading Committee: Thrasyvoulos N. Pappas, Michael Honig, DeLiang Wang. <pdf>

[1] Zhiyao Duan, Research on Polyphonic Music Pitch Estimation, M.S. Thesis, Department of Automation, Tsinghua University, July 2008. (in Chinese). Advisor: Changshui Zhang.