Publications / Patents


  • Bochen Li and Aparna Kumar, Systems, Methods & Computer Program Products for Associating Media Content Having Different Modalities, U.S. Patent 16/439,626. June 2019.
    (Patent filled with )

  • Bochen Li, Karthik Dinesh, Chenliang Xu, Gaurav Sharma, and Zhiyao Duan, Online Audio-Visual Source Association for Chamber Music Performances, Transactions of the International Society for Music Information Retrieval (TISMIR), vol. 2, no. 2, pp. 29-42, 2019. DOI: http://doi.org/10.5334/tismir.25

  • Bochen Li and Aparna Kumar, Query by Video: Cross-modal Music Retrieval”, In Proc. International Society for Music Information Retrieval (ISMIR), 2019.

  • Bochen Li*, Xinzhao Liu*, Karthik Dinesh, Zhiyao Duan, and Gaurav Sharma, Creating a multi-track classical music performance dataset for multi-modal music analysis: challenges, insights, and applications, IEEE Transactions on Multimedia, vol. 21, no. 2, pp. 522-535, 2019. (* equal contribution) <pdf> <project>

  • Yapeng Tian, Jing Shi, Bochen Li, Zhiyao Duan, and Chenliang Xu, Audio-visual event localization in unconstrained videos, in Proc. European Conference on Computer Vision (ECCV), 2018. <pdf>

  • Bochen Li, Akira Maezawa, and Zhiyao Duan, Skeleton plays piano: online generation of pianist body movements from MIDI performance, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2018. <pdf> <demo>

  • Bochen Li and Akira Maezawa, MIDI2Pose: Online keyboard performance motion generation from performance data, in Proc. Information Processing Society of Japan, 2018. <link>

  • Xueyang Wang, Ryan Stables, Bochen Li, and Zhiyao Duan, Score-aligned polyphonic microtiming estimation, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018. <pdf> <poster>

  • Bochen Li, Karthik Dinesh, Gaurav Sharma, and Zhiyao Duan, Video-based vibrato detection and analysis for polyphonic string music, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2017, 123-130. (best paper nomination) <pdf> <slides>

  • Bochen Li, Chenliang Xu, and Zhiyao Duan, Audio-visual source association for string ensembles through multi-modal vibrato analysis, in Proc. The 14th Sound and Computing Conference (SMC), 2017, pp. 159-166. (best paper award) <pdf> <slides>

  • Bochen Li, Karthik Dinesh, Zhiyao Duan and Gaurav Sharma, See and listen: score-informed association of sound tracks to players in chamber music performance videos, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, pp. 2906-2910. <pdf> <slides>

  • Karthik Dinesh*, Bochen Li*, Xinzhao Liu, Zhiyao Duan and Gaurav Sharma, Visually informed multi-pitch analysis of string ensembles, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, pp. 3021-3025. (* equal contribution) <pdf> <slides>

  • Bochen Li and Zhiyao Duan, An approach to score following for piano performances with the sustained effect, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 12, pp. 2425-2438, 2016. <pdf> <project>

  • Bochen Li and Zhiyao Duan, Score following for piano performances with sustain-pedal effects, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2015, pp. 469-475. <pdf> <poster>

  • Li, N., Wang, R., Deng, Y., Liu, Y., Li, B., Wang, C., and Balz, T. Unsupervised polarimetric synthetic aperture radar classification of large-scale landslides caused by Wenchuan earthquake in hue-saturation-intensity color space. Journal of Applied Remote Sensing, vol. 8, no. 1, 2014.

  • Li, N., Wang, R., Deng, Y., Liu, Y., Wang, C., Balz, T., and Li, B. Polarimetric Response of Landslides at X-Band Following the Wenchuan Earthquake. IEEE Geoscience Remote Sensing Letter., vol. 11, no. 10, pp. 1722-1726, 2014.