Course Calender/Schedule (Subject to Change)

Week Day Date Topic Speaker Required Reading Extended Reading Assigned Due
1 Thu Aug 29 Introduction Zhiyao Duan Lyon: Machine Hearing: An Emerging Field Class Paper Review;
Presentation of Research;
Course Project;
Peer Feedback;
HW0 (Warm-up)
2 Tue Sep 3 Auditory Scene Analysis Zhiyao Duan Bregman: ASA Book Chapter 1 Wang & Brown: CASA Book Chapter 1 HW1
2 Thu Sep 5 Signal Processing Review Zhiyao Duan Mueller: Fundamentals of Music Processing Book Chapter 2
3 Tue Sep 10 Human Auditory Sensation Zhiyao Duan Yost: Hearing Book Chapter 11 Patterson: Auditory Images;
Lyon et al: Sparse Auditory Representations
 
3 Thu Sep 12 Human Auditory Sensation Zhiyao Duan Yost: Hearing Book Chapter 13 Shamma: Encoding Sound Timbre in the Auditory System
Wang & Shamma: Spectral Shape Analysis
HW2 HW1 (due Wed)
4 Tue Sep 17 Single Pitch Detection Zhiyao Duan Cheveigne: CASA Book Chapter 2.1-2.3 Cheveigne & Kawahara: YIN;
Boersma: Praat
4 Thu Sep 19 Rhythm Analysis Zhiyao Duan Mueller: Fundamentals of Music Processing Book Chapter 6 Ellis: Beat Tracking by Dynamic Programming
Klapuri et al: Meter Analysis;
HW3 HW2 (due Sat)
5 Tue Sep 24 Timbre Representation Zhiyao Duan Herrera-Boyer et al: Signal Processing Methods for Music Transcription Book Chapter 6 Childers et al.: The Cepstrum;
Davis & Mermelstein: MFCC
5 Thu Sep 26 Timbre Representation Zhiyao Duan Tzanetakis: Music Data Mining Book Chapter 2 Hermansky: PLP;
Hermansky & Morgan: RASTA
Paper Review Batch 1 (due Wed)
6 Tue Oct 1 NMF Audio Modeling Zhiyao Duan Smaragdis & Brown: NMF Polyphonic Music Transcription Lee & Seung: NMF HW4 HW3 (due Mon)
6 Thu Oct 3 NMF Audio Modeling Zhiyao Duan Smaragdis et al.: PLCA Virtanen: Monaural Sound Source Separation
7 Tue Oct 8 HMM Audio Modeling Zhiyao Duan Rabiner: HMM Mysore: PhD Thesis Chapter 2 Project Proposal (due Mon)
7 Thu Oct 10 HMM Audio Modeling Zhiyao Duan Goodfellow et al.: Deep Learning Book Chapter 6 Hinton et al.: DNN for Speech Recognition;  
8 Tue Oct 15 NO CLASS: Fall Break How to write a paper?
How to give a talk?
How to make a poster?
  HW4 (due Mon)
8 Thu Oct 17 Deep Learning for Audio
CIRC Intro
Zhiyao Duan & Christos Benetatos Goodfellow et al.: Deep Learning Book Chapter 9
DNN for Speech Separation;
Huang et al: Singing Voice Separation by RNN
HW5 Paper Review Batch 2 (due Wed)
9 Tue Oct 22 Convolutional Neural Network
BP derivation
Christos Benetatos & Zhiyao Duan Goodfellow et al.: Deep Learning Book Chapter 14 Schluter & Bock: Onset Detection by CNN;
Hamel & Eck: Music Feature Learning with DBN;
 
9 Thu Oct 24 Multi-pitch Analysis Zhiyao Duan Cheveigne: CASA Book Chapter 2 Klapuri: Harmonicity and Spectral Smoothness
Duan et al: Peak and Non-peak Region
 
10 Tue Oct 29 Multi-pitch Analysis Zhiyao Duan Duan et al: Multi-pitch Streaming Poliner & Ellis: Discriminative Model;
Sigtia et al.: Neural Network for Piano Transcription
 
10 Thu Oct 31 Score-informed Source Separation Zhiyao Duan Dannenberg & Raphael: Alignment and Accompaniment;
Ewert et al: SISS Overview
Ewert & Muller: Score-informed NMF;
Duan et al: Soundprism
HW5 (due Fri)
11 Tue Nov 5 Speaker Verification Ge Zhu Reynolds et al.: Speaker Verification using Adaptive GMM; Dehak et al.: Front End Factor Analysis for Speaker Verification Variani et al.: DNN for Small Footprint Text Dependent SV; Schroff et al.: FaceNet Paper Review Batch 3 (due Tue)
11 Thu Nov 7 Music Generation Yujia Yan
12 Tue Nov 12 Audio-Visual Scene Understanding Zhiyao Duan Arandjelovic & Zisserman: Objects that Sound; Owens & Efros: AV Scene Analysis Arandjelovic & Zisserman: Look Listen and Learn; Zhao et al.: Sounds of Pixels
12 Thu Nov 14 Multi-channel Source Localization and Separation Zhiyao Duan Stern et al: CASA Book Chapter 5;
Yilmaz & Rickard: DUET
Woodruff & Wang: Binaural Localization Reverberant Noisy Project Status Update Meeting (Wed)
13 Tue Nov 19 Acoustic Event Detection Zhiyao Duan Grzekzick et al.: Bag-of-Features Methods Cakir et al.: Convolutional RNN;
Jansen et al.: Large-scale AED from YouTube
13 Thu Nov 21 Score Following / Cover Song ID Students / Varun & Luke   Nakamura et al.: Real-Time Alignment with Repeats and Skips; Dorfer et al.: Learning to Listen Read and Follow
Seetharaman & Rafii: Cover Song ID with 2D Fourier Transform; Tralie: Early MFCC and HPCP Fusion
14 Tue Nov 26 Music Source Separation / Music Generation Shadi, Narges & Nicholas / Frank & Mojtaba Rafii et al: Overview of Lead Accompaniment Separation; Lluis et al.: End to End Music Source Separation; Takahashi & Mitsufuji: MMDenseNets
Roberts et al.: Hierarchical Latent Vector Model; Jaques et al.: Generating Music with Reinforcement Learning; Yang et al.: MIDINET; Sturm et al.: Music Transcription Modeling and Composition
14 Thu Nov 28 NO CLASS: Happy Thanks Giving!
15 Tue Dec 3 Music Expressiveness / Music Recommendation Gavin & Claire / Michael, Tong & Raunaq Xia et al.: Spectral Learning for Expressive Interactive Performance; Cancino-Chacon et al.: An Evaluation of Expressiveness Models
van den Oord et al.: Deep Content Based Music Recommendation; Neural Collaborative Filtering
Project Report Draft (due Mon)
15 Thu Dec 5 Speech Recognition / Speech Dereverberation Tolga, Gazi & Ian / Bo, Haiqin & Meiying Chan et al.: Listen, Attend and Spell; Graves et al.: Speech Recognition with DRNN
Han et al.: Learning Spectral Mapping; Williamson et al.: TF Masking in Complex Domain
16 Tue Dec 10 Source Localization / Source Separation Kyle & Shaf / Yuxiang & Neil Yalta et al.: Sound Source Localization using Deep Learning; Sun et al.: Indoor Source Localizaiton using Probabilistic Neural Network
Hershey et al.: Deep Clustering; Chen et al.: Deep Attractor Network
16 Fri Dec 13 Project Poster Presentations Students 10:30-12 in CSB 601  
16 Sat Dec 14 NO CLASS: Finals Week Project Report Final;
Poster Final