At the AIR lab, we conduct research in the emerging field of computer audition, i.e., designing computational systems that are able to analyze and understand sounds including music, speech, and environmental sounds. We address fundamental issues such as parsing polyphonic auditory scenes (the cocktail party effect), as well as design novel applications such as sound retrieval and music information retrieval. We also combine sound analysis with the analysis of other signal modalities such as text and video towards multi-modal scene analysis. Various projects we have been working on include audio source separation, automatic music transcription, audio-score alignment, speech enhancement, speech diarization and emotion recognition, sound retrieval, sound event detection, and audio-visual scene understanding.
We are looking for highly motivated students to join the AIR lab. Students are expected to have a solid background in mathematics, programming, and academic writing. Experiences in music activities will be a plus. Most importantly, students should be fascinated by human's ability in perceiving and understanding sounds, and are willing to make computers to achieve this capability! If you are interested, please apply to the ECE Ph.D. program, and mention Prof. Zhiyao Duan in your application. If you are a master or undergrad student at UR and want to do a project/thesis in the AIR lab, please send Dr. Duan an email or stop by his office.