Welcome to UCSD's Computer Audition Laboratory. Listen up!


Supported by NSF CAREER grant IIS-1054960, "An integrated framework for multimodal music search and discovery".


faculty affiliated faculty graduate students alumni research projects news
publications
2014
Y. Vaizman, B. McFee and G. Lanckriet - "Codebook-Based Audio Feature Representation for Music Information Retrieval", IEEE/ACM Transactions on Audio, Speech and Language Processing. Volume 22, Issue 10. Pages 1483-1493. 2014
G. Surges and S. Dubnov - "Feature Selection and Composition using PyOracl", workshop on Musical Metacreation, Ninth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE-13), 2014
T. Dubnov, Z. Seldess and S. Dubnov - "Interactive projection for aerial dance using depth sensing camera", Proc. SPIE 9012, The Engineering Reality of Virtual Reality 2014
C. Wang and S. Dubnov - "Guided Music Synthesis with Variable Markov Oracle", Music Metacreation Workshop, Tenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE-14), 2014
M. Rabinovich, I. Trisan and S. Dubnov - "Nonlinear dynamics of human creativity", IEEE International Conference on Systems, Man and Cybernetics, 2014
C. Wang and S. Dubnov - "Variable Markov Oracle: A Novel Sequential Data Points Clustering Algorithm with Application to 3D Gesture Query-Matching", IEEE International Symposium on Multimedia (ISM2014)

2013
S. Dubnov - "Characterizing Time Series Variability and Predictability from Information Geometry Dynamic" volume 8085 of the Lecture Notes in Computer Science series, pp. 658-668, Springer, 2013
S. Dubnov and G. Surges - "Delegating Creativity: Use of Musical Algorithms in Machine Listening and Composition", "Digital da Vinci", pp. 127-157, Springer, 2013
J.C. Pereira, E. Coviello, G. Doyle, N. Rasiwasia, G. Lanckriet, R. Levy, N. Vasconcelos, - "On the role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval", To appear on IEEE Transactions on Pattern Analysis and Machine Intelligence.
K. Ellis, E. Coviello, A. Chan and G. Lanckriet, - "A Bag of Systems Representation for Music Auto-tagging". To appear on IEEE Transactions on Audio, Speech and Language Processing.
T. Chuk, A.C.W. Ng, E. Coviello, A.B. Chan and J.H. Hsiao, - "Understanding eye movements in face recognition with hidden Markov model". On CogSci 2013, Berlin, Germany. 31 July - 3 Aug. 2013.
E. Coviello, A. Mumtaz, A.B. Chan and G. Lanckriet, - "That was fast! Speeding up NN search of high dimensional distributions". ICML 2013, Atlanta, Georgia (USA). 16 - 21 June 2013.
D. Lim, B. McFee, G. Lanckriet, - "Robust Structural Metric Learning". ICML 2013, Atlanta, Georgia (USA). 16 - 21 June 2013.
G. Surges - "PyOracle - Analysis of Musical Structure Using Python". PyCon 2013. Python Software Foundation. Santa Clara, CA. 17 March 2013

2012
E. Coviello, A.B. Chan & G. Lanckriet - The variational hierarchical EM algorithm for clustering hidden Markov model. NIPS 2012
S. Dubnov, G. Assayag, - Music Design with Audio Oracle using Information Rate MUME Workshop, AAAI 2012
E. Coviello, Y. Vaizman & G. Lanckriet - Multivariate Autoregressive Mixture Models for Music. ISMIR 2012
J. Urbano, S. Downie, B. McFee & M. Schedl - How significant is statistically significant? The case of audio music similarity and retrieval. ISMIR 2012
B. McFee & G. Lanckriet- Hypergraph models of playlist dialects. ISMIR 2012
E. Coviello, A. Mumtaz, A. Chan & G. Lanckriet - Growing a Bag of Systems Tree for Fast and Accurate Classification. IEEE CVPR 2012
L. Barrington, D. Turnbull, & Lanckriet - Game-Powered Machine Learning. Proceedings of the National Academy of Sciences (2012), Vol. 109, pp. 6411-6416.
B. McFee, T. Bertin-Mahieux, D.P.W. Ellis, & G. Lanckriet- The Million Song Dataset Challenge. 4th International Workshop on Advances in Music Information Research (AdMIRe), 2012. - MSD Challenge on Kaggle
McFee, B., Barrington, L., & Lanckriet, G.R.G. Learning content similarity for music recommendation. In IEEE Transactions on Audio, Speech and Language Processing, 2012.

2011
S.Dubnov - Changes in Musical Culture and Practices as a result of new Multimedia Technologies. Keynote challenge talk at IEEE ISM 2011
K. Ellis, E. Coviello, & G. Lanckriet - Semantic Annotation and Retrieval of Music Using a Bag of Systems Representation. ISMIR 2011
E. Coviello, R. Miotto, & G. Lanckriet - Combining Content-Based Auto-Taggers with Decision-Fusion. ISMIR 2011
B. McFee & G. Lanckriet - The natural language of playlists. ISMIR 2011 (data)
B. McFee & G. Lanckriet - Large-scale music similarity search with spatial trees. ISMIR 2011 (code)
Y. Vaizman & R.Y. Granot & J. Israel & G. Lanckriet - Modeling Dynamic Patterns for Emotional Content in Music. ISMIR 2011
S. Dubnov, G. Assayag and A. Cont - Audio Oracle analysis of Musical Information Rate, Proceedings of IEEE Semantic Computing Conference. . ICSC , September 2011
S. Dubnov, G. Assayag and A. Cont - On the Information Geometry of Audio Streams with Applications to Similarity Computing. IEEE Transactions on Audio, Speech and Language Processing, 19(4), pp. 837 - 846, 2011.
J. Keshet, C-C Cheng, M. Stoehr, C. McAllester & L. K. Saul - Direct Error Rate Minimization of Hidden Markov Models. INTERSPEECH, August 2011
E. Coviello, A.B. Chan & G. Lanckriet - Time Series Models for Semantic Music Annotation. IEEE Transactions on Audio, Speech, and Language Processing, July 2011
C-C Cheng and B. Kingsbury - Arccosine Kernels: Acoustic Modeling with Infinite Neural Networks. ICASSP, May 2011
B. McFee, L. Barrington & G. Lanckriet - Learning content similarity for music recommendation. Submitted to IEEE Transactions on Audio, Speech and Language Processing, 2011.
B. McFee & G. Lanckriet - Learning multi-modal similarity. Journal of Machine Learning Research (JMLR), February, 2011.

2010
B. McFee, L. Barrington & G. Lanckriet - Learning Similarity from Collaborative Filters. ISMIR 2010
R. Miotto, L. Barrington & G. Lanckriet - Improving Auto-tagging by Modeling Semantic Co-occurrences. ISMIR 2010
E. Coviello, L. Barrington, A.B. Chan & G. Lanckriet - Automatic Music Tagging With Time Series Models. ISMIR 2010
N. Koenigstein, G. Lanckriet, B. McFee and Y. Shavitt - Collaborative Filtering Based on P2P Networks. ISMIR 2010
B. McFee & G. Lanckriet - Metric learning to rank. ICML 2010
S. Dubnov - Musical Information Dynamics as Models of Auditory Anticipation. Machine Audition: Principles, Algorithms and Systems, ed. W. Weng, IGI Global publication, 2010.
L. Barrington, A.B. Chan, G. Lanckriet - Modeling Music as a Dynamic Texture. IEEE Transactions on Audio, Speech and Language Processing 18-3 pp 602-612. (project page)
C.-C. Cheng, F. Sha, & L. K. Saul - Online learning and acoustic feature adaptation in large margin hidden Markov models. EEE Journal of Selected Topics in Signal Processing 4(6): 926-942, 2010.

2009
C.-C. Cheng, F. Sha, and L. K. Saul - Large margin feature adaptation for automatic speech recognition. Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-09). Merano, Italy.
B. McFee and G. Lanckriet - Heterogeneous embedding for subjective artist similarity. Tenth International Symposium for Music Information Retrieval (ISMIR). Kobe, Japan.
L. Barrington, R. Oda, G. Lanckriet - Smarter Than Genius? Human Evaluation of Muisc Recommender Systems. Tenth International Symposium for Music Information Retrieval (ISMIR). Kobe, Japan.
D. J. Hu and L. K. Saul - A probabilistic model of unsupervised learning for musical-key profiles. Tenth International Society for Music Information Retrieval Conference (ISMIR-09). Kobe, Japan.
S. Dubnov, Y. Kiyoki - Opera of Meaning: film and music performance with semantic associative search. Frontiers in Artificial Intelligence and Applications, Information Modelling and Knowledge Bases XX, Volume 190, pp. 384 – 391, 2009.
C.-C. Cheng, F. Sha, and L. K. Saul - A fast online algorithm for large margin training of continuous-density hidden Markov models. In Proceedings of the Tenth Annual Conference of the International Speech Communication Association (Interspeech-09). Brighton, UK.
L. Barrington, D. Turnbull, M. Yazdani, G. Lanckriet - Combining Audio Content and Social Context for Semantic Music Discovery. SIGIR, 2009.
B. McFee, G. Lanckriet - Partial order embedding with multiple kernels. Twenty-sixth International Conference on Machine Learning (ICML), 2009.
C.-C. Cheng, F. Sha, and L. K. Saul - Matrix updates for perceptron training of continuous-density hidden Markov models. In Proceedings of the Twenty Sixth International Conference on Machine Learning (ICML-09), pages 153-160. Montreal, Canada.
Y. Cho and L. K. Saul - Learning dictionaries of stable autoregressive models for audio scene analysis. Twenty Sixth International Conference on Machine Learning (ICML), pages 169-176. Montreal, Canada.
Y. Cho and L. K. Saul - Sparse decomposition of mixed audio signals by basis pursuit with autoregressive models. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processing (ICASSP), pages 1705-1708. Taipei, Taiwan.
L. Barrington, A.B. Chan, G. Lanckriet - Dynamic Texture Models of Music. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processing (ICASSP). Taipei, Taiwan.
S.Dubnov, M,J.Hinich - Analyzing several musical instrument tones using the randomly modulated periodicity model. Signal Processing, Volume 89 , Issue 1, pp 24-30, January 2009

2008
L. Barrington, M. Yazdani, D. Turnbull, G. Lanckriet - Combination of Feature Kernels for Semantic Music Retrieval. ISMIR 2008
D. Turnbull, L. Barrington, G. Lanckriet - Five Approaches to Coleecting Tags for Music. ISMIR 2008
C. C. Cheng, D. J. Hu, and L. K. Saul - Nonnegative matrix factorization for real time musical analysis and sight-reading evaluation. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP-08), pages 2017-2020. Las Vegas, NV.
D. Turnbull, L. Barrington, D. Torres, G. Lanckriet - Semantic Annotation and Retrieval of Music and Sound Effects. IEEE Transactions on Audio, Speech, and Language Processing, February 2008 bib
S. Dubnov - Unified View of Prediction and Repetition Structure in Audio Signals. IEEE Transactions on Audio, Speech and Language Processing, Februrary 2008
S. Dubnov and G. Assayag - Memex and Composer Duets: computer aided composition using style modeling and mixing. Open Music Composers book 2, 2008

2007
Turnbull, Liu, Barrington & Lanckriet - A Game-Based Approach for Collecting Semantic Annotations of Music ISMIR, Vienna, Austria, September 2007.
Torres, Turnbull, Barrington & Lanckriet - Identifying Words that are Musically Meaningful ISMIR, Vienna, Austria, September 2007. bib
Turnbull, Lanckriet, Pampalk, & Goto - A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting ISMIR, Vienna, Austria, September 2007.
Cont, Dubnov & Wessel - Realtime Multiple-pitch and Multiple-instrument Recognition For Music Signals using Sparse Non-negative Constraints. DAFx, Bordeaux, France, September 2007.
Cont, Dubnov & Assayag - GUIDAGE: A Fast Audio Query Guided Assemblage. ICMC, Copenhagen, Denmark, August 2007.
Dubnov, Cont & Assayag - Audio Oracle: A New Algorithm for Fast Learning of Audio Structures. ICMC, Copenhagen, Denmark, August 2007.
Turnbull, Barrington, Torres & Lanckriet - Towards Musical Query-by-Semantic Description using the CAL500 Data Set. To appear in SIGIR, Amsterdam, July 2007 bib
Cont, Dubnov & Assayag - Anticipatory Model of Musical Style Imitation using Collaborative and Competitive Reinforcement Learning. in Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior Butz, M.V.; Sigaud, O.; Pezzulo, G.; Baldassarre, G. (Eds.), Pages 285-306, LNCS 4520, Springer Verlag.
Barrington, Chan, Turnbull & Lanckriet - Audio Information Retrieval Using Semantic Similarity. International Conference on Acoustic, Speech and Signal Processing (ICASSP), Hawaii, April 2007 bib
Sriperumbudur, Torres & Lanckriet - Sparse Eigen Methods by D.C. Programming. To appear in International Conference on Machine Learning (ICML), 2007 bib
Turnbull, Barrington, Torres & Lanckriet - Exploring the Semantic Annotation and Retrieval of Sound. CAL Technical Report CAL-2007-01, San Diego, February 2007

2006
Turnbull, Barrington, Torres & Lanckriet - Modeling the Semantics of Sound NIPS Workshop on Advances in Models for Acoustic Processing, Vancouver, December 2006
Turnbull, Barrington & Lanckriet - Modeling Music and Words using a Multi-Class naive Bayes Approach. International Symposium on Music Information Retrieval (ISMIR), Victoria, October 2006
Cont - Realtime Multiple Pitch Observation using Sparse Non-negative Constraints. International Symposium on Music Information Retrieval (ISMIR), Victoria, October 2006.
Cont, Dubnov & Assayag - A framework for Anticipatory Machine Improvisation and Style Imitation. Anticipatory Behavior in Adaptive Learning Systems (ABiALS), Rome, September 2006.
Cont - Realtime Audio to Score Alignment for Polyphonic Music Instruments Using Sparse Non-negative constraints and Hierarchical HMMs. ICASSP'06, Toulouse, May 2006.
Barrington, Lyons, Diegmann & Abe - Ambient Display Using Musical Effects. Intelligent User Interfaces (IUI), Sydney, January 2006