During my Tuesday presentation, I suddenly began talking about "horses", and cited a recently published paper:
B. L. Sturm, “A simple method to determine if a music information retrieval system is a ‘horse’,” IEEE Trans. Multimedia 16(6): 1636-1644, 2014.
Admittedly, this was confusing for many people. :)
- "What is a 'horse'?"
From my article: "[A horse is] a system appearing capable of a remarkable human feat, e.g., music genre recognition [from an audio signal], but actually working by using irrelevant characteristics (confounds)."
- "Did he just call the result of all my research a 'horse'?"
From my article: "[Calling] an MIR system a 'horse' is not meant to be an aspersion. As an intentional nod to Clever Hans, a 'horse' is just a system that is not actually addressing the problem it appears to be [or is claimed to be] solving. The judgment of whether a 'horse' is useful or not completely revolves around a use case of a system [1], [2]: can the requirements demanded of a use case be satisfied by a system that relies on characteristics confounded with the 'ground truth'?"
- "Why does this matter?"
From my article: "By explaining why an MIR
system produces the [figure of merit] it does [from an evaluation], our method provides a sanity test of an
MIR system, suggests ways to improve it, and thus ultimately provides a
way to complete the 'IR research and development cycle' for which MIR
has been described as falling short [3]–[5]."
- "Let's break his face!"
From my article: "!?!?!?"
So, I propose a session to have a conversation about these questions and many more, providing my face for breaking --- metaphorically speaking. For the first 10 minutes I will talk about "horses", real and virtual, with several fun examples. Then I will try to facilitate a discussion/brawl on the topic.
---
References:
[1] C. C. Liem, M. Mueller, D. Eck, G. Tzanetakis, and A. Hanjalic, “The need for music information retrieval with user-centered and multimodal strategies,” in Proc. Int. ACM workshop on Music information retrieval with user-centered and multimodal strategies, pp. 1-6, 2011.
[2] M. Schedl, A. Flexer, and J. Urbano, “The neglected user in music information retrieval research,” J. Intell. Info. Systems, vol. 41, no. 3, pp. 523-539, Dec. 2013.
[3] J. Urbano, M. Schedl, and X. Serra, “Evaluation in music information retrieval,” J. Intell. Info. Systems, vol. 41, no. 3, pp. 345-369, Dec. 2013.
[4] J. Urbano, “Evaluation in audio music similarity,” Ph.D. dissertation, University Carlos III of Madrid, 2013.
[5] X. Serra, M. Magas, E. Benetos, M. Chudy, S. Dixon, A. Flexer, E. Go mez, F. Gouyon, P. Herrera, S. Jorda, O. Paytuvi, G. Peeters, J. Schluter, H. Vinet, and G. Widmer, "Roadmap for Music Information ReSearch", G. Peeters, Ed. Creative Commons, 2013.