Mirage for Duplicate Detection

4 views
Skip to first unread message

Charlie M

unread,
Dec 9, 2008, 5:37:35 AM12/9/08
to Mirage - Automatic Playlist Generation
I have recently been looking in to making an extension for Banshee
which detects duplicates in a Banshee music library.

I have been looking at all sorts of different ways of doing it. From
simply hashing the compressed audio data (not the meta data). That
would only work for files that started from the same original encoding
but had different meta data.

I have also been looking at acoustic finger printing technologies. Its
was suggested on the mailing list could Mirage or at least some of its
techniques be used to find identical tracks. Presumabbly they would
produce the same values in Mirage even if it was encoded with a
different encoder at a different bit rate or would this just be the
same for all similar tracks.

Dominik Schnitzer

unread,
Dec 10, 2008, 4:59:34 AM12/10/08
to mirag...@googlegroups.com
Hi Charlie,

I guess duplicate detection could indeed be done with mirage. It would
work like this:

- Analyze the whole library
- Search track by track for its nearest neighbours (NN)
- If some NN tracks lie in a given (maybe
user-adjustable&interactive) threshold show the possible duplicates to
the user
- The user could then act on this..

This is related to:
http://code.google.com/p/banshee-unofficial-plugins/issues/detail?id=9

I guess that would be an interesting add-on for the next Mirage
release. Hm maybe open a bug?

But in the first case mirage is no fingerprinting technique. But I
guess real fingerprinting could be integrated in the Mirage pipeline,
in the MEL sepctrum.

greetings,
dominik.
--
DI Dominik Schnitzer
Austrian Research Inst. for Artificial Intelligence
Freyung 6/6, A 1010 Vienna, Austria, Europe
Fax: +43-1-5336112-77(Fax)
Reply all
Reply to author
Forward
0 new messages