[Imdbpy-help] Search for movie by title, year and runtime?

36 views
Skip to first unread message

Kesselheim, David DK - NOH

unread,
Sep 28, 2018, 1:09:18 AM9/28/18
to imdbp...@lists.sourceforge.net

Hi,

 

I need to look up quite a large number of titles (16k). I have the title (more or less, sometimes ‘4k’, ‘3D’, etc is appended to it), the year and the runtime. With that info I can get the correct title-ID from IMDB but it is very slow because after having searched for the title and filtered by year I need to run ia.update(title) on each of my initial matches to compare the runtime.

 

Is there a way to speed up the process?

 

I insourced the tsv.gz files from IMDB but since the title isn’t an exact match I am not sure how well SQL Server is suited for finding the right title.

 

Thanks for the help!

 

David

 

H. Turgut Uyar

unread,
Sep 28, 2018, 6:26:50 AM9/28/18
to imdbp...@lists.sourceforge.net
Hi,

Could it help to use fuzzy string matching on the local tsv files?

https://github.com/seatgeek/fuzzywuzzy


--
Turgut
> *David *
>
>  
>
>
>
>
>
> _______________________________________________
> Imdbpy-help mailing list
> Imdbp...@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/imdbpy-help
>


_______________________________________________
Imdbpy-help mailing list
Imdbp...@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help
Reply all
Reply to author
Forward
0 new messages