Hi Camille,
Your idea to import the data you need into a SQL database sounds good,
and IMDbPY could help.
There are, however, some caveats: first of all, the document you found
(README.sqldb.txt)
refers to an obsolete set of data no longer updated by IMDb since some years.
A new, up-to-date, dataset exists (
https://www.imdb.com/interfaces/ )
and IMDbPY is able to import it
into a SQL database of your choice; you can find the documentation
here:
https://imdbpy.readthedocs.io/en/latest/usage/s3.html
But you may face another problem: IMDb includes very little
information, in this new dataset.
Look at it, and decide if it's okay for your project.
If it is, you can proceed.
More or less, the workflow would be as follow:
1.
install the latest version of
https://github.com/alberanid/imdbpy/ -
see
https://imdbpy.readthedocs.io/en/latest/#installation
2.
Download the dataset.
You can do it manually or, if you prefer, you can use the
"download-from-s3" script you'll find in the docs/goodies directory
(it requires a Linux system)
3.
import the dataset; as an example, to import the data into a SQLite
database, you can do something like:
s32imdbpy.py /path/to/the/imdb-dataset-2020-07-17/ sqlite:///imdb.db --verbose
(notice the three / in sqlite:///imdb.db - they are all needed)
After a while, you will have an "imdb.db" file in the current
directory, containing the imported data.
4.
you can now search and analyze the data in this file using the
Python's "sqlite3" module.
Let me know if you have questions or something is not clear.
Hope this helps.
> _______________________________________________
> Imdbpy-help mailing list
>
Imdbp...@lists.sourceforge.net
>
https://lists.sourceforge.net/lists/listinfo/imdbpy-help
--
Davide Alberani <
davide....@gmail.com> [PGP KeyID: 0x3845A3D4AC9B61AD]
http://www.mimante.net/
_______________________________________________
Imdbpy-help mailing list
Imdbp...@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help