IMDb Text Files

149 views
Skip to first unread message

jcrubino

unread,
Oct 23, 2010, 1:29:19 AM10/23/10
to mongodb-user
Hello Everyone,
I am new to MongoDb.
I have successfully installed it but that is about it.

I am an experienced python programmer but willing to learn new
languages if it serves the purpose.

Has anyone worked on importing the IMDb text files and are there any
important things to know about doing so?
I have only found some anomalous posts about this but feel free to
point me in a specific direction if I missed something.

Thanks

Alvin Richards

unread,
Oct 23, 2010, 2:38:19 AM10/23/10
to mongodb-user
I'm not familiar with the IMDb format, so you have a couple of options

1. use mongo import. This will import a comma seperate file format.
details here

http://www.mongodb.org/display/DOCS/Import+Export+Tools?focusedCommentId=4554852#ImportExportTools-mongoimport

2. build some custom code

If you are familiar with Python, you can use the PyMongo driver to
build a script to get the data into the format you need. Info here

http://api.mongodb.org/python/1.9%2B/index.html

-Alvin

Andrew M

unread,
Oct 25, 2010, 4:50:55 PM10/25/10
to mongod...@googlegroups.com
I actually wrote a basic PHP to MongoDB importer for some of the IMDB
files. They are incredibly hard to parse due to their odd format(s).
Email me off list if you are interested and I will dig it up when I get
home.

Regards
Andrew

jcrubino

unread,
Oct 26, 2010, 6:20:30 AM10/26/10
to mongodb-user
That would be a huge help.
Thanks

JR
Reply all
Reply to author
Forward
0 new messages