Hi,
Is there anyone else who have experiences of running some kind of deep learning models on TIRA, and if so, approximately how long does it take you to run it on the article-training and article-test data? I am running a kind of RNN model trained in pytorch (which runs quite fast on a GPU) and it took me almost 20 hours to run (i.e. predict, not train) on the by-article training data on the TIRA-CPU. I know that pytorch models can be rather slow to run on the CPU, but 20 hours is totally insane, especially compared to the time it took to run more shallow models before the early-bird deadline. The model is a around 200MB so it is large but it is only loaded once in the code and then used to call predict() one time per new article. It would probably be faster to collect the whole dataset and then predict larger batches but I find it totally unreasonably that it should take so long to run anyway.
All kinds of input or suggestions on what to do in order to speed up things would be highly appreciated.
Best regards,
Fredrik