Hi Andrea,
Gensim doesn't store your original "untokenized" documents at all.
So if you want to retrieve them, you have to do that outside of Gensim. For example:
corpus = [doc1, doc2, doc3, …]
bow_corpus = …whatever processing you use…
index.num_best = 10
for doc_no, score in index[tfidf[query_vector]]:
print("original document:", corpus[doc_no])
In this example, you'd be using corpus as your outside-of-Gensim storage of your original tokens.
Hope that helps,
Radim