Newsgroups: comp.lang.python
From: "Joerg Schuster" <joerg.schuster.REMOVET...@gmail.com>
Date: 7 Mar 2005 05:36:32 -0800
Local: Mon, Mar 7 2005 8:36 am
Subject: shuffle the lines of a large file
Hello,
I am looking for a method to "shuffle" the lines of a large file. I have a corpus of sorted and "uniqed" English sentences that has been (1) sort corpus | uniq > corpus.uniq corpus.uniq is 80G large. The fact that every sentence appears only So, it would be very useful to do one of the following things: - produce corpus.uniq in a such a way that it is not sorted in any way Unfortunately, none of the machines that I may use has 80G RAM. Any ideas? Joerg Schuster You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
| ||||||||||||||