The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
Newsgroups: comp.lang.python
From: Steven D'Aprano <st...@REMOVE-THIS-cybersource.com.au>
Date: Fri, 18 Jan 2008 22:43:17 -0000
Local: Fri, Jan 18 2008 5:43 pm
Subject: Re: Efficient processing of large nuumeric data file
On Fri, 18 Jan 2008 09:58:57 -0800, Paul Rubin wrote: However, wc -l doesn't do the same thing as what the Original Poster is > David Sanders <dpsand...@gmail.com> writes: >> The data files are large (~100 million lines), and this code takes a >> long time to run (compared to just doing wc -l, for example). > wc is written in carefully optimized C and will almost certainly run trying to do. There is little comparison between counting the number of lines and building a histogram, except that both tasks have to see each line. Naturally the second task will take longer compared to wc. ("Why does it take so long to make a three-tier wedding cake? I can boil -- You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
| ||||||||||||||