Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.

Dismiss

re: Reading in external file, strip out duplicates, sort then save as ext. file

0 views

Skip to first unread message

Macromedia

unread,

Sep 21, 2005, 6:27:30 PM9/21/05

to begi...@perl.org

Hi,

I have a file that I would like to read in then do the following:

- Read in each line and remove any duplicate text with tags
- Sort the file so all tag IDs are in sequential order
- Save the results to a different file name.

Can this be done easily? If so, how? I'm really a newbie at this
stuff. Any help would be greatly appreciated.

Below is a sample of my input file and what I want the output file to
look like.

Input:

Output:

John W. Krahn

unread,

Sep 21, 2005, 6:58:35 PM9/21/05

to Perl Beginners

macromedia wrote:
> Hi,

Hello,

> I have a file that I would like to read in then do the following:
>
> - Read in each line and remove any duplicate text with tags
> - Sort the file so all tag IDs are in sequential order
> - Save the results to a different file name.
>
> Can this be done easily? If so, how? I'm really a newbie at this
> stuff. Any help would be greatly appreciated.

#!/usr/bin/perl
use warnings;
use strict;

my $file_in = 'somefile';
my $file_out = 'differentfile';

open my $in, '<', $file_in or die "Cannot open $file_in: $!";
open my $out, '>', $file_out or die "Cannot open $file_out: $!";

my %seen;
print $out map $_->[ 1 ],
sort { $a->[ 0 ] <=> $b->[ 0 ] }
map [ /<tag id=(\d+)>/, $_ ],
grep />([^<]+)</ && !$seen{ $1 }++,
<$in>;

__END__

John
--
use Perl;
program
fulfillment

0 new messages