Why isnt this optimized away?

Elizabeth Mattijsen

unread,

Mar 15, 2004, 2:27:58 PM3/15/04

to perl5-...@perl.org

Is there a reason why the initialization of a lexical array (or hash,
for that matter) with an empty list, isn't optimised away?

$ perl -MO=Concise -e 'my @a';
4 <@> leave[1 ref] vKP/REFC ->(end)
1 <0> enter ->2
2 <;> nextstate(main 1 -e:1) v ->3
3 <0> padav[@a:1,2] vM/LVINTRO ->4

$ perl -MO=Concise -e 'my @a = ()';
7 <@> leave[1 ref] vKP/REFC ->(end)
1 <0> enter ->2
2 <;> nextstate(main 1 -e:1) v ->3
6 <2> aassign[t2] vKS ->7
- <1> ex-list lK ->4
3 <0> pushmark s ->4
- <0> stub lP ->-
- <1> ex-list lK ->6
4 <0> pushmark s ->5
5 <0> padav[@a:1,2] lRM*/LVINTRO ->6

I see the idiom:

my @foo = ();
my %foo = ();
my $foo = undef;

quite often in code made by not too experienced Perl programmers.
Seems like an easy fix with a good potential for an overal increase
in performance of Perl. A small benchmark shows that the difference
is quite significant (relatively speaking, of course).

cmpthese( -2,{
new => sub {my @a = ()},
old => sub {my @a},
} );
__END__
Rate new old
new 648871/s -- -55%
old 1447413/s 123% --

Or maybe a warning should be issued if warnings are enabled?

Liz

Elizabeth Mattijsen

unread,

Mar 16, 2004, 5:24:59 PM3/16/04

to Rafael Garcia-Suarez, perl5-...@perl.org

At 22:51 +0100 3/16/04, Rafael Garcia-Suarez wrote:

>Elizabeth Mattijsen wrote:
>>
>> Is there a reason why the initialization of a lexical array (or hash,
>> for that matter) with an empty list, isn't optimised away?
>

>No.
>
>(but you knew that, didn't you ;)

Only 99.95% sure... wanted to be 100% sure... ;-)

Thanks for the confirmation...

Liz

Rafael Garcia-Suarez

unread,

Mar 16, 2004, 4:51:43 PM3/16/04

to perl5-...@perl.org

Elizabeth Mattijsen wrote:
>
> Is there a reason why the initialization of a lexical array (or hash,
> for that matter) with an empty list, isn't optimised away?

No.

Rafael Garcia-Suarez

unread,

Mar 16, 2004, 6:22:01 PM3/16/04

to Elizabeth Mattijsen, perl5-...@perl.org

Elizabeth Mattijsen wrote:
>
> At 22:51 +0100 3/16/04, Rafael Garcia-Suarez wrote:
> >Elizabeth Mattijsen wrote:
> >>
> >> Is there a reason why the initialization of a lexical array (or hash,
> >> for that matter) with an empty list, isn't optimised away?
> >
> >No.
> >
> >(but you knew that, didn't you ;)
>
> Only 99.95% sure... wanted to be 100% sure... ;-)

Here's a patch for turning C<my $x = undef> into C<my $x>.

The array/hash case is similar, except that it needs more optree
inspection, a new ck_aassign function, and I'm tired tonight...

Index: op.c
===================================================================
--- op.c (revision 3350)
+++ op.c (working copy)
@@ -5593,6 +5593,19 @@ Perl_ck_sassign(pTHX_ OP *o)
return kid;
}
}
+ /* optimise C<my $x = undef> to C<my $x> */
+ if (kid->op_type == OP_UNDEF) {
+ OP *kkid = kid->op_sibling;
+ if (kkid && kkid->op_type == OP_PADSV
+ && (kkid->op_private & OPpLVAL_INTRO))
+ {
+ cLISTOPo->op_first = NULL;
+ kid->op_sibling = NULL;
+ op_free(o);
+ op_free(kid);
+ return kkid;
+ }
+ }
return o;
}

End.

Yitzchak Scott-Thoennes

unread,

Mar 16, 2004, 8:10:06 PM3/16/04

to Rafael Garcia-Suarez, Elizabeth Mattijsen, perl5-...@perl.org

Is this a good idea? It's just extra bloat for a case that more
experienced perl programmers wouldn't do. Especially the idea
of adding a new ck_ routine just for this seems unworthwhile.

(Though I'd always assumed that ck_null wasn't actually called and
now I see it is. Perhaps there's a good place for an optimization.)

Robert Spier

unread,

Mar 16, 2004, 10:23:08 PM3/16/04

to Yitzchak Scott-Thoennes, Rafael Garcia-Suarez, Elizabeth Mattijsen, perl5-...@perl.org

>
> Is this a good idea? It's just extra bloat for a case that more
> experienced perl programmers wouldn't do. Especially the idea
> of adding a new ck_ routine just for this seems unworthwhile.

Yes. It is a good idea.

/pkg/perl-5.8.0/lib/5.8.0$ egrep -r 'my @[a-z]+ *= *' * | wc -l
64

/pkg/perl-5.8.0/lib/site_perl$ egrep -r 'my @[a-z]+ *= *' * | wc -l
161

Personally, I _like_

my @foo = ();

because it makes it crystal clear, no doubt about it, that @foo is the
empty list.

-R

h...@crypt.org

unread,

Mar 16, 2004, 11:19:06 PM3/16/04

to Yitzchak Scott-Thoennes, perl5-...@perl.org

Yitzchak Scott-Thoennes <stho...@efn.org> wrote:

:On Wed, Mar 17, 2004 at 12:22:01AM +0100, Rafael Garcia-Suarez <rgarci...@free.fr> wrote:
:> > >Elizabeth Mattijsen wrote:

:> > >>
:> > >> Is there a reason why the initialization of a lexical array (or hash,

:> > >> for that matter) with an empty list, isn't optimised away?

[...]
:> Here's a patch for turning C<my $x = undef> into C<my $x>.
:
:Is this a good idea? It's just extra bloat for a case that more

:experienced perl programmers wouldn't do. Especially the idea
:of adding a new ck_ routine just for this seems unworthwhile.

The bloat is minimal, the compile-time cost negligible, and the runtime
cost negative or zero. I'd say it is worth it.

If it lets perl programmers write clearer code without worrying about
nano-optimisations that's a further benefit.

Hugo

H.Merijn Brand

unread,

Mar 17, 2004, 3:14:27 AM3/17/04

to Yitzchak Scott-Thoennes, Perl 5 Porters

On Wed 17 Mar 2004 02:10, Yitzchak Scott-Thoennes <stho...@efn.org> wrote:
> Is this a good idea? It's just extra bloat for a case that more
> experienced perl programmers wouldn't do.

I agree with Robert and Hugo, that this /is/ a good idea

/I/ might know the impact, but I still use it all the time to make my code
clearer for the less experienced co-maintainers

> Especially the idea of adding a new ck_ routine just for this
> seems unworthwhile.
>
> (Though I'd always assumed that ck_null wasn't actually called and
> now I see it is. Perhaps there's a good place for an optimization.)

--
H.Merijn Brand Amsterdam Perl Mongers (http://amsterdam.pm.org/)
using perl-5.6.1, 5.8.0, & 5.9.x, and 806 on HP-UX 10.20 & 11.00, 11i,
AIX 4.3, SuSE 8.2, and Win2k. http://www.cmve.net/~merijn/
http://archives.develooper.com/daily...@perl.org/ per...@perl.org
send smoke reports to: smokers...@perl.org, QA: http://qa.perl.org

John Peacock

unread,

Mar 17, 2004, 6:35:08 AM3/17/04

to H.Merijn Brand, Yitzchak Scott-Thoennes, Perl 5 Porters

H.Merijn Brand wrote:
> On Wed 17 Mar 2004 02:10, Yitzchak Scott-Thoennes <stho...@efn.org> wrote:
>
>>Is this a good idea? It's just extra bloat for a case that more
>>experienced perl programmers wouldn't do.
>
>
> I agree with Robert and Hugo, that this /is/ a good idea

And I'd like to join the chorus saying it is a Good Thing(TM).

>
> /I/ might know the impact, but I still use it all the time to make my code
> clearer for the less experienced co-maintainers

I prefer to initialize all variables, as a matter of course, even though it is
not required in most circumstances. I think it is an important lesson to teach
new programmers, because it immediately causes them to think about their
assumptions. It's great that Perl will create a new variable on the fly without
being declared first; it's confusing to new programmers.

John

--
John Peacock
Director of Information Research and Technology
Rowman & Littlefield Publishing Group
4720 Boston Way
Lanham, MD 20706
301-459-3366 x.5010
fax 301-429-5747

Yitzchak Scott-Thoennes

unread,

Mar 17, 2004, 6:55:50 AM3/17/04

to John Peacock, H.Merijn Brand, Perl 5 Porters

On Wed, Mar 17, 2004 at 06:35:08AM -0500, John Peacock <jpea...@rowman.com> wrote:
> H.Merijn Brand wrote:
> >On Wed 17 Mar 2004 02:10, Yitzchak Scott-Thoennes <stho...@efn.org> wrote:
> >
> >>Is this a good idea? It's just extra bloat for a case that more
> >>experienced perl programmers wouldn't do.
> >
> >
> >I agree with Robert and Hugo, that this /is/ a good idea
>
> And I'd like to join the chorus saying it is a Good Thing(TM).

The chorus has overwhelmed me. I retract.

> >/I/ might know the impact, but I still use it all the time to make my code
> >clearer for the less experienced co-maintainers
>
> I prefer to initialize all variables, as a matter of course, even though it
> is not required in most circumstances. I think it is an important lesson
> to teach new programmers, because it immediately causes them to think about
> their assumptions. It's great that Perl will create a new variable on the
> fly without being declared first; it's confusing to new programmers.

This isn't about a case where there's no declaration, its about making

my $foo = undef;
my @bar = ();

be as efficient as

my $foo;
my @bar;

Rafael Garcia-Suarez

unread,

Mar 17, 2004, 7:37:16 AM3/17/04

to perl5-...@perl.org

Nicholas Clark wrote in perl.perl5.porters :
>
> Mmmm. This reminds me of a debate that IIRC propagated onto the ARM Linux
> list from the main kernel list. The crux of the matter was that some people
> were advocating explicitly initialising all static variables to 0
> (which is what C would make them by default), whereas others were advocating
> removing all the zero initialisations.
>
> The for camp were arguing that it was good to be explicit because it was
> documenting that the initial value was important.
> The against camp were arguing that it was good to remove them because gcc
> (at that time, may still be) would add explicit code to set them to zero, and
> wasn't smart enough to optimise them out.
>
> The real solution would have been to fix gcc.

And the real world solution would have been to define another C macro,
DECLARE_STATIC_INIT_TO_ZERO(type,var), and use that everywhere :-)

Nicholas Clark

unread,

Mar 17, 2004, 7:04:46 AM3/17/04

to John Peacock, H.Merijn Brand, Yitzchak Scott-Thoennes, Perl 5 Porters

On Wed, Mar 17, 2004 at 06:35:08AM -0500, John Peacock wrote:
> H.Merijn Brand wrote:
> >On Wed 17 Mar 2004 02:10, Yitzchak Scott-Thoennes <stho...@efn.org> wrote:
> >
> >>Is this a good idea? It's just extra bloat for a case that more
> >>experienced perl programmers wouldn't do.
> >
> >
> >I agree with Robert and Hugo, that this /is/ a good idea
>
> And I'd like to join the chorus saying it is a Good Thing(TM).
>
> >
> >/I/ might know the impact, but I still use it all the time to make my code
> >clearer for the less experienced co-maintainers
>
> I prefer to initialize all variables, as a matter of course, even though it
> is not required in most circumstances. I think it is an important lesson
> to teach new programmers, because it immediately causes them to think about
> their assumptions. It's great that Perl will create a new variable on the
> fly without being declared first; it's confusing to new programmers.

Mmmm. This reminds me of a debate that IIRC propagated onto the ARM Linux

list from the main kernel list. The crux of the matter was that some people
were advocating explicitly initialising all static variables to 0
(which is what C would make them by default), whereas others were advocating
removing all the zero initialisations.

The for camp were arguing that it was good to be explicit because it was
documenting that the initial value was important.
The against camp were arguing that it was good to remove them because gcc
(at that time, may still be) would add explicit code to set them to zero, and
wasn't smart enough to optimise them out.

The real solution would have been to fix gcc.

Nicholas Clark

H.Merijn Brand

unread,

Mar 17, 2004, 7:00:44 AM3/17/04

to Yitzchak Scott-Thoennes, Perl 5 Porters

Which I know, and which is what I was talking about too.

FWIW initializing new variables is a good thing for documentation purposes and
for code maintainance. Even with undef or (). Maintainers immediately SEE what
the coder expects

Nicholas Clark

unread,

Mar 17, 2004, 7:41:06 AM3/17/04

to Rafael Garcia-Suarez, perl5-...@perl.org

On Wed, Mar 17, 2004 at 12:37:16PM -0000, Rafael Garcia-Suarez wrote:
> Nicholas Clark wrote in perl.perl5.porters :

> > The real solution would have been to fix gcc.

>
> And the real world solution would have been to define another C macro,
> DECLARE_STATIC_INIT_TO_ZERO(type,var), and use that everywhere :-)

Since when were the perl5 headers "real world"? :-)

Nicholas Clark

John Peacock

unread,

Mar 17, 2004, 7:54:44 AM3/17/04

to Yitzchak Scott-Thoennes, H.Merijn Brand, Perl 5 Porters

Yitzchak Scott-Thoennes wrote:
> This isn't about a case where there's no declaration, its about making
>
> my $foo = undef;
> my @bar = ();
>
> be as efficient as
>
> my $foo;
> my @bar;

I know, that's why I want the optimization, too! I want an explicit declaration
_and_ efficiency (and a carmel if you don't mind ;~).

Rafael Garcia-Suarez

unread,

Mar 17, 2004, 1:42:56 PM3/17/04

to perl5-...@perl.org, Elizabeth Mattijsen

Rafael Garcia-Suarez wrote:
>
> Elizabeth Mattijsen wrote:
> >
> > At 22:51 +0100 3/16/04, Rafael Garcia-Suarez wrote:
> > >Elizabeth Mattijsen wrote:
> > >>
> > >> Is there a reason why the initialization of a lexical array (or hash,
> > >> for that matter) with an empty list, isn't optimised away?
> > >
> > >No.
> > >
> > >(but you knew that, didn't you ;)
> >
> > Only 99.95% sure... wanted to be 100% sure... ;-)
>
> Here's a patch for turning C<my $x = undef> into C<my $x>.
>
> The array/hash case is similar, except that it needs more optree
> inspection, a new ck_aassign function, and I'm tired tonight...

Now applied as #22520, along with the array/hash similar optimization.
In fact I *didn't* need to introduce ck_aassign, because in this case
I avoid completely constructing an OP_AASSIGN. (So that's probably
a compilation speedup too :)