interpreters & semantics

Marlene Miller

unread,

May 27, 2004, 2:27:43 AM5/27/04

to

Q1.
"Programs called interpreters provide the most direct, executable expression
of program semantics." EoPL Preface xi.

Does this mean, if you *read the source code* for the interpreter, then you
can know the program semantics? So if I wanted to know the semantics for
Scheme, I could read the R5RS, but even better, I could read the source code
for MzScheme?

Q2.
"Most of these essentials relate to the semantics, or meaning, of program
elements. Such meanings reflect how program elements are interpreted as the
program executes." EoPL Preface xi.

The second sentence seems backwards to me. Does semantics reflect how
elements are interpreted, or does how the elements are interpreted reflect
the semantics?

Lauri Alanko

unread,

May 27, 2004, 3:53:11 AM5/27/04

to

In article <zTftc.75275$hH.13...@bgtnsc04-news.ops.worldnet.att.net>,

Marlene Miller <marlen...@worldnet.att.net> wrote:
> Q1.
> "Programs called interpreters provide the most direct, executable expression
> of program semantics." EoPL Preface xi.
>
> Does this mean, if you *read the source code* for the interpreter, then you
> can know the program semantics?

Yes, _if_ you already know the semantics of the language that the
interpreter has been written in. If it is a metacircular interpreter
(i.e. one written in the language it is interpreting), then you have
to resort to other means for learning the language's semantics, e.g.
reading an informal English description, or some formal operational or
denotational rules, or maybe just playing around with an
implementation and learning by experimenting, how the language works.
This latter kind of inductive learning, of course, is always prone to
mistakes.

> Q2.
> "Most of these essentials relate to the semantics, or meaning, of program
> elements. Such meanings reflect how program elements are interpreted as the
> program executes." EoPL Preface xi.
>
> The second sentence seems backwards to me. Does semantics reflect how
> elements are interpreted, or does how the elements are interpreted reflect
> the semantics?

That depends. If the language was properly designed, then it probably
has an a priori semantics, and an implementation's job is simply to
execute programs according to the semantics. On the other hand,
sometimes a language has been implemented without exact
specifications, and then a semantics can be designed afterwards to
formally express what the interpreter does. This often a pretty
thankless task, though, since undesigned languages tend to be full of
horribly complicated kludges.

Lauri Alanko
l...@iki.fi

Matthias Felleisen

unread,

May 27, 2004, 8:29:10 AM5/27/04

to

Marlene Miller wrote:

> Q2.
> "Most of these essentials relate to the semantics, or meaning, of program
> elements. Such meanings reflect how program elements are interpreted as the
> program executes." EoPL Preface xi.
>
> The second sentence seems backwards to me. Does semantics reflect how
> elements are interpreted, or does how the elements are interpreted reflect
> the semantics?

One can, in principle, define a mathematical function that assigns a
mathematical object to each phrase of a language. That's the meaning of the phrase.

Say your language looks like this:

exp = 0 | 1 | -1 | ... | (+ exp exp)

Then you could decide to say that the meaning of an exp is an Integer (Z), that
the phrase "0" denotes the number "0", etc, and that the phrase "(+ exp1 exp2)"
denotes the addition of the meaning of exp1 and the meaning of exp2.

Now write this as an interpreter and use the semantics to guide you.

-- Matthias

Erann Gat

unread,

May 27, 2004, 11:52:13 AM5/27/04

to

In article <zTftc.75275$hH.13...@bgtnsc04-news.ops.worldnet.att.net>,
"Marlene Miller" <marlen...@worldnet.att.net> wrote:

> Q1.
> "Programs called interpreters provide the most direct, executable expression
> of program semantics." EoPL Preface xi.
>
> Does this mean, if you *read the source code* for the interpreter, then you
> can know the program semantics?

It depends on what you mean by "know".

If you read a piece of sheet music, do you know what the music sounds
like? The situation is exactly analogous. If you have the right
training and the music is simple enough then you can "hear" the music by
reading the notes. If you know the semantics of the language the
interpreter is written in and the interpreter is simple enough then you
can "know" the semantics of the language by reading the interpreter.
But if you don't or it isn't then you can't.

E.

Bill Richter

unread,

Jun 5, 2004, 12:31:27 AM6/5/04

to

Matthias Felleisen responded to Marlene Miller:

> > "Most of these essentials relate to the semantics, or meaning, of
> > program elements. Such meanings reflect how program elements are
> > interpreted as the program executes." EoPL Preface xi.

> One can, in principle, define a mathematical function that assigns a

> mathematical object to each phrase of a language. That's the meaning
> of the phrase.

Mathias, that's Schmidt's definition of Denotation Semantics (DS):

David Schmidt's book "DS: a methodology for language development"
states on p 3:

The DS method maps a program directly to its meaning, called its
denotation. The denotation is usually a mathematical value, such
as a number or a function. No interpreters are used, a valuation
function maps programs directly to its meaning.

I think that's a great definition of DS, and it includes your LC_v
Standard Reduction function eval_s, defined on p. 51 in
<http://www.ccs.neu.edu/course/com3357/mono.ps>
your paper with Matthew Flatt
Programming Languages and Lambda Calculi.

Unfortunately it seems to me that DS has been redefined to mean
specific ways of constructing the valuation function via domains
(i.e. Scott models of LC) and structural induction, the techniques
Schmidt describes in his book.

Marlene Miller

unread,

Jun 5, 2004, 12:39:32 PM6/5/04

to

Thank you Bill for clarifying this issue.

If that's what the *preface* is about, clearly the rest of EOPL is way out
of scope for me.

"Bill Richter" <ric...@math.northwestern.edu> wrote in message
news:57189ce0.04060...@posting.google.com...

Bill Richter

unread,

Jun 5, 2004, 11:41:49 PM6/5/04

to

"Marlene Miller" <marlen...@worldnet.att.net> responded to me:

> Thank you Bill for clarifying this issue.
>
> If that's what the *preface* is about, clearly the rest of EOPL is
> way out of scope for me.

Marlene, I didn't give you good advice. I took something Matthias
Felleisen (MFe) wrote & ran off in a different direction. I apologize.

Really, I'm just excited that MFe is now posting regularly to c.l.s.
I think he has a lot of good leadership to offer. We had a disastrous
thread last year about the R5RS DS, and there were 2 problems:

1) I made a bunch of dumb errors initially, and by the time I'd been
straightened out (mostly by MB and WC), folks were fed up, and

2) there was nobody on c.l.s. with MFe's expertise.

And now I'll try to answer your original question:

> Does semantics reflect how elements are interpreted, or does how
> the elements are interpreted reflect the semantics?

I say the first, for EoPL. The interpreter defines the semantics.
The meaning of a program (or an phrase), is what the interpreter does
to it. Sometimes this is expressed mathematically. Let me explain.

Here's the text from the EoPL introduction past your quote:
"Programs called interpreters provide the most direct executable
expression of the program semantics. They process a program by
directly analyzing an abstract representation of the program text.
We therefore choose interpreters as our primary vehicle for
expressing the semantics of programming language elements."

So it looks to me like EoPL is a book about interpreters, such as
DrScheme or Gambit. So much of the book should be accessible.

I think this is what's called Operational Semantics (OpS), and it's
got a different flavor than the DS I scared you off with. In fact,
the very previous paragraph in Schmidt's DS book is:

The OpS method uses an interpreter to define a language. The
meaning of a program in the language is the evaluation history that
the interpreter produces when it interprets the program. The
evaluation history is a sequence of internal interpreter
configurations.

So semantics = meaning which is expressed mathematically, but in OpS,
it's expressed by the interpreter. The "meaning" of your program is
what the interpreter is gonna do to it. That's OK, right?

Or as MFe said, the interpreter "assigns a mathematical object to each
phrase of a language". In the OpS/interpreter world, that means take
a phrase in your language, and evaluate in a given environment, with
various things stored in memory (and a continuation if you like), and
ask what the interpreter is going to print, or how the memory
locations will change, etc. That's not too mathematical, right?

BTW I enjoyed MFe's ambiguous, "That's the meaning of the phrase."

However, on the next page, EoPL says:
"Frequently our interpreters a very high level view that expresses
language semantics in a very concise way, not far from that of formal
mathematical semantics."
Maybe you'd have trouble there, and that might even involve some DS.
(I haven't read EoPL myself, but the Preface is on their web page.)

But EoPL is hard, and old, and maybe there are better books for you.
What do you want to learn? (I won't be able to help, but others can.)

Marlene Miller

unread,

Jun 6, 2004, 2:14:48 AM6/6/04

to

Thank you Bill. I hope MFe sees your question to him.

(I don't mind math. That was my subject in graduate school.)

I am beginning to suspect Essentials of Programming Languages is Essential
for people who research and design languages, not for people who use
languages to build things.

Marlene Miller

unread,

Jun 6, 2004, 2:14:49 AM6/6/04

to

Bill Richter

unread,

Jun 6, 2004, 10:57:20 PM6/6/04

to

"Marlene Miller" <marlen...@worldnet.att.net> responded to me:

> (I don't mind math. That was my subject in graduate school.)

Then let me be more specific, Marlene, and correct an error of my last
post. Schmidt defines DS to be the study of semantic functions

curly-E : Expression-of-our-Language ----> Some set

where the 2 sets and the function are mathematically defined. You
know what this means! And you know that curly-E can't be a computable
function by the Halting problem (i.e. Goedel Incompleteness).

So any kind of semantics becomes DS once you fully mathematize it. In
that sense, the OpS (as studied I think in EoPL) is also DS.

So OpS must mean the subset of DS where you concentrate on
interpreters, and not their full mathematization. A good example of
this is the R5RS DS, which as MB pointed out is usually understood by
schemers as functional programming. It's a good exercise (worked out
on Anton vS's web page) to code up the R5RS DS as a functional Scheme
program. That would be called OpS. Now to mathematize even
functional programming requires hard Lambda calculus. But it's still
the "shallow end" of the DS pool, as the real mathematization of the
R5RS DS uses domains: Scott models of the Lambda calculus, which
involves non-Hausdorff Cantor sets. That's much much harder Math!
And it's this "deep end" of the DS pool which is normally called DS.

> I am beginning to suspect Essentials of Programming Languages is
> Essential for people who research and design languages, not for
> people who use languages to build things.

Yeah, maybe! What do you want to build? I suspect EoPL is a book
about how to build interpreters. From the end of Abelson's preface:

"You'll come to see yourself as a designer of languages, rather than
only a user of languages, as a person who chooses the rules by which
languages are put together, rather than only a follower of rules that
other people have chosen."

Marlene Miller

unread,

Jun 7, 2004, 1:29:05 AM6/7/04

to

Thank you for the intro to DS and OpS. Thank you for your time - the time it
takes to write such an explanation.

I hope you get an answer soon to your original question.

Joe Marshall

unread,

Jun 7, 2004, 12:25:09 PM6/7/04

to

"Marlene Miller" <marlen...@worldnet.att.net> writes:

> I am beginning to suspect Essentials of Programming Languages is Essential
> for people who research and design languages, not for people who use
> languages to build things.

In my experience, people who research and design languages are *much*
better at using them to build things. A huge part of solving a
complex problem is coming up with the language necessary to describe
the problem. If you know a bit about designing languages, you'll be
much further ahead than someone that just knows `how to program'.

As a math major you surely are aware of the power of a good formal
notation. (And as a computer scientist I am *very* frustrated by the
amazingly poor formal notations invented by mathematicians who don't
program!) You don't need to steep yourself ``non-Hausdorff Cantor
sets'' (what?) and Scott's domains to understand programming
semantics. Programming usually involves creating a `mini-language' or
extending an existing language with problem-specific features.
Knowing a bit about semantics will keep you from doing amazingly dumb
things like separating program clauses into `statements' and
`expressions' that cannot be interchanged, making the syntax depend on
runtime values, forgetting about tail recursion, etc.

~jrm

p.s. Scott was trying to adapt set theory to programming, but since
programs can operate on programs, you need the set P : P->P which is,
unfortunately, empty. However, if you add some conditions to P (like
restricting it to continuous partial functions) you can make it
satisfy the conditions necessary to apply Tarski's fixed-point theorem
and construct a non-empty set suitable for modeling programming.

Some people are very uncomfortable with mathematics that has not been
proven correct. Some people are uncomfortable unless they work
through the proof themselves. I'm pretty sure that Scott and Tarski
got it right, so I haven't bothered to memorize the details.

Jens Axel Søgaard

unread,

Jun 7, 2004, 1:01:25 PM6/7/04

to

Joe Marshall wrote:

> Knowing a bit about semantics will keep you from doing amazingly dumb
> things like separating program clauses into `statements' and

> `expressions' ...

Gold. Can I use it in a signature?

And Marlene: Just by reading your (very good) questions in
this group makes me absolutely sure, that you will like
"Essentials of Programming Languages". Get it at the library
first, if you want to skim it first.

(the reason you get so good answers, is because you ask
the right ones)

--
Jens Axel Søgaard

Shriram Krishnamurthi

unread,

Jun 7, 2004, 9:47:32 PM6/7/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> I say the first, for EoPL. The interpreter defines the semantics.
> The meaning of a program (or an phrase), is what the interpreter does
> to it. Sometimes this is expressed mathematically. Let me explain.
>

> [...]
>
> So it looks to me like EoPL is a book about interpreters [...]

Bill, have you considered actually reading the books you're talking
about before you hold forth about them? Or do you only read the
prefaces and forewords of books?

(I understand that reading books has the deleterious effect of
actually slowing down your prodigious newsgroup and mail output.)

Shriram

Bill Richter

unread,

Jun 7, 2004, 11:35:20 PM6/7/04

to

Joe Marshall <j...@ccs.neu.edu> responded to Marlene Miller:

> Programming usually involves creating a `mini-language' or extending
> an existing language with problem-specific features. Knowing a bit
> about semantics will keep you from doing amazingly dumb things like
> separating program clauses into `statements' and `expressions' that
> cannot be interchanged, making the syntax depend on runtime values,
> forgetting about tail recursion, etc.

Cool, Joe. Sounds heavy on interpreters. Where's a place to read?

Perhaps a better book for Marlene would be Shriram Krishnamurthi's
Programming Languages: Application and Interpretation
listed right after EoPL:
<http://www.schemers.org/Documents/#all-texts>

> p.s. Scott was trying to adapt set theory to programming, but since
> programs can operate on programs, you need the set P : P->P which
> is, unfortunately, empty. However, if you add some conditions to P
> (like restricting it to continuous partial functions)

You mean P = (P->P), right? The function space of continuous maps
from P to itself is bijective with P. And so you need a topology on P
to talk about continuous maps from P to P.

> You don't need to steep yourself ``non-Hausdorff Cantor sets''
> (what?) and Scott's domains to understand programming semantics.

I think so. As you say, you just need to know that P = (P->P), you
don't have to know why. But (responding to your (what?)) here's why
Scott's domain P = P(infty) is a ``non-Hausdorff Cantor set'':

P(omega) is the power set of the natural numbers N, i.e. the set
{ S subset N } = (N -> boolean).
See, a function f : N ---> boolean defines a subset
S = f^{-1}(true) subset N

The usual topology on (N -> boolean) has "basic" open sets the
collection of subsets of (N -> boolean) of the form

O(A,B) = { S subset N : A subset S, B disjoint from S }

where A and B are finite subsets of N.

With this topology, (N -> boolean) is the well known Cantor set, which
you should know something about from fractals: the "dust"Julia sets.

Scott's P(omega) is (N -> boolean) with a different and non-Hausdorff
topology. The basic open sets of P(omega) are

O(A) = { S subset N : A subset S }

for finite subsets A subset N. We did this a year ago on c.l.s. If
you don't know what Hausdorff means, let's just say this: the real
line R is Hausdorff, and non-Hausdorff is really strange. I think
Scott was a genius to come up with his P(omega).

Marlene Miller

unread,

Jun 8, 2004, 2:54:08 AM6/8/04

to

Thank you Joe for your (always) helpful insights and explanations.

> A huge part of solving a
> complex problem is coming up with the language necessary to describe
> the problem.

> Programming usually involves creating a `mini-language' or

> extending an existing language with problem-specific features.

I've never thought about programming in this way. Abelson talks about
metalinguistic abstraction, which puzzled me. Is this the same as or related
to what you are saying?:

----
"We control complexity by establishing new languages for describing design,
each of which emphasizes particular ascpects of the design and deemphasizes
others." SICP

"a cluster of languages, where the pieces could be flexibly combined"
preface to EOPL

"To appreciate this point [the evaluator is ust another program] is to
change our images of ourselves as programmers. We come to see ourselves as
designers of languages, rather than only users of languages designed by
others." SICP

"Perhaps the whole distinction between programming and programming language
is a misleading idea, and future programmers will see themselves not as
writing programs in particular, but as creating new languages for each new
application." preface to EOPL
----

I thought of a way to explain my concern. A plumbers doesn't need to read
Plato to be good at plumbing. He might enjoy reading Plato. So he reads
Plato in his spare time on Sundays. If Plato is good for the plumber, why
don't we see all plumbers reading Plato?

Marlene Miller

unread,

Jun 8, 2004, 3:12:22 AM6/8/04

to

"Jens Axel Søgaard" <use...@soegaard.net> wrote>

> And Marlene: Just by reading your (very good) questions in
> this group makes me absolutely sure, that you will like
> "Essentials of Programming Languages". Get it at the library
> first, if you want to skim it first.
>
> (the reason you get so good answers, is because you ask
> the right ones)
>
> --
> Jens Axel Søgaard

Thank you very much, Jens Axel, for your encouragement and advice.

I own the book. It looks fun to read. I like the idea of learning by
implementing ideas in code. It's so tedious having to read English prose and
map ambiguous words and metaphors to technical ideas. (I like the R5RS.) I
am trying to decide whether I am "allowed" to move this book from the Fun
queue to the queue with Tanenbaum's Computer Networks, Lea's Concurrent
Programming in Java, etc.

Joe Marshall

unread,

Jun 8, 2004, 1:51:57 PM6/8/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Joe Marshall <j...@ccs.neu.edu> responded to Marlene Miller:
>
>> Programming usually involves creating a `mini-language' or extending
>> an existing language with problem-specific features. Knowing a bit
>> about semantics will keep you from doing amazingly dumb things like
>> separating program clauses into `statements' and `expressions' that
>> cannot be interchanged, making the syntax depend on runtime values,
>> forgetting about tail recursion, etc.
>
> Cool, Joe. Sounds heavy on interpreters. Where's a place to read?

I haven't seen a `how to design a language' book. The Art of the
Interpreter <http://c2.com/cgi/wiki?TheArtOfTheInterpreter> is a good
paper to look at. There is a lot of good stuff at
<http://library.readscheme.org/> As for my `dumb things' list, those
are things I've encountered in various languages.

There are many languages that draw a distinction between `statements'
and `expressions'. C is one. There are statements like while, do,
if, return, for, break, etc. and there are expressions like
(a + b)*c. Semantically, there are two kinds of continuation, one for
statements and one for expressions. A statement continuation discards
its argument, but an expression continuation does not. So you cannot
use a statement where you have an expression because a statement will
not supply a return value to the continuation.

Unfortunately, most of the control-flow constructs in C are
statements. There are constructs in C for expression sequences and
conditional expressions, but they have a completely different syntax
from statement sequences and conditional sequences. The C statement

if (x > 3) {
x += 2;
y -= 3;
return TRUE;
}
else {
x -= 2;
y += 7;
return FALSE;
}

is essentially equivalent to the C expression
(x > 3)
? (x += 2,
y -= 3,
TRUE)
: (x -= 2,
y += 7,
FALSE)

but you clearly cannot just substitute one for the other! Yet you may
wish to do exactly that sort of thing if you are refactoring code.

If your syntax depends on runtime values, then you cannot effectively
compile your program. In REBOL, for example, the expression
[foo bar baz] could mean (begin (foo) (bar) (baz)) or it could mean
(foo (bar (baz))) or it could mean (begin (foo bar) (baz)) or any of
about 20 other parses. What it means depends on the current values of
foo, bar, and baz, and that can change over time.

Without tail recursion, you must pepper your language with looping
constructs. Users cannot create their own, so you must supply a wide
variety. But loops can only express primitive recursion, so there
will be some things that are extraordinarily painful to compute this
way. Users will also not be able to resort to
continuation-passing-style if they need complex control flow.

> Perhaps a better book for Marlene would be Shriram Krishnamurthi's
> Programming Languages: Application and Interpretation
> listed right after EoPL:
> <http://www.schemers.org/Documents/#all-texts>
>
>> p.s. Scott was trying to adapt set theory to programming, but since
>> programs can operate on programs, you need the set P : P->P which
>> is, unfortunately, empty. However, if you add some conditions to P
>> (like restricting it to continuous partial functions)
>
> You mean P = (P->P), right? The function space of continuous maps
> from P to itself is bijective with P. And so you need a topology on P
> to talk about continuous maps from P to P.

Yes.

>> You don't need to steep yourself ``non-Hausdorff Cantor sets''
>> (what?) and Scott's domains to understand programming semantics.
>
> I think so. As you say, you just need to know that P = (P->P), you
> don't have to know why.

Right. Scott proved that the domain is well-founded and his word is
good enough for me.

Joe Marshall

unread,

Jun 8, 2004, 2:11:47 PM6/8/04

to

"Marlene Miller" <marlen...@worldnet.att.net> writes:

> Thank you Joe for your (always) helpful insights and explanations.
>
>> A huge part of solving a
>> complex problem is coming up with the language necessary to describe
>> the problem.
>
>> Programming usually involves creating a `mini-language' or
>> extending an existing language with problem-specific features.
>
> I've never thought about programming in this way. Abelson talks about
> metalinguistic abstraction, which puzzled me. Is this the same as or related
> to what you are saying?:

It's related. You can modify an existing language to accept a few new
constructs or you can go whole hog and write a brand new language
tailor-made for your problem. One advantage to Lisp and Scheme is
that you can do something inbetween these two extremes. Any new
language is going to need variables, definitions, primitive data,
etc. You'll probably want strings and numbers for interacting with
the rest of the world. You'll need to manage memory. Start with
Scheme or Lisp and you get all that for free.

> I thought of a way to explain my concern. A plumbers doesn't need to read
> Plato to be good at plumbing. He might enjoy reading Plato. So he reads
> Plato in his spare time on Sundays. If Plato is good for the plumber, why
> don't we see all plumbers reading Plato?

If your only aspiration were to be a plumber, then Plato may not have
a direct impact on your life. But I'm not sure this analogy is quite
the right one.

You don't *have* to understand programming semantics to be a good
programmer, but people that do understand semantics tend to be far
better programmers than people who do not. Furthermore, given the
absolutely horrible state of software in the world, it seems that the
bulk of people writing software are not good programmers.

So instead of Plato, what about fluid dynamics? A plumber doesn't
need to understand fluid dynamics to solder pipes together, but
if plumbing were like software, we'd be ankle deep in water. A
plumber with an understanding of fluid dynamics would get more work
and be able to relax on Sundays in a dry house.

Bill Richter

unread,

Jun 8, 2004, 10:17:50 PM6/8/04

to

Shriram Krishnamurthi <s...@cs.brown.edu> responds to me:

> > So it looks to me like EoPL is a book about interpreters [...]
>
> Bill, have you considered actually reading the books you're talking
> about before you hold forth about them? Or do you only read the
> prefaces and forewords of books?
>
> (I understand that reading books has the deleterious effect of
> actually slowing down your prodigious newsgroup and mail output.)

:D Shriram, I don't think EoPL is on-line, unlike your book, and your
expertise here greatly exceeds mine. So please bail me out:

Would it seem that Marlene ought to read your book on schemers.org
instead of EoPL? Could you compare the goals of the 2 books?

My expertise here just has to do with MFe's response on 2004-05-27:

> Does semantics reflect how elements are interpreted, or does how
> the elements are interpreted reflect the semantics?

One can, in principle, define a mathematical function that assigns
a mathematical object to each phrase of a language. That's the
meaning of the phrase.

That's DS. The point is that any semantics becomes DS if you fully
mathematize it. I've had a lot of fun thinking about LC and R5RS DS.

But as to how to write interpreters, why learning about interpreters
or semantics makes one a better programmer: I'm a rank beginner.

Shriram Krishnamurthi

unread,

Jun 9, 2004, 9:05:21 AM6/9/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Would it seem that Marlene ought to read your book on schemers.org
> instead of EoPL? Could you compare the goals of the 2 books?

I like EoPL; I loved the first edition (which I bet you could get real
cheap used) even more. I cut my teeth on it, and it's what convinced
me to take up programming languages. So I couldn't possibly tell
someone to read my book instead of EoPL. Nor do I want to flog my
book here.

Given what Marlene has told us of her background, EoPL may indeed be a
better book than mine for her.

My book grew out of a great frustration from teaching from EoPL.
There are several things I don't like about it as a professor. I was
once asked by a correspondent (Ehud Lamm, I think) about the pedagogic
philosophy behind the text and for whom it was written. I wrote:

My long-term vision for teaching programming languages is to integrate
the "two cultures" that have evolved in its pedagogy. I was raised in
the interpreter (EoPL) culture, which meant I looked with some disdain
at the "survey of languages" courses. After a while I realized that
otherwise intelligent people used the survey approach, so I spent some
time trying to understand what they got out of it.

I still think that not doing interpreters (broadly construed) is a
mistake, and students who go through the experience of doing it come
out with a much richer perspective. But what I have realized is that
students who don't do the survey also lose something valuable.

Since a course needs one dominant philosophy, I decided to make the
interpreters dominant but use the survey to inform the interpreters.
So students program with a new set of features first (survey), then
try to distill those principles into an actual interpreter. This has
the following benefits:

- by seeing the feature in the context of a real language, they can
build something interesting with it first, so they understand that
it isn't an entirely theoretical construct, and will actually *care*
to build an interpreter for it (in my experience, a few students who
are interested in knowledge for its own sake will get excited about
the interpreter in either case, but I want to also capture the
attention of the other 90%)

- they get at least fleeting exposure to multiple languages, which is
an important educational attribute that is fast being crushed in
this era of Java's dominance (and in the process, they come to
understand why Java will not be the last word in languages)

- because they have already coded with the feature, the explanations
and discussions are much more interesting than when all they have
seen is an abstract model

- by first building a mental model for the feature through experience,
they have a much better chance of actually figuring out how the
interpreter is supposed to work

In short, many more humans work by induction than by deduction, so a
pedagogy that supports it is much more likely to succeed than one that
suppresses it. The book currently reflects this design, though the
survey parts are done better (!) in lecture than in the book; that
will change in future versions.

Separate from this vision is a goal. My goal is to not only teach
students new material, but to also change the way they solve problems;
as Marx wrote, "The philosophers have only interpreted the world in
various ways; the point, however, is to change it." I want to show
students where languages come from (the language "nebula"), why we
should regard languages as the ultimate form of abstraction, how to
recognize such an evolving abstraction, and how to turn what they
recognize into a language. The last section of the book, on
domain-specific languages, is a very, very weak step in this
direction. The homeworks I've done in the class have conveyed this
point much better. Over time, I will update the text to reflect what
the homeworks have taught.

The book is currently the sole textbook for the programming languages
course at Brown, where it is taken primarily by juniors (3rd year),
seniors (4th year) and beginning graduate (both MS and PhD) students.
It seems very accessible to smart sophomores (2nd year) too, and
indeed those are some of my most successful students. The book has
been used at some other universities as a primary or secondary text.
The book's material is worth one undergraduate course worth of credit;
for students who want graduate credit, I supplement the material in
the book with some research paper readings.

The book is still very much under development.

One common criticism I have heard of the text is that the writing is
too colloquial; it sounds too much like me standing in front of a room
and lecturing. However, I don't intend to change the voice of the
book (tighten it, of course; change it, no). I've been told this may
make it much harder to publish the book formally. Either way, I
intend to continue offering a full, free copy on the Web.

Comments welcome.

Shriram

Thant Tessman

unread,

Jun 9, 2004, 11:58:15 AM6/9/04

to

Shriram Krishnamurthi wrote:

[...]

> Separate from this vision is a goal. My goal is to not only teach
> students new material, but to also change the way they solve problems;
> as Marx wrote, "The philosophers have only interpreted the world in

> various ways; the point, however, is to change it." [...]

The change Marx had in mind was metaphysical (i.e. delusional). See
"Science, Politics, and Gnosticism," by Eric Voegelin, and "Karl Marx:
Communist as Religious Eschatologist," in the second volume of "Logic of
Action" by Murray Rothbard.

Sorry for the off-topic distraction, but I'm not one to pass up a chance
to dis Marx.

-thant

Shriram Krishnamurthi

unread,

Jun 9, 2004, 10:44:23 PM6/9/04

to

There was more I should have written about the comparison between my
book and EoPL. Warning: here I *will* be dissing EoPL a bit.

I think EoPL does a poor job on some crucial topics:

- type systems
- garbage collection
- domain-specific languages

The material on types is so caught up in mechanics (especially of type
inference) that I think it fails to provide very much insight into
types. There is little or discussion of soundness, safety, etc. This
is a pity coming from authors who are masters of the topic, but it is
in general consistent with EoPL's "look ma"-ness.

Garbage collection isn't discussed at all in any meaningful way. I
find that students are generally woefully uninformed about garbage
collection, having lots of wrong ideas in their heads. The
programming languages course at most universities is the only chance
to rectify some of these misconceptions (especially since many
colleagues on facutly actively cause the misconceptions). This is one
place where I have found having them implement collectors, very much
in the spirit of EoPL, is actually really helpful; it takes a lot of
the mystery out of GC. I think we also have a responsibility to
discuss some of the systems aspects of GC, especially provide a
meaningful comparison to manual memory management.

While the entire EoPL philosophy is built around "build your own
language" (as Abelson's preface also points out), EoPL doesn't reflect
on this practice. As such, many students can leave an EoPL course
unsure of what to do with the interpreters, eg, not knowing that
Scheme macros offer a great way to transplant what they've learned to
building their own languages. [In prehistoric times, EoPL was
actually written entirely through macros. I'm glad this practice did
not survive, but it's a pity that the book has swung entirely in the
opposite direction.]

All these points can also be read as positive statements about EoPL,
especially if you're a purist. So these should help you determine
which book is more appropriate for your studies, EoPL or PLAI.
Hopefully you will read both. But if you're going to insist on
reading only one, follow the advice in my Acknowledgments section --
read EoPL, a true classic.

Shriram

Bill Richter

unread,

Jun 10, 2004, 1:49:57 AM6/10/04

to

Shriram Krishnamurthi <s...@cs.brown.edu> wrote an excellent post about
his book (listed on schemers.org) and how it compares to EoPL!

we should regard languages as the ultimate form of abstraction

That's exciting, Shiram! You must mean some higher-level version of
what you write on p 235 of your book: "Scheme's map and filter are
also abstractions". So I read HtDP to try to learn about abstraction.
And I like HtDP's sec 21.5 "Designing Abstractions from Templates":
abstracting your design template for lists leads us to the abstract
function `fold' (actually in sec 22.2). I really see how HtDP
abstraction makes for better programmers (Plato for plumbers), and I
can maybe dimly glimpse how your "ultimate abstraction" helps.

I very much liked your (short) Ch 19 "Semantics". You write:

It would be convenient to have some common language for explaining
interpreters. We already have one: math!

We call this *big step operational semantics*. It's semantics
because it ascribes meaning to programs. [...] It's operational
because we aren't compiling the entire program into a mathematical
object and using fancy math to reduce it to an answer.

Good: semantics is Math. By Schmidt's definition, you're doing DS as
well. You're defining a semantic (or valuation) function

V: Expressions -> (Environments -> Values)

It's something I tried unsuccessfully to post 2 years ago. The fact
that you're not using fancy Math (CPOs & Scott models of LC) doesn't
mean it's not DS, by Schmidt's definition. I have some comments:

1) There's some unstated mathematical induction in your definition of
V. E.g. your rule on p 173 includes

b, E'[i <- a_v] ==> b_v

but of course you'll have to apply the rule recursively to do so, and
maybe it will not halt, and in that case (V exp E) = bottom, and that
shows V is not a computable function... This kind of induction isn't
hard, but it's all you need to define V. You don't need CPOs...

2) I'm not quite sure you define your mathematical set Values. But it
sure looks to me like you don't need Scott models of LC. The subset
Procedure-Values of Values just consists (p 172) of triples
<function-name, function-body text, evaluating environment>

So you don't run into the problem MB explained to me 2 years ago:

> E [the R5RS domain of Values] is "defined" as
>
> E = .... some expressions ultimately involving E ...

you only need Scott models (non-Hausdorff Cantor sets) if you define
Procedure-Values to be a set of functions on Values, while Values
contains Procedure-Values as a subset. You avoid this problem by
giving your (I say SICP-ish) triples definition of Procedure-Values.

3) You only fail to define an R5RS-like semantic function

Expressions -> (Env Store Cont -> Answers)

by not doing continuations (which you discussed at length 2 sections
earlier, so probably you'll probably do Cont when you expand Ch 19),
and by "conflating Store and Env", to use MB's great phrase. I
haven't read enough of your book to comment on your decision.

4) So I claim that your "big step operational semantics" is also DS,
by Schmidt's definition. It seems that folks only say DS if you're
doing CPOs & Scott models, but that's a cultural distinction.

Michael Sperber

unread,

Jun 10, 2004, 3:56:38 AM6/10/04

to

>>>>> "Shriram" == Shriram Krishnamurthi <s...@cs.brown.edu> writes:

Shriram> I think EoPL does a poor job on some crucial topics:

Shriram> - type systems
Shriram> - garbage collection
Shriram> - domain-specific languages

Concurrency?

--
Cheers =8-} Mike
Friede, Völkerverständigung und überhaupt blabla

Shriram Krishnamurthi

unread,

Jun 10, 2004, 7:55:38 AM6/10/04

to

Michael Sperber <spe...@informatik.uni-tuebingen.de> writes:

> Shriram> I think EoPL does a poor job on some crucial topics:
>

> [...]
>
> Concurrency?

Well, if we start going down this path, there's a lot more that one
could add. I was trying to limit myself to things that I do discuss
in some detail in my course/book.

I don't cover concurrency due to a peculiarity of Brown's curriculum,
which covers it very well in several other courses (or at least well
enough that I don't feel like expending precious time on the subject).

As an example of something that I *do* cover, I think it's important
to show that types are only the tip of the proof iceberg, and there
are interesting techniques for proving properties of programs; eg, I
discuss model checking.

Shriram

Joe Marshall

unread,

Jun 10, 2004, 11:52:11 AM6/10/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> 2) I'm not quite sure you define your mathematical set Values. But it
> sure looks to me like you don't need Scott models of LC. The subset
> Procedure-Values of Values just consists (p 172) of triples
> <function-name, function-body text, evaluating environment>
>
> So you don't run into the problem MB explained to me 2 years ago:
>
>> E [the R5RS domain of Values] is "defined" as
>>
>> E = .... some expressions ultimately involving E ...
>
> you only need Scott models (non-Hausdorff Cantor sets) if you define
> Procedure-Values to be a set of functions on Values, while Values
> contains Procedure-Values as a subset. You avoid this problem by
> giving your (I say SICP-ish) triples definition of Procedure-Values.

Right, but this removes first-class functions from your language.
Rather boring.

> 3) You only fail to define an R5RS-like semantic function
>
> Expressions -> (Env Store Cont -> Answers)
>
> by not doing continuations (which you discussed at length 2 sections
> earlier, so probably you'll probably do Cont when you expand Ch 19),
> and by "conflating Store and Env", to use MB's great phrase. I
> haven't read enough of your book to comment on your decision.

Remove continuations and you remove control flow. That's even less
interesting than a language with no functions.

Bill Richter

unread,

Jun 11, 2004, 1:07:21 AM6/11/04

to

Joe Marshall <j...@ccs.neu.edu> responded to me:

>
> > it sure looks to me like you don't need Scott models of LC. The
> > subset Procedure-Values of Values just consists (p 172) of triples
> > <function-name, function-body text, evaluating environment>

Joe, that's identifier, not function-name, so (lambda (x) body)
evaluates in environment E to the Procedure-Value triple <x, body, E>.
That's basically SICP, which you agree has first-class functions.

> > So you don't run into the problem MB explained to me 2 years ago:
> >
> >> E [the R5RS domain of Values] is "defined" as
> >>
> >> E = .... some expressions ultimately involving E ...
> >

> > So you only need Scott models if you define Procedure-Values (a
> > subset of Values) to be a set of functions on Values.

> Right, but this removes first-class functions from your language.
> Rather boring.

What? Shriram didn't change his language (approx. Scheme) at all, but
only the definition of the set Values in his semantic function

Expressions -> (Env -> Values)

These aren't my ideas. It's Shriram's book PLAI
<http://www.cs.brown.edu/~sk/Publications/Books/ProgLangs/PDF/all.pdf>
listed on schemers.org right after EoPL.

> > 3) You only fail to define an R5RS-like semantic function
> >
> > Expressions -> (Env Store Cont -> Answers)
> >

> > by not doing continuations, and by "conflating Store and Env", to

> > use MB's great phrase.

> Remove continuations and you remove control flow. That's even less
> interesting than a language with no functions.

Sure, but Part IX Semantics is only 3 pages long, so maybe it's under
construction. Part VI Continuations is 50+ pages long. So I think we
can conclude that Shriram knows how to add continuations to his
semantics without changing everything (adding in Scott models e.g.).

Marlene Miller

unread,

Jun 11, 2004, 2:52:53 AM6/11/04

to

Thank you very much Shriram. Thank you for taking the time to explain. Thank
you for sharing your perpsective and insights and lots of interesting ideas.
Thank you Bill for presenting my question to Shriram.

I think I would learn much from reading both books.

Marlene, the plumber

Joe Marshall

unread,

Jun 11, 2004, 11:09:52 AM6/11/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Joe Marshall <j...@ccs.neu.edu> responded to me:
>>
>> > it sure looks to me like you don't need Scott models of LC. The
>> > subset Procedure-Values of Values just consists (p 172) of triples
>> > <function-name, function-body text, evaluating environment>
>
> Joe, that's identifier, not function-name, so (lambda (x) body)
> evaluates in environment E to the Procedure-Value triple <x, body, E>.
> That's basically SICP, which you agree has first-class functions.

Die, horse, die!

Shriram is using *operational* semantics rather than *denotational*
semantics. The key difference is this: Denotational semantics
defines a function that maps programs to what they mean; operational
semantics defines a set of rules that *maintain* the meaning.

As an example, look at function application:

f, E => <i, b, E0> a, E => av b, E0 [i <-av] => bv
-----------------------------------------------------------
{f a}, E => bv

This says that *provided that* f, E is the triple <i, b, E0> *and*
a, E reduces to av, *and* b, E0[i <- av] reduces to bv, *then* (f a),
E reduces to bv.

There are a couple of unanswered questions:

1. Does there exist an f, E, i, b, E0, a, av, and bv that can
satisfy the antecedents? (In particular, it would be nice to
know if there exists non-empty sets of values for av and bv.)

2. Do the sets of values av and bv correspond to the kinds of
values we want to manipulate?

3. Does this reduction rule correspond to our intuitive notion of
function application? In particular, if I want to model the
`add three' function, does there exist an f (a program) that
will do that?

There are three approaches to answering these questions:

1. Assume a-priori that Shrirams semantics for reducing an
application correspond to the common notion of function
application.

2. *Prove* (or disprove) that Shrirams semantics do indeed mean
function application.

3. Treat the operational semantics as the rules for a meaningless
game.

If you don't wish to simply assume that the semantics work, then you
either need to prove they do or take the position that `function
application' as defined in the language is simply a complex syntactic
operation that may or may not have anything to do with mathematical
functions.

>> > So you only need Scott models if you define Procedure-Values (a
>> > subset of Values) to be a set of functions on Values.
>
>> Right, but this removes first-class functions from your language.
>> Rather boring.

Let me amend this: Without Scott domains I can only consider
Procedure-Values to be curiously formed tuples that have a complex
reduction rule. This isn't interesting.

> What? Shriram didn't change his language (approx. Scheme) at all, but
> only the definition of the set Values in his semantic function
>
> Expressions -> (Env -> Values)
>
> These aren't my ideas. It's Shriram's book PLAI
> <http://www.cs.brown.edu/~sk/Publications/Books/ProgLangs/PDF/all.pdf>
> listed on schemers.org right after EoPL.

Yes, but Shriram is discussing ``big-step operational semantics'' not
denotational semantics. Shriram's semantics will tell me that
((lambda (x) (+ x 3)) 7) => 10, but it has nothing to say about the
relationship between the Scheme expression `(lambda (x) (+ x 3))' and
the mathematical function `add three'.

Marlene Miller

unread,

Jun 11, 2004, 3:58:24 PM6/11/04

to

Shriram, I live in Seattle, so I was curious to see what the University of
Washington uses for their programming languages course. They are using your
book. http://www.cs.washington.edu/education/courses/341/04sp/

Shriram Krishnamurthi

unread,

Jun 11, 2004, 9:56:35 PM6/11/04

to

Joe Marshall <j...@ccs.neu.edu> writes:

> Yes, but Shriram is discussing ``big-step operational semantics'' not
> denotational semantics. Shriram's semantics will tell me that
> ((lambda (x) (+ x 3)) 7) => 10, but it has nothing to say about the
> relationship between the Scheme expression `(lambda (x) (+ x 3))' and
> the mathematical function `add three'.

Quite right, though note that it will also tell you

((lambda (x) (+ x 3)) 7) => ^10^ [circumflex-10]

You could even possibly prove that, for all numbers ^N^,

((lambda (x) (+ x 3)) N) => ^N+10^

where N => ^N^.

Joe enumerates three approaches for Making Sense of my semantics:

> 1. Assume a-priori that Shrirams semantics for reducing an
> application correspond to the common notion of function
> application.
>
> 2. *Prove* (or disprove) that Shrirams semantics do indeed mean
> function application.
>
> 3. Treat the operational semantics as the rules for a meaningless
> game.

Since I do periodically allude in that section to the relationship
between the semantics and an interpreter, we can rule out both #3 (the
semantics doesn't live cut off from the universe) and #1 (though I'm
taking great liberties, I'm at least trying to draw offer a
justification). We are therefore left with two refinements of #2:

1. Prove that the semantics captures function application.

2. Prove that the semantics faithfully reflects the interpreter, and
prove that the interpreter captures function application.

Shriram

Bill Richter

unread,

Jun 13, 2004, 1:14:57 AM6/13/04

to

Joe Marshall <j...@ccs.neu.edu> responded to me:

> Shriram is using *operational* semantics rather than *denotational*
> semantics.

Let's be precise, Joe. Shriram calls it big-step OpS (p 173, PLAI).
But it's also DS by Schmidt's definition (p 3 of his DS book):

The DS method maps a program directly to its meaning, called its
denotation. The denotation is usually a mathematical value, such
as a number or a function. No interpreters are used, a valuation
function maps programs directly to its meaning.

So any mathematically defined function

V: Programs [or Expressions] ---> Some-Set

is a DS valuation function, by Schmidt's definition. So practically
any semantics is DS, once you mathematize it. (More DS talk below.)

> The key difference is this: Denotational semantics defines a
> function that maps programs to what they mean;

But Shriram did just that. He mapped programs (expressions even)
mathematically to a set, by a function (I'll give names here)

V: Expressions -> (Env -> Values)

In Schmidt's definition of DS, "meaning" just means a mathematical
value. If you don't think his subset Procedure-Values is interesting,
that's fine, but there's no mathematical point to argue about.

Let's keep going. You seemed to agree that Shriram is a good enough
semanticist that he could've expanded his semantics to a math function

W: Expressions -> (Env Store Cont -> Answers)

Now Shriram's W won't be the same as the R5RS DS semantic function
curly-E, because the target sets are different. Since Shriram has a
different definition of Values (using Procedure-Value triples to
bypass Scott models), his Store & Cont will differ from R5RS DS.

But Shriram's set Answers will be the same as R5RS DS, so we can ask a
more meaningful question. R5RS DS uses an initial

<rho_0, sigma_0, kappa_0> in Env x Store x Cont

Shriram's semantic function W will also uses such initial values, and
let's call them by the same names, even though they live in different
sets. Then for any program P, I claim that

W[[ P ]] <rho_0, sigma_0, kappa_0>
=
curly-E[[ P ]] <rho_0, sigma_0, kappa_0> in Answers

By P here I mean the expression you get by wrapping P in a let form
with "undefined"s as R5RS DS does in sec 7.2. I think this would be
easy to prove. And I'd say that was a satisfactory "proof" of your

> 2. *Prove* (or disprove) that Shriram's semantics do indeed mean
> function application.

which is to say, that's what I think your "mean" should mean.

> [...] Shriram's semantics will tell me that

> ((lambda (x) (+ x 3)) 7) => 10, but it has nothing to say about the
> relationship between the Scheme expression `(lambda (x) (+ x 3))' and
> the mathematical function `add three'.

Sure, but that's a matter for proving theorems about observational
equivalence (which MFe has posted about). It's not a deficiency of
Shriram's semantic functions V & W. I don't see that Shriram's
semantics is at any disadvantage here with R5RS DS.

I think folks have posted that R5RS DS isn't particularly good for
observational equivalence, and I remember WC posting that R5RS DS is
actually too strict: there are expressions that we want to say are the
same, but curly-E distinguishes them because of some inconsequential
differences in what gets stored in what locations.

Maybe we should say that DS is a subject that humans work in, so DS
means whatever the DS humans are doing. I work in the subject of
"Homotopy Theory", but the meaning of "Homotopy Theory" has changed
quite a bit since I started 25 years ago. However, let's note that
Barendregt is much in agreement with Schmidt's definition above.
Barendregt's book is called
The Lambda Calculus: its Syntax and Semantics,
and he says LC Semantics include the term model (more or less standard
reduction) as well as the much harder Scott models.

Shriram's book comes close to a precise definition. In his short
Semantics section, he writes

It would be convenient to have some common language for explaining
interpreters. We already have one: math!

[...] It's semantics because it ascribes meanings to programs.

To me, that sounds like what Schmidt calls DS, Shriram calls S! How
about this: any mathematically defined function

V: Programs [or Expressions] ---> Some-Set

is an S valuation function. [Now don't tell me the previous sentence
here was "We call this a big-step OpS. It makes no difference.]

I'll abide by Shriram's definition of S, if others will do accept it
and also reject Schmidt's definition. What about a definition of DS?

In an interesting private discussion, I think Shriram said he didn't
want to call something DS unless it made really serious and integrated
use of Scott models. Maybe that means if you only use Scott models to
solve the P = (P -> P) problem, it's not really DS. That's fine, as
long as we admit this is culture, and not Math. We can argue about
whether somebody's "really" using Scott models, just like we can argue
about whether some proof is "deep", or "trivial". The truth of our
theorems must be above such political discussions, or it's not Math.
Math isn't a `common language' if we won't use it precisely.

Joe Marshall

unread,

Jun 13, 2004, 12:45:19 PM6/13/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Joe Marshall <j...@ccs.neu.edu> responded to me:
>
>> Shriram is using *operational* semantics rather than *denotational*
>> semantics.
>
> Let's be precise, Joe. Shriram calls it big-step OpS (p 173, PLAI).
> But it's also DS by Schmidt's definition (p 3 of his DS book):
>
> The DS method maps a program directly to its meaning, called its
> denotation. The denotation is usually a mathematical value, such
> as a number or a function. No interpreters are used, a valuation
> function maps programs directly to its meaning.

I don't see how this applies. Shriram's equations do *not* map
programs directly to their meaning. Shriram is assuming that a
mapping exists and shows how to reduce Scheme expressions while
preserving the mapping. In fact, if we restrict our Scheme to
lambda-expressions only, we can dispense with the right hand side.
We'd still have an operational semantics that reduced expressions, but
no denotation attached to it.

> So any mathematically defined function
>
> V: Programs [or Expressions] ---> Some-Set
>
> is a DS valuation function, by Schmidt's definition.

The set is supposed to be the meaning of the program. The unix `wc'
command (word count) maps program text to non-negative integers, but
it is not generally considered a DS valuation function.

> So practically any semantics is DS, once you mathematize it. (More
> DS talk below.)

Nonsense. Denotational semantics involves finding a function that
maps programs directly to their meaning. There can be other
relationships between programs and meanings. Consider a `boolean
semantics' which maps (program * set) -> boolean (true if the
program's meaning is contained within the set, false otherwise). It's
a weird sort of semantics (and not entirely useless), but not
denotational.

>> The key difference is this: Denotational semantics defines a
>> function that maps programs to what they mean;
>
> But Shriram did just that.

Take a look at pg. 171 There are constructs like this:

l => ^lv^ r => ^rv^
-----------------------------
{+ l r} => ^lv + rv^

These are called `judgements'. They are not terms in curly-E and
Shriram makes no claim that they are.

> He mapped programs (expressions even) mathematically to a set, by a
> function (I'll give names here)
>
> V: Expressions -> (Env -> Values)

Some of the rules in the operational semantics have no antecedents.
For example,
n,E => ^n^

These are taken as execution axioms.

> In Schmidt's definition of DS, "meaning" just means a mathematical
> value. If you don't think his subset Procedure-Values is interesting,
> that's fine, but there's no mathematical point to argue about.
>
> Let's keep going. You seemed to agree that Shriram is a good enough
> semanticist that he could've expanded his semantics to a math function
>
> W: Expressions -> (Env Store Cont -> Answers)

The reason Shriram didn't expand that is because he isn't trying to do
denotational semantics. On page 173 Shriram says explicitly:

``It's *operational* because ... we aren't compiling the entire

program into a mathematical object and using fancy math to reduce it

to an answer.''

> Now Shriram's W won't be the same as the R5RS DS semantic function
> curly-E, because the target sets are different.

Yes. This is why you don't try to expand the axioms.

> Since Shriram has a different definition of Values (using
> Procedure-Value triples to bypass Scott models), his Store & Cont
> will differ from R5RS DS.
>
> But Shriram's set Answers will be the same as R5RS DS,

That's a bold assertion.

> so we can ask a more meaningful question.
>
> R5RS DS uses an initial
>
> <rho_0, sigma_0, kappa_0> in Env x Store x Cont
>
> Shriram's semantic function W will also uses such initial values, and
> let's call them by the same names, even though they live in different
> sets. Then for any program P, I claim that
>
> W[[ P ]] <rho_0, sigma_0, kappa_0>
> =
> curly-E[[ P ]] <rho_0, sigma_0, kappa_0> in Answers
>
> By P here I mean the expression you get by wrapping P in a let form
> with "undefined"s as R5RS DS does in sec 7.2. I think this would be
> easy to prove.

Feel free to provide the proof. Consider in particular the issue of
self-application. Hint:

http://www1.elsevier.com/homepage/sac/opit/24/article.pdf

Meyer and de Vink make heavy use of domains, though, so you'll have to
remove them from the proof.

> And I'd say that was a satisfactory "proof" of your
>
>> 2. *Prove* (or disprove) that Shriram's semantics do indeed mean
>> function application.
>
> which is to say, that's what I think your "mean" should mean.

I see no proof.

>> [...] Shriram's semantics will tell me that
>> ((lambda (x) (+ x 3)) 7) => 10, but it has nothing to say about the
>> relationship between the Scheme expression `(lambda (x) (+ x 3))' and
>> the mathematical function `add three'.
>
> Sure, but that's a matter for proving theorems about observational
> equivalence (which MFe has posted about).

Given that the denotational semantics *do* say that
`(lambda (x) (+ x 3))' means the mathematical function `add three',
this is a central point.

> It's not a deficiency of Shriram's semantic functions V & W. I
> don't see that Shriram's semantics is at any disadvantage here with
> R5RS DS.

I never said that Shriram's semantics were deficient, I said they were
operational rather than denotational. They approach the problem of
semantics in a different manner.

> Shriram's book comes close to a precise definition. In his short
> Semantics section, he writes
>
> It would be convenient to have some common language for explaining
> interpreters. We already have one: math!
> [...] It's semantics because it ascribes meanings to programs.
>
> To me, that sounds like what Schmidt calls DS, Shriram calls S!

Not to me. Semantics ascribes meanings to programs, but there are
many techniques for this. Denotational semantics attempts to define a
function over programs that maps them to meanings. Operational
semantics identifies the meaning of a program with the steps taken to
evaluate it. Both are mathematical.

--
~jrm

Daniel C. Wang

unread,

Jun 13, 2004, 3:25:17 PM6/13/04

to

Big-step Ops only define meanings for terminating programs. i.e. the
evaluation function is a partial function. DS requires the meaning function
be total for all programs.

The small steps operational semantics do not define any meaning for programs
but allows you to relate a program with valid reductions of it.

Big-step and DS are different end of story.

Lauri Alanko

unread,

Jun 13, 2004, 3:22:49 PM6/13/04

to

In article <57189ce0.04061...@posting.google.com>,

Bill Richter <ric...@math.northwestern.edu> wrote:
> Let's be precise, Joe. Shriram calls it big-step OpS (p 173, PLAI).
> But it's also DS by Schmidt's definition (p 3 of his DS book):
>
> The DS method maps a program directly to its meaning, called its
> denotation. The denotation is usually a mathematical value, such
> as a number or a function. No interpreters are used, a valuation
> function maps programs directly to its meaning.
>
> So any mathematically defined function
>
> V: Programs [or Expressions] ---> Some-Set
>
> is a DS valuation function, by Schmidt's definition. So practically
> any semantics is DS, once you mathematize it. (More DS talk below.)

Since you feel so keen to interpret Schmidt, you may want to have a
look at <http://citeseer.ist.psu.edu/schmidt95programming.html>.
There Schmidt says, among other things:

Unlike denotational semantics, natural semantics does not
claim that the meaning of a program is necessarily
"mathematical."

Here "natural semantics" means big-step operational semantics, as you
can readily verify by reading the paper.

Lauri Alanko
l...@iki.fi

Daniel C. Wang

unread,

Jun 13, 2004, 3:44:04 PM6/13/04

to

Daniel C. Wang wrote:

> Big-step Ops only define meanings for terminating programs. i.e. the
> evaluation function is a partial function. DS requires the meaning
> function be total for all programs.
>
> The small steps operational semantics do not define any meaning for
> programs but allows you to relate a program with valid reductions of it.
>
> Big-step and DS are different end of story.

Let me clarify one subtle point. For programs or terms for which the
big-step operational semantics is total, one may in a rather perverse way
consider it a form of DS.

i.e. if we consider an OP-sem for simple arithmetic expressions without
non-terminating expressions. The DS for such a language would basically be
the same thing.

However, for Scheme and any non-trivial programming language they are not
the same.

Bill Richter

unread,

Jun 13, 2004, 10:27:37 PM6/13/04

to

Joe, there's a lot in your post, but IMO we need to straighten out
some basic stuff first. So please comment on the next 3 paragraphs:

Now I wrote that Shriram had mathematically defined a function

V: Expressions -> (Env -> Values)

Now maybe he didn't actually do so, and maybe that's the problem.
Shriram's short Semantics section in PLAI certainly doesn't use the
names V, Env, Values, or Procedure-Values. Is that an issue?

But I claim that Shriram could easily have done just that. That is,
whatever he wrote about judgments or antecedents, that Shriram could
easily have defined such a mathematical function V.

And then I claim that such a function V is what Schmidt calls a DS
valuation function, if we wish to say that V captures the semantics.

Now while I'm waiting for your response (or Laurie's), let me make
some quick comments (uh, 150 lines I mean :D) on your post:

Joe Marshall (prunes...@comcast.net) responded to me:

> > So any mathematically defined function
> >
> > V: Programs [or Expressions] ---> Some-Set
> >
> > is a DS valuation function, by Schmidt's definition.
>
> The set is supposed to be the meaning of the program. The unix `wc'
> command (word count) maps program text to non-negative integers, but
> it is not generally considered a DS valuation function.

Right, good point, & I tried to correct this above. We have to assert
that V captures the semantics of our language. `wc' certainly
doesn't! So how do we decide? We're trying to keep it to Math and
away from culture. I know what my criterion is: stick to programs,
which is all that Schmidt's quote refers to. For any program P,

V[[ P ]] in Some-Set

must be a mathematization of the actual interpreter output. Maybe
that even works for expressions. Shriram's (or my) value

V[[ (lambda (x) body) ]](E) in Values

strikes you as "meaningless", but Scheme prints something meaningless,
as you know. I just pasted a lambda expr (sol to Ex 22.2.3 of HtDP,
and I was quite proud of my solution) into the Interactions window of
DrScheme, hit RET, and here's my output:

> ;; fold : Y (X Y -> Y) -> ((listof X) -> Y)
(define (fold base combine)
(local ((define (abs-fun aloX)
(if (empty? aloX)
base
(combine (first aloX) (abs-fun (rest aloX))))))
abs-fun))
>

Absolutely nothing!

> > R5RS DS uses an initial
> >
> > <rho_0, sigma_0, kappa_0> in Env x Store x Cont
> >
> > Shriram's semantic function W will also uses such initial values, and
> > let's call them by the same names, even though they live in different
> > sets. Then for any program P, I claim that
> >
> > W[[ P ]] <rho_0, sigma_0, kappa_0>
> > =
> > curly-E[[ P ]] <rho_0, sigma_0, kappa_0> in Answers
> >
> > By P here I mean the expression you get by wrapping P in a let form
> > with "undefined"s as R5RS DS does in sec 7.2. I think this would be
> > easy to prove.
>
> Feel free to provide the proof.

I'd like to, Joe, but there's a serious communication problem. I
don't know even what part of this you think is hard. I'd say this is
obvious because R5RS DS is basically playing the SICP game, just with
domains. That is, curly-E[[ (lambda (x) body) ]] is the function that
get by using the SICP rules for evaluation. I thought everyone agreed
that it made pretty good sense to think of R5RS DS as FP. Anton vS
even worked out WC's exercise, writing a DS->Scheme meta-interpreter.

> > Sure, but that's a matter for proving theorems about observational
> > equivalence (which MFe has posted about).
>
> Given that the denotational semantics *do* say that
> `(lambda (x) (+ x 3))' means the mathematical function `add three',
> this is a central point.

Maybe you're right, Joe, it sounds reasonable. My R5RS DS is pretty
rusty. But I want to stick to programs anyway if we're gonna decide
if some function is a DS valuation. And you didn't seem to
understand my point, so let me try again, with your example in mind:

We can say that Shriram's W is too strict on lambda expressions. Any
two lambda's will be distinguished by W unless they're identical.

We don't want that. We'd like our DS valuation function to identify
lambda's that are observationally equivalent.

But R5RS DS is too strict as well. I think WC posted lambda's that
are observationally equivalent, but separated by curly-E.

To me, that just shows that "the proof of the DS valuation pudding" is
in the programs, not the expressions.

> > Shriram's book comes close to a precise definition. In his short
> > Semantics section, he writes
> >
> > It would be convenient to have some common language for explaining
> > interpreters. We already have one: math!
> > [...] It's semantics because it ascribes meanings to programs.
> >
> > To me, that sounds like what Schmidt calls DS, Shriram calls S!
>
> Not to me. Semantics ascribes meanings to programs, but there are
> many techniques for this. Denotational semantics attempts to define a
> function over programs that maps them to meanings. Operational
> semantics identifies the meaning of a program with the steps taken to
> evaluate it. Both are mathematical.

Then I want an example of Ops that can't easily be turned into DS.
Let's suppose we have an mathematically defined function

omega: Programs ---> Machine-History

which takes a program to its entire evaluation history, a sequence
[state_1, state_2,... ]. Is that OpS?

Now I'll define a DS valuation function

V: Programs ---> Answers

which sends the program P to either

state_n, if omega[[ P ]] = [state_1, state_2,..., state_n ]

bottom, if omega[[ P ]] is an infinite sequence.

Now the only thing I read from your paper is that they defined all DS
valuation function to be compositional, and this function V is not
compositional. But that's a matter of definition. I claim V is a
non-compositional DS valuation function, and I think I handled
Daniel's objection. Now I'm assuming above that state_n will be the
actual program output (if it exists).

There's certainly a gain in mathematical complexity. I'd bet that
there's a computable function Next-State that computes state_{i+1}
from state_i. But V is definitely not a computable function, by the
Halting problem. We need induction to define V from Next-State.

Shriram's W will be compositional, because that's the SICP way.

Lauri Alanko

unread,

Jun 14, 2004, 7:19:22 AM6/14/04

to

In article <57189ce0.04061...@posting.google.com>,
Bill Richter <ric...@math.northwestern.edu> wrote:

> Now while I'm waiting for your response (or Laurie's)

I think that was just pretty illustrative. You are unable to take at
face value what people actually write, and make unwarranted
extrapolations, even though the things you assume have been explicitly
denied, as you might find out if you bothered to actually follow this
newsgroup and read what people say.

Lauri

Lauri Alanko

unread,

Jun 14, 2004, 7:40:55 AM6/14/04

to

In article <57189ce0.04061...@posting.google.com>,
Bill Richter <ric...@math.northwestern.edu> wrote:

> Let's suppose we have an mathematically defined function
>
> omega: Programs ---> Machine-History
>
> which takes a program to its entire evaluation history, a sequence
> [state_1, state_2,... ]. Is that OpS?
>
> Now I'll define a DS valuation function
>
> V: Programs ---> Answers
>
> which sends the program P to either
>
> state_n, if omega[[ P ]] = [state_1, state_2,..., state_n ]
>
> bottom, if omega[[ P ]] is an infinite sequence.

Right. So the meaning of "(lambda (x) (+ x 3))" is
"(lambda (x) (+ x 3))". Mighty useful piece of information, that one.

(It _is_ useful for most of the practical purposes that CS folks use
calculi for. That's why op sem is so prevalent nowadays. But it does
not give any enlightenment about whether the term represents the
mathematical function that we intuitively associate it with.)

Lauri

Joe Marshall

unread,

Jun 14, 2004, 12:20:13 PM6/14/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Now I wrote that Shriram had mathematically defined a function
> V: Expressions -> (Env -> Values)
> Now maybe he didn't actually do so, and maybe that's the problem.
> Shriram's short Semantics section in PLAI certainly doesn't use the
> names V, Env, Values, or Procedure-Values. Is that an issue?

Yes.

> But I claim that Shriram could easily have done just that. That is,
> whatever he wrote about judgments or antecedents, that Shriram could
> easily have defined such a mathematical function V.

He *could* have, but he didn't.

> And then I claim that such a function V is what Schmidt calls a DS
> valuation function, if we wish to say that V captures the semantics.

Yes.

It is non-trivial to go from a set of judgements to a valuation
function. In order to determine the meaning of a program from the
judgements, you must construct a chain of judgements that
incrementally reduce the program to axioms. Yes, you do this with
induction, but in order for the induction to work you must have a base
case. If you apply a function to itself, however, you construct an
infinite chain of judgements. You can still attribute meaning to the
infinite chain if it approaches a limit, but it isn't easy.

But Shriram isn't doing this.

> Right, good point, & I tried to correct this above. We have to assert
> that V captures the semantics of our language. `wc' certainly
> doesn't! So how do we decide? We're trying to keep it to Math and
> away from culture. I know what my criterion is: stick to programs,
> which is all that Schmidt's quote refers to. For any program P,
>
> V[[ P ]] in Some-Set
>
> must be a mathematization of the actual interpreter output. Maybe
> that even works for expressions. Shriram's (or my) value
>
> V[[ (lambda (x) body) ]](E) in Values
>
> strikes you as "meaningless", but Scheme prints something meaningless,
> as you know.

Lambda expressions are meaningful, but they mean different things in
denotational semantics and operational semantics. In denotational
semantics, lambda expressions mean mathematical functions. In
operational semantics, lambda expressions mean to construct a
closure.

>> > R5RS DS uses an initial
>> >
>> > <rho_0, sigma_0, kappa_0> in Env x Store x Cont
>> >
>> > Shriram's semantic function W will also uses such initial values, and
>> > let's call them by the same names, even though they live in different
>> > sets. Then for any program P, I claim that
>> >
>> > W[[ P ]] <rho_0, sigma_0, kappa_0>
>> > =
>> > curly-E[[ P ]] <rho_0, sigma_0, kappa_0> in Answers
>> >
>> > By P here I mean the expression you get by wrapping P in a let form
>> > with "undefined"s as R5RS DS does in sec 7.2. I think this would be
>> > easy to prove.
>>
>> Feel free to provide the proof.
>
> I'd like to, Joe, but there's a serious communication problem. I
> don't know even what part of this you think is hard.

The hard part is establishing a one-to-one mapping between the tuples
of Operational Semantics and the domain of function values in
denotational semantics.

> Then I want an example of Ops that can't easily be turned into DS.

On the one hand, since it can be proven that operational semantics is
equivalent to denotational semantics, there can be no example. On the
other hand, you need to invoke domain theory to do it, so it's never
easy.

> Let's suppose we have an mathematically defined function
>
> omega: Programs ---> Machine-History
>
> which takes a program to its entire evaluation history, a sequence
> [state_1, state_2,... ]. Is that OpS?

Not quite. omega is defined by induction over the operational
semantics. OpS takes you from state_n to state_n+1, but no further.

> There's certainly a gain in mathematical complexity. I'd bet that
> there's a computable function Next-State that computes state_{i+1}
> from state_i. But V is definitely not a computable function, by the
> Halting problem. We need induction to define V from Next-State.

Exactly. By restricting ourselves to the state-transition function,
we avoid the computability problems associated with the denotational
approach. But this comes at a cost: we can no longer say that
the state transitions involved with function application and lambda
expressions `sum up' to `real' functions.

Bill Richter

unread,

Jun 14, 2004, 10:40:59 PM6/14/04

to

Joe Marshall <j...@ccs.neu.edu> responded to me:
>

> > Now I wrote that Shriram had mathematically defined a function
> > V: Expressions -> (Env -> Values)
> > Now maybe he didn't actually do so, and maybe that's the problem.
> > Shriram's short Semantics section in PLAI certainly doesn't use
> > the names V, Env, Values, or Procedure-Values. Is that an issue?
>
> Yes.

Good! Joe, I think we're making progress.

> > But I claim that Shriram could easily have done just that. That
> > is, whatever he wrote about judgments or antecedents, that Shriram
> > could easily have defined such a mathematical function V.
>
> He *could* have, but he didn't.

Great. Now the question we're divided on is how hard it would.
That's not gonna be real easy to settle, but let's keep working:

> > And then I claim that such a function V is what Schmidt calls a DS
> > valuation function, if we wish to say that V captures the semantics.
>
> Yes.

Great.

> It is non-trivial to go from a set of judgments to a valuation
> function.

That's what I dispute, and I think we're heading toward a resolution.

> [...] Yes, you do this with induction, [...]

Great!

> But Shriram isn't doing this.

Yeah, maybe. But if it's easy enough to pass to V, then I'm not way
off base to have misinterpreted Shriram this way. Now if it's hard,
then I wildly misinterpreted Shriram, and I owe him an apology.

> > Right, good point, & I tried to correct this above. We have to
> > assert that V captures the semantics of our language. `wc'
> > certainly doesn't! So how do we decide? We're trying to keep it
> > to Math and away from culture. I know what my criterion is: stick
> > to programs, which is all that Schmidt's quote refers to. For any
> > program P,
> >
> > V[[ P ]] in Some-Set
> >
> > must be a mathematization of the actual interpreter output.

Joe, can I get you to vote on this? You went on to my next point.

> > Maybe that even works for expressions. Shriram's (or my) value
> >
> > V[[ (lambda (x) body) ]](E) in Values
> >
> > strikes you as "meaningless", but Scheme prints something
> > meaningless, as you know.
>
> Lambda expressions are meaningful, but they mean different things in
> denotational semantics and operational semantics. In denotational
> semantics, lambda expressions mean mathematical functions.

Now here I claim you're bringing culture into Math. In the DS we've
seen, yes, you're right. There's no requirement though, in Schmidt's
definition, and you seemed to agree with me on this above.

> >> > R5RS DS uses an initial
> >> >
> >> > <rho_0, sigma_0, kappa_0> in Env x Store x Cont
> >> >
> >> > Shriram's semantic function W will also uses such initial
> >> > values, and let's call them by the same names, even though they
> >> > live in different sets. Then for any program P, I claim that
> >> >
> >> > W[[ P ]] <rho_0, sigma_0, kappa_0>
> >> > =
> >> > curly-E[[ P ]] <rho_0, sigma_0, kappa_0> in Answers
> >> >
> >> > By P here I mean the expression you get by wrapping P in a let form
> >> > with "undefined"s as R5RS DS does in sec 7.2. I think this would be
> >> > easy to prove.
> >>
> >> Feel free to provide the proof.
> >
> > I'd like to, Joe, but there's a serious communication problem. I
> > don't know even what part of this you think is hard.
>
> The hard part is establishing a one-to-one mapping between the tuples
> of Operational Semantics and the domain of function values in
> denotational semantics.

Yeah, great. I think I can do that. I read R5RS DS quite carefully
and it really looked to me like curly-E[[ lambda expressions ]] was
just the obvious function you'd want to define. It took me a while to
decode R5RS DS, and now it's looks impenetrable again.

Our eval order plt-scheme thread will help. From now on, I'm junking
the permute/unpermute part of R5RS DS, and going with left->right eval
order. I mean, I was doing that before anyway, but I felt guilty :D

> > Then I want an example of Ops that can't easily be turned into DS.
>
> On the one hand, since it can be proven that operational semantics
> is equivalent to denotational semantics, there can be no example.
> On the other hand, you need to invoke domain theory to do it, so
> it's never easy.

I think you contradicted yourself below, Joe. That is, if domain
theory means Scott models of LC. Let's go read it:

> > Let's suppose we have an mathematically defined function
> >
> > omega: Programs ---> Machine-History
> >
> > which takes a program to its entire evaluation history, a sequence
> > [state_1, state_2,... ]. Is that OpS?
>
> Not quite. omega is defined by induction over the operational
> semantics. OpS takes you from state_n to state_n+1, but no further.

Ah, thanks. So OpS is just my Next-State function below. And sure,
you'd need induction to even define omega. Great.

> > There's certainly a gain in mathematical complexity. I'd bet that
> > there's a computable function Next-State that computes state_{i+1}
> > from state_i. But V is definitely not a computable function, by
> > the Halting problem. We need induction to define V from
> > Next-State.
>
> Exactly. By restricting ourselves to the state-transition function,
> we avoid the computability problems associated with the denotational
> approach.

But this doesn't bother me a bit! Pure mathematicians rarely have
computable functions. Any time you bring in the real line you're not
computable, because (as Tom Bushnell posted), the real line isn't a
computable set!

This is where I thought you were contradicting yourself. Because this
V doesn't seem to use Scott models, and I said it was a
non-compositional DS valuation function. Hmm, you snipped that part :)

> But this comes at a cost: we can no longer say that the state
> transitions involved with function application and lambda
> expressions `sum up' to `real' functions.

Don't quite grok. Is this what you were going after Shriram for, that
his big-step OpS might be meaningless reduction rules? If so, that's
a real good point, and the reason I always want to produce the V.

Bill Richter

unread,

Jun 14, 2004, 10:56:56 PM6/14/04

to

Lauri Alanko <l...@iki.fi> responded to me:

> > Let's suppose we have an mathematically defined function
> >
> > omega: Programs ---> Machine-History
> >
> > which takes a program to its entire evaluation history, a sequence
> > [state_1, state_2,... ]. Is that OpS?
> >
> > Now I'll define a DS valuation function
> >
> > V: Programs ---> Answers
> >
> > which sends the program P to either
> >
> > state_n, if omega[[ P ]] = [state_1, state_2,..., state_n ]
> >
> > bottom, if omega[[ P ]] is an infinite sequence.
>
> Right. So the meaning of "(lambda (x) (+ x 3))" is
> "(lambda (x) (+ x 3))". Mighty useful piece of information, that
> one.

But that's the answer you get, or even less, Lauri! V maps programs
to answers, mostly meaning the printed output of the interpreter.
That's how R5RS DS uses the name Answers. If your program was a
lambda expression, DrScheme gives no output at all.

> (It _is_ useful for most of the practical purposes that CS folks use
> calculi for. That's why op sem is so prevalent nowadays. But it does
> not give any enlightenment about whether the term represents the
> mathematical function that we intuitively associate it with.)

Yeah, and that must be why folks use Scott models in DS. But there's
no requirement (in Schmidt's definition at least) that the valuation
of lambda-expr is your mathematical function. It could be your
"mighty useful" (lambda (x) (+ x 3)). Don't agret that the proof
of the DS pudding is for programs? If I hand you a math function

V: Expressions -> (Env Store Cont -> Answers)

and if you're deciding if my V is really a DS valuation function,
then you're going to make your decision based on the restriction

V^p : Programs -> Answers

V^p( P ) = V(P)(rho_0, sigma_0, kappa_0)

All bets are off on the original V, unless we demand compositionality.

Lauri, you seem antagonistic, and maybe you remember how this went up
in smoke 2 years ago. But MFe is here now, and Joe & I have built up
some goodwill on plt-scheme... I'm willing to give it another try.

Joe Marshall

unread,

Jun 15, 2004, 4:41:13 AM6/15/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

>> > Right, good point, & I tried to correct this above. We have to
>> > assert that V captures the semantics of our language. `wc'
>> > certainly doesn't! So how do we decide? We're trying to keep it
>> > to Math and away from culture. I know what my criterion is: stick
>> > to programs, which is all that Schmidt's quote refers to. For any
>> > program P,
>> >
>> > V[[ P ]] in Some-Set
>> >
>> > must be a mathematization of the actual interpreter output.
>
> Joe, can I get you to vote on this? You went on to my next point.

I'm not quite sure what you are asserting here, but I'd say this:

If your interpreter is faithful to your denotational semantics, then

V [[ P ]] = V [[ interpreter output ]]

Note that this is different from operational semantics:

Op (P) => interpreter output

i.e., operational semantics applied to a program reduces to the
interpreter output.

>> > Maybe that even works for expressions. Shriram's (or my) value
>> >
>> > V[[ (lambda (x) body) ]](E) in Values
>> >
>> > strikes you as "meaningless", but Scheme prints something
>> > meaningless, as you know.
>>
>> Lambda expressions are meaningful, but they mean different things in
>> denotational semantics and operational semantics. In denotational
>> semantics, lambda expressions mean mathematical functions.
>
> Now here I claim you're bringing culture into Math. In the DS we've
> seen, yes, you're right. There's no requirement though, in Schmidt's
> definition, and you seemed to agree with me on this above.

Yes, we could be using `wc'.

>> But this comes at a cost: we can no longer say that the state
>> transitions involved with function application and lambda
>> expressions `sum up' to `real' functions.
>
> Don't quite grok. Is this what you were going after Shriram for, that
> his big-step OpS might be meaningless reduction rules? If so, that's
> a real good point, and the reason I always want to produce the V.

I wasn't really `going after' Shriram; I'm sure his operational
semantics are well-founded, and I believe that he chose operational
semantics over denotational semantics in order to avoid the problems
associated with the latter and to illustrate more closely the actions
of the interpreter.

But while Shriram's semantics may faithfully model what the
interpreter does, they do not provide (nor are they intended to
provide) a reason to believe that the program *as a whole* means what
we intend (i.e., the valuation function is implied through induction,
but there is no proof that the induction is valid).

For simple expressions that makes no difference, but suppose we
consider this one:

(((lambda (f)
((lambda (D) (D D))
(lambda (x) (f (lambda () (x x))))))
(lambda (f)
(lambda (n)
(if (zero? n)
1
(* n ((f) (- n 1))))))) 10)

Operational semantics can easily show that this reduces to 3628800,
but it could not show that this fragment:

(lambda (f)
((lambda (D) (D D))
(lambda (x) (f (lambda () (x x))))))

when applied to this fragment:

(lambda (f)
(lambda (n)
(if (zero? n)
1
(* n ((f) (- n 1))))))

yields a (partial) function.

--
~jrm

Bill Richter

unread,

Jun 15, 2004, 11:52:52 PM6/15/04

to

Joe Marshall <prunes...@comcast.net> responds to me:

Joe, I didn't really follow your post, except for your (almost!) Y_v
combinator (see below), but I think I maybe see our problem:

I think there's 2 separate issues here, and I'd like your vote on both
the mathematical issue and the "real-world modeling" issue:

****** Math ******

I claimed I could easily turn Shriram's big-step OpS into a
mathematical function

V : Expressions -> (Env -> Values)

where V[[ (lambda (x) body) ]](E) = <x, body, E>

I said I didn't need Scott models or CPO's to define V, just some
induction. Now relating V to the curly-E of R5RS DS would be real
work, because curly-E is real work, as it involves Scott models.

I think maybe you agreeing with me on this point, but you say I can't
call it a DS valuation function, on account of:

****** Real-World Modeling ******

When we make a mathematical model a real-world phenomenon, we have to
ask if it's a "good" model. And that's partly a math question, but
it's partly a real-world question. What do we mean by good?

So a DS valuation/semantic function for our language

V : Expressions -> Meaning-Set

must satisfy us mathematically, but it must also satisfy us
intuitively, in a way that I think can't itself be mathematized.

So V[[ (lambda (x) body) ]] is supposed to mathematically codify the
"meaning" of (lambda (x) body). But what is the "meaning"?

I think we can legitimately give different answers to this question,
and we must define different semantic functions accordingly.

I say there's nothing wrong with the SICP solution, as Sussman &
Steele invented Scheme. SICP "conflates" the Store with the
Environment, as Shriram does, and SICP also declares the value of
(lambda (x) body) in an environment E to be just the lambda tagged
with E, i.e. Shriram's triple <x, body, E>.

You can say SICP is old-hat, but I say we can, and ought to, define
a SICP DS semantic function, and then we'll have to say that the
"meaning" of (lambda (x) body), i.e.
V[[ (lambda (x) body) ]]
reflects the SICP biz,
and then we'll have to say something like Shriram's

V[[ (lambda (x) body) ]](E) = <x, body, E>

Now you can say, "No!" The "real" meaning of a lambda is some actual
function on Values, and that forces Scott models etc.

And I'd say, that's fine, but that's a different "semantic"
understanding of Scheme, so you need R5RS's curly-E, a different DS
semantic function. There's no right answer here. Heck, there are
folks who think that everything is a pointer in Scheme, and they also
need a DS semantic function that's different from curly-E.

****** other issues: R5RS DS has real-world-fit problems ******

I think I remember what WC posted was wrong with the R5RS DS def of
procedure values. Let's forget Cont for now, and dumb down the R5RS
semantic function to the sort of call/cc-free DS semantic function

curly-E: Expressions -> (Store Env -> Store Value)

which Cartwright & Felleisen talk about on the top of page 2 of their
"Extensible Denotational" paper on PLT.

Then curly-E[[ (lambda (x) body) ]](sigma E) = (sigma f)

where f is an honest function. But I think what WC pointed out is
that meaningless changes to (lambda (x) body) will result in different
functions f. f is a function that changes the contents of locations,
and we can modify this location-behavior in meaningless ways by making
meaningless changes to (lambda (x) body). By meaningless, I'm making
a "real-world" distinction: they don't mean anything to us Scheme
programmers. But in the DS meaning of meaning, it's not meaningless!
What that means is that the (very nice IMO) curly-E doesn't really
reflect our real-world understanding of Scheme semantics. But it's
pretty good, and it's perfect on whole programs.

Why don't I also point out that my Shriram V semantic function is
compositional by the definition in "Extensible Denotional", p 6:

[The map from syntactic domains to semantic domains] satisfies the law
of compositionality: the interpretation of a phrase is a function of
the interpretation of the sub-phrases.

Since that's SICP says, it's obviously true for the Shriram

V : Expressions -> (Env -> Values)

****** your interesting Y_v combinator ******

Your code looks pretty much like the Y_v combinator version of

(fact 10) => 3628800

but it looked odd enough I checked. Here's yours

(((lambda (f)
((lambda (D) (D D))
(lambda (x)
(f (lambda () (x x))))))
(lambda (f)
(lambda (n)
(if (zero? n)
1
(* n ((f) (- n 1))))))) 10)

and here's the usual Y_v fact biz (Y_v courtesy of TLS):

(((lambda (f)
((lambda (D) (D D))
(lambda (x)

(f (lambda (p) ((x x) p))))))

(lambda (f)
(lambda (n)
(if (zero? n)
1

(* n (f (- n 1))))))) 10)

I never saw your version. Finally I realized that your f is really
(f), so you don't quite have a version of Y_v. Pretty clever anyway!

Joe Marshall

unread,

Jun 16, 2004, 4:25:44 AM6/16/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> You can say SICP is old-hat, but I say we can, and ought to, define
> a SICP DS semantic function, and then we'll have to say that the
> "meaning" of (lambda (x) body), i.e.
> V[[ (lambda (x) body) ]]
> reflects the SICP biz,
> and then we'll have to say something like Shriram's
>
> V[[ (lambda (x) body) ]](E) = <x, body, E>
>
> Now you can say, "No!" The "real" meaning of a lambda is some actual
> function on Values, and that forces Scott models etc.

More or less. I'm not insisting that you must model a function, but
if you choose not to model a function, then you cannot make sense of this:

((lambda (x) x) 42)

Because the lambda expression means a 3-tuple and you have not
provided a semantics for applying 3-tuples to arguments.

You have 3 options at this point:

1. Haul out the Scott domains

2. Outlaw applying closures to arguments

3. Punt and use Operational semantics to rewrite
((lambda (x) x) 42) => 42 and declare victory.

> Why don't I also point out that my Shriram V semantic function is
> compositional by the definition in "Extensible Denotional", p 6:
>
> [The map from syntactic domains to semantic domains] satisfies the law
> of compositionality: the interpretation of a phrase is a function of
> the interpretation of the sub-phrases.

But it isn't compositional (yet) because you haven't defined what the
application of 3-tuples mean.

> Your code looks pretty much like the Y_v combinator version of
>
> (fact 10) => 3628800
>
> but it looked odd enough I checked. Here's yours
>
> (((lambda (f)
> ((lambda (D) (D D))
> (lambda (x)
> (f (lambda () (x x))))))
> (lambda (f)
> (lambda (n)
> (if (zero? n)
> 1
> (* n ((f) (- n 1))))))) 10)
>
> and here's the usual Y_v fact biz (Y_v courtesy of TLS):
>
> (((lambda (f)
> ((lambda (D) (D D))
> (lambda (x)
> (f (lambda (p) ((x x) p))))))
> (lambda (f)
> (lambda (n)
> (if (zero? n)
> 1
> (* n (f (- n 1))))))) 10)
>
> I never saw your version. Finally I realized that your f is really
> (f), so you don't quite have a version of Y_v. Pretty clever anyway!

My Y is curried.

--
~jrm

Felix Klock

unread,

Jun 16, 2004, 9:43:44 AM6/16/04

to

ric...@math.northwestern.edu (Bill Richter) wrote in message news:<57189ce0.04061...@posting.google.com>...

> You can say SICP is old-hat, but I say we can, and ought to, define
> a SICP DS semantic function, and then we'll have to say that the
> "meaning" of (lambda (x) body), i.e.
> V[[ (lambda (x) body) ]]
> reflects the SICP biz,
> and then we'll have to say something like Shriram's
>
> V[[ (lambda (x) body) ]](E) = <x, body, E>
>
> Now you can say, "No!" The "real" meaning of a lambda is some actual
> function on Values, and that forces Scott models etc.
>
> And I'd say, that's fine, but that's a different "semantic"
> understanding of Scheme, so you need R5RS's curly-E, a different DS
> semantic function. There's no right answer here. Heck, there are
> folks who think that everything is a pointer in Scheme, and they also
> need a DS semantic function that's different from curly-E.

Its not just a "different semantic understanding". Its an
understanding that has no more layers of evaluation on top of it.

Your V yields a <x, body, E> tuple. What *is* this, I ask you? You
might say, "hmmm... well, to understand what this tuple means, I need
to show you it running on some data in my interpreter." (And in fact,
different interpreters may yield different answers to this question)

But in Denotational Semantics, once I get the element of the
"Meaning-Set" as you call it, I'm done. That's it. There's no more
running it in an interpreter. The thing I get back will be a
function; a real mathmatical function. I can't change what the
factorial or fibonacci functions *are*.

(I can't change your <x, body, E> tuple either, but I can change what
I think the 'body' inside of it means.)

Also, I suggest you go back and look at Joe's example at the end of
his last post. It sounds like you were trying to look at it and twist
it into something like the Y-combinator, and in the process you missed
the whole point that he was making: that Denotational Semantics gave
us a *meaning* for the sub-expressions that weren't just abstract
tuples with some code in it.

Instead, the meanings were "just" abstract functions that are tricky
for us humans to make sense out of. Are these functions "better" than
your tuples for the purposes of reasoning about programs? I'm a
biased (and ignorant!) party, so I'm going to be political here and
just say: It depends on the structure and goals of your analysis.

Bill, if you want to try to understand why one approach might be
better or worse than another, you should stop posting on the
newsgroup, and go write some static analysis code. REAL CODE.
Something like a CFA would be a good exercise for you; go google for
"control-flow analysis scheme", do some reading, and spend three weeks
hacking something up.

Ray Blaak

unread,

Jun 16, 2004, 12:57:30 PM6/16/04

to

pnkf...@gmail.com (Felix Klock) writes:
> Its not just a "different semantic understanding". Its an
> understanding that has no more layers of evaluation on top of it.
>
> Your V yields a <x, body, E> tuple. What *is* this, I ask you? You
> might say, "hmmm... well, to understand what this tuple means, I need
> to show you it running on some data in my interpreter." (And in fact,
> different interpreters may yield different answers to this question)
>
> But in Denotational Semantics, once I get the element of the
> "Meaning-Set" as you call it, I'm done. That's it. There's no more
> running it in an interpreter. The thing I get back will be a
> function; a real mathmatical function. I can't change what the
> factorial or fibonacci functions *are*.

This is my problem with DS: who decides the semantics of the "Meaning-Set"?
Just math, we say. But that is just the original problem all over again.

Math is a system of symbols and axioms and rules for evaluating them, quite
like programs, really. It is just that people have internalized math quite
well over the years, and so there is immediate recognition and agreement as to
the meaning of things.

But ultimately the problem is the same: semantics comes about from how things
are used, are they interact; in programming terms: how things are interpreted.

In my view DS is unnecessarily complicated.

--
Cheers, The Rhythm is around me,
The Rhythm has control.
Ray Blaak The Rhythm is inside me,
rAYb...@STRIPCAPStelus.net The Rhythm has my soul.

Joe Marshall

unread,

Jun 16, 2004, 1:25:12 PM6/16/04

to

pnkf...@gmail.com (Felix Klock) writes:

> Also, I suggest you go back and look at Joe's example at the end of
> his last post. It sounds like you were trying to look at it and twist
> it into something like the Y-combinator, and in the process you missed
> the whole point that he was making: that Denotational Semantics gave
> us a *meaning* for the sub-expressions that weren't just abstract
> tuples with some code in it.

The other point is that if you attempt to perform induction over the
operational semantics with this subform you will find yourself with no
base case. It isn't the Y combinator that's the problem, it's the
self-application of F that's going to do you in.

Bill Richter

unread,

Jun 16, 2004, 5:04:45 PM6/16/04

to

Joe Marshall <prunes...@comcast.net> responds to me:
>

> > You can say SICP is old-hat, but I say we can, and ought to, define
> > a SICP DS semantic function, and then we'll have to say that the
> > "meaning" of (lambda (x) body), i.e.
> > V[[ (lambda (x) body) ]]
> > reflects the SICP biz,
> > and then we'll have to say something like Shriram's
> >
> > V[[ (lambda (x) body) ]](E) = <x, body, E>
> >
> > Now you can say, "No!" The "real" meaning of a lambda is some actual
> > function on Values, and that forces Scott models etc.
>
> More or less. I'm not insisting that you must model a function, but
> if you choose not to model a function, then you cannot make sense of
> this:
>
> ((lambda (x) x) 42)

Thanks, Joe, that clarifies your previous Y biz. I disagree:

> Because the lambda expression means a 3-tuple and you have not
> provided a semantics for applying 3-tuples to arguments.

Shriram wrote down the `semantics for applying 3-tuples to arguments'
in PLAI. It's the SICP rule, essentially. That's the reduction rule
you posted about. But yeah, we have to provide such semantics!

> > Why don't I also point out that my Shriram V semantic function is

> > compositional by the definition in "Extensible Denotational", p 6:

> >
> > [The map from syntactic domains to semantic domains] satisfies the law
> > of compositionality: the interpretation of a phrase is a function of
> > the interpretation of the sub-phrases.
>
> But it isn't compositional (yet) because you haven't defined what
> the application of 3-tuples mean.

OK, sorry for not clarifying. Would you agree it's compositional now?

Maybe I should clarify about SICP. They stress: to evaluate a
combination, first evaluate the arguments, and the 1st must return a
procedure value (one of Shriram's triple), and then make a new
environment... Sounds like compositionality to me.

> My Y is curried.

Thanks! I'll hafta think about that.

Now Felix: I'm not claiming that my (or Shriram's) semantic function
is better in some calculational way. I'm in fact clueless about the
value of DS, except for what Shriram wrote in PLAI:

It would be convenient to have some common language for explaining
interpreters. We already have one: math!

I'm just thinking of DS/math as a way to clearly talk about
interpreters. I'm not a great programmer. I got into this because I
couldn't understand the text of R5RS, which they describe as the
"informal semantics", so I read R5RS DS to clarify, and it worked, but
I had huge misunderstandings about the Math...

Joe Marshall

unread,

Jun 16, 2004, 5:33:30 PM6/16/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Joe Marshall <prunes...@comcast.net> responds to me:
>

>> Because the lambda expression means a 3-tuple and you have not
>> provided a semantics for applying 3-tuples to arguments.
>
> Shriram wrote down the `semantics for applying 3-tuples to arguments'
> in PLAI. It's the SICP rule, essentially. That's the reduction rule
> you posted about. But yeah, we have to provide such semantics!

Unfortunately, Shriram's rule for reducing lambda expressions does not
give the semantics for applying 3-tuples. Instead, it gives us a
rewrite rule that allows us to change the application of a 3-tuple
into something else. But I can't use a rewrite rule as a definition
of a valuation function without identifying the rewrite operation as
being semantically meaningful in the valuation space.

>> > Why don't I also point out that my Shriram V semantic function is
>> > compositional by the definition in "Extensible Denotational", p 6:
>> >
>> > [The map from syntactic domains to semantic domains] satisfies the law
>> > of compositionality: the interpretation of a phrase is a function of
>> > the interpretation of the sub-phrases.
>>
>> But it isn't compositional (yet) because you haven't defined what
>> the application of 3-tuples mean.
>
> OK, sorry for not clarifying. Would you agree it's compositional now?

No.

This would be compositional:
V [[ ((lambda (x) x) 3) ]] = V[[ (lambda (x) x) ]] (V[[3]])

But you aren't defining V[[ (lambda (x) x) ]], you are applying the
operational steps on the lambda expression:

OpS (OpS (lambda (x) x), OpS(3))

with the hope that

V[[ OpS (OpS (lambda (x) x), OpS(3)) ]] = V[[ (lambda (x) x) ]] (V[[3]])

But you have given no justification for asserting this.

> Maybe I should clarify about SICP. They stress: to evaluate a
> combination, first evaluate the arguments, and the 1st must return a
> procedure value (one of Shriram's triple), and then make a new
> environment... Sounds like compositionality to me.

Yes, a finite string of Operational semantics steps compose. That
isn't a valuation function.

Lauri Alanko

unread,

Jun 16, 2004, 9:56:57 PM6/16/04

to

In article <uekoft...@STRIPCAPStelus.net>,

Ray Blaak <rAYb...@STRIPCAPStelus.net> wrote:
> Math is a system of symbols and axioms and rules for evaluating them, quite
> like programs, really.

This is a formalist view, and probably doesn't reflect the attitude of
the majority of mathematicians, although it may come naturally to a
computer scientist (I know it comes naturally to me).

I think the prevailing view is that mathematical objects exist in some
platonic realm quite independently of any formal systems, and
mathematical truths are quite independent of the derivabality of
anything (excepting, of course, those mathematical propositions that
explicitly concern derivability). We are supposed to have an intuitive
understanding of these mathematical objects and a formal system can
then be judged by how well it captures this intuition of ours.

In this context, the "meaning" that DS assigns to the Scheme program
(+ 2 2) is not simply a numeral "4", but actually "four", or just
_four_, fourness itself. What this _really_ means is then a question
for philosophers. Certainly, as Benacerraf has argued, it cannot mean
simply {{},{{}},{{},{{}}},{{},{{}},{{},{{}}}}} (here meaning the set
which that expression denotes, not the expression itself as a
syntactic object).

In any case, even though there are philosophical problems about the
meaning of math, at least DS pushes the problem of interpretation to
the philosophers, and out from the computer scientists' shoulders. :)

Lauri Alanko
l...@iki.fi

Bill Richter

unread,

Jun 16, 2004, 10:06:49 PM6/16/04

to

Joe Marshall <j...@ccs.neu.edu> responded to me:

> >> > Why don't I also point out that my Shriram V semantic function is

> >> > compositional by the definition in "Extensible Denotational", p 6:
> >>

> >> But it isn't compositional (yet) because you haven't defined what
> >> the application of 3-tuples mean.
> >
> > OK, sorry for not clarifying. Would you agree it's compositional now?
>
> No.
>
> This would be compositional:
> V [[ ((lambda (x) x) 3) ]] = V[[ (lambda (x) x) ]] (V[[3]])

Ah, that looks like exactly our problem, Joe! I think I owe you an
apology for suggesting you were dragging culture into this. I see now
why you insisted that the meaning of a lambda-exp is a function.

You don't quite have the the definition of compositionality. What you
wrote is an example of compositionality, but more generally, it's that
there's a function Phi s.t.

V [[ ((lambda (x) x) 3) ]] = Phi(V[[ (lambda (x) x) ]], V[[3]])

Your Phi is Phi(f, x) = f(x), which is fine, but that's not the only
one. As I posted here on 26 Jun 2002 (some editing):

As Cartwright and Felleisen's "Extensible Denotational" paper says:

[The map from syntactic domains to semantic domains] satisfies the
law of compositionality: the interpretation of a phrase is a
function of the interpretation of the sub-phrases.

This means that for a DS valuation function

curly-E: Expressions ---> M

compositionality means e.g. there must be a function

Phi: M x M ---> M

such that for a pair of expressions (X, Y),

curly-E[[ (X Y) ]] = Phi( curly-E[[ X ]], curly-E[[ Y ]] )

I don't see any trouble defining Phi, and I think I did so 2 years
ago. You just read it off SICP. And if we unconflate Store & Env,
it's not SICP, it's R5RS, 4.1.4 Procedures:

Semantics: A lambda expression evaluates to a procedure. The
environment in effect when the lambda expression was evaluated is
remembered as part of the procedure. When the procedure is later
called with some actual arguments, the environment in which the
lambda expression was evaluated will be extended by binding the
variables in the formal argument list to fresh locations, the
corresponding actual argument values will be stored in those
locations, and the expressions in the body of the lambda expression
will be evaluated sequentially in the extended environment.

It's easy to translate that to a Phi function for a DS function

curly-E: Expressions ---> (Env x Store -> Values x Store)

just like the (very similar) SICP prose yields a Phi for Shriram's

V: Expressions -> (Env -> Values)

Thus Shriram's V (obtained from his actual big-step OpS with a little
induction) is a compositional DS semantic function.

It's exciting to think we might settle this old argument! And getting
back to Felix, my interest here is just that overly arcane math makes
for a bad `common language', per Shriram great PLAI slogan:

Bill Richter

unread,

Jun 16, 2004, 10:17:54 PM6/16/04

to

Ray Blaak responds to pnkf...@gmail.com (Felix Klock):

> > But in Denotational Semantics, once I get the element of the
> > "Meaning-Set" as you call it, I'm done. That's it. There's no
> > more running it in an interpreter. The thing I get back will be a
> > function; a real mathmatical function. I can't change what the
> > factorial or fibonacci functions *are*.

Felix is accurately expressing the usual practice in DS, I think, but
there's no actual requirement to get `a real mathmatical function'.

> Math is a system of symbols and axioms and rules for evaluating
> them, quite like programs, really. It is just that people have
> internalized math quite well over the years, and so there is
> immediate recognition and agreement as to the meaning of things.

There's an important difference, Ray! Math is much more powerful!
That is `math' that `people have internalized' includes powerful
axioms that allow you say construct functions that you can't write
programs for. I like your `immediate recognition and agreement' :D

Shriram Krishnamurthi

unread,

Jun 16, 2004, 10:29:41 PM6/16/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> It's exciting to think we might settle this old argument! And getting
> back to Felix, my interest here is just that overly arcane math makes
> for a bad `common language', per Shriram great PLAI slogan:
>
> It would be convenient to have some common language for explaining
> interpreters. We already have one: math!

If only I'd known my throw-away comment would be quoted this many
times and in this context, I'd never have written it. (As I'm sure
Dave Schmidt might be regretting not paying more attention to his
introduction, either.)

Bill, I'm going to rewrite my text to say that a semantics is not
denotational unless it maps lambda to an actual mathematical function.
A mapping of lambda to tuples, or to any other kind of structure, is
not a denotational semantics. Then I'm going to quote the book ad
infinitum in response to your posts.

Shriram

Lauri Alanko

unread,

Jun 16, 2004, 10:37:08 PM6/16/04

to

In article <57189ce0.0406...@posting.google.com>,

Bill Richter <ric...@math.northwestern.edu> wrote:
> just like the (very similar) SICP prose yields a Phi for Shriram's
>
> V: Expressions -> (Env -> Values)
>
> Thus Shriram's V (obtained from his actual big-step OpS with a little
> induction) is a compositional DS semantic function.

It is easy to say "it is easy to see that..." Since we're such
skeptics here, it would be easier for you to simply provide the
desired function immediately, since someone is going ask for it anyway.

Now, as to why I for one am so skeptical, lessee... In Shriram's
manuscript on p. 173 we see a rule:

f,e => <i,b,e'> a,e => a_v b,e'[i<-a_v] => b_v
---------------------------------------------------
{f a},e => b_v

Now, if I try to turn this straightforwardly into a meaning function
like you suggest, I end up with something like this:

E[{f a}] e = let <i,b,e'> = E[f] e
in E[b] e'[i<-(E[a] e)]
^^^^
See the problem?

If you have a _compositional_ meaning definition derived from the
above big-step rule, I'm sure everyone would be interested in seeing
it. Especially if you won't use any recursive domains since those
would bring along the host of problems that requires all those hairy
CPOs to solve, and you certainly don't need _those_, as you have
reiterated...

Lauri Alanko
l...@iki.fi

Matthias Blume

unread,

Jun 16, 2004, 11:07:05 PM6/16/04

to

Shriram Krishnamurthi <s...@cs.brown.edu> writes:

> ric...@math.northwestern.edu (Bill Richter) writes:
>
> > It's exciting to think we might settle this old argument! And getting
> > back to Felix, my interest here is just that overly arcane math makes
> > for a bad `common language', per Shriram great PLAI slogan:
> >
> > It would be convenient to have some common language for explaining
> > interpreters. We already have one: math!
>
> If only I'd known my throw-away comment would be quoted this many
> times and in this context, I'd never have written it. (As I'm sure
> Dave Schmidt might be regretting not paying more attention to his
> introduction, either.)
>
> Bill, I'm going to rewrite my text to say that a semantics is not
> denotational unless it maps lambda to an actual mathematical function.

Uh, oh, Shriram. Be veeeeery careful!

The first problem with your throw-away sentence (no, not the first
one, this one!) is that almost anything can formally be turned into a
function. What you probably mean is that the denotation of a lambda
should be a function that, when applied to the denotation of an
argument, returns the denotation of the result.

But this stronger requirement would actually rule out some semantics
which are commonly considered denotational.

Matthias

Felix Klock

unread,

Jun 16, 2004, 11:24:01 PM6/16/04

to

Ray Blaak <rAYb...@STRIPCAPStelus.net> wrote in message news:<uekoft...@STRIPCAPStelus.net>...

> pnkf...@gmail.com (Felix Klock) writes:
> > But in Denotational Semantics, once I get the element of the
> > "Meaning-Set" as you call it, I'm done. That's it. There's no more
> > running it in an interpreter. The thing I get back will be a
> > function; a real mathmatical function. I can't change what the
> > factorial or fibonacci functions *are*.
>
> This is my problem with DS: who decides the semantics of the "Meaning-Set"?
> Just math, we say. But that is just the original problem all over again.
>
> Math is a system of symbols and axioms and rules for evaluating them, quite
> like programs, really. It is just that people have internalized math quite
> well over the years, and so there is immediate recognition and agreement as to
> the meaning of things.
>
> But ultimately the problem is the same: semantics comes about from how things
> are used, are they interact; in programming terms: how things are interpreted.

I agree with this. I said I was biased. I didn't say in which
direction (though I suspect my choice of challenge to Bill Richter
(developing a CFA) revealed which way I'm tilted towards).

But then again, I also said I was ignorant. I've taken only one class
that tried to cover Den.Sem., so I don't consider myself an expert in
its utility. Hopefully by next summer I'll have a greater
appreciation for the utility of Scott domains. Or even Category
Theory!

Daniel C. Wang

unread,

Jun 17, 2004, 1:15:47 AM6/17/04

to

Felix Klock wrote:
{stuff deleted}

> But then again, I also said I was ignorant. I've taken only one class
> that tried to cover Den.Sem., so I don't consider myself an expert in
> its utility. Hopefully by next summer I'll have a greater
> appreciation for the utility of Scott domains. Or even Category
> Theory!

IMNHO the killer app for denotational techniques is when you want to prove
rich semantic equivalences between terms in your language. Especially when
the operational techniques are just too clunky. Unfortunately, for general
programming languages their are few if any general semantic equivalences
that your user may want to prove.

But imagine giving a denotational semantics to some interesting thing like a
declarative language for representing resolution independent pictures...
(i.e. SVG, Postscript, FLASH...) The Haskell School of Expression by Hudak
has lots of good examples of good uses of denotational reasoning of this flavor.

You would rather describe a circle as the set of points equidistant from a
point rather than with a particular algorithm used to rasterize it.
Especially if you want to prove you can collapse several coordinate
transformations into one transformation matrix. In this area the
mathematical denotation of the object is the most natural way of thinking
about it.

For programs most of the time, I feel reasoning about the operational
behavior of it is unfortunately more useful/natural/intuitive. So I think
the denotational approach and Scott domains are a bit of an overkill for
most things.

For example it is nice to know that the iterative version of fib in some way
are semantically the same function as the naive exponential one. However,
most people also care that they are different in the sense that the
iterative version is more efficient. Programming requires reasoning about
correctness and operational behavior. I do not think you can realistically
ignore one or the other!

To be fair I think, the operational techniques have become more common
because people have given up on reasoning about correctness, so the
operational techniques are just useful enough to prove the weaker theorems
people are interested in these days. One day I hope, maybe people will start
worrying about corecntess and Scott domains will be in vogue again.

Felix Klock

unread,

Jun 17, 2004, 2:12:29 AM6/17/04

to

ric...@math.northwestern.edu (Bill Richter) wrote in message news:<57189ce0.04061...@posting.google.com>...

> Ray Blaak responds to pnkf...@gmail.com (Felix Klock):
>
> > > But in Denotational Semantics, once I get the element of the
> > > "Meaning-Set" as you call it, I'm done. That's it. There's no
> > > more running it in an interpreter. The thing I get back will be a
> > > function; a real mathmatical function. I can't change what the
> > > factorial or fibonacci functions *are*.
>
> Felix is accurately expressing the usual practice in DS, I think, but
> there's no actual requirement to get `a real mathmatical function'.

Okay, i think I understand now.

Bill Richter wants to give us a different kind of semantics... lets
not call it "Denotational Semantics", since that is easily confused
with the common (but clearly pointless) practice of making the range
of the valuation function be "real math stuff"...

Lets call it "A New Kind of Semantics", or NKS for short.

And this revolutionary NKS will have the ease of use of Denotational
Semantics, while providing the awesome power of reasoning in an
Operational Semantics.

After all, the V for an ideal NKS would clearly have properties like:

V[[ (lambda (x) ((lambda (y) (- x y)) 3)) ]] != V[[ (lambda (x)
(- x 3)) ]]

and in NKS, we get exactly that, since these two applications of V
yield totally different tuples! Wonderful!

</sarcasm>

Bill, please please please go try to develop a static analysis of some
sort. Yes, I read that your background is in math and not in computer
science, but that's simply an unacceptable excuse. We need people
with a math background in this area; there's too many compiler hackers
out there WITHOUT a reasonable background in math. Just write a
CFA-0, it shouldn't take more than a month. And I bet you'll find
many knowledgable folk here to help you (that is, I bet there's lots
more CFA experts reading comp.lang.scheme than NKS experts).

Until you try to apply some of the things you've learned to a concrete
problem (some problem where you have the potential for automated
testing; and posting formulae to comp.lang.scheme does not count as
"automated testing"), I fear you're never going to get an insight for
why Computer Scientists have chosen particular frameworks for
particular tasks, and why theorems aren't freely interchangable
between frameworks.

-Felix

p.s. if I misinterpreted what V[[ - ]] would do on the expressions
above, I apologize. However, please do not interpret this apology as
a request for an explanation for what your V[[ - ]] would yield nor
how it would do so.

Joe Marshall

unread,

Jun 17, 2004, 7:52:50 AM6/17/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Joe Marshall <j...@ccs.neu.edu> responded to me:
>>

>> This would be compositional:
>> V [[ ((lambda (x) x) 3) ]] = V[[ (lambda (x) x) ]] (V[[3]])
>
> Ah, that looks like exactly our problem, Joe! I think I owe you an
> apology for suggesting you were dragging culture into this. I see now
> why you insisted that the meaning of a lambda-exp is a function.
>
> You don't quite have the the definition of compositionality. What you
> wrote is an example of compositionality, but more generally, it's that
> there's a function Phi s.t.
>
> V [[ ((lambda (x) x) 3) ]] = Phi(V[[ (lambda (x) x) ]], V[[3]])
>
> Your Phi is Phi(f, x) = f(x), which is fine, but that's not the only
> one. As I posted here on 26 Jun 2002 (some editing):
>
> As Cartwright and Felleisen's "Extensible Denotational" paper says:
>
> [The map from syntactic domains to semantic domains] satisfies the
> law of compositionality: the interpretation of a phrase is a
> function of the interpretation of the sub-phrases.
>
> This means that for a DS valuation function
>
> curly-E: Expressions ---> M
>
> compositionality means e.g. there must be a function
>
> Phi: M x M ---> M
>
> such that for a pair of expressions (X, Y),
>
> curly-E[[ (X Y) ]] = Phi( curly-E[[ X ]], curly-E[[ Y ]] )
>
> I don't see any trouble defining Phi, and I think I did so 2 years
> ago.

The appropriate Phi for curly-E is not going to work for Ops, so I'll
call your putative Phi `Phi1':

So you are at this stage:
V[[ ((lambda (x) x) 3) ]] = Phi1 (V[[ (lambda (x) x) ]], V[[3]])

Now I assume you wish your semantics to have the same semantics as
R5RS, so you expect this to hold:

Phi1 (<a 3-tuple>, <the number 3>) = Phi (<a function>, <the number 3>)

Or, since Phi (x, y) is defined as x(y),

Phi1 (<a 3-tuple>, <the number 3>) = <a function> (<the number 3>)

So Phi1 must map 3-tuples to functions. That is,

Phi1 (x, y) = Phi (3-tuple->function (x), y)

But you have not demonstrated that
3-tuple->function is definable over all (or almost all) 3-tuples,
or that 3-tuple->function exists for all (or almost all) appropriate 3-tuples,
or that 3-tuple->function is unique for those.

> It's easy to translate that to a Phi function for a DS function
>
> curly-E: Expressions ---> (Env x Store -> Values x Store)
>
> just like the (very similar) SICP prose yields a Phi for Shriram's
>
> V: Expressions -> (Env -> Values)
>
> Thus Shriram's V (obtained from his actual big-step OpS with a little
> induction) is a compositional DS semantic function.

Woah, there's a lot of induction. If you expand out the Y operator
example you'll find more induction than you can shake a stick at! You
haven't show that this induction will work.

--
~jrm

Shriram Krishnamurthi

unread,

Jun 17, 2004, 8:30:07 AM6/17/04

to

Matthias Blume <fi...@my.address.elsewhere> writes:

> But this stronger requirement would actually rule out some semantics
> which are commonly considered denotational.

It would rule out decision-tree representations, some game-theoretic
models, and so on. But then Richter would be writing the authors of
those papers demanding to know why their purported "denotational"
semantics don't match the throw-away wording in my book, no?

Shriram

Shriram Krishnamurthi

unread,

Jun 17, 2004, 8:44:20 AM6/17/04

to

"Daniel C. Wang" <danw...@hotmail.com> writes:

> To be fair I think, the operational techniques have become more common
> because people have given up on reasoning about correctness, so the
> operational techniques are just useful enough to prove the weaker
> theorems people are interested in these days. One day I hope, maybe
> people will start worrying about corecntess and Scott domains will be
> in vogue again.

You can reason about correctness using other models, eg, Kripke
structures and temporal logic. Some of us do this all the time.
Correctness is not out of vogue, only this technique is.

Yes, you're missing a step of proof that the Kripke structure you
extract from a program corresponds to the program's "real" (ie,
denotational) meaning. But the extraction process is usually quite
straight-forward (not much different from, and often reusing, what a
compiler does), and lots of weaker theorems have been proven about it
to give the user additional confidence.

Shriram

Ray Blaak

unread,

Jun 17, 2004, 2:20:54 PM6/17/04

to

Lauri Alanko <l...@iki.fi> writes:

[a nice response]

> I think the prevailing view is that mathematical objects exist in some
> platonic realm quite independently of any formal systems, and
> mathematical truths are quite independent of the derivabality of
> anything (excepting, of course, those mathematical propositions that
> explicitly concern derivability). We are supposed to have an intuitive
> understanding of these mathematical objects and a formal system can
> then be judged by how well it captures this intuition of ours.

I don't strictly believe this myself, but have no real problem with this
view. Certainly I use my intuitive understanding of things to guide me.

My take on it is that ultimately it doesn't matter. Our attempts to refer to
such platonic objects run into the limitations of our notations and reasoning
abilities, meaning that even if there is a "true" semantics to be found, we
are not sure if we are achieving it.

Our practical semantics, then, is the result of how things interact, which
hopefully for our intuition would tend to correspond to what we believe the
"true" semantics are like.

> In this context, the "meaning" that DS assigns to the Scheme program
> (+ 2 2) is not simply a numeral "4", but actually "four", or just
> _four_, fourness itself. What this _really_ means is then a question
> for philosophers. Certainly, as Benacerraf has argued, it cannot mean
> simply {{},{{}},{{},{{}}},{{},{{}},{{},{{}}}}} (here meaning the set
> which that expression denotes, not the expression itself as a
> syntactic object).

I see at least 6 ways to refer to the notion of "four" here. Which is right?
Are they all the different, equivalent, or just sometimes?

Is the platonic set you are attempting to refer to the same as the platonic 4?

I think it all depends on context and how things are used. E.g. how things are
interpreted.

> In any case, even though there are philosophical problems about the
> meaning of math, at least DS pushes the problem of interpretation to
> the philosophers, and out from the computer scientists' shoulders. :)

I don't know if pushing the problem to the philosophers really solves
anything; they tend to argue alot :-).

I do agree that it is useful to be able to push a problem to another standard
one, since that allows us to understand how problems can relate to each other,
and to determine if problems are equivalent. So, yes, I can see the usefulness
of DS in that respect.

Matthias Blume

unread,

Jun 17, 2004, 3:43:07 PM6/17/04

to

Shriram Krishnamurthi <s...@cs.brown.edu> writes:

> Matthias Blume <fi...@my.address.elsewhere> writes:
>
> > But this stronger requirement would actually rule out some semantics
> > which are commonly considered denotational.
>
> It would rule out decision-tree representations, some game-theoretic
> models, and so on.

Exactly.

> But then Richter would be writing the authors of
> those papers demanding to know why their purported "denotational"
> semantics don't match the throw-away wording in my book, no?

I must have missed a smiley (actually present or implied), because I
don't know what you are trying to say. (Your sentence supports the
idea of requiring denotations of lambdas to be functions HOW?)

Matthias

Daniel C. Wang

unread,

Jun 17, 2004, 3:47:25 PM6/17/04

to

Shriram Krishnamurthi wrote:
{stuff deleted}

> You can reason about correctness using other models, eg, Kripke
> structures and temporal logic. Some of us do this all the time.
> Correctness is not out of vogue, only this technique is.
>
> Yes, you're missing a step of proof that the Kripke structure you
> extract from a program corresponds to the program's "real" (ie,
> denotational) meaning. But the extraction process is usually quite
> straight-forward (not much different from, and often reusing, what a
> compiler does), and lots of weaker theorems have been proven about it
> to give the user additional confidence.

Having spent the last several months worrying about proofs at an
unbelievable level of pedantry and annoying detail, I find it absolutely
disturbing to "leave a step out" and claim to have a proof. Perhaps, you can
build Kripke structures directly from an operational semantics of a language
and show they are some how sound, but I suspect a DS version would be a bit
more pleasant.

I would also note that if you wanted to actually mathematically prove that
what goes on in a compiler is sound a DS with Scott models might be the most
obvious way to go about things. Of course not many people actually have the
time to mathematically prove that what their compiler does is sound.

On that related note, does anyone happen to have a pointer to a proof that
shows the CPS transformation is semantics preserving? Was such a proof done
via a DS or operational approach. I imagine such a proof could be carried
out using either technqiue, I'm just curious how it actually was carried out.

Shriram Krishnamurthi

unread,

Jun 17, 2004, 8:12:46 PM6/17/04

to

"Daniel C. Wang" <danw...@hotmail.com> writes:

> Having spent the last several months worrying about proofs at an
> unbelievable level of pedantry and annoying detail, I find it
> absolutely disturbing to "leave a step out" and claim to have a
> proof.

I'm amused that you would be "absolutely disturbed" about something
that I didn't say. I said that the proofs are about the Kripke
structures, and that we can have "confidence" in the relationship
between the structure and the program. I chose my words carefully.

Given that "verification" is mostly about debugging, not about proving
correctness, this is not only useful, but often about as much as you
can achieve. (Did the pedantry and annoying detail you dealt with
take into account quantum effects, hardware errors, etc? If not,
should we be absolutely disturbed that you assumed these away?)

Shriram

Daniel C. Wang

unread,

Jun 17, 2004, 10:37:29 PM6/17/04

to

Shriram Krishnamurthi wrote:
{stuff deleted}

> I'm amused that you would be "absolutely disturbed" about something
> that I didn't say. I said that the proofs are about the Kripke
> structures, and that we can have "confidence" in the relationship
> between the structure and the program. I chose my words carefully.

I don't want to nitpick, but reading your original text carefully still
makes it seems ambiguous about as to what you actually claimed.

> Given that "verification" is mostly about debugging, not about proving
> correctness, this is not only useful, but often about as much as you
> can achieve. (Did the pedantry and annoying detail you dealt with
> take into account quantum effects, hardware errors, etc? If not,
> should we be absolutely disturbed that you assumed these away?)

The issue is making a precise statement about what assumptions are made and
understanding how interesting the conclusion is with respect to the
assumptions. The right assumptions will allow you to prove anything.

If you are abstracting the program semantics to prove a property about the
program, one definitely should have a proof that the abstraction is sound
with respect to the program semantics. If however, you are merely saying
that you have a proof about your putative abstraction and make no formal
claims about program correctness than fine. But in that case I do not know
what you mean by "you can reason about correctness ...." if you don't
establish the link between your abstraction and the real semantics.

Perhaps, I'm misreading "reason" as "prove" rather than "have confidence
about". In any case, I have confidence about many things that are totally
wrong.

Bill Richter

unread,

Jun 17, 2004, 11:14:42 PM6/17/04

to

Lauri Alanko <l...@iki.fi> responded to me:

> E[{f a}] e = let <i,b,e'> = E[f] e
> in E[b] e'[i<-(E[a] e)]
> ^^^^
> See the problem?

Lauri, I don't quite follow you, but I do see a problem! Thanks!
It's maybe impossible to show compositionality for my SICP-like

E: Expressions -> (Env -> Values)

because evaluating the 1st argument in {f a} changes the env/store.
So let's unconflate Env & Store, to get a DS semantic function

E: Expressions ---> (Env x Store -> Value x Store)

of the sort discussed on page 2 of Cartwright & Felleisen "Extensible
Denotational" paper. My E uses left->right eval order, like mzscheme.

Let's translate Shriram's OpS example as this. We'll define

E[{f a}](e, s) = (b_v, s3), if

E[f](e, s) = (<i,b,e'>, s1)

E[a](e, s1) = (a_v, s2)

l = (new s2)

E[b](e'[i->l], s2[l->a_v]) = (b_v, s3), and

E[{f a}](e, s) = bottom, otherwise.

My R5RS-like `new' produces a fresh location, i.e. sent to bottom by
s2 in Store = (Location -> Value).

Was that your point, that I needed to unconflate to define E? Can you
check my stores? I stared at & fiddled with them for a long time, as
my State semantics is rusty. I've been coding just Lambda alg FP.

Now I'll answer Joe's question, and define (this different!) Phi,

Phi: (Env x Store -> Value x Store) x (Env x Store -> Value x Store)

-> (Env x Store -> Value x Store)
I claim that

E[{f a}] = Phi(E[f], E[a]) in (Env x Store -> Value x Store)

That's kinda complicated, but its easy for schemers, who are excellent
at functions that take functions as arguments etc.

Given alpha, beta in (Env x Store -> Value x Store), we define

Phi(alpha, beta)(e, s) = (b_v, s3), if

alpha(e, s) = (<i,b,e'>, s1)

beta(e, s1) = (a_v, s2)

l = (new s2)

E[b](e'[i->l], s2[l->a_v]) = (b_v, s3), and

Phi(alpha, beta)(e, s) = bottom, otherwise.

As you see, it's a straightforward translation of the eval rule.

> If you have a _compositional_ meaning definition derived from the
> above big-step rule, I'm sure everyone would be interested in seeing
> it.

I think I posted my Phi 2+ years ago, and nobody said it was false, or
interesting, but only that I had violated a rule, as my Phi used E.

I said no DS author insists that Phi be defined independently of E,
and we can't mathematize such a notion, and given such a Phi depending
on E, and we then re-define E by structural induction, so this rule is
even satisfied! I didn't got a response, but folks were worn out.

As to the rest of the traffic: I think of Shriram as sortuva friend,
he tried to find me a job once, & I don't get riled at his (quite
funny!) sarcasm. PLAI (like HtDP) looks like a great book. Nice to
see some mild support from MB, who I learned quite a lot from 2+ yrs
ago. Felix: I'd be happy to know something useful about DS, perhaps
you could post something about CFA, or give a link, but what I'm
saying is really simple & really worth understanding, I think.

Lauri Alanko

unread,

Jun 18, 2004, 5:37:08 AM6/18/04

to

In article <57189ce0.04061...@posting.google.com>,

Bill Richter <ric...@math.northwestern.edu> wrote:
> I claim that
>
> E[{f a}] = Phi(E[f], E[a]) in (Env x Store -> Value x Store)

> Given alpha, beta in (Env x Store -> Value x Store), we define

>
> Phi(alpha, beta)(e, s) = (b_v, s3), if
> alpha(e, s) = (<i,b,e'>, s1)
> beta(e, s1) = (a_v, s2)
> l = (new s2)
> E[b](e'[i->l], s2[l->a_v]) = (b_v, s3), and
>
> Phi(alpha, beta)(e, s) = bottom, otherwise.
>
> As you see, it's a straightforward translation of the eval rule.

This is not a definition. This is a constraint, an equation that may
or may not hold for some functions E. You yet need to construct some
function that satisfies this equation. Well, all right, _that_
is easy: just set Phi(alpha,beta)(e, s) = bottom. Or you can choose
any of an infinite number of other functions. But only a couple of
them are ones that we are _interested_ in (in the sense that bottom
accurately corresponds with nontermination). And you haven't shown
which ones are those. Scott has.

This is a recursive equation, and the usual way of dealing with those
is to get the least fixed point, but for that you need to have an
_ordering_ and that gets you again into the world of CPOs.

Incidentally, I don't think it's necessary to add stores (or
continuations) to this discussion. They just add complexity without
bringing any new insight.

> I think I posted my Phi 2+ years ago, and nobody said it was false, or
> interesting, but only that I had violated a rule, as my Phi used E.

Indeed. That's what makes it recursive, and that's why it's not a
definition.

> I said no DS author insists that Phi be defined independently of E,
> and we can't mathematize such a notion,

We don't _need_ to mathematize such a notion, because in math things
are by default defined independently of themselves. It's _recursive_
definitions that need to be mathematized.

I think usually the authors take for granted that the reader knows
that when making a definition you can't refer, even indirectly, to the
thing that you are defining.

In Schmidt, p. 51, the existence of the meaning functions is proven by
structural induction. That's what "compositional" means: defined with
structural induction. Your "definition" of E is recursive, so you have
to find out some other way of specifying exactly which function you
are talking about. You haven't done this.

> and given such a Phi depending on E, and we then re-define E by
> structural induction, so this rule is even satisfied!

By structural induction on what? As you have been told, you no longer
have a base case.

You have been given the omega example gazillion times. Consider. Let's
find, using your technique, the value of:

E[{{fun x => {x x}} {fun x => {x x}}}](e0)

Now, the start is easy. I'll leave out the store. e0 stands for the
empty environment.

E[{fun x => {x x}}](e0) = <x,{x x},e0>

So, by your definition (simplified without the stores), the result is now
E[{x x}](e1) (where e1 = e0[x-><x,{x x},e0>])

So, we get

E[{x}](e1) = <x,{x,x},e0>

and therefore

E[{x x}](e1) = E{{x x}](e1)

You call that a definition?

Now, as far as constraints go, that one is trivial. What all this
means is that your "definition" is satisfied by e.g. the meaning
function that assigns the value 42 to this example. But we don't want
that. We want _bottom_. And not just because we are being difficult,
but because when I say ((lambda (x) (x x)) (lambda (x) (x x))) to an
interpreter, it _doesn't terminate_!

Lauri Alanko
l...@iki.fi

Joe Marshall

unread,

Jun 18, 2004, 11:20:51 AM6/18/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Now I'll answer Joe's question, and define (this different!) Phi,
>
> Phi: (Env x Store -> Value x Store) x (Env x Store -> Value x Store)
>
> -> (Env x Store -> Value x Store)

Modeling the store is unnecessary to my argument (as you will run into
trouble long before you want to introduce side effects). Let's keep
it simple:

Phi: (Env -> Value) x (Env -> Value) -> (Env -> Value)

> I claim that
>
> E[{f a}] = Phi(E[f], E[a]) in (Env x Store -> Value x Store)
>
> That's kinda complicated, but its easy for schemers, who are excellent
> at functions that take functions as arguments etc.

So if you have a Scheme form `(f a)', you claim that the denotation
may be derived as follows:

V [[ f ]] = <a 3-tuple of args, body, and environment>

V[[ (f a) ]] = Phi( V[[ f ]], V [[ a ]])

> Given alpha, beta in (Env x Store -> Value x Store), we define
>
> Phi(alpha, beta)(e, s) = (b_v, s3), if
>
> alpha(e, s) = (<i,b,e'>, s1)
>
> beta(e, s1) = (a_v, s2)
>
> l = (new s2)
>
> E[b](e'[i->l], s2[l->a_v]) = (b_v, s3), and
>
> Phi(alpha, beta)(e, s) = bottom, otherwise.

Leaving out the store,

Phi (alpha, beta) (e) = b_v if

alpha (e) = <i, b, e'>

beta (e) = a_v

E[b](e'[i->a_v]) = b_v
otherwise bottom.

Let's plug in our original equation:

V [[ (f a) ]] = V[[ b ]](e'[i -> V[[ a ]])) where

<i, b, e'> = V [[ f ]]

> As you see, it's a straightforward translation of the eval rule.

Right, but for one thing.

If you look at the semantic functions in section 7.2.3 in R5RS, you
will see that curly-E is defined over the recursive decomposition of
an expression. Because expressions are finite, the denotation of an
expression can be given non-recursively.

Now look at your valuation function. It too is defined recursively,
but the recursion is over the *body* of the function, not simply the
subform `f'. Now suppose our function f is recursive factorial.

V [[ (f a) ]] =

V[[ (if (zero? x) x (* x (fact (- x 1)))) ]](e'[i -> V[[ a ]])) where

<i, (if (zero? x) x (* x (fact (- x 1)))), e'> = V [[ f ]]

Since V is compositional, this term:

V[[ (if (zero? x) x (* x (fact (- x 1)))) ]]

must be equal to some function (I'll call it phi2) of the valuation of
subterms:

= phi2 (V [[ (zero? x) ]], V [[ x ]], V [[ (* (fact (- x 1))) ]])

Since V is compositional, the third subterm must be equal to some
function (I'll call it phi3) of the valuation of *its* subterms:

V [[ (* (fact (- x 1))) ]]

= phi3 (V [[ * ]], V [[ (fact (- x 1)) ]])

Since V is compositional, the second subterm must be equal to some
function (I'll call it phi4) of the valuation of *its* subterms:

V [[ (fact (- x 1)) ]]

= phi4 (V [[ fact ]], V [[ (- x 1) ]])

But wait a second. Phi4 defines the composition necessary for
function application, so phi4 must be the same as the phi way up near
the top. In fact, phi3 is this same function. So by back
substitution, we find that V [[ (fact a) ]] is some composition (I'll
call this one phi5) of these terms:

V [[ (fact a) ]] =

phi6 (V [[ zero? ]],
V [[ a ]],
V [[ 1 ]],
V [[ * ]],
V [[ (fact a') ]] where a' = V [[ (- a 1) ]])

That last term, V [[ (fact a') ]], can be expanded with the above
equation:

V [[ (fact a) ]] =

phi6 (V [[ zero? ]],
V [[ a ]],
V [[ 1 ]],
V [[ * ]],
phi6 (V [[ zero? ]],
V [[ a' ]],
V [[ 1 ]],
V [[ * ]],
V [[ (fact a'') ]]
where a'' = V [[ (- a' 1) ]])
where a' = V [[ (- a 1) ]])

That last term V [[ (fact a'') ]], can be expanded:

V [[ (fact a) ]] =

phi6 (V [[ zero? ]],
V [[ a ]],
V [[ 1 ]],
V [[ * ]],
phi6 (V [[ zero? ]],
V [[ a' ]],
V [[ 1 ]],
V [[ * ]],
phi6 (V [[ zero? ]],
V [[ a''' ]],
V [[ 1 ]],
V [[ * ]],
V [[ (fact a''') ]]
where a''' = V [[ (- a'' 1) ]])
where a'' = V [[ (- a' 1) ]])
where a' = V [[ (- a 1) ]])

As you can see, this diverges. Your valuation function is not well
defined.

> I think I posted my Phi 2+ years ago, and nobody said it was false, or
> interesting, but only that I had violated a rule, as my Phi used E.
>
> I said no DS author insists that Phi be defined independently of E,
> and we can't mathematize such a notion, and given such a Phi depending
> on E, and we then re-define E by structural induction, so this rule is
> even satisfied! I didn't got a response, but folks were worn out.

It probably should have been explained more explicitly, but the
problem isn't that Phi can't use E, but that Phi's use of E must
converge. This is easily satisfiable by compositional induction over
a finite expression, but not by compositional induction over the
*value* of the expression.

Bill Richter

unread,

Jun 18, 2004, 11:08:41 PM6/18/04

to

Lauri Alanko <l...@iki.fi> responded to me:

> E[{x x}](e1) = E{{x x}](e1)

>
> You call that a definition?

You caught me again! Thanks. And I'm very pleased to see that the
traffic today was just Math, from you & Joe.

I should've said I have some reduction rule, and if it doesn't return
a value in finite time, then we say

E[expr](e, s) = bottom

I'm drawing a blank on my reduction rule, so I'll think about it and
get back to you, after I unstick myself. My State Semantics is rusty.

But Lauri, I take strong exception to other things you said, and I can
deal with these now. The model for my (poorly defined) function

E: Expressions ---> (Env x Store -> Value x Store)

is Felleisen & Flatt's Standard Reduction function eval_s, defined on
p. 51 in <http://www.ccs.neu.edu/course/com3357/mono.ps>.

It seems to me that all the complaints you made about my E (other than
my goof!) apply to F&F's eval_s. I'd say that their function

eval_s : LC_v Expressions ---> LC_v Values

is well-defined and satisfying compositionality, even though they
don't do the various things you mention below. Or Barendregt's
Standard Reduction function in LC, which is not I think
compositional. Can you think about that, while I fix my E?

> This is a recursive equation, and the usual way of dealing with those
> is to get the least fixed point, but for that you need to have an
> _ordering_ and that gets you again into the world of CPOs.

I say no. Are you saying F-F & Barendregt needed CPOs? It's just
induction. Keep flailing away the reduction rule until you terminate,
and if you don't, send it to bottom. Just because this function is
(as you say) the solution of a fixed point equation doesn't mean you
have to consider all possible solutions of the fixed point equation.
You just inductively define the one solution you want, and you don't
need the CPOs, which tell you the desired solution is the minimal one!

> Incidentally, I don't think it's necessary to add stores (or
> continuations) to this discussion. They just add complexity without
> bringing any new insight.

You don't need stores to understand my goof! Plus, I goofed in
another way. The SICP conflation works fine if we tack on Env:

E: Expressions ---> (Env -> Value x Env)

> > I think I posted my Phi 2+ years ago, and nobody said it was
> > false, or interesting, but only that I had violated a rule, as my
> > Phi used E.
>
> Indeed. That's what makes it recursive, and that's why it's not a
> definition.

I say no, but that's a good question. I define E first by induction,
and then afterward, I defined Phi in terms of E. I don't define E &
Phi by simultaneous recursion.

> In Schmidt, p. 51, the existence of the meaning functions is proven
> by structural induction. That's what "compositional" means: defined
> with structural induction.

I say that's false, Lauri. Can you give me an exact quote? Here's my
quote again from Cartwright and Felleisen's "Extensible Denotational":

[The map from syntactic domains to semantic domains] satisfies the
law of compositionality: the interpretation of a phrase is a
function of the interpretation of the sub-phrases.

It's just what I said: there must exist such functions like my (poorly
defined) Phi. It doesn't matter how you construct the Phi functions.

So I assert that C-F & Schmidt do not make your definition, and
furthermore, it's impossible to make a mathematical formulation of
your definition of compositional. Can you check?

> Your "definition" of E is recursive, so you have to find out some
> other way of specifying exactly which function you are talking
> about. You haven't done this.

Yeah, I sure haven't! Many apologies.

> You have been given the omega example gazillion times.

Thanks for giving it to me again.

Bill Richter

unread,

Jun 19, 2004, 12:36:08 AM6/19/04

to

Joe Marshall <j...@ccs.neu.edu> responded to me:

> > I said no DS author insists that Phi be defined independently of
> > E, and we can't mathematize such a notion [...]

>
> It probably should have been explained more explicitly, but the
> problem isn't that Phi can't use E, but that Phi's use of E must
> converge. This is easily satisfiable by compositional induction
> over a finite expression, but not by compositional induction over
> the *value* of the expression.

Let's go with that, Joe. Thanks. Thats' a nice "mathematical" thing
to say. Now the burden is on me to define a converging E & Phi.

> > Phi: (Env x Store -> Value x Store) x (Env x Store -> Value x Store)
> >
> > -> (Env x Store -> Value x Store)
>
> Modeling the store is unnecessary to my argument (as you will run
> into trouble long before you want to introduce side effects). Let's
> keep it simple:
>
> Phi: (Env -> Value) x (Env -> Value) -> (Env -> Value)

I failed to define such a Phi last night, and therefore switched back
to (Env x Store -> Value x Store). The obvious problem is that
evaluating expressions can change the store/env with side effects.

But even with functional programs, you need side effects for the
recursion to make sense, at least in the R5RS way that I'm thinking.
We need define (which in R5RS DS is really set!), which I didn't
mention yet, but it's going to set up a binding

Ident -e'--> Locations -s--> Values

fact -> (e' fact) -> <i, (if (zero? x) x (* x (fact (- x 1)))), e'>

and later we'll add the bindings (let's say a_v = 6)

i -> l -> 6
i -> l' -> 5
i -> l'' -> 4

Your diverging factorial computation didn't seem to use that binding
for fact.

> So if you have a Scheme form `(f a)', you claim that the denotation
> may be derived as follows:
>
> V [[ f ]] = <a 3-tuple of args, body, and environment>

That's not quite right, and your post yesterday had this trouble too.
You're reverting my E to

V: Expressions -> (Env -> Values)

so you needed to say

V [[ f ]](this env) = <a 3-tuple of args, body, and environment>

Maybe that's what you mean, and that's probably fine for this
factorial computation. But in general, I can't define Phi without V
being a function on all env's. Let's do it my way:

E: Expresions -> (Env x Store -> Value x Store)

I can't define my Phi(alpha, beta)(e, s) with alpha & beta only
calling the initial argument (e, s).

Joe Marshall

unread,

Jun 19, 2004, 1:13:27 PM6/19/04

to

ric...@math.northwestern.edu (Bill Richter) writes:

> Joe Marshall <j...@ccs.neu.edu> responded to me:
>
>> > I said no DS author insists that Phi be defined independently of
>> > E, and we can't mathematize such a notion [...]
>>
>> It probably should have been explained more explicitly, but the
>> problem isn't that Phi can't use E, but that Phi's use of E must
>> converge. This is easily satisfiable by compositional induction
>> over a finite expression, but not by compositional induction over
>> the *value* of the expression.
>
> Let's go with that, Joe. Thanks. Thats' a nice "mathematical" thing
> to say. Now the burden is on me to define a converging E & Phi.
>
>> > Phi: (Env x Store -> Value x Store) x (Env x Store -> Value x Store)
>> >
>> > -> (Env x Store -> Value x Store)
>>
>> Modeling the store is unnecessary to my argument (as you will run
>> into trouble long before you want to introduce side effects). Let's
>> keep it simple:
>>
>> Phi: (Env -> Value) x (Env -> Value) -> (Env -> Value)
>
> I failed to define such a Phi last night, and therefore switched back
> to (Env x Store -> Value x Store). The obvious problem is that
> evaluating expressions can change the store/env with side effects.
>
> But even with functional programs, you need side effects for the
> recursion to make sense, at least in the R5RS way that I'm thinking.

No, you don't. That's why I used the Y operator. You get recursion
out of self-application.

> We need define (which in R5RS DS is really set!), which I didn't
> mention yet, but it's going to set up a binding
>
> Ident -e'--> Locations -s--> Values
>
> fact -> (e' fact) -> <i, (if (zero? x) x (* x (fact (- x 1)))), e'>
>
> and later we'll add the bindings (let's say a_v = 6)
>
> i -> l -> 6
> i -> l' -> 5
> i -> l'' -> 4
>
> Your diverging factorial computation didn't seem to use that binding
> for fact.

I hope you want your factorial to work for other values than 6. But
let me point out that I'm *not* performing a factorial computation.
I'm performing an analysis of the factorial program and it is the
analysis that is diverging, not the program. The fault lies in the
analysis, which is not sufficiently powerful to model recursive
programs.

Look what happens even if you do substitute in a value:

V [[ (fact 2) ]] =
phi6 (V [[ zero? ]],
V [[ 2 ]],