[erlang-questions] Inconsistent and incomprehensible handling of variables in spec's by compiler

Robert Virding

unread,

Feb 17, 2012, 4:54:41 AM2/17/12

to Erlang Bugs, erlang-questions

If in my file I add the sepc:

-spec set_var(PrefixExp, Val, State) -> State.

I get the compiler errors, errors mind you, not warnings:

src/luerl_eval.erl:332: type variable 'PrefixExp' is only used once (is unbound)
src/luerl_eval.erl:332: type variable 'Val' is only used once (is unbound)

but if I write it as:

-spec set_var(_PrefixExp, _Val, State) -> State.

then the compiler is happy and is quiet. If I write it as:

-spec set_var(Val, Val, State) -> State.

(which is wrongly from my point of view) or go all verbose and write:

-spec set_var(PrefixExp::any(), Val::any(), State::any()) -> State::any().

(which doesn't really add any useful information) the compiler is again happy and silent.

This is inconsistent and does not make sense for variables in a function spec. Either it is wrong, for some incomprehensible reason, to have singleton variables in a function spec, or it shouldn't matter what the variable names are. And why should adding a type, even the most general one, make a difference to the variable name usage. And why should the compiler be bothered with the function spec at all seeing it is doesn't use it and it is not part of the language.

In my naive view of the the typing world my original spec says that set_var/3 takes three arguments of any type and returns a value of the same type as the third argument. Then why not just add type information and make the compiler happy? Well in my case it doesn't give me anything; it makes the spec much longer and there is nothing in a type which would give me more information than is in the variable names. Also dialyzer doesn't actually *need* the spec as it works it out by itself anyway. And the type of PrefixExp is a big hairy recursive type which I am happy to let dialyzer work out for me. Typer gives me a half-screen long type for it.

I get the same behaviour when running dialyzer on the different versions. Again I don't understand why it behaves like that.

Robert
_______________________________________________
erlang-questions mailing list
erlang-q...@erlang.org
http://erlang.org/mailman/listinfo/erlang-questions

Magnus Henoch

unread,

Feb 17, 2012, 6:49:09 AM2/17/12

to Robert Virding, erlang-questions, Erlang Bugs

There is a big difference between this:

> -spec set_var(PrefixExp, Val, State) -> State.

and this:

> -spec set_var(PrefixExp::any(), Val::any(), State::any()) -> State::any().

The argument and return value type specifications are either Type
or Name::Type, so the first declaration says that the arguments
are of _types_ PrefixExp, Val and State, while the second declaration
says that all three arguments are of type any(). (As far as I know,
argument names are ignored by the compiler and dialyzer, and only
used by edoc.) Specifying a singleton variable as the _type_ of an
argument is most likely not what the programmer meant, since it
doesn't add any information, so to me it makes sense to have the
compiler reject it.

That specification could be written like this:

-spec set_var(PrefixExp::any(), Val::any(), State::X) -> NewState::X.

which specifies that State and NewState must have the same type, no
matter which one.

Hope this helps,
--
Magnus Henoch
Erlang Solutions Ltd
http://www.erlang-solutions.com/

Robert Virding

unread,

Feb 17, 2012, 1:33:02 PM2/17/12

to Magnus Henoch, erlang-questions, Erlang Bugs

Yes, I know that. My comment was mainly about how variables are handled. It is not logical that the compiler complains about a variable being used once but not when used more than once. It also doesn't make sense that prefixing the variable name with a '_' makes the compiler keep quite. It seems like it is reusing some of the variable checking code for functions but generating errors instead of warnings.

In my case just using variable names does add information, it tells me what it is. In the respect the actual type is unimportant, if I need to type check my functions dialyzer will do it for me without a type spec. Just giving it an appropriate name is enough. It is also a way of giving long names in the "documentation" which allows me to use short names in the code. Reusing the same name tells me it is the same type. In this respect it gives me as much information as your alternative and is shorter and more concise.

Again the actual errors I get from the compiler don't make sense and I ask why the compiler should worry as it does not use the information at all. The type interpretation is actually very close to my usage and would work if it wasn't for the compiler errors, and the similar dialyzer errors.

Robert

Kostis Sagonas

unread,

Feb 17, 2012, 3:56:00 PM2/17/12

to Erlang Bugs, erlang-questions

As others mentioned, in their most basic form, specs should contain
*types*, not type variables. Continuing your example, the following is a
valid spec:

-spec set_var(exp(), val(), state()) -> state().

for some appropriate definitions of exp(), val() and state() types.
I recommend that you write such specs. I would even go as far as say
that you should stick to this kind of specs: specs that contain types
and types alone.

Of course, the benefits of writing the above spec are dependent on the
precision of your type declarations. If you are lazy and map all three
types to any(), something that the type language allows you to do, then
of course you cannot expect much from the tools that take these specs
into account, can you? (In such a case the spec is there mostly (only?)
for documentation purposes. Personally, I am not interested in this type
of specs; they have very little purpose.)

Now, for convenience and for compatibility with what the edoc @spec
language also allowed, we have retained the possibility that the user
provides, perhaps selectively, *names* to argument positions and return
results. So, you can write the above as:

-spec set_var(PrefixExp::exp(), ImpValue::val(), state()) -> state().

Note that PrefixExp and ImpValue, i.e. whatever comes to the left of the
:: symbol in the above, are *optional names* for arguments 1 and 2, not
type variables.

Type variables in specs come into the picture when you want to express
(subtype or equality) relations between different argument positions or
return values. (And by definition to express a relation you need to
somehow use the same variable twice; personally I cannot think of a case
where a singleton type variable is interesting.) For example, you may
want to state in a more emphatic way that the third argument and the
return result of the set_var/3 function are of the same type. You can do
that as follows:

-spec set_var(PrefixExp::exp(), ImpValue::val(), InState::ST) ->
OutState::ST when ST::state().

(Note that the only type variable in the above is ST.)

Another example of use of type variables is in functions like map/2.
Since the type system is based on subtyping, you would need to write its
spec as follows:

-spec map(F::fun((X) -> Y), InL::[XE]) -> OutL::[YE]
when X::term(), Y::term(), XE::term(), YE::term(), XL::X, YL::Y.

or, even stricter, as:

-spec map(F::fun((X) -> Y), InL::[X]) -> OutL::[Y]
when X::term(), Y::term().

but I guess most people would find the above two ways of giving a spec
for map/2 probably too verbose so we also allowed them to write simply:

-spec map(fun((X) -> Y), [X]) -> [Y].

with the implicit assumption that for all type variables T that are not
qualified by a subtype constraint, there is a constraint T::term() which
is implicit.

(Note that in this case, all type variables are not singletons.)

Also, because we had a hunch that there may be Erlang programmers who
have hard time grasping the subtleties of the type language (for
example, they might be confusing type variables with argument and
position names) and might make typos in specs, we decided to never allow
type variables that are singleton. As I explained, such variables are
not just unconstrained, but they are really not needed because they
serve no purpose; instead type names do.

I saw/see no reason for one to write a type variable just to name a
position; if you think otherwise, please provide an example that makes
sense. Till you do, you are not allowed to write MY_NAME for the type
any(); if you want the type any() please write any(); if you want to
name it appropriately, you are required to write MY_NAME::any() or its
shorthand MY_NAME::_.