John Skaller2

unread,

Oct 25, 2018, 10:25:58 PM10/25/18

to felix google

RATIONALE
==========

A reduction is a rule like tthe one here:

class FloatAddgrp[t] {
inherit Eq[t];
virtual fun neg : t -> t;
…
reduce inv(x:t): - (-x) => x;
axiom sym (x:t,y:t): x+y == y+x;
}

Its a special kind of axiom which is not checked, rather it is tells the compiler
to replace one expression by another. In this case it says

if you see an expression doubly negated, remove both negations

I removed processing of reductions some time ago but now I want to put them back.
Vital optimisations can be expressed with reductions which are hard to build into
the compiler with generality, or, indeed, at all!

A trivial analogue of the above:

rev (rev x) => x

reversing a list twice has no effect, so don’t! There are some VERY POWERFUL
optimisations which are less trivial. Here’s a FUSION reduction:

map f (map g x) ==> map (f \circ g) x

For example, instead of mapping g over a list, then mapping f
over the result, map the composite function f \circ g over the list.
Saving the construction and rescanning and deallocation of a temporary list.

What is even more interesting about this one is that

THE RULE APPLIES TO ALL FUNCTORS

Not just lists, trees, arrays, anything at all, as long as it is
a functor. In fact

THIS IS THE DEFINITION OF A FUNCTOR

The point is, map preserves the structure of the original data type, changing
only the values of the elements. And I might add this is one of the core
optimisations in Haskell. But it is not the only fusion optimisation.

At present, Felix has no way to say: if this is a functor, maps over it can
be fused: but we CAN say for a particular functor like list to do the fusion,
and then again for array.

PROBLEMS
=========

Applying reduction is EXTREMELY EXPENSIVE. What Felix does is take
some code and match the LHS of EVERY reduction rule to EVERY expression,
and that includes EVERY SUBEXPRESSION OF EVERY EXPRESSION.
If a reduction is performed, a previous failure could now succeed so we might
do the matching again and keep going until there are no more reductions.

So it’s hellishly expensive. But that’s the least of the problems!
At present I’m working on a reduction pass that finds reductions
in your code AS YOU WROTE IT.

Its very unlikely it will find any. Who is going to write

rev (rev mylist)

??? No one. Where we profit from reductions is applying them
AFTER INLINING. For example its common processing lists to take
a list, do some processing on it, resulting in a backwards list, so
our function reverses it. Another function does the same, and another.
Then you chain them together. However a lot of these algorithms
work fine on reversed lists too, for example map. In fact rev_map
is faster than map, because map is literally defined as rev_map,
which is tail recursive, followed by rev.

So why does the programmer keep reversing the list?
The answer is: because its too dang hard to keep track of the order
of the list otherwise. You have to throw in a “rev” manually after
counting the number of operations if the number is odd.

The optimisation above, applied recursively AFTER inlining,
might do that for you. Although we need an axiom

rev (f x) = f (rev x)

to allow it.

But here’s the problem: inlining is now done AFTER monomorphisation,
and monomorphisation maps every instrance of a polymorphic function
or type that is actually used to a brand new function or type.

And the reduction pattern matching won’t recognise these patterns.
[Unless (a) they were already monomorphic or (b) they’re C bindings
which are not monomorpised]

It isn’t possible to instantiate reductions in classes because we don’t
know what instances are needed. Also there is a real problem in that
if we do somehow find a match after monomorphisation, and replace
some expression with another, where do the symbols in the new expression
come from??? We didn’t know that instance of a symbol was going to be used
when we monomorphised.

Felix USED to do inlining on polymorphic functions, before monomorphisation,
in fact Felix never did monomorphisation! It just tracked required instances
and the monomorphisation was done “on the fly” in the code generator.

But that meant inlining had to not only replace parameters with arguments
it had to specialise types as well and it got really hard to understand the code
at all. So I decided to momorphise first, THEN inline, so there were no type
variables or specialisations to worry about.

The good news is monomorphic reductions survive, as do polymorphic
ones applied to C bindings. So I can eliminate the others for post-monormophisation
provided I make sure the RHS symbols are retained by the symbol garbage collector.
After a pass, wipe all the reductions so the GC can clean out unused symbols.

I have no idea if all this will be effective though, remembering the cost of
checking if reduction rules actually apply.

—
John Skaller
ska...@internode.on.net

John Skaller2

unread,

Oct 26, 2018, 11:26:20 AM10/26/18

to felix google

So .. monomorphic reductions are no first preserved then discarded.
The check is that the symbols are all available in the monmorphisation table.
I think I can proppagate them through to inlining by adding the reduction garbage
collection routine in the right place. Some monomorphic reductions will be lost
if their RHS contains unused symbols, even if applying the reduction would lead
to the symbol being used. In theory this situation can be improved but it may be useful.

And now I have a cool idea, though I don’t know for sure it will work.

Lets add a new nominal data type rope, which contains a list of strings.
It stands for the concatenation of the strings. Adding ropes, or prepending or
appending a string to a rope is fast compared to the equivalent operations
on strings.

But, other operations won’t work, so we have to convert ropes back t strings
to do them.

Now here is the FUN!!

Consider

reduce ropes (x:string,y:string): x + y => mkrope (x,y);
reduce ropes (x:rope, y:string): x + y => mkrope (x,y);
reduce ropes (x:string, y:rope): x + y => mkrope (x,y);

and three functions

fun mkrope(x:string,y:string) => …
…

Now, all string concatenation makes ropes and ropes add to strings.

So how do we get a string when we need one, for example:

search (s1, s2)

searches in s1 for s2. We don’t want to overload search so s1 and s1 can
be a string or a rope (that’s 4 functions already).

What we need is a C++ like operator conversion.

And we have one!!!!!!!

supertype string (x:rope) => …

This says, a rope is a subtype of a string, so a rope can be passed to a function
with a string parameter. When that happens, the function above is used as a coercion.

If this works its magical, because it makes the use of ropes transparent!

The main problem is that this is ambiguous:

fun f(x:string) => …
fun f(x:rope) => …

because Felix does not weight subtyping coercions worse than no coercions.

At least I think this is what happens … hmm …

—
John Skaller
ska...@internode.on.net

John Skaller2

unread,

Nov 12, 2018, 7:07:38 AM11/12/18

to felix google

So here is the pointer code in Ocaml:

let ismonomorphic t =
let f_btype t = match t with BTYP_type_var _ -> raise Not_found | _ -> () in
try iter ~f_btype t; true
with Not_found -> false

let rec strip_nonclt ts = match ts with
| [] -> []
| h :: t ->
if not (islinear_type () h)
then strip_nonclt t
else ts

let throw_tail ts = match ts with
| [] -> []
| h :: t ->
if ismonomorphic h then [h]
else ts

let btyp_ptr m t ts =
BTYP_ptr (m,t,ts)

let reduce_ptr m t ts =
let ts = throw_tail (strip_nonclt ts) in
btyp_ptr m t ts

Now, the originally the reduce_ptr code was meant to preserve the
invariant by being in the btyp_ptr constructor function. But it failed.

The reaon is really NASTY. Felix thinks a named type cannot be compact linear.

The problem is during binding, some really weird stuff happens with typedefs.
At first, typedefs get symbol table entries which are later removed.
The binding process goes through several passes.

The idea of the invariant is that the type of a pointer to a compact linear vale consists
of the machine address of a containing compact linear type, and the
type pointed at. The reason is, an actual pointer consists of a pointer to a
machine word containing the whole compact linear type, plus a divisor
and modulus to extract the component. The point is the type of the “top level”
compact linear type matters.

To find the top level compact linear type we just run down types of the
pointers from the starting product type until we hit one that is compact linear.
At that point, we can’t change the machine address so further projections
change the divisor and modulus instead.

As an example:

var x = true;
var y = true,false;
var px = &x;
var py1 = &y.1;

px is an ordinary pointer to a bool. py1 is a compact linear pointer,
consisting of a pointer to y AND the divisor 2 and modulus 2.

So the two pointers have different types even though they both target a bool.

The PROBLEM is my reduction throws out non-compact linear types from
the projection list, but it thinks a typedef of a compact linear type isn’t
compact linear. So I cannot apply the reduction until the binding is fully
completed.

I have attempted to apply it during monomorphisation instead.

I haven’t actually done the full chaining yet.

—
John Skaller
ska...@internode.on.net

John Skaller2

unread,

Nov 12, 2018, 1:13:05 PM11/12/18

to felix google

Wow! This works. I don’t even know why!

var x = (1,(2,(true, (`1:3,(`3:5,`7:8)))));
println$ x.1._strr;
println$ (x.1).0._strr;
println$ (x.1).1._strr;
println$ ((x.1).1).0._strr;
println$ ((x.1).1).1._strr;
println$ (((x.1).1).1).0._strr;
println$ (((x.1).1).1).1._strr;
println$ ((((x.1).1).1).1).0._strr;

println$ (*(&x.1))._strr;
println$ (*(&x.1).0)._strr;
println$ (*(&x.1).1)._strr;
println$ (*((&x.1).1).0)._strr;
println$ (*((&x.1).1).1)._strr;
println$ (*(((&x.1).1).1).0)._strr;
println$ (*(((&x.1).1).1).1)._strr;
println$ (*((((&x.1).1).1).1).0)._strr;
println$ (*((((&x.1).1).1).1).1)._strr;

Of course its not using stand-alone projections. None-the-less the second
batch of prints IS using pointer projections that dive down to every sub-component.
THROUGH the non-compact linear to compact linear barrier.

Here’s the output:

(2,(true,(case 1 of 3,(case 3 of 5,case 7 of 8))))
2
(true,(case 1 of 3,(case 3 of 5,case 7 of 8)))
true
(case 1 of 3,(case 3 of 5,case 7 of 8))
case 1 of 3
(case 3 of 5,case 7 of 8)
case 3 of 5
(2,(true,(case 1 of 3,(case 3 of 5,case 7 of 8))))
2
(true,(case 1 of 3,(case 3 of 5,case 7 of 8)))
true
(case 1 of 3,(case 3 of 5,case 7 of 8))
case 1 of 3
(case 3 of 5,case 7 of 8)
case 3 of 5
case 7 of 8
—
John Skaller
ska...@internode.on.net

John Skaller2

unread,

Nov 14, 2018, 7:24:16 AM11/14/18

to felix google

So .. I think this is the way forward:

I think I will remove ALL handling of compact linear types except for the code generator,

So a pointer is a pointer, and a projection is a projection, and there are no special
terms for compact linear stuff at all. The detection is done in the back end.

What this means is that applications of projections to pointers are irreducible.

So all compact linear type have the same representation at the moment.
Its a uint64_t. Pointers to *interior* compact linear types all uses the same
C++ struct, but with different arguments to the constructor.

The *tricky* bit is that the type of a pointer no longer tells which representation
to use, because the top level compact linear type is pointed at by an ordinary
pointer.

If an ordinary pointer to a compact linear type is projected, the resulting pointer
is a compact linear pointer. If a compact linear pointer is projected, the result
is ALSO a compact linear pointer.

This means, what I first said will not work! The *type* of a pointer MUST indicate
which kind it is. The existing pointer type i recently implement should do that.
It has the form

BTYP_ptr (mode, target, projectionlist)

where: mode is initially R,W,RW or N (N isn’t implemented yet but the code is there).
The target is what’s pointed at. The projection list is the forward order list of domains
of the projections that have been applied to get to this pointer type. The codomains
are the the domain indicated by the next element in the list, or the target for
the last element.

If, after reduction, the projection list is empty, its an ordinary pointer,
otherwise it has one element, and its a compact linear pointer, with
the one element being the type of the containing compact linear type.

The reduction strips non-compact linear types from the list,
after monomorphisation. These will always be the head elements of
the list. If the list is non-empty, then the trail of the list can be discarded,
it represents through nesting of the compact linear type from the
the machine value to the target. We don’t need these in the type.
In the actual representation the chain of compect linear projections
can be reduced to a single projection.

Non-clt prjections are just the usual C field accesses (or array accesses
for arrays). So the result will be

machinepointer = &(p->mem_n1.mem_n2.mem_n3 ….)
cltpointer = clptr_t (machinepointer,divisor,modulus)

The modulus is the size of the target. The divisor is the productr of the
divisors of the individual projections. This requires knowing the projection
index and the size of the types stored “to the right” of that component.
The type information for that is in the projection itself. We threw that
information out of the type but not the expression term which still
contains the complete list of projections.

More precisely, a chain of projections looks like

apply (prj3, (apply (prj2, (apply (prj1, pointer))))

Substitute “aprj” for array projections.

—
John Skaller
ska...@internode.on.net

John Skaller2

unread,

Nov 15, 2018, 12:31:01 PM11/15/18

to felix google

So the problem is this: a &T has one piece of information and a compact linear
type pointer has two: _pclt<mach,target>.

So, if we have two routines:

fun deref[T] (p:&T) => _deref p;
fun defef[mach,target] (p: _pclt<mach,target>) => _deref p;

overload resolution has to pick one *before* we know the type of a pointer.
After monomorphisation we get the correct type, but its too late: the _deref
would work correctly in each function, but the wrong function may be chosen.

Given a type

T * bool

we don’t know if its compact linear until after monomorphisation.

If we use a typeclass, the choice is defered until after monomorphisation,
but the pointed at value is lost unless we give TWO type parameters to
the class:

class deref[P,V] { virtual fun deref: P -> V; }

which can’t work: the instances would be

instance[T] deref[&T,T] ..
instance[mach,target] deref[_pclt<mach,target>,target] { … }

The problem is the virtual is given a pointer type P, but not V.

Now we should be able to fix this with:

class deref[P] { virtual type V; fun … }

There’s a bug in the routine i need to fix, it barfed with a recursive type
which means I forgot to alpha convert. But even if it worked it would
be useless because the virtual type isn’t fixed until monomorphisation.
We need to know the type of

*p = deref p

during binding to select overloads.

So at the moment it seems that automatic detection of compact linear types
can’t work, because it can only work too late: it works after monomorphisation
but we need the type information during binding, before monomorphisation.

One solution is to explicitly construct compact linear types with
specialised product, sum, and array index operatrors:

5 * 7 .. ordinary product
5 \* 7 .. compact product

etc. Then

T \* 7

is a compact product and this means T must be constrained to a compact linear type.

Another solution is to use a barrier term

compact< 5 * 7 >

which just means the operators inside are compact ones. It would need to be more trivial like just

< 5 * 7 >

so that say arrays would be

int ^ <5 * 7>

Maybe [] could be used but by summetry

T[K]

should be a type indexed by a kind. We don’t have these yet and probably won’t.
Note that

vector[T]

is a type, a C++ vector storing Ts. So this notation is no good. If we want to compare
types < and > aren’t so good either. But { } is available.

int ^ {5 * 7}
{ 3 * 2 + 1}

It basically means “pack the value of this type into an uint64_t.

The point is now, the type tells us exactly where the boundary is.
In particular

2 ^ 3

is not compact linear but

{ 2 ^ 3 }

is. And I guess

{ T }

means a compact linear type variable:

T:COMPACTLINEAR

The PROBLEM is that we get something like:

T = { U }

to solve in unification, what do we do?

—
John Skaller
ska...@internode.on.net

John Skaller2

unread,

Nov 16, 2018, 10:29:11 AM11/16/18

to felix google

Well! This works as written:

/////////
instance Str[3] { fun str (x:3) => x._strr; }
instance Str[5] { fun str (x:5) => x._strr; }
instance Str[8] { fun str (x:8) => x._strr; }

proc test[T,C:COMPACTLINEAR] (i:T,j:C) {
var x = (1,(i,(true, (j,(`3:5,`7:8)))));
println$ x;

println$ x.1._strr;
println$ (x.1).0._strr;
println$ (x.1).1._strr;
println$ ((x.1).1).0._strr;
println$ ((x.1).1).1._strr;
println$ (((x.1).1).1).0._strr;
println$ (((x.1).1).1).1._strr;
println$ ((((x.1).1).1).1).0._strr;

println$ (*(&x.1))._strr;
println$ (*(&x.1).0)._strr;
println$ (*(&x.1).1)._strr;

/*
println$ (*((&x.1).1).0)._strr;
*/

println$ (*((&x.1).1).1)._strr;
println$ (*(((&x.1).1).1).0)._strr;
println$ (*(((&x.1).1).1).1)._strr;
println$ (*((((&x.1).1).1).1).0)._strr;
println$ (*((((&x.1).1).1).1).1)._strr;
}

test (2,`1:3);
/////////

First .. surprisingly, the instances work! However this:

instance[C:COMPACTLINEAR] Str[C] { fun str(x:C) => x._strr; }

does not. It fails with

Function 13100[bool,3 * (5 * 8)]
instance parent 13096[<T13097> * <T13098>]
instance vs= T<13097>:TYPE,U<13098>:TYPE
defined: /Users/skaller/felix/src/packages/core_type_constructors.fdoc: line 615, cols 4 to 55
614: instance[T,U] Str[T*U] {
615: fun str (t:T, u:U) => "("+str t + ", " + str u+")";
****************************************************
616: }

CLIENT ERROR
[flx_frontend/flx_typeclass.ml:741: E366] No most specialised instance!

Which seems correct. Interesting, _strr delegates to str if necessary,
but here, str is delegating back to _strr. :-)

Anyhow the KEY thing to see in the test case is that this fails:

println$ (*((&x.1).1).0)._strr;

[Flx_lookup.bind_expression] Inner bind expression failed binding (_strr (deref (0 (1 (1 &(x))))))

Flx_lookup:inner_bind_expression: unknown exception File “src/compiler/flx_core/flx_btype.ml”,
line 751, characters 33-39: Assertion failed

And the reason is simple enough: it’s trying to find the size of a type variable.
The interesting thing si that the other cases don’t fail BECAUSE the size isn’t required.

But that is BY LUCK! Because it happens the type variables comes first in the tuple,
It the order was reversed, the size would be needed for all the other projections
and pointer projections. The size isn’t needed printing the actual type variable
because .. it prints like so:

(typevar?:2,(true,(typevar?:case 1 of 3,(case 3 of 5,case 7 of 8))))

So by LUCK all the value cases work.

Now here’s the thing: with an explicit chain of projections we CAN do the calculations
after monomorphisation, although _strr will NOT work. This is because its a HACK.
The bound type is analysed and _strr generates UNBOUND code which is then bound.
It shouuld generate BOUND code.

If the chain isn’t explicit, for example, the projections are stand alone and store in variables
first, then applied, so we know the type of the projection, but not the actual projection,
it should STILL WORK because we can do the calculations at run time. Although there’s
no way to generate compact linear values and projections at run time at the moment,
the C++ types can do the calculations. After all, its just division and modulus, which works
with arbitrary expressions not just constants. Of course .. the compiler generated stuff
is type checked.

The key problem is to pick the right deref function which picks _deref. The problem would
go away if the parser generated _deref. But at the moment, the parser generates
deref, which is overloaded, and both overloads call _deref, but with non-compact and
compact types, respectively.

The overload picks the wrong function. I like the overload because I want the
syntax to work for abstract pointers too.

IN FACT .. it might be interesting to make the abstract pointer type used in all cases
and specialise it after monomorphisation (though, I don’t know how to do that).

There HAS to be a way to make this work by just delaying evaluations that can’t
be done prior to monomorphisation.

—
John Skaller
ska...@internode.on.net

Reply all

Reply to author

Forward