Empty File

Daniel

unread,

Aug 19, 2010, 10:23:25 AM8/19/10

to anic

Hi,

I'm very interested in the ANI language and I'm keen to start
developing with it, but when I compile:

"Hello, World!" ->std.out

I get an empty file. What am I doing wrong? The last 2 lines of output
with -v are:

anic: generating code dump....
anic: successfully generated code dump.

The command I use to compile is:

../anic/anic test.ani -o test -v

Thanks,

Daniel

Daniel Kersten

unread,

Aug 19, 2010, 11:16:25 AM8/19/10

to da.wat...@gmail.com, anic

Hi Daniel,

The ANI compiler is still very much a work in progress and the code generator is not yet complete. The empty file is very much the expected output at the moment.

Hopefully an initial version will be working soon, but we're all busy, so work is progressing very slowly.

Dan.

--
Daniel Kersten.

Daniel

unread,

Aug 19, 2010, 11:29:48 AM8/19/10

to anic

Hi Dan,

I'm a C/C++ programmer. Is there anything I could do to help?

Daniel

On Aug 19, 4:16 pm, Daniel Kersten <dkers...@gmail.com> wrote:
> Hi Daniel,
>
> The ANI compiler is still very much a work in progress and the code
> generator is not yet complete. The empty file is very much the expected
> output at the moment.
>
> Hopefully an initial version will be working soon, but we're all busy, so
> work is progressing very slowly.
>
> Dan.
>

Adrian

unread,

Aug 19, 2010, 12:35:18 PM8/19/10

to anic

That depends. Unfortunately, compilers by definition cut through a lot
of layers of programming, so just being a good c/++ programmer isn't
enough.

Right now in particular we could use help with writing a low-level
runtime and dumping assembly source (the latter of which Mr. Kersten
is already doing), so unfortunately, the barrier to entry is pretty
high. Do you have a good understanding of assembly patterns and/or
system calls?

Daniel

unread,

Aug 19, 2010, 2:05:28 PM8/19/10

to anic

I have a working understanding of asm, though I'm not generally known
for writing much in it. What's the representation you generate before
asm? From a cursory glance at the source, in some places it looks like
a thin layer over asm, and in others it looks like you're templating.

Out of interest, is there a reason your not using something slightly
higher level like LLVM? I know compiling it is a slog, but you get
optimization and portability for free and good optimization frameworks
are man-years of effort. From the looks of genner.h it wouldn't be
difficult to translate between the different IRs. (LLVM uses SSA, but
it's not pure, it has Load and Store instructions that you can use and
then you can get it to translate into pure SSA, and then binary). It
doesn't look like it would be that difficult to hack something
together.

Thanks,

Daniel

Ultimus Freelance

unread,

Aug 19, 2010, 2:47:23 PM8/19/10

to anic

On Thu, Aug 19, 2010 at 11:05 AM, Daniel <da.wat...@gmail.com> wrote:
> I have a working understanding of asm, though I'm not generally known
> for writing much in it. What's the representation you generate before
> asm? From a cursory glance at the source, in some places it looks like
> a thin layer over asm, and in others it looks like you're templating.

The intermediate representation is very custom-tailored to ANI
programs. For example, there are no functions or function calls
anywhere -- only chunks of code that can be "dispatched" by the
runtime. We also have locking as a primitive in the IR, since proper
synchronization of data is such a big deal for the language. There are
some traditional things like expressions and assignments in there, but
since ANI's execution model flies in the face of conventional
imperative programming, I felt that doing an IR that was specially
tailored to ANI would be the easiest way to go.

>
> Out of interest, is there a reason your not using something slightly
> higher level like LLVM? I know compiling it is a slog, but you get
> optimization and portability for free and good optimization frameworks
> are man-years of effort. From the looks of genner.h it wouldn't be
> difficult to translate between the different IRs. (LLVM uses SSA, but
> it's not pure, it has Load and Store instructions that you can use and
> then you can get it to translate into pure SSA, and then binary). It
> doesn't look like it would be that difficult to hack something
> together.

We're not compiling to LLVM or such frameworks because it doesn't give
nearly the amount of control that we need to make ANI programs run
with their massive parallelism. Not only do we need native
instruction-level granularity with our thread dispatching (which
compiling to LLVM wouldn't give us), but anic does some really
unconventional things to allow its level of multithreading -- all
existing frameworks I know of don't have any support for the stuff
we're planning to do.

For example, we derive a logical partitioning schema for the entire
memory space during compilation so that object data can be statically
proven to never collide even under multithreading, and almost all
references are bound via offset rather than pointer. This means that
we need full control over how the program is linked and at which exact
offset each component is linked relative to the rest. A framework that
takes away from us this level of control would ruin our compilation
design.

It's true that full optimizations aren't easy to do from scratch, but
there's really no other way to go. anic compiles quite differently
from the other compilers out there and trades binary flexibility for
statically safe parallelism. Such an ambitious goal requires
rethinking the execution model as a whole, and all intermediate
frameworks I've seen are incompatible at the design level with what
we're doing.

Adrian
(project lead)

Daniel

unread,

Aug 19, 2010, 4:13:29 PM8/19/10

to anic

ok that makes sense. You can't afford to miss out on granularity and
flexibility and I agree LLVM isn't exactly what you'd call flexible.
However, there is another option. Have you heard of nanojit, it's the
jit compiler used in Firefox. It's biggest draw is that it's less than
30,000 lines of C++ (more than a 1/3 of that is platform dependent
stuff), which makes it very hackable. You could modify the code-base
to handle instruction level locking and memory partitioning (you'd
need to make a binary format writer too) and you'd still have
flexibility, but you could potentially save a fair amount of time.

Thanks, I hope I'm not bothering you,

Daniel

On Aug 19, 7:47 pm, Ultimus Freelance <ulti...@gmail.com> wrote:

Ultimus Freelance

unread,

Aug 19, 2010, 7:25:59 PM8/19/10

to anic

On Thu, Aug 19, 2010 at 1:13 PM, Daniel <da.wat...@gmail.com> wrote:
> ok that makes sense. You can't afford to miss out on granularity and
> flexibility and I agree LLVM isn't exactly what you'd call flexible.
> However, there is another option. Have you heard of nanojit, it's the
> jit compiler used in Firefox. It's biggest draw is that it's less than
> 30,000 lines of C++ (more than a 1/3 of that is platform dependent
> stuff), which makes it very hackable. You could modify the code-base
> to handle instruction level locking and memory partitioning (you'd
> need to make a binary format writer too) and you'd still have
> flexibility, but you could potentially save a fair amount of time.

I just looked at nanojit and although it looks interesting, it doesn't
look like a good fit for anic. We already have our own IR that
suffices and integrates well with the language, and I think the effort
to learn nanojit and mold our IR to its exact spec, and the dangers
with relying on a third-party codebase (and being impacted by their
bugs) is too great for a native compiler like this one.

Doing it ourselves will probably be quicker than learning and
integrating into nanojit's prescribed format, and although web
browsers can afford to crash or have minor problems if incorrect code
is produced once in a million cases, having anic produce subtly
incorrect parallel code because of the tiniest problem in nanojit
would not only be unacceptable, but a nightmare to debug and a
publicity blow to the language that we can't afford to take.

That's just my personal thoughts on the matter, but I reserve the
right to be convinced of the merits of integrating with a back-end
platform that seems acceptable -- I just haven't seen one that looks
like it would work well enough (yet).

>
> Thanks, I hope I'm not bothering you,

Don't worry -- not at all.

Adrian

Daniel

unread,

Aug 20, 2010, 10:53:02 AM8/20/10

to anic

On Aug 20, 12:25 am, Ultimus Freelance <ulti...@gmail.com> wrote:

Just to clarify, does having a logical partitioning schema mean being
able to deterministically find the thread that an object is handled by
given it's base ptr or perhaps it's offset? i.e.

int thread_index = (size_t)ptr%num_of_cores;
thread = threads[thread_index];

Or have I misinterpreted the meaning of it?

Ultimus Freelance

unread,

Aug 20, 2010, 2:25:25 PM8/20/10

to anic

No, it's more complicated than that.

Since all ANI programs can be represented as trees, we treat the
entire memory space as an array representing a tree, which very
closely corresponds to the layout of the data structures represented
in source. Of course, different parts of the memory space are
represented by trees of different arity (binary, ternary, etc.) --
anic handles the proper mapping. Threads are dispatched onto known
offsets into the correct data structures (this is all calculated by
the compiler), and since this is a top-down tree, dispatching properly
guarantees no data collisions. Large parts of the tree are passed
around via entangling the tree and swapping child nodes, to avoid the
overhead of actually ever copying any tree data.

It's even more complicated than that when a child needs more memory
(it recursively looks to steal memory from siblings).

I plan on writing a document about how all of this works in anic soon
(since I'm working on the partitioner/allocator right now).

Cheers,
Adrian

rurban

unread,

Nov 13, 2010, 7:18:13 PM11/13/10

to Ultimus Freelance, ani-co...@googlegroups.com

On 20 Aug., 00:25, Ultimus Freelance <ulti...@gmail.com> wrote:

lightning maybe. Just simple cpp macros, no lib in between.
My self-written perl5 JIT is also super simple, but it starts to get
bigger
and bigger the more CPU's and different data structures you have to
support.

And asmdump is just the first step.
You probably also need a 2nd linking step, and dump the coff/elf
header.
JIT'ting would be much easier.