You can use a cast, and force one type in the pattern, then use the
other one in a Pat:
def VGETITEM:
[(set GPR:$rD, (extractelt (v4i32 VR:$rA), GPR:$rB))]
def: Pat<(extractelt (v4f32 VR:$rA), GPR:$rB)),
(VGETITEM VR:$rA, GPR:$rB)>;
-Krzysztof
--
Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum,
hosted by The Linux Foundation
_______________________________________________
LLVM Developers mailing list
llvm...@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
You can use a cast, and force one type in the pattern, then use the other one in a Pat:
def VGETITEM:
[(set GPR:$rD, (extractelt (v4i32 VR:$rA), GPR:$rB))]
def: Pat<(extractelt (v4f32 VR:$rA), GPR:$rB)),
(VGETITEM VR:$rA, GPR:$rB)>;
-Krzysztof
--
Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, hosted by The Linux Foundation
_______________________________________________
LLVM Developers mailing list
llvm...@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
That is kind of hard to answer satisfactorily. I had done compiler
development for 8 years before moving on to LLVM, so the understanding
of how compilers work was not a problem. The rest was essentially
reading the code and writing my own. The beginnings are slow and
painful, but the more information you absorb, the faster it becomes.
There are some general principles of compiler development, namely that
you start having a lot of high-level information about the program
structure, and then the "granularity" increases: the level of detail in
the representation increases at the cost of losing the high-level
information. For example, early on, loops and loop nests may be
structured nicely, making them easy to optimize, but then some branches
may become folded, or optimized and the CFG may no longer be so clear.
So, you perform loop nest optimizations before that happens. Then you
run passes that are not concerned with the high-level structures, then
you run passes that look into even more details, and so on. In case of
LLVM, first you have a bunch of passes that do target-independent things
on the LLVM IR, then the influence of target-dependent information (like
TTI) increases, then you have the selection DAG, then the DAG is
legalized, then instructions are selected. After that you have MI with
SSA, then register allocation begins and you have MI without SSA, then
the register allocation ends and you have physical registers. Then
machine functions get prolog and epilog, then the instructions are
lowered to the MC layer, then that is printed (in text format, or
encoded) into the output stream. Each of these stages has certain
properties and the passes that run there utilize (and usually preserve)
these properties. The actual details are basically only visible in the
sources, but if you have a general idea about what is happening, these
details will be fairly understandable.
The TableGen? That was a painstaking trial and error. :)
That is kind of hard to answer satisfactorily. I had done compiler development for 8 years before moving on to LLVM, so the understanding of how compilers work was not a problem. The rest was essentially reading the code and writing my own. The beginnings are slow and painful, but the more information you absorb, the faster it becomes.
There are some general principles of compiler development, namely that you start having a lot of high-level information about the program structure, and then the "granularity" increases: the level of detail in the representation increases at the cost of losing the high-level information. For example, early on, loops and loop nests may be structured nicely, making them easy to optimize, but then some branches may become folded, or optimized and the CFG may no longer be so clear. So, you perform loop nest optimizations before that happens. Then you run passes that are not concerned with the high-level structures, then you run passes that look into even more details, and so on. In case of LLVM, first you have a bunch of passes that do target-independent things on the LLVM IR, then the influence of target-dependent information (like TTI) increases, then you have the selection DAG, then the DAG is legalized, then instructions are selected. After that you have MI with SSA, then register allocation begins and you have MI without SSA, then the register allocation ends and you have physical registers. Then machine functions get prolog and epilog, then the instructions are lowered to the MC layer, then that is printed (in text format, or encoded) into the output stream. Each of these stages has certain properties and the passes that run there utilize (and usually preserve) these properties. The actual details are basically only visible in the sources, but if you have a general idea about what is happening, these details will be fairly understandable.
The TableGen? That was a painstaking trial and error. :)
-Krzysztof
--
Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, hosted by The Linux Foundation
def VGETITEM:
[(set GPR:$rD, (extractelt (v4i32 VR:$rA), GPR:$rB))]
def: Pat<(extractelt (v4f32 VR:$rA), GPR:$rB)),
(VGETITEM VR:$rA, GPR:$rB)>;
-Krzysztof
--
Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, hosted by The Linux Foundation
_______________________________________________
LLVM Developers mailing list
llvm...@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
Try changing this Pat:
> // Cast load of a floating point vector to use the same
> // operation as a load of an integer vector.
> def: Pat<(set (v4f32 VR:$rD), (load ADDRri:$src)),
> (VLWZ VR:$rD, ADDRri:$src)>;
to
def: Pat<(v4f32 (load ADDRri:$src)),
(VLWZ ADDRri:$src)>;
That is, remove the "set" from the input pattern and the output operand
from the output pattern.
Generally, Pats only contain input operands.
If you have an error message like the one above, one easy way to locate
the problem is to look at the list of all records generated by table-gen:
$ llvm-tblgen -print-records -I /path/to/llvm/lib/Target/<target> -I
/path/to/llvm/lib/Target -I /path/to/llvm/include
/path/to/lib/Target/<target>/<target>.td -o output_file
(This is the same invocation of table-gen as for any other purpose,
except for the "-print-records" option instead of the typical -gen-...
options.)
The output_file will then contain all the records that came out of the
.td files, for example:
def anonymous_1405 { // Pattern Pat T_CMP_pat
dag PatternToMatch = (i1 (seteq (i32 IntRegs:$src1), s10ImmPred:$src2));
list<dag> ResultInstrs = [(C2_cmpeqi IntRegs:$src1, s10ImmPred:$src2)];
list<Predicate> Predicates = [];
int AddedComplexity = 0;
string NAME = ?;
}
You can look for "anonymous_570" in your case and see where it came
from. The comment after "def anonymous..." is a list of classes from
which this definition was inherited.