Hi Palo,
1. Yes, this version uses the new register allocator that uses liveness analysis information. It should handle jumps/loops much better than previous one, but there are still areas for improvements, especially when it comes to 'modified'/'unmodified' state changes.
2. Yes, currently you can assign immediates to gp arguments (probably the most used), but I will extend for sure to allow float/mm/xmm arguments as well.
3. Well, from my private tests, Assembler speed doubled and Compiler speed is similar to the old asmjit, but in some cases it can be a bit slower if it uses look ahead extensively (there is look-ahead feature that looks if a variable will be allocated to a specific register at deciding step. I plan to put this to API so you can control it).
Hope that helps.
If you have a framework to test the performance and also the validity of the code generated I would be glad to see some real numbers. My benchmarks were mostly synthetic.