How are they not comparable? Do you mean in the same way that lz4 and lz4hc is not comparable? Which only means that they differ a lot, not that they are not actually comparable.
Which versions that are actually not compatible I don't know. What I do know is that the linux command line utility lzo is able to compress something that the linux kernel then can decompress. That indicates compatibility to me. I am sure there are lzo compressors that compress in a manner that the linux kernel cannot decompress, I have not tested those, nor do I think they are relevant.
Regarding the benchmark, you say it was done on pc, and you also say that unaligned access is most efficient on pc, but the tests chart does not indicate which de/compressors were run using it or not.
"1. ARMv7, 1.5GHz based board
Kernel: linux 3.4
Uncompressed Kernel Size: 14MB
Compressed Size Decompression Speed
LZO 6.7MB 20.1MB/s, 25.2MB/s(UA)
LZ4 7.3MB 29.1MB/s, 45.6MB/s(UA)
2. ARMv7, 1.7GHz based board
Kernel: linux 3.7
Uncompressed Kernel Size: 14MB
Compressed Size Decompression Speed
LZO 6.0MB 34.1MB/s, 52.2MB/s(UA)
LZ4 6.5MB 86.7MB/s
- UA: Unaligned memory Access support
- Latest patch set for LZO applied"
I agree performance is not the same using the above kind of arm architecture or a modern pc, but the relative performance increases you see from unaligned access is comparable I'd say.
Anyway, my main point is that just looking at the chart like I did, and then comparing results from my kernel compressed with lzo and lz4hc, it is inconsistent. The reason for that is that the chart uses another lzo compressor than the command line utility on most linuxes. That was what was unclear for me, until now. Please correct me if I'm still wrong! =)
As a side note, the actual command used to compress the lz4 kernel is lz4c -l -c1. I have not found any documentation for the -l switch just yet. (My guess is legacy format)