--
You received this message because you are subscribed to the Google Groups "java-serialization-benchmarking" group.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-be...@googlegroups.com.
To post to this group, send email to java-serializat...@googlegroups.com.
Visit this group at http://groups.google.com/group/java-serialization-benchmarking?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
Size of resulting messages is already included in results isn't it?
Or are you asking for something differnet?
-+ Tatu +-
On Fri, Apr 26, 2013 at 10:33 AM, diegocip . <die...@brturbo.com.br> wrote:
Hi,
would be of great interest to have the serialized content for each "technology" put on the results, considering binary to be put on a binary text representation like hexadecimal. Its unacceptable that text protocols, in particular XML based protocols, have less size of serialized content than XML ones, so its of importance to have the serialized data to do some analyses of the cons and pros. Hessian 4 will use 4 bytes for a 32 bit int, as its for Date support in minutes, and floating point numbers. On the other side, text based protocols, will have one byte (according to the charset, can be two), per character or digit of a number, and one more for the decimal point in decimal numbers. For date that would be (mm/yy/aaaa hh:mm) comparted to 4 bytes of Hessian or other binary protocol, I really like to see the serialized result without the need to run the test myself.
Thanks all,
Congrats,
Diego C Nascimento.
--
You received this message because you are subscribed to the Google Groups "java-serialization-benchmarking" group.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-benchmarking+unsubscribe@googlegroups.com.
To post to this group, send email to java-serialization-benchm...@googlegroups.com.
Thanks for repling. Yes it is, what I said is including the serialized content, to analise the pros and cons it is giving. A XML based protocol given less size than a binary one, and for Hessian 2, its really weird results, without compactation!
So giving the serialization result (text for text protocols and hexadecimal notation for example, for binary protocols) it
should be of great help to know why some protocols that may generate very small datasizes compared to any text based
one is generating bigger sizes. A test with a medium quantity list of the object in the test would be of great help, with random numbers, as with fixed numbers GZIP or other compactator would get great benefit on it.
Thanks,
Diego
On Friday, April 26, 2013 2:46:56 PM UTC-3, cowtowncoder wrote:Size of resulting messages is already included in results isn't it?Or are you asking for something differnet?-+ Tatu +-On Fri, Apr 26, 2013 at 10:33 AM, diegocip . <die...@brturbo.com.br> wrote:Hi,
would be of great interest to have the serialized content for each "technology" put on the results, considering binary to be put on a binary text representation like hexadecimal. Its unacceptable that text protocols, in particular XML based protocols, have less size of serialized content than XML ones, so its of importance to have the serialized data to do some analyses of the cons and pros. Hessian 4 will use 4 bytes for a 32 bit int, as its for Date support in minutes, and floating point numbers. On the other side, text based protocols, will have one byte (according to the charset, can be two), per character or digit of a number, and one more for the decimal point in decimal numbers. For date that would be (mm/yy/aaaa hh:mm) comparted to 4 bytes of Hessian or other binary protocol, I really like to see the serialized result without the need to run the test myself.
Thanks all,
Congrats,
Diego C Nascimento.
--
You received this message because you are subscribed to the Google Groups "java-serialization-benchmarking" group.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-benchmarking+unsubscribe@googlegroups.com.
To post to this group, send email to java-serialization-benchm...@googlegroups.com.
Visit this group at http://groups.google.com/group/java-serialization-benchmarking?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to the Google Groups "java-serialization-benchmarking" group.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-be...@googlegroups.com.
To post to this group, send email to java-serializat...@googlegroups.com.
When I first encountered the project I also thought it was important to go into a little more detail and created the first version of the feature comparison table.
I don't have strong objections to your idea, but it does seem like a lot of work that nobody here feels motivated to do :-P
If you were to write something up and send it to the list, I and others would probably be happy to provide feedback. If it looks like a fair comparison, we could put it on the site.
There's always the question of whether something like that will stay maintained, but I think if it ever gets too out of date we can just remove it.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-benchmarking+unsubscribe@googlegroups.com.
To post to this group, send email to java-serialization-benchm...@googlegroups.com.
I dont agree with your first argument, experient developers with caution on the bandwidth probably will have interest in this, and high level languages developers will have advantages for comparasion. In fact some developers my be interested and dont have posted their interest.
As I said, the results are not the expected, the expected is like this: http://census2.jamesward.com/ It even dont have hessian 2 that can perform better than AMF. So provinding the serialized content would help developers examine why this is happing. I dont have time to examine all the serialization protocols so this is the help it would provide.
Just taking like a 0-31 lenght string in Hessian it would be one byte plus the string, textual representation would need
quotes, that would be two bytes, and for special characters need one scape char for each, this need dont exist for
binary. XML would need "<" and ">" just to open and close tags, and is common to need a closing tag. So its expected that the Hessian have lower size.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-be...@googlegroups.com.
To post to this group, send email to java-serializat...@googlegroups.com.
To post to this group, send email to java-serialization-benchmarking...@googlegroups.com.
Visit this group at http://groups.google.com/group/java-serialization-benchmarking?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-be...@googlegroups.com.
To post to this group, send email to java-serializat...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to java-serialization-be...@googlegroups.com.
To post to this group, send email to java-serializat...@googlegroups.com.
I think I now see what Diego is getting at. The goal is not to explain the formats to someone who doesn't know. It is to help legitimize the testing methodology to someone who is confused by the results.
Case in point: Diego saw the results and was confused about how Hessian could produce larger output than some of the XML formats. This makes him doubt the legitimacy of the results.
If he was wondering why Hessian is bigger than "xml/fastinfo", then he could have looked at the output and realized that "xml/fastinfo" is actually a binary representation of XML. (Tangent: maybe the word "binary" should be somewhere in the name?)
Or maybe he was wondering why Hessian is bigger than "xml/xstream+c". And when I just went to look at it, I was also confused about why "xml/xstream+c" XML is smaller than the other XML formats. The "tool behavior" page explains why, but that page is no longer linked from the front page. (Tangent: turns out the "+c" means we used abbreviated tag/attribute names, which we stopped doing for JSON and should stop doing for XML.)
So having a single page with a hex+ascii dump of the serialized data makes sense to me. If you'd like to add an option to the benchmark runner that produces this, I'd be in favor of linking to it.
Would this be dump of canonical data, in sort of BNF-style form? That is, not really binary dump, but explanation of structures tests use.I assumed suggestions was to display hex dumps of all serialized output, with explanation, and I was not sure that was worth the effort (compared to project pages of tools, or format definition pages).
But if this would be explanation of the logical data model, yes, I could see this being useful. Along with links to more information, if such exists.I did realize (after responding) that likely starting point was indeed "but how could XML be more compact than Hessian" -- valid question of course -- and perhaps those kinds of FAQ would make sense to address?
-+ Tatu +