Thanks Scott. The RECORD* params definitely address the discrepancy between the data written and my observed disk space growth (I left them at their defaults, which the docs say is 1.2). So, setting these to 1 would essentially remove any fluff from the record. That's potentially good, and since it's settable on a per-cluster basis, is something that can be easily left to the database architect, based on their knowledge of how a particular class/cluster will be used (i.e. zero/few updates vs. lots of frequent updates, as well as they kind of updates).
In my particular case, at the time the record is initially written, the "whole" record will be known, but there will be two possible update scenarios: 1) Data updated "in place" with no size change (i.e. updating the value of an INTEGER); and 2) adding a lightweight edge. I'll assume that scenario #1 does not fragment the record, since it's storage size doesn't change (???), but what happens in the case of adding an edge (or several)? In that case, I assume that each edge will increase the size of the record (but by how much?)?
What might be ideal is a way to specify on a per-record (or per-cluster) basis a specific number of padding bytes, when this is known in advance. Here again, for many database applications, adding padding is probably a good idea (although MongoDB's 2X recommendation seems pretty wasteful), but for applications that are storing billions of records, that overhead adds up quick (disk space may be "cheap", but anything multiplied by 1B is still a lot).
Any thoughts on how COMPRESSION may help (or hurt) this? I assume it would be very efficient at removing fluff from a record, but I've also seen comments that would suggest that COMPRESSION isn't very efficient. My guess is that the "padding" is applied after the compression, since the point of the padding is to leave some free physical space in the disk storage (i.e. compressing after padding would result in most updated records taking up more physical space, which defeats the purpose). Can anyone from Orient explain this in more detail--specifically how COMPRESSION relates the physical disk storage and padding?
--Eric