Hi, I am curious on how bup splits large files that a portion of the beginning content may change and/or increase/decrease in size but much of the file is actually the same.
Example would be a database dump where a large portion of the dump is the same, but first table that gets dumped may have a few rows added. I am worried that it will see the first portion as the same, but generate new hashes for the remaining portion of the file because the beginning of the file changed.
To explain my concern in a visual representation...
If i am backing up a file like:
1234567890
and bup splits it based on every other character (the pipe represents the chunks that would be hashed)
12|34|56|78|90
then the file has a 2 added after the first 2
12|23|45|67|89|0
it will generate a new hash for every section except the first.
My question is if it uses some sort of "look for me" text as markers to split different sections as checkpoints to begin new hash sections. Similar to how Git/SVN splits based on (\n|\r)+
I am sorry if I did not explain my question well, if I need to elaborate on a specific section let me know.
Thanks,
-Nathan