I tried to dump some bytes of the HTML in the following manner (hopefully without conversion):
BufferedOutputStream bos = new BufferedOutputStream(fs.create(new Path(bytearrayDump), true));
0000000: 3c 21 44 4f 43 54 59 50 45 20 48 54 4d 4c 20 50 <!DOCTYPE HTML P
0000010: 55 42 4c 49 43 20 22 2d 2f 2f 57 33 43 2f 2f 44 UBLIC "-//W3C//D
0000020: 54 44 20 48 54 4d 4c 20 34 2e 30 31 2f 2f 45 4e TD HTML 4.01//EN
0000030:
22 20 22 68 74 74 70 3a 2f 2f 77 77 77 2e 77 33
" "
http://www.w30000040: 2e 6f 72 67 2f 54 52 2f 68 74 6d 6c 34 2f 73 74 .org/TR/html4/st
0000050: 72 69 63 74 2e 64 74 64 22 3e 0a 3c 68 74 6d 6c rict.dtd">.<html
0000060: 20 78 6d 6c 6e 73 3a 66 62 3d 22 68 74 74 70 3a xmlns:fb="http:
0000070: 2f 2f 77 77 77 2e 66 61 63 65 62 6f 6f 6b 2e 63 //www.facebook.c
0000080: 6f 6d 2f 32 30 30 38 2f 66 62 6d 6c 22 20 78 6d om/2008/fbml" xm
0000090:
6c 6e 73 3a 6f 67 3d 22 68 74 74 70 3a 2f 2f 6f
lns:og="
http://o00000a0: 70 65 6e 67 72 61 70 68 70 72 6f 74 6f 63 6f 6c pengraphprotocol
00000b0: 2e 6f 72 67 2f 73 63 68 65 6d 61 2f 22 3e 0a 3c .org/schema/">.<
00000c0: 68 65 61 64 3e 0a 3c 6d 65 74 61 20 68 74 74 70 head>.<meta http
00000d0: 2d 65 71 75 69 76 3d 22 63 6f 6e 74 65 6e 74 2d -equiv="content-
00000e0: 74 79 70 65 22 20 63 6f 6e 74 65 6e 74 3d 22 74 type" content="t
00000f0: 65 78 74 2f 68 74 6d 6c 3b 20 63 68 61 72 73 65 ext/html; charse
0000100: 74 3d 75 74 66 2d 38 22 2f 3e 0a 3c 6c 69 6e 6b t=utf-8"/>.<link
...