Thanks. I do reference the JobDefs "DefaultNDMPJob" in the Job definition.
Here is an example:
Job {
Name = NDMP-NFS-boaz3-backup
Client = boaz3-ndmp
JobDefs = DefaultNDMPJob
Schedule = "WeeklyCycle"
Write Bootstrap = /var/lib/bareos/boaz3-ndmp.bsr
Full Backup Pool = Monthly
Incremental Backup Pool = Daily
Differential Backup Pool = Weekly
Fileset = NDMP-NFS-Fileset
}
The only difference I see in protocol, is that for the client, and the
storage, I specified NDMPv4. (bareos-dir -t -v -c
/etc/bareos/bareos-dir.conf fails otherwise). For the jobdefs, it gave
me an error with "Protocol = NDMPv4", so I used "Protocol = NDMP" like
the example in the bareos_ndmp_readme.txt.
29-Sep 13:39 bareos-dir: ERROR TERMINATION at dird_conf.c:2007
Config error: Expected a Protocol Type keyword, got: NDMPv4
: line 68, col 19 of file /etc/bareos/bareos-dir.conf
Protocol = NDMPv4
I commented the "Protocol = NDMPv4" line entirely, and now have a
different error.
*m
29-Sep 14:38 orlbacula-dir JobId 145729: No prior Full backup Job record
found.
29-Sep 14:38 orlbacula-dir JobId 145729: No prior or suitable Full
backup found in catalog. Doing FULL backup.
29-Sep 14:38 orlbacula-dir JobId 145729: Start Backup JobId 145729,
Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
29-Sep 14:38 orlbacula-dir JobId 145729: Using Device "Drive-0" to write.
29-Sep 14:38 orlbacula-dir JobId 145729: Fatal error: Unable to
authenticate with File daemon at "
10.10.10.52:10000". Possible causes:
Passwords or names not the same or
Maximum Concurrent Jobs exceeded on the FD or
FD networking messed up (restart daemon).
Please see
http://doc.bareos.org/master/html/bareos-manual-main-reference.html#AuthorizationErrors
for help.
29-Sep 14:38 orlbacula-dir JobId 145729: Error: getmsg.c:196 Malformed
message: T
29-Sep 14:38 orlbacula-dir JobId 145729: Fatal error: bsock.c:551 Packet
size too big from "Client: boaz4-ndmp:
10.10.10.52:10000. Terminating
connection.
29-Sep 14:38 orlbacula-dir JobId 145729: Fatal error: No Job status
returned from FD.
29-Sep 14:38 orlbacula-dir JobId 145729: Error: Bareos orlbacula-dir
13.2.2 (12Nov13):
Build OS: x86_64-unknown-linux-gnu redhat CentOS
release 6.2 (Final)
JobId: 145729
Job: NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
Backup Level: Full (upgraded from Incremental)
Client: "boaz4-ndmp"
FileSet: "NDMP-NFS-Fileset" 2014-09-03 16:46:01
Pool: "Monthly" (From Job FullPool override)
Catalog: "MyCatalog" (From Client resource)
Storage: "tape" (From Job resource)
Scheduled time: 29-Sep-2014 14:38:11
Start time: 29-Sep-2014 14:38:21
End time: 29-Sep-2014 14:38:26
Elapsed time: 5 secs
Priority: 10
FD Files Written: 0
SD Files Written: 0
FD Bytes Written: 0 (0 B)
SD Bytes Written: 0 (0 B)
Rate: 0.0 KB/s
Software Compression: None
VSS: no
Encryption: no
Accurate: no
Volume name(s):
Volume Session Id: 8
Volume Session Time: 1412014705
Last Volume Bytes: 0 (0 B)
Non-fatal FD errors: 2
SD Errors: 0
FD termination status: Error
SD termination status: Waiting on FD
Termination: *** Backup Error ***
*
Looking at the network traffic, I see the following.
ngrep -d any -nn "" "port 9103 or 9102 or 9101 or 10000" -s1514 -q -Wbyline
interface: any
filter: ( port 9103 or 9102 or 9101 or 10000 ) and (ip or ip6)
T
127.0.0.1:52911 ->
127.0.0.1:9101 [AP]
....yes
T
127.0.0.1:9101 ->
127.0.0.1:52911 [AP]
....Job queued. JobId=145729
T
127.0.0.1:9101 ->
127.0.0.1:52911 [AP]
....You have messages.
....
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
...%Hello Director orlbacula-dir calling
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
...9auth cram-md5 <1950766065.1412015901@orlbacula-sd> ssl=0
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
....password.
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
....1000 OK auth
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
...:auth cram-md5 <1501389973.1412015901@orlbacula-dir> ssl=0
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
....password.
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
....1000 OK auth
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
....3000 OK Hello
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
...^JobId=145729 job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
job_name=NDMP-NFS-boaz4-backup client_name=boaz4-ndmp type=66 level=70
FileSet=NDMP-NFS-Fileset NoAttr=0 SpoolAttr=0
FileSetMD5=669foH/s29+RP7+4a+Qu/C SpoolData=0 PreferMountedVols=1
SpoolSize=0 rerunning=0 VolSessionId=0 VolSessionTime=0 Quota=0
Protocol=0 BackupFormat=dump DumpLevel=0
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
...[3000 OK Job SDid=8 SDtime=1412014705
Authorization=IBFM-NIOA-MPKM-GGMC-MFHA-PMDA-CNBK-IPOB
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
...fuse storage=tape media_type=ULTRIUM-LTO-4 pool_name=Monthly
pool_type=Backup append=1 copy=0 stripe=0
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
....use device=Autochanger
........
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
...ZCatReq Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18 GetVolInfo
VolName=WBT022 write=1
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
...L1000 OK VolName=WBT022 VolJobs=2 VolFiles=221 VolBlocks=1696807
VolBytes=109464477696 VolMounts=2 VolErrors=0 VolWrites=1696808
MaxVolBytes=0 VolCapacityBytes=0 VolStatus=Append Slot=20 MaxVolJobs=0
MaxVolFiles=0 InChanger=1 VolReadTime=7560 VolWriteTime=13651952500
EndFile=219 EndBlock=7306 LabelType=0 MediaId=353 EncryptionKey=
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
..."3000 OK use device device=Drive-0
T
172.18.2.31:36318 ->
172.18.2.31:9103 [AP]
....run
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
...EStatus Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18 JobStatus=70
T
10.10.10.17:55335 ->
10.10.10.52:10000 [AP]
...%Hello Director orlbacula-dir calling
T
10.10.10.17:55335 ->
10.10.10.52:10000 [AP]
...%Hello Director orlbacula-dir calling
T
10.10.10.52:10000 ->
10.10.10.17:55335 [AP]
...<....T)..............................Connection successful...
T
172.18.2.31:36320 ->
172.18.2.31:9103 [AP]
...%Hello Director orlbacula-dir calling
T
172.18.2.31:9103 ->
172.18.2.31:36320 [AP]
...9auth cram-md5 <
2074485908.1412015906@orlbacula-sd> ssl=0
T
172.18.2.31:36320 ->
172.18.2.31:9103 [AP]
....password.
T
172.18.2.31:9103 ->
172.18.2.31:36320 [AP]
....1000 OK auth
T
172.18.2.31:36320 ->
172.18.2.31:9103 [AP]
...9auth cram-md5 <619509447.1412015906@orlbacula-dir> ssl=0
T
172.18.2.31:9103 ->
172.18.2.31:36320 [AP]
....password.
T
172.18.2.31:36320 ->
172.18.2.31:9103 [AP]
....1000 OK auth
T
172.18.2.31:9103 ->
172.18.2.31:36320 [AP]
....3000 OK Hello
T
172.18.2.31:36320 ->
172.18.2.31:9103 [AP]
...8cancel Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
T
172.18.2.31:9103 ->
172.18.2.31:36320 [AP]
...\3000 JobId=145729 Job="NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18"
marked to be canceled.
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
....
T
172.18.2.31:9103 ->
172.18.2.31:36318 [AP]
........
T
172.18.2.31:9103 ->
172.18.2.31:36320 [AP]
....
T
172.18.2.31:36320 ->
172.18.2.31:9103 [AP]
....
To me it looks like authentication is working, but then the job is
getting canceled by bareos. Perhaps this error is the revelant one:
"Packet size too big from".