Using warcvalidator

28 views
Skip to first unread message

Gary W

unread,
Aug 10, 2010, 11:23:25 AM8/10/10
to warc tool list
I am building a converter to change our
WebBase format files to WARC.
I built a WB->ARC converter before.
I picked up a warc record from the ISO doc and got:
<gary@WB6 WB2WARC 8:15am> WARC-tools/warc-tools/app/warcvalidator -f
outputs/iso-standard-doc-ex.warc -v
> debug: lib/private/wfsmhdl.c :585:"expecting a valid WARC ID"
error in FSM state address 0x80611a0, at offset 0 in The Warc File
invalid
<gary@WB6 WB2WARC 8:15am>

but that output is not very helpful to me.
I get the same thing whatever output file I test.

Am I using the right tool?
Does someone have a small example file I can validate
the validator with?

Gary
--

Nothing is more conducive to peace of mind than not having any opinions
at all.
-- Georg Christoph Lichtenberg

iso-standard-doc-ex.warc

Alex Osborne

unread,
Aug 10, 2010, 7:16:58 PM8/10/10
to warc-...@googlegroups.com
Hi Gary,

Gary W <ga...@db.stanford.edu> writes:

> > debug: lib/private/wfsmhdl.c :585:"expecting a valid WARC ID"
> error in FSM state address 0x80611a0, at offset 0 in The Warc File
> invalid
>

> Am I using the right tool?
> Does someone have a small example file I can validate
> the validator with?

I think Hanzo's warc-tools validate against a draft version (0.18) of
the specification. If you look in ./lib/private/wversion.h you'll
probably find this:

#define WARC_VERSION "WARC/0.18"

Try changing that to:

#define WARC_VERSION "WARC/1.0"

and see if you have any better luck with it.

Cheers,

Alex

Reply all
Reply to author
Forward
0 new messages