Hi James,
The bulk_extractor build in BitCurator is not (currently) compiled with lightgrep support, so it defaults to the RE2 engine. RE2 does not support lookarounds because it guarantees linear performance in the size of input pattern and text - a guarantee that could not (in its current form - there's some new-ish research out there about this) be made if accepting constructs that required backtracking. Lightgrep *also* does not support constructs that require backtracking, as it is similarly geared towards high performance, single-pass scans of large inputs.
One possible option would be to run the find scan with your regexes as-is (with an appropriately-sized context window) and then filter the results from the report with something like pcre2grep (found in pcre2-utils in Ubuntu), which *does* support lookarounds, backreferences, and non-capturing groups. It might be a good idea for us to just include pcre2-utils in BitCurator either way, in the future, as an additional set of tool options.
Others may have better suggestions. For a broadly scoped search like this where the context may be pretty varied and the result cannot be algorithmically verified (unlike the VIN example), I imagine there are better approaches I'm not thinking of right now.
Kam