Extraction error in jsonld format file from October 2023 Common Crawl Corpus

13 views
Skip to first unread message

Shivam Sharma

unread,
Mar 28, 2024, 11:16:38 AMMar 28
to Web Data Commons
Hi all,

I am facing an extraction error for this file. When using the `gzip` command on Linux to extract it using the following command:

```bash
gzip -dv dpef.html-embedded-jsonld.nq-01121.gz
``` 

I get the following error:
```bash
gzip: dpef.html-embedded-jsonld.nq-01121.gz: unexpected end of file
```
Any advises on an alternative? Thanks.
Reply all
Reply to author
Forward
0 new messages