Hi,
I'm working on a Gradle script that use WAT file to get WARC item. The items are filtered by mime type. I want to get a powerpoint .pptx files, the mime type for those files is application/vnd.openxmlformats-officedocument.presentationml.presentation. The files all open with the same background color, black and same font format, white little and pixelated.
So while testing and developing, I noticed that the mime type in WAT JSON is actually application/vnd.openxmlformats-officedocument.presentationml.persentation, the e and r are reversed. I assume it shouldn't have any influence on the readability of the file for the software and OS, but just making sure.
I tested with .ppt files and the files open normally with what seems to be there normal styles.
Thanks for all.
tl;dr
Is the header Content-Type influence how the crawler process response body? Should I worry if a content-type is misspelled?
--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl...@googlegroups.com.
To post to this group, send email to common...@googlegroups.com.
Visit this group at https://groups.google.com/group/common-crawl.
For more options, visit https://groups.google.com/d/optout.