Hey there,
do you have any guidance (tutorial, examples) on how you could use Common Crawl to insert data into MySQL? Let's say I want to have a database that I want to populate with the help of Common Crawl.
That database has a table where I want to insert:
- the title of a website
- <meta description="">
- <Hn>
- etc, basically extract some texts that are either in HTML tags or in header/footer
And from the multitude of given elements to scan and store, insert just the ones that can be found on a website and for the rest to add an "-"
Name: <title>
Website: main URL
Phone: (in footer if there is, grab it and place it in the table), if there's no phone number in footer, insert a "-" in the table
Email: same, grab it from the header or footer if there is
Description: <meta description="">
I hope it makes sense 😄