Hi all,I'm working on a script to start adding the GrypeDB data into GSD. You can see the start of my script here
It's going to output the GrypeDB data into OSV format.
And I've hit an interesting challenge. At the moment the script will spit out just the advisory types because I needed to figure something out.There are CVE IDs, and GHSA, but also a lot of ELSA (Oracle Linux) and ALSA (Amazon Linux) advisories.How do we want to handle these types overlapping identifiers? If there is a CVE or GHSA already, should we just add metadata to the existing ID? Should we let them have their own GSD that also has one or more related tags?
I see value in both approaches.The old way would be to overload one ID as the "primary" ID as much as possible. This would be CVE probably given it is both the most widely used and least flexible.Given we have a large number of available integers (nearly infinite), and we are targeting machines as the intended audience, it's also easy to say just give every possible identifier its own GSD ID.
--Thoughts?--Josh
You received this message because you are subscribed to the Google Groups "GSD Discussion Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gsd+uns...@groups.cloudsecurityalliance.org.
On Wed, 6 Apr 2022 at 06:48, Josh Bressers <jo...@bress.net> wrote:Hi all,I'm working on a script to start adding the GrypeDB data into GSD. You can see the start of my script hereIt's going to output the GrypeDB data into OSV format.That's awesome!! I see you've already filed https://github.com/ossf/osv-schema/issues/40, but let us know if you run into any other difficulties.Side question: Is there a license for GrypeDB data?And I've hit an interesting challenge. At the moment the script will spit out just the advisory types because I needed to figure something out.There are CVE IDs, and GHSA, but also a lot of ELSA (Oracle Linux) and ALSA (Amazon Linux) advisories.How do we want to handle these types overlapping identifiers? If there is a CVE or GHSA already, should we just add metadata to the existing ID? Should we let them have their own GSD that also has one or more related tags?I see value in both approaches.The old way would be to overload one ID as the "primary" ID as much as possible. This would be CVE probably given it is both the most widely used and least flexible.Given we have a large number of available integers (nearly infinite), and we are targeting machines as the intended audience, it's also easy to say just give every possible identifier its own GSD ID.I think we should pull advisories that have a well defined source (e.g. GHSA, ELSA, ALSA) from the original source only, as it's the most authoritative. This isn't convenient or easy for every source today, and GrypeDB will be a big help in filling in the gaps.
For any given ID, if there is in fact a direct source where this can be pulled from (e.g. GitHub's GHSA repo in OSV format), then this should be trusted over other sources that provide metadata for the same ID. If some other database wants to provide metadata for the same vulnerability, then they should get/use their own ID. This will make updates and keeping things in sync a lot easier.
CVEs are an exception, because they're rather overloaded -- if those are the primary key for a source, then perhaps it makes sense to assign them their own GSD to disambiguate them?
On Wed, 6 Apr 2022 at 06:48, Josh Bressers <jo...@bress.net> wrote:Hi all,I'm working on a script to start adding the GrypeDB data into GSD. You can see the start of my script hereIt's going to output the GrypeDB data into OSV format.That's awesome!! I see you've already filed https://github.com/ossf/osv-schema/issues/40, but let us know if you run into any other difficulties.
Side question: Is there a license for GrypeDB data?
The old way would be to overload one ID as the "primary" ID as much as possible. This would be CVE probably given it is both the most widely used and least flexible.Given we have a large number of available integers (nearly infinite), and we are targeting machines as the intended audience, it's also easy to say just give every possible identifier its own GSD ID.I think we should pull advisories that have a well defined source (e.g. GHSA, ELSA, ALSA) from the original source only, as it's the most authoritative. This isn't convenient or easy for every source today, and GrypeDB will be a big help in filling in the gaps.For any given ID, if there is in fact a direct source where this can be pulled from (e.g. GitHub's GHSA repo in OSV format), then this should be trusted over other sources that provide metadata for the same ID. If some other database wants to provide metadata for the same vulnerability, then they should get/use their own ID. This will make updates and keeping things in sync a lot easier.CVEs are an exception, because they're rather overloaded -- if those are the primary key for a source, then perhaps it makes sense to assign them their own GSD to disambiguate them?