Mining Maven Repository

17 views
Skip to first unread message

robertburrelldonkin

unread,
Aug 14, 2007, 5:05:21 PM8/14/07
to Discuss A Release Audit Tool
after fixing my gentoo issues, i had a play with the pom2rdf. it
doesn't do license and organisation data but this could be added. it
reads maven 2 POMs ATM so work would be needed if it needed to read
maven 1 ones.

- robert

Matthieu Riou

unread,
Aug 14, 2007, 5:53:42 PM8/14/07
to discuss-a-rele...@googlegroups.com
Cool :) I actually also have some code that does Maven repository browsing and POMs reading but it's in Ruby so I guess it wouldn't have been so useful anyway. Did you check that in rats? I could have a go in extending it for licenses info.

Matthieu

Matthieu Riou

unread,
Aug 17, 2007, 12:34:57 AM8/17/07
to discuss-a-rele...@googlegroups.com
Actually this being fairly isolated I've started from my Ruby scripts. Hope you don't mind, since I've started developing regularly in Ruby I find it hard to get back to Java ;) For now I have something that goes through the repository by scrapping it, reads the POMs and get the license from there. It's fairly easy then to stuff that into some XML. The only caveat for now is for project with parent POMs. I'll have to transitively check the parents for the license. That's coming next

So 2 questions:
  - Do you have some examples of the RDF you generate, so that I can reuse the same structure?
  - If I send you the code would you put it somewhere in RAT as part of the tooling?

Thanks,
Matthieu

Robert Burrell Donkin

unread,
Sep 18, 2007, 4:12:09 PM9/18/07
to discuss-a-rele...@googlegroups.com
(probably best to continue this in labs)

On 8/17/07, Matthieu Riou <matthi...@gmail.com> wrote:
> Actually this being fairly isolated I've started from my Ruby scripts. Hope
> you don't mind, since I've started developing regularly in Ruby I find it
> hard to get back to Java ;) For now I have something that goes through the
> repository by scrapping it, reads the POMs and get the license from there.
> It's fairly easy then to stuff that into some XML. The only caveat for now
> is for project with parent POMs. I'll have to transitively check the parents
> for the license. That's coming next

IIRC after some playing around i came to the conclusion that scraping
the xml was actually easier than messing around with maven

> So 2 questions:
> - Do you have some examples of the RDF you generate, so that I can reuse
> the same structure?

the structure had some interesting aspects but going forward i
recommend something more specialised. we can pick this up in the labs.

> - If I send you the code would you put it somewhere in RAT as part of the
> tooling?

i'd be glad to use this in RAT. probably the best place to talk about
this is in the lab.

- robert

Reply all
Reply to author
Forward
0 new messages