[VOTE] release 1.4

12 views
Skip to first unread message

Sebastian Nagel

unread,
Jul 13, 2023, 6:08:09 AM7/13/23
to crawler...@googlegroups.com
Hi,

a first release candidate for 1.4 is ready...

- changes:

https://github.com/crawler-commons/crawler-commons/blob/crawler-commons-1.4/CHANGES.txt

- git tag:

https://github.com/crawler-commons/crawler-commons/releases/tag/crawler-commons-1.4

- artifacts:
https://oss.sonatype.org/content/repositories/comgithubcrawler-commons-1019

- you can test the release candidate with Maven by inserting the following into
your pom.xml:

<dependencies>
<dependency>
<groupId>com.github.crawler-commons</groupId>
<artifactId>crawler-commons</artifactId>
<version>1.4</version>
</dependency>
...

<repositories>
<repository>
<id>crawler-commons-release-test</id>
<name>Testing crawler-commons 1.4 release candidate</name>

<url>https://oss.sonatype.org/content/repositories/comgithubcrawler-commons-1019</url>
</repository>
</repositories>


I'll update the Javadocs, README etc. once the vote has passed.


Here's my +1
- upgraded a sitemap parser test project to use crawler-commons 1.4 and
- parsed the sitemaps of a test collections without any regressions
- run the robot rules parser on a test collection of robots.txt files


Thanks,
Sebastian

Julien Nioche

unread,
Jul 14, 2023, 4:01:46 AM7/14/23
to crawler...@googlegroups.com
Thanks Sebastian

+1 from me
compiled StormCrawler using CC 1.4 and ran a small crawl without noticing any issues

--
You received this message because you are subscribed to the Google Groups "crawler-commons" group.
To unsubscribe from this group and stop receiving emails from it, send an email to crawler-commo...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/crawler-commons/a461cc3d-461c-0f5b-0f90-31ce283707b3%40googlemail.com.


--

Richard Zowalla

unread,
Jul 14, 2023, 4:34:22 AM7/14/23
to crawler...@googlegroups.com

Avi Hayun

unread,
Jul 14, 2023, 5:22:05 AM7/14/23
to crawler...@googlegroups.com

Aécio

unread,
Jul 14, 2023, 11:11:33 AM7/14/23
to crawler...@googlegroups.com

Sebastian Nagel

unread,
Jul 18, 2023, 6:38:41 AM7/18/23
to crawler...@googlegroups.com
Thanks for the reviews and the votes! I'll continue with the release...

+1 from
Aécio
Avi Hayun
Julien Nioche
Richard Zowalla
Sebastian

On 7/14/23 17:11, Aécio wrote:
> +1
>
> On Fri, Jul 14, 2023, 5:22 AM Avi Hayun <avra...@gmail.com
> <mailto:avra...@gmail.com>> wrote:
>
> +1
>
> On Fri, Jul 14, 2023 at 11:34 AM Richard Zowalla <fear...@gmail.com
> <mailto:fear...@gmail.com>> wrote:
>
> +1
>
> Julien Nioche <lists.dig...@gmail.com
> <mailto:lists.dig...@gmail.com>> schrieb am Fr., 14. Juli 2023,
> 10:01:
>
> Thanks Sebastian
>
> +1 from me
> compiled StormCrawler using CC 1.4 and ran a small crawl without
> noticing any issues
>
> On Thu, 13 Jul 2023 at 11:08, 'Sebastian Nagel' via crawler-commons
> <crawler...@googlegroups.com
> <mailto:crawler...@googlegroups.com>> wrote:
>
> Hi,
>
> a first release candidate for 1.4 is ready...
>
> - changes:
>
> https://github.com/crawler-commons/crawler-commons/blob/crawler-commons-1.4/CHANGES.txt <https://github.com/crawler-commons/crawler-commons/blob/crawler-commons-1.4/CHANGES.txt>
>
> - git tag:
>
> https://github.com/crawler-commons/crawler-commons/releases/tag/crawler-commons-1.4 <https://github.com/crawler-commons/crawler-commons/releases/tag/crawler-commons-1.4>
>
> - artifacts:
> https://oss.sonatype.org/content/repositories/comgithubcrawler-commons-1019 <https://oss.sonatype.org/content/repositories/comgithubcrawler-commons-1019>
>
> - you can test the release candidate with Maven by inserting the
> following into
> your pom.xml:
>
>    <dependencies>
>      <dependency>
>        <groupId>com.github.crawler-commons</groupId>
>        <artifactId>crawler-commons</artifactId>
>        <version>1.4</version>
>      </dependency>
>    ...
>
>    <repositories>
>      <repository>
>        <id>crawler-commons-release-test</id>
>        <name>Testing crawler-commons 1.4 release candidate</name>
>
> <url>https://oss.sonatype.org/content/repositories/comgithubcrawler-commons-1019 <https://oss.sonatype.org/content/repositories/comgithubcrawler-commons-1019></url>
>      </repository>
>    </repositories>
>
>
> I'll update the Javadocs, README etc. once the vote has passed.
>
>
> Here's my +1
> - upgraded a sitemap parser test project to use crawler-commons
> 1.4 and
> - parsed the sitemaps of a test collections without any regressions
> - run the robot rules parser on a test collection of robots.txt
> files
>
>
> Thanks,
> Sebastian
>
> --
> You received this message because you are subscribed to the
> Google Groups "crawler-commons" group.
> To unsubscribe from this group and stop receiving emails from
> it, send an email to
> crawler-commo...@googlegroups.com
> <mailto:crawler-commons%2Bunsu...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/crawler-commons/a461cc3d-461c-0f5b-0f90-31ce283707b3%40googlemail.com <https://groups.google.com/d/msgid/crawler-commons/a461cc3d-461c-0f5b-0f90-31ce283707b3%40googlemail.com>.
>
>
>
> --
> *
> */Open Source Solutions for Text Engineering/
> /
> /http://www.digitalpebble.com <http://www.digitalpebble.com/>
> http://digitalpebble.blogspot.com/ <http://digitalpebble.blogspot.com/>
> #digitalpebble <http://twitter.com/digitalpebble>
>
> --
> You received this message because you are subscribed to the Google
> Groups "crawler-commons" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to crawler-commo...@googlegroups.com
> <mailto:crawler-commo...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/crawler-commons/CA%2B-fM0sXKANKZcQXNqgspGQFS0cZ2URjeGPxQawa4G%2BgPO0Rag%40mail.gmail.com <https://groups.google.com/d/msgid/crawler-commons/CA%2B-fM0sXKANKZcQXNqgspGQFS0cZ2URjeGPxQawa4G%2BgPO0Rag%40mail.gmail.com?utm_medium=email&utm_source=footer>.
>
> --
> You received this message because you are subscribed to the Google
> Groups "crawler-commons" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to crawler-commo...@googlegroups.com
> <mailto:crawler-commo...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/crawler-commons/CAHBhw7AVDZkAX58veFzShz52p7o6DEuZBcj4So-9SgCLfy68pg%40mail.gmail.com <https://groups.google.com/d/msgid/crawler-commons/CAHBhw7AVDZkAX58veFzShz52p7o6DEuZBcj4So-9SgCLfy68pg%40mail.gmail.com?utm_medium=email&utm_source=footer>.
>
> --
> You received this message because you are subscribed to the Google Groups
> "crawler-commons" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to crawler-commo...@googlegroups.com
> <mailto:crawler-commo...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/crawler-commons/CAKLtEQP_8T%2Bpaxzy8xMapqtTPQ7r2gqvZc8XhkTpgo5X_NrfjQ%40mail.gmail.com <https://groups.google.com/d/msgid/crawler-commons/CAKLtEQP_8T%2Bpaxzy8xMapqtTPQ7r2gqvZc8XhkTpgo5X_NrfjQ%40mail.gmail.com?utm_medium=email&utm_source=footer>.
>
> --
> You received this message because you are subscribed to the Google Groups
> "crawler-commons" group.
> To unsubscribe from this group and stop receiving emails from it, send an email
> to crawler-commo...@googlegroups.com
> <mailto:crawler-commo...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/crawler-commons/CAOrZYMi4fK309hU50KBB7fbGLhdpgUfruHSTMCEeiOnk6hpW6g%40mail.gmail.com <https://groups.google.com/d/msgid/crawler-commons/CAOrZYMi4fK309hU50KBB7fbGLhdpgUfruHSTMCEeiOnk6hpW6g%40mail.gmail.com?utm_medium=email&utm_source=footer>.

Julien Nioche

unread,
Jul 18, 2023, 8:34:19 AM7/18/23
to crawler...@googlegroups.com
Thanks for the release and all your hard work Sebastian. 

To unsubscribe from this group and stop receiving emails from it, send an email to crawler-commo...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/crawler-commons/c7083e9c-cd36-a943-3516-49c877c024d9%40googlemail.com.
Reply all
Reply to author
Forward
0 new messages