Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Bug#1057878: qa.debian.org: UDD upload_history has truncated email addresses

2 views
Skip to first unread message

Stuart Prescott

unread,
Dec 9, 2023, 8:20:04 PM12/9/23
to
Package: qa.debian.org
Severity: normal
X-Debbugs-Cc: stu...@debian.org

The 'maintainer' and 'maintainer_email' columns of the upload_history table
in UDD have truncated email addresses. Somewhere the 'maintainer' data
is being truncated and then the maintainer_email is consequently broken.

udd=> SELECT maintainer, maintainer_email FROM upload_history WHERE maintainer_email LIKE '%=' LIMIT 10;
maintainer | maintainer_email
----------------------------------------------------------------+----------------------------------------------
Maintainers of GStreamer packages <pkg-gstreamer-maintainers@= | pkg-gstreamer-maintainers@=
Maintainers of GStreamer packages <pkg-gstreamer-maintainers@= | pkg-gstreamer-maintainers@=
Zenoss Packaging Team <pkg-zen...@lists.alioth.debian.or= | pkg-zen...@lists.alioth.debian.or=
Debian GNOME Maintainers <pkg-gnome-...@lists.alioth.= | pkg-gnome-...@lists.alioth.=
Debian Perl Group <pkg-perl-m...@lists.alioth.debian.o= | pkg-perl-m...@lists.alioth.debian.o=
Debian VoIP Team <pkg-voip-m...@lists.alioth.debian.or= | pkg-voip-m...@lists.alioth.debian.or=
Debian Python Modules Team <python-mo...@lists.alioth.= | python-mo...@lists.alioth.=
Debian Python Modules Team <python-mo...@lists.alioth.= | python-mo...@lists.alioth.=
Debian Firebird Group <pkg-firebi...@lists.alioth.debi= | pkg-firebi...@lists.alioth.debi=
Debian Samba Maintainers <pkg-sam...@lists.alioth.debian= | pkg-sam...@lists.alioth.debian=
(10 rows)


The input data from the d-d-c mailing list looks fine in the web archive,
but I can imagine this being due to linewrappig in the mbox files.

Looking at one specific example:

https://lists.debian.org/debian-devel-changes/2007/12/msg00466.html

udd=> SELECT maintainer, maintainer_email FROM upload_history WHERE maintainer_email LIKE '%=' AND source = 'libxml-rss-perl' AND version = '1.31-3';
maintainer | maintainer_email
----------------------------------------------------------------+---------------------------------------------
Debian Perl Group <pkg-perl-m...@lists.alioth.debian.o= | pkg-perl-m...@lists.alioth.debian.o=
(1 row)

This particular example is quite old but the problem also exists in
recent uploads; as of writing the most recent one is libgetdata (0.11.0-9)
that was uploaded today.

Of the 850k rows in upload_history, this data issue is in 70k of them.

cheers
Stuart
0 new messages