[llvm-dev] IRC spam

Renato Golin via llvm-dev

unread,

Jun 16, 2020, 4:22:16 AM6/16/20

to LLVM Dev

Hey folks,

Those of us who are still connected to the IRC channel surely have
noticed spam accounts posting random (machine generated?) garbage.
Moderators ban the nicks, but obviously, random ones come back.

Keeping on banning won't work. At Linaro, the somewhat controversial
move of requiring only registered users to join worked as a deterrent,
but the annoyance of registering may make less savvy users avoid the
channel.

Moving on to another platform (like Discord) will surely alienate a
good part of the community (myself included), but so will letting the
spam accounts run free (I'm almost giving up on the channel).

I don't know automated spam filtering for IRC (banning bots?), but we
usually "speak garbage" ourselves (aka pasting code), so false
positive rates will be high.

Anyone got better ideas?

cheers,
--renato
_______________________________________________
LLVM Developers mailing list
llvm...@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev

Joerg Sonnenberger via llvm-dev

unread,

Jun 16, 2020, 10:06:41 AM6/16/20

to llvm...@lists.llvm.org

On Tue, Jun 16, 2020 at 09:21:50AM +0100, Renato Golin via llvm-dev wrote:
> Moving on to another platform (like Discord) will surely alienate a
> good part of the community (myself included), but so will letting the
> spam accounts run free (I'm almost giving up on the channel).

I also see little evidence that it would fix the problem of someone
having too much time on their hand and wants to be a nuisance.

Joerg

Renato Golin via llvm-dev

unread,

Jun 16, 2020, 10:28:00 AM6/16/20

to Joerg Sonnenberger, LLVM Dev

On Tue, 16 Jun 2020 at 15:06, Joerg Sonnenberger via llvm-dev
<llvm...@lists.llvm.org> wrote:
> I also see little evidence that it would fix the problem of someone
> having too much time on their hand and wants to be a nuisance.

AFAICS, this is not the problem.

The nicks are clearly randomly generated by smashing words together
and the content seems to be what comes out of a language model after
reading angry Facebook posts.

Registering is quick and easy and would filter 99% of the automated
accounts. Not 100% and not the people with too much time in their
hands. But those, nothing will.

It's like a bicycle lock: the bigger your lock, the more likely the
thief will steal someone else's bike. But someone wanting *your* bike
will take it, no matter the lock.

David Blaikie via llvm-dev

unread,

Jun 16, 2020, 12:35:23 PM6/16/20

to Renato Golin, LLVM Dev

On Tue, Jun 16, 2020 at 7:27 AM Renato Golin via llvm-dev
<llvm...@lists.llvm.org> wrote:
>
> On Tue, 16 Jun 2020 at 15:06, Joerg Sonnenberger via llvm-dev
> <llvm...@lists.llvm.org> wrote:
> > I also see little evidence that it would fix the problem of someone
> > having too much time on their hand and wants to be a nuisance.
>
> AFAICS, this is not the problem.
>
> The nicks are clearly randomly generated by smashing words together
> and the content seems to be what comes out of a language model after
> reading angry Facebook posts.

Perhaps, though I've seen them responding to people (not that that
rules out AI), etc. I'm with Joerg and Erich - I believe it's an
actual person. I actually saw them respond to a query yesterday - the
person seemed to get the answer they were looking through (reading
between the lines/etc of the strained grammar of this spammer person),
and their understanding/conclusion wasn't wrong & left before I could
clarify/provide further context - I worry about that happening more
regularly. Either this person scaring people off, and/or giving them
confusing/problematic advice, etc.

> Registering is quick and easy and would filter 99% of the automated
> accounts. Not 100% and not the people with too much time in their
> hands.

Yep - having to create a new email address/verify it/etc does add some
friction (for intended and unintended users of course) - discord and
the like have far more incentive & resources to implement anti-spam
functionality than IRC does, unfortunately.

Renato Golin via llvm-dev

unread,

Jun 16, 2020, 12:43:12 PM6/16/20

to David Blaikie, LLVM Dev

On Tue, 16 Jun 2020 at 17:35, David Blaikie <dbla...@gmail.com> wrote:
> Perhaps, though I've seen them responding to people (not that that
> rules out AI), etc. I'm with Joerg and Erich - I believe it's an
> actual person.

Yeah, saw it now, too. Sigh...

Johannes Doerfert via llvm-dev

unread,

Jun 16, 2020, 1:43:34 PM6/16/20

to Renato Golin, David Blaikie, LLVM Dev

Let's try more OPs and badword lists, might help :)

Tom Stellard via llvm-dev

unread,

Jun 17, 2020, 12:04:05 AM6/17/20

to Renato Golin, Joerg Sonnenberger, LLVM Dev

On 06/16/2020 07:27 AM, Renato Golin via llvm-dev wrote:
> On Tue, 16 Jun 2020 at 15:06, Joerg Sonnenberger via llvm-dev
> <llvm...@lists.llvm.org> wrote:
>> I also see little evidence that it would fix the problem of someone
>> having too much time on their hand and wants to be a nuisance.
>
> AFAICS, this is not the problem.
>
> The nicks are clearly randomly generated by smashing words together
> and the content seems to be what comes out of a language model after
> reading angry Facebook posts.
>
> Registering is quick and easy and would filter 99% of the automated
> accounts. Not 100% and not the people with too much time in their
> hands. But those, nothing will.
>
> It's like a bicycle lock: the bigger your lock, the more likely the
> thief will steal someone else's bike. But someone wanting *your* bike
> will take it, no matter the lock.

I'm fairly certain that this is the same person that has been spamming
the #dri-devel and #radeon channels on freenode for years. The pattern
of comments is they same and I've seen them mention people in comments on
#llvm who only join those other channels. #dri-devel requires users to identify
with the server and also uses extbans, but that hasn't really helped.

I'm not sure how to solve this, but I think having more ops
(spread across all timezones) would help.

-Tom

Ryan Houdek via llvm-dev

unread,

Jun 25, 2020, 8:03:54 PM6/25/20

to tste...@redhat.com, LLVM Dev

I'll comment from the perspective of someone that is in the Mesa, #dri-devel, #radeon channels myself and have watched their behaviour over the years. This is a real person that spams a load of information into a channel about their understanding of how hardware works.

I have no idea what their goal is for spamming this information, could be some desire for acceptance from perceived smartness. Or something as simple as wanting to be hired for their "brilliance". Hard to tell.

A major issue with their personality is that they will retaliate against anyone that tries to stop their ranting, and they become hostile with their phrasing very quickly because of it. Just check the logs for them retaliating against anyone that has kickbanned them.

Another issue is that depending on their mood of the day, they may be entirely lost to any form of reasoning, which makes it difficult for any communication.

So just to reiterate, they are a real person but are difficult to deal with.

On that note, they aren't completely impossible to work with in some cases, it just might require accepting getting attacked for a few weeks.

I'm a channel operator in one of the Mesa related IRC channels and have had success in communicating with them that their behaviour is not conducive to the environment that we were attempting to create in the channel.

This took a bit of coaxing on their "good" days, and communicating with them while being attacked for around a month on end. At the end of this month-long attack and communication I was able to get them to understand that they aren't welcome to the channel.

They no longer enter the channel that I moderate; I managed to get through to them on some level at least.

Sadly this sort of baby sitting of a user shouldn't be required and requiring some thick skin to get through their harsh comments is difficult.
More moderation will "work" but while they are rampaging, you're going to still have to watch the channel and you'll get a few lines of harassing text while an op takes a bit of time to see them (and sometimes even perceive them, on "good" days they make comments that make some sense initially).

Stephen Neuendorffer via llvm-dev

unread,

Jun 25, 2020, 8:11:46 PM6/25/20

to Ryan Houdek, LLVM Dev

Seems like a useful machine-learning recognition task.... Lots of data... useful to automate... Low cost of false-negatives...

Johannes Doerfert via llvm-dev

unread,

Jun 25, 2020, 8:43:43 PM6/25/20

to Stephen Neuendorffer, Ryan Houdek, LLVM Dev

We went with old school solutions, less sexy I know, but seems to be effective.

@Renato, do you still feel there is much spam?

P.S. I'm going to switch to your ML approach once ready ;)

Renato Golin via llvm-dev

unread,

Jun 26, 2020, 5:58:25 AM6/26/20

to Johannes Doerfert, LLVM Dev

On Fri, 26 Jun 2020 at 01:43, Johannes Doerfert via llvm-dev
<llvm...@lists.llvm.org> wrote:
> We went with old school solutions, less sexy I know, but seems to be effective.
> @Renato, do you still feel there is much spam?

No, it's good now, thanks to all that volunteered to moderate.

> P.S. I'm going to switch to your ML approach once ready ;)

This is how we start the robot uprising... :)

Renato Golin via llvm-dev

unread,

Jun 26, 2020, 6:23:49 AM6/26/20

to Ryan Houdek, LLVM Dev

On Fri, 26 Jun 2020 at 01:03, Ryan Houdek <sonica...@gmail.com> wrote:
> I'll comment from the perspective of someone that is in the Mesa, #dri-devel, #radeon channels myself and have watched their behaviour over the years. This is a real person that spams a load of information into a channel about their understanding of how hardware works.

Hi Ryan,

Yeah, half-way through I realised, as some other people pointed out,
it is a real person.

I identified as a robot for two main reasons:
1. Really confusing sentences, intermixing completely unrelated
subjects and starting a new phrase before the old one was finished.
This is very typical of markov chain or cheap language models trained
with a small subset of unrelated texts. It's also unfortunately common
in people who can't help but work on multiple trains of thought (like
me).
2. The random names and fast rejoin were consistent with either a
mindless bot, or a very persistent individual. I couldn't fathom why a
person would do that, so I assumed bot. That was on me for not seeing
it far enough.

> I have no idea what their goal is for spamming this information, could be some desire for acceptance from perceived smartness. Or something as simple as wanting to be hired for their "brilliance". Hard to tell.
> A major issue with their personality is that they will retaliate against anyone that tries to stop their ranting, and they become hostile with their phrasing very quickly because of it. Just check the logs for them retaliating against anyone that has kickbanned them.
> Another issue is that depending on their mood of the day, they may be entirely lost to any form of reasoning, which makes it difficult for any communication.

There are plenty of life situations that make smart people behave
erratically or crack entirely, most of them mundane to the majority of
people. It's not fun.

> On that note, they aren't completely impossible to work with in some cases, it just might require accepting getting attacked for a few weeks.
> I'm a channel operator in one of the Mesa related IRC channels and have had success in communicating with them that their behaviour is not conducive to the environment that we were attempting to create in the channel.
> This took a bit of coaxing on their "good" days, and communicating with them while being attacked for around a month on end. At the end of this month-long attack and communication I was able to get them to understand that they aren't welcome to the channel.
> They no longer enter the channel that I moderate; I managed to get through to them on some level at least.

I'm impressed with your care and stamina. Not many people I know would
have gone that far. Thank you for doing that.

> Sadly this sort of baby sitting of a user shouldn't be required and requiring some thick skin to get through their harsh comments is difficult.
> More moderation will "work" but while they are rampaging, you're going to still have to watch the channel and you'll get a few lines of harassing text while an op takes a bit of time to see them (and sometimes even perceive them, on "good" days they make comments that make some sense initially).

Any kind of barrier should be enough to get bots and persistent
individuals to stop. Moderators, registration, etc.

I think in the end it worked out well. People quickly realised those
words were meaningless (in the context of the particular channel), and
worked to stop the flood.

cheers,
--renato

Stefan Stipanovic via llvm-dev

unread,

Jun 28, 2020, 4:00:50 PM6/28/20

to Renato Golin, LLVM Dev

I'm aware of the efforts. As of today it looks like it is invite-only. I'm getting the error "473: #llvm Cannot join channel (+i)", which should mean it is invite only, right?

Thanks,

Stefan

On Sun, Jun 28, 2020 at 2:29 PM Renato Golin <reng...@gmail.com> wrote:

On Sun, 28 Jun 2020 at 13:01, Stefan Stipanovic <stefom...@gmail.com> wrote:
> Is the channel now purposefully invite-only? If so, how is one supposed to join?

It's not. More people were added as moderators to curb the problem
instead. Are you having trouble joining the channel?

Eric Christopher via llvm-dev

unread,

Jun 29, 2020, 4:20:50 PM6/29/20

to Stefan Stipanovic, LLVM Dev

I fixed this a few hours ago I think :)

_______________________________________________

Renato Golin via llvm-dev

unread,

Jun 29, 2020, 4:23:39 PM6/29/20

to Stefan Stipanovic, LLVM Dev

On Sun, 28 Jun 2020 at 13:01, Stefan Stipanovic <stefom...@gmail.com> wrote:
> Is the channel now purposefully invite-only? If so, how is one supposed to join?

It's not. More people were added as moderators to curb the problem
instead. Are you having trouble joining the channel?

Stefan Stipanovic via llvm-dev

unread,

Jun 29, 2020, 4:30:59 PM6/29/20

to Renato Golin, LLVM Dev

Is the channel now purposefully invite-only? If so, how is one supposed to join?

Thanks,

Stefan

Johannes Doerfert via llvm-dev

unread,

Jun 29, 2020, 5:14:47 PM6/29/20

to Stefan Stipanovic, Renato Golin, LLVM Dev

It was made invite only apparently. I was not online at the time.

IMHO, there are way better ways to make this work than invite only,

for example, we can require authenticated users. Invite only w/o a

well documented way to get the invite seems like the opposite of inclusive.

Reply all

Reply to author

Forward