Hi Fu Yiwen,
Thank you for coming to the mailing list to explain this.
It is not enough though that you email here saying that you will be
transparent etc though. There needs to be a discussion about whether
it is reasonable for you to open these kinds of issues at all.
For the benefit of others reading, this follows these issues:
https://github.com/sympy/sympy/issues/29358
https://github.com/sympy/sympy/issues/29360
https://github.com/sympy/sympy/issues/29361
https://github.com/sympy/sympy/issues/29416
https://github.com/sympy/sympy/issues/29417
https://github.com/sympy/sympy/issues/29418
https://github.com/sympy/sympy/issues/29419
Also there have been other issues in other repositories:
https://github.com/pylint-dev/pylint/issues/10910
https://github.com/pylint-dev/pylint/issues/10909
https://github.com/pylint-dev/pylint/issues/10907
https://github.com/pylint-dev/pylint/issues/10906
https://github.com/pylint-dev/pylint/issues/10905
I understand that you are a PhD student and that you are hoping that
your software will be helpful but it should be quite clear from the
linked issues that you either need to stop doing this or you need to
do it very differently.
Firstly, you are posting the output of an AI tool as if it is from
yourself. You need to make it very clear what is human communication
and what is AI-generated output because otherwise you look like an AI
bot and it isn't clear which human is actually in control of the bot.
Secondly, you are using an open source repo for software Engineering
research but you have not sought any consent from the project. This
email from you here now is a statement from you saying that you will
improve your conduct going forwards but you have not asked the
question: does anyone consent to you doing this research?
Thirdly, you should not post issues like this based on your research
tool without vetting them very carefully as a human and it is clear
that you are not doing that to a high enough standard. It is not hard
for open source repos to end up buried under AI generated rubbish so
you need to be much more careful about ensuring that the issues you
open are valid.
In principle I like the idea of using AI to identify issues (much more
than using AI to write PRs) but the signal to noise ratio needs to be
high and the issues need to be about important things. What I don't
like is wasting time filtering AI slop to provide data for someone's
research project.
Does Beihang University follow a research ethics process?
There is no way that an ethics panel in my University would have
authorised anything like the way that you have behaved in the linked
issues.
I am open to discussing how you could use your tool in a way that
might be actually useful to sympy. The emphasis needs to be on what is
useful for sympy though and that means that a very low false positive
rate is needed and any issues need to be communicated very clearly for
the benefit of anyone reading. Otherwise it is just not useful to get
lots of AI generated bug reports of questionable quality.
--
Oscar
> --
> You received this message because you are subscribed to the Google Groups "sympy" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
sympy+un...@googlegroups.com.
> To view this discussion visit
https://groups.google.com/d/msgid/sympy/tencent_EFF5799CA0086F254BDDFE456BBEBF0E000A%40qq.com.