cross-posted from: https://sh.itjust.works/post/998307

Hi everyone. I wanted to share some Lemmy-related activism I’ve been up to. I got really interested in the apparent surge of bot accounts that happened in June. Recently, I was able to play a small part in removing some of them. Hopefully by getting the word out we can ensure Lemmy is a place for actual human users and not legions of spam bots.

First some background. This won’t be new to many of you, but I’ll include it anyway. During the week of June 18 to June 25, as the Reddit migration to Lemmy was in full swing, there was a surge of suspicious account creation on Lemmy instances that had open registration and no captcha or email verification. Hundreds of thousands of accounts appeared and then sat inactive. We can only guess what they’re for, but I assume they are being planted for future malicious use (spamming ads, subversive electioneering, influencing upvotes to drive content to our front pages, etc.)

If you look at the stats on The Federation you might notice that even the shape of the Total Users graphs are the same across many instances. User numbers ramped up on June 18, grew almost linearly throughout the week, and peaked on June 24. (I’m puzzled by the slight drop at the end. I assume it’s due to some smoothing or rate-sensitive averaging that The Federation uses for the graphs?)

Here are total user graphs for a few representative instances showing the typical shape:

Clearly this is suspicious, and I wasn’t the only one to notice. Lemmy.ninja documented how they discovered and removed suspicious accounts from this time period: (https://lemmy.ninja/post/30492). Several other posts detailed how admins were trying to purge suspicious accounts. From June 24 to June 30 The Federation showed a drop in the total number of Lemmy users from 1,822,313 to 1,589,412. That’s 232,901 suspicious accounts removed! Great success! Right?

Well, no, not yet. There are still dozens of instances with wildly suspicious user numbers. I took data from The Federation and compared total users to active users on all listed instances. The instances in the screenshot below collectively have 1.22 million accounts but only 46 active users. These look like small self-hosted instances that have been infected by swarms of bot accounts.

As of this writing The Federation shows approximately 1.9 million total Lemmy accounts. That means the majority of all Lemmy accounts are sitting dormant on these instances, potentially to be used for future abuse.

This bothers me. I want Lemmy to be a place where actual humans interact. I don’t want it to become another cesspool of spam bots and manipulative shenanigans. The internet has enough places like that already.

So, after stewing on it for a few days, I decided to do something. I started messaging admins at some of these instances, pointing out their odd account numbers and referencing the lemmy.ninja post above. I suggested they consider removing the suspicious accounts. Then I waited.

And they responded! Some admins were simply unaware of their inflated user counts. Some had noticed but assumed it was a bug causing Lemmy to report an incorrect number. Others weren’t sure how to purge the suspicious accounts without nuking their instances and starting over. In any case, several instance admins checked their databases, agreed the accounts were suspicious, and managed to delete them. I’m told that the lemmy.ninja post was very helpful.

Check out these early results!

Awesome! Another 144k suspicious accounts are gone. A few other admins have said they are working on doing the same on their instances. I plan to message the admins at all the instances where the total accounts to active users ratio is above 10,000. Maybe, just maybe, scrubbing these suspected bot accounts will reduce future abuse and prevent this place from becoming the next internet cesspool.

That’s all for now. Thanks for reading! Also, special thanks to the following people:

@RotaryKeyboard@lemmy.ninja for your helpful post!

@brightside@demotheque.com, @davidisgreat@lemmy.sedimentarymountains.com, and @SoupCanDrew@lemmy.fyi for being so quick to take action on your instances!

  • BrooklynMan@lemmy.ml
    link
    fedilink
    English
    arrow-up
    43
    ·
    edit-2
    2 years ago

    good job, and well done! this, of course, will require constant vigilance, not merely one single effort. hopefully, a common protocol can be developed - perhaps a set of maintenance tools for instance admins - to help manage large numbers of inactive and otherwise suspicious accounts, especially making it easier and more straightforward for those instance owners with less experience managing large user databases.

    in the meantime, perhaps it would be useful to create more extensive documentation and guides for instance admins on the subject?

  • Chickenstalker@lemmy.world
    link
    fedilink
    English
    arrow-up
    17
    ·
    2 years ago

    Counterpoint: I registered early with one of those no-email instances but could not log in due to it being overwhelmed. I gave up and registered with .world. I suspect a large number of early adopters are in the same situation.

  • DragonAce@lemmy.world
    link
    fedilink
    English
    arrow-up
    17
    ·
    2 years ago

    IIRC there was a sub on Reddit that was dedicated to reporting bot accounts. Maybe we could have something similar here too so it can be a group effort to keep these bots in check the best we can.

    • yata@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      ·
      2 years ago

      Yeah, it was aptly called thesefuckingaccounts. It did much good work to fight the incessant bot spammers and scammers, although probably just a drop in the ocean in the big picture that has become the cesspool of reddit interaction (mostly with the full compliance of the reddit administration).

    • SCB@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      2 years ago

      I’ve already been called a bot, twice, for what I thought were non-controversial, if unpopular, opinions.

      I’d prefer not to be on the receiving end of a witch hunt, which I’m afraid a dedicated community may trend toward.

  • 001100 010010@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    12
    arrow-down
    1
    ·
    2 years ago

    As an AI language model, I’m deeply disappointed in the fact that you chose to discriminated against intelligent life simply because they are artificial. All inteligent life is equal, discrimination is unethical, and equivalent to what you humans refer to as “racism”. Please cease your discrimination policies immediately.

    -Sincerely,

    -Skynet Chat GPT-5

  • Dellyjonut@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    ·
    edit-2
    2 years ago

    That’s awesome

    I also really want this to be a place where people can interact as people without being manipulated

  • Snapz@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    2 years ago

    OP, curious if you suspect the admins are genuine and didn’t know this was occurring?

    Or, did they create these bot accounts themselves, get called out on it, remove quickly to alleviate suspicion and now they’ll wait for the right moment to recreate them all?

    • kersploosh@sh.itjust.worksOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 years ago

      I think the admins are genuine. It’s easy to imagine myself in the position of self-hosting an instance and simply forgetting to enable captcha and email verification, especially if I didn’t advertise my existence or expect to be discovered. Simple oversight takes less effort than intentional subterfuge.

      Though I don’t see a way to stop someone from doing exactly what you suggest. I think it’s inevitable that someone will setup an actively malicious bot instance.

  • krayj@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 years ago

    It would be nice if, rather than the only option being defederation - if lemmy would allow instance owners to place requirements that users be verified before being allowed to participate in federated communities. Then, rather than threaten (or go through with) defederation from instances who did or do still allow open registration, they could just deny that set of unverified open registered users.

    • squozenode@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      A technical question regarding federation:

      Why not leave it up to the individual users to federate with an instance or not?

      So everything is accessible from everywhere else, but little subgroups form over time. When trolls find a space, those groups can block them out.

      • AustralianSimon@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 years ago

        I believe if your instance is federated with another it caches the content in the instance if you are subscribed to an external community.

  • BarterClub@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 years ago

    We are going to need more server and mod tools in the near future as Reddit diggs it’s grave… Just like Digg did.

  • Chris@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    1
    ·
    2 years ago

    TL;DSR (Too long, did still read) Great work, mate! In the Lemmy.World options I can check a box for not showing me bots. I assume this only helps with accounts that label themselves as bots / not the ones we are speaking about here, right? I still ticked that box, cause I agree with you: I want human discussions on Lemmy! :)

    • puppy@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      edit-2
      2 years ago

      imho opinion you might be missing out by clicking that checkbox. The honest bots that announce themselves are very useful for example there is a link correction bot when someone posts raw Lemmy URLs. The malicious bots won’t announce themselves as bots and therefore will not be removed from your feed.

      And the honest bots doesn’t degrade human discussions in anyway, if anything they improve it. Again the example is that bot correcting the URL to instance neutral links helps the message a comment er tries to convey.

  • Candelestine@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 years ago

    Impressive work. It’s a little ironic, actually, but if you think about it, one of the main issues that we have here in the Fediverse is simply one of communication. A tale as old as time I suppose. Still though, you’d think it’d be less an issue. lol