Spam: Trends & Solutions

Vanilla Forums
edited December 2015 in Product

I want to start looking more critically at spam problems across Vanilla sites (both cloud and open source) to see if we can diagnose some patterns and find low-hanging solutions in the short term. To start, let's start collecting spam problems definitively and gathering more information.

Prerequisites for reporting a spam issue:

  • Open source: Using 2.2 or master.
  • Captcha is in use.
  • Registration option "Require users to confirm their email addresses" is checked.
  • Default roles are properly designated on Roles page ("default: unapproved" etc are showing under descriptions).
  • A "Unapproved" role exists AND has restricted permissions (NO *.Add or Profile.Edit).
  • Akismet is on (and using our cloud key if a client).
  • StopForumSpam is on and the threshold have been lowered appropriately (all in 5-15 range).
  • Ranks IS NOT set to Verify users with less than 10 posts.
  • Editing posts timeframe is restricted to a day or less (at least for new members if using Ranks).
  • Burner Blocker plugin is enabled or the spam is not from burner email domains.

Information to collect:

  • What is the RATE & RATIO of spam users registering (to legit users)?
  • What is the RATE & RATIO of live spam posts (to legit posts)?
  • What patterns can be found in the user profiles? "Status" (About), Photo, IP address, and email are all common vectors.
  • Do the users appear to be bots or humans?

How to differentiate a bot from a human:

  • Humans tend to register multiple accounts from the same IP address and email domain consecutively in a limited timespan.
  • Bots tend to register in sustained waves with no discernible pattern in IP or emails, often with longer periods between each individual registration.
  • Bots tend to use regimented username patterns (ex: 4-letter word + 2 digits + 3-letter word).
  • Humans tend to keyboard-mash usernames (jladsjksad) or be creative.
  • Bot usernames RARELY (if ever) match emails ("Bobby123" with email "Bobby123@hotmail.com").

Let me know if you have more good telltale signs of a bot, prerequisites, or good information you find.

The floor is open! Let's see some investigation results.

PLEASE re-check all the prerequisites before adding a case.

Comments

  • Alex Powell
    edited December 2015

    The only thing I have been seeing, is some spam bots (or humans) are getting around

    $Configuration['Garden']['Profile']['EditPhotos'] = false;

    This is Open Source, using Master from Github Version 2.2.101.3

  • Do we support the new captcha? Is it any better at stopping bots?

  • Alex Powell
    edited December 2015

    Came across this company called: http://www.claim.io/welcome/pricing.php, which spam bombed my site in creating multiple accounts(over 500+ in a 3 day period). They did not spam the community with content, but created numerous profiles, added the company logo and posted their story in activity page.

    I did meet all the requirements, in the checklist. This is just a good example to also ask customers to study the referrals too. Sometimes this can help.

    Solution I used was to turn off show activity, usually a vector for spammers:

    $Configuration['Garden']['Profile']['ShowActivities'] = false;
    

    They were still able to get around the editing of photos restriction.

    Now it's more an annoyance than anything for user count.

  • @Lvez said:
    Do we support the new captcha? Is it any better at stopping bots?

    Not yet, but that seems like something we'll tackle as a side project in Q1. I don't have information on it being more effective; it's primary goal to my knowledge was better user experience (which is significant). We want to implement the new one in a "generic" way so that we can support alternate captchas generally and not have 1 Google solution hard-coded.

  • Having multiple captchas (at least one non-Google option) would be great, particularly for China-based users.