Spam accounts on Phabricator

Hi folks,

We recently discovered a significant amount of spam user accounts on Phabricator. If anyone has an idea on how they could have been created despite requiring a Github/Google account is appreciated.

Best,
Christian

I don’t think a Github/Google account is actually required? Unless that changed… I know my phab account isn’t tied to github and I’m pretty sure it isn’t tied to a Google account. Honestly it was so long ago I don’t remember how I created it!

This changed because of the spam, back in October. But these account are Google accounts, so I turned on email verification for Google/GitHub accounts now!

Hi,
I’ve just got a spam comment on my commit.
See rG3227aa3aa834
What’s the recommended way to report these?

Can we disable logging in with Google and only allow Github logins? I think that would cut down a lot on the SPAM accounts.

We should first look into existing contributors with a Google login. Otherwise what may be more practical (but I don’t know if there is an option for this) would be to prevent only new users with Google account.

OK, who can do this?

As long as the instance is still hosted by Google and not the foundation / the IWG, we need someone at Google to run through the SQL database I think.

1 Like

OK, would you or someone else have time to work on this? Maybe we can setup a meeting to work through some of these issues?

I just had a quick look and disabled another 5 (or so) Phabricator accounts that were created today. It seems we still have a significant influx of new spam accounts.

I just ran some queries on the Conduit API:

  • We have 1330 new user accounts on Phabricator since 2021-12-01.
  • All of the ones I checked manually haven’t done anything on Phabricator.
  • Many of them have spam on the user account page.

I’m running into request limits on the Conduit API, so we do need someone to investigate on the SQL database…

1 Like

I’m less concerned about not accepting new Google account to register, I think that’s fine (but is it possible? Do we need to patch the code?).
The concern I have is more about removing Google auth entirely with respect to existing contributors using it. So the data on this to get from SQL would be “how many Google accounts are actively contributing” to gauge the impact of removing Google auth entirely (new and existing account).

Is there an approval mode for Phabricator accounts? I hate to add more barriers, but we had to do this with Bugzilla and Mailman basically because of Spam.

Ideally, it would be good to get rid of the really bad ones on the site. It sounds like deletion is not straight forward?

Deletion is fairly easy, but it requires collecting all the usernames, and then it is one command through SSH access. I’m happy to run the command provided the list of usernames!

I also just found the setting to disable registration for Google accounts and triggered it: there shouldn’t be any new Google account: we’re only gonna have registration of GitHub account if everything works as intended.

1 Like

I’m still seeing a lot of new Spam accounts. Are you able to check how they are registering?

GitHub it seems: ♟ businessnewsupdates is registered with businessnewsupdates · GitHub

I started reporting some of the accounts to GitHub, but it seems like we are going to need a different solution.

I expect these folks are doing this primarily for SEO benefit. Phabricator does not set rel=“ugc” or the older rel=“nofollow” on outgoing user-generated links (see google’s docs). So, we’re giving our search-engine ranking credit to all these spam sites.

I don’t know whether we’ve gotten onto a specific list of “good sites to spam for SEO ranking”, or if it’s a more general “phabricator sites are good for this” kind of thing. I wouldn’t be surprised by either.

It looks like right now there’s not too much comment spam, but there are a TON of accounts that have profiles with spam links in them. (This, at least, doesn’t actively spam users of the site, but we’re still giving those spammers SEO-juice, unfortunately.)

Unfortunately, phab doesn’t have an option to add that the rel=ugc attribute. I think we’ll likely need to continue removing spam accounts in any case, but hopefully it’ll help if we can patch the remarkup->html translator in phab to add the attribute for explicitly-specified link targets.

I patched the remarkup code on Phabricator to add “ugc nofollow” to all these links, maybe that’ll remove the incentive?

Well, we could require a manual review/approval step for new accounts. Phabricator has an option for that. However I’m not sure how we would want to weed out spam accounts and who would have time for that.

In parallel I’m in contact with Google’s identity team. They offered some help in identifying the Google-account based spam accounts…