A High Priority for Moving Away from Lemmy

Chris Remington@beehaw.org · 1 year ago

A High Priority for Moving Away from Lemmy

Gaywallet (they/it)@beehaw.org · 1 year ago

A few observations/thoughts.

There’s an awful lot of posts basically saying “this is a part of the job of moderation” and I don’t think that’s a particularly empathetic or useful observation. I’ve been on the internet and moderating for long enough to have been exposed to a lot of this, but this is not an inevitability. It’s an outcome of the system we’ve designed, of regulation and law that we have, and of not prioritizing this as a problem strongly enough. Being dismissive of an emotional experience and trauma isn’t particularly helpful.
I’m not technical enough to explain this, but there are technical and legal issues with CSAM and the lemmy platform that we’ve ran into. For one, there’s no automated scanning tools for this kind of content. My understanding is that even implementing or creating said tools would be difficult because of the way pict-rs and rust are storing images in the first place. You cannot turn off image federation, at all. At best, you can clear the content, but doing so may violate CSAM laws depending on the country and reporting requirements. Someone on the technical side can explain better than I can.
This isn’t a thread to discuss who’s to blame for CSAM. Please cease all discussions fighting about religion in the comments. I will be removing these comments.

PenguinCoder@beehaw.org · edit-2 1 year ago

You cannot turn off image federation, at all.

This is correct for Lemmy codebase; but a WIP by the pictrs dev and upstream Lemmy itself.

For now, Beehaw users can go to their settings via the website, and uncheck Show images if they’re so inclined. This should prevent all images in posts and comments from loading automatically for you. This does not translate to other instances, front-ends, or apps. Just the main website. EDIT: Because of the caching, you’ll need to CTRL +F5 after saving this setting, to see it take affect.

Intelligence_Gap@beehaw.org · 1 year ago

I’m not sure that’s possible with images being allowed. If Google, Facebook, Instagram, and YouTube all struggle with it I think it will be an issue anywhere images are allowed. Maybe there’s an opening for an AI to handle the task these days but any dataset for something like that could obviously be incredibly problematic

thanevim@kbin.social · 1 year ago

Yeah, the key problem here is that any open forum, of any considerable popularity, since the dawn of the Internet has had to deal with shit like CSAM. You don’t see it elsewhere because of moderators. Doing the very job Op does. It’s just now, Op, you’re in the position. Some people can, and have decided to, deal with moderating the horrors. It may very well not be something you, Op, can do.

d3Xt3r@beehaw.org · edit-2 1 year ago

The thing is though, with traditional forums you get a LOT of controls for filtering out the kind of users who post such content. For instance, most forums won’t even let you post until you complete an interactive tutorial first (reading the rules and replying to a bot indicating you’ve understood them etc).

And then, you can have various levels of restrictions, eg, someone with less than 100 posts, or an account less than a month old may not be able to post any links or images etc. Also, you can have a trust system on some forums, where a mod can mark your account as trusted or verified, granting you further rights. You can even make it so that a manual moderator approval is required, before image posting rights are granted. In this instance, a mod would review your posting history and ensure that your posts genuinely contributed to the community and you’re unlikely to be a troll/karma farmer account etc.

So, short of accounts getting compromised/hacked, it’s very difficult to have this sort of stuff happen on a traditional forum.

I used to be a mod on a couple of popular forums back in the day, and I even ran my own community for a few years (using Invision Power Board), and never once have I had to deal with such content.

The fact is Lemmy is woefully inadequate in it’s current state to deal with such content, and there are definitely better options out there. My heart goes out to @Chris and the staff for having to deal with this stuff, and I really hope that this drives the Beehaw team to move away from Lemmy ASAP.

In the meantime, I reckon some drastic actions would need to be taken, such as disabling new user registrations and stopping all federation completely, until the new community is ready.

Thevenin@beehaw.org · 1 year ago

So this just got posted on lemmy.dbzer0. They’ve got an AI-based CSAM screen up and running with promising initial results. The model was trained using CLIP, which as far as I understand it means they used written descriptions of what CSAM is or is not.

Could something like this work for Beehaw?

apis@beehaw.org · 1 year ago

Wonder whether in theory one could use a dataset of… everything else, have the AI exclude what it does not recognise, then run the exclusions against a dataset to see whether or not they contain children. There could be an additional layer of running the exclusions against a dataset of regular sexual content.

One issue is that admin of any site would still want to report any CSAM to authorities. That could be automated by an AI checker, but one would have to have a lot of faith that the AI was decently accurate and not generating many false reports. The workaround I described to avoid using datasets of abuse is unlikely to be particularly accurate - ok for the purposes of protecting admin, but leaves them in an odd spot when it comes to banning a user, especially where a user’s livelihood could be impacted, or things like paid online courses. I guess specialist police departments probably would have to use highly relevant datasets, along with review by humans, but still - nobody wants to inadvertently clog up that system with false reports.

liv@beehaw.org · edit-2 1 year ago

I just want to say, I am so so so sorry you had to see that.

I accidentally saw some CSAM in the 1990s and you are right, it is burnt into your mind. It’s the real limit case of “what has been seen cannot be unseen” - all I could do was learn to avoid accessing those memories.

If you can access counselling for this, that might be a good option. Vicarious trauma is a real phenomenon.

Chris Remington@beehaw.org · 1 year ago

If you can access counselling for this, that might be a good option. Vicarious trauma is a real phenomenon.

Thank you for the advice. I’m not sure that I’ll need counseling but I’m open to it if need be. Time will tell.

loops@beehaw.org · 1 year ago

Be sure to keep tabs on yourself, sometimes these things can really sneak up on you.

flatbield@beehaw.org · edit-2 1 year ago

People keep talking about going to another platform. Personally I think a better idea would be to develop lemmy to deal with these issues. This must be a fediverse wide problem. So some discussion with other admins and the developers is probably the way to go on many of these things. Moreover you work with https://opencollective.com/, can they help. Beyond this, especially CSAM, there must be large funding agencies where one could get a grant to get some real professional programming put into this problem. Perhaps we could raise funds ourselves to help with this too.

So frankly I would like to see Beehaw solve the issues with lemmy, rather then just move to some other platform that will have its own issues. The exception may be if the Beehaw people think that being a safe space creates too big a target that you have to leave the Threadiverse to be safe. That to me seems like letting the haters win. It is exactly what they want. My vote will always be to solve the threadiverse issues rather then run away.

Just my feeling. There may be more short term practical issues that take precedence and frankly it is all up to you guys where you want to take this project.

snowe@programming.dev · 1 year ago

The solution is to use an already existing software product that solves this, like CloudFlare’s CSAM Detection. I know people on the fediverse hate big companies, but they’ve solved this problem already numerous times before. They’re the only ones allowed access to CSAM hashes, lemmy devs and platforms will never get access to the hashes (for good reason).

flatbield@beehaw.org · 1 year ago

They will still need to have a developer set this up and presumably it should be added as an option to the main code base. I thought I heard the beehaw admins were not developers.

snowe@programming.dev · 1 year ago

Not sure what you mean. You do not need to be a developer to set up CloudFlare’s CSAM detection. You simply have email the NCMEC, get an account, then check a box in CF, input some information about your NCMEC account, and then you’re good to go.

flatbield@beehaw.org · 1 year ago

How does the scan happen? It has to be linked in some how. Are you saying that choosing cloudflair as your CDN that will flag at distribution time? Or at upload time?

snowe@programming.dev · 1 year ago

If you use CloudFlare as your proxy then all your instances traffic gets routed through CF before ever making it to your server. If someone tries to upload CSAM it will immediately be flagged (before ever making it to your server). CloudFlare then quarantines it and automatically files a report with the National Center for Missing and Exploited Children. There’s more to the prices, but the point is that putting it in the lemmy software is not a good solution, especially when industry standard proven solutions already exist. You don’t have to use CF. You can also use solutions from Google, FB, Microsoft, Thorn, etc.

flatbield@beehaw.org · 1 year ago

Interesting. Thanks.

thySatannic@beehaw.org · 1 year ago

Wait… why is no access to csam hashes a good thing? Wouldn’t it make it easier to detect if hashes were public?! I feel like I’m missing something here…

snowe@programming.dev · 1 year ago

Giving access to CSAM hashes means anyone wanting to avoid detection simply has to check what they’re about to upload against the db. If it matches then they simply modify the image until it doesn’t. It’s literally guaranteed to make the problem worse, not better.

thySatannic@beehaw.org · 1 year ago

Ah thanks, hadn’t thought of that!

sarmale@lemmy.zip · 11 months ago

Question, from what I saw it seems like every CSAM image ever is assigned a new hash. Isnt it unscalable to asign a separate hash for everything? does that mean that most CSAM images were detected before?

bermuda@beehaw.org · 1 year ago

I’d be fine with not hosting images entirely. I don’t think people come to beehaw primarily to look at pictures

Chobbes@beehaw.org · 1 year ago

I’ve been thinking lately that I kind of miss things like IRC where you couldn’t really post pictures in chat. With things like Discord and Slack the off topic channels often devolve into people just sharing random memes they found funny at the time, and not really talking to each other. I’m sure there’s value in that too, but I think it can take up a lot of oxygen in the social space, so I’m not sure it’s always a win. Different formats encourage different ways of interacting with each other, I guess, and it’s interesting!

lerba@beehaw.org · 1 year ago

This post seems highly reactive to me. I’m sorry to hear of you being exposed to such disturbing material, but I fail to see at true connection of that happening and using Lemmy as the platform. I absolutely agree that nobody should have to experience what you did, but I disagree with the platform change proposition.

potterman28wxcv@beehaw.org · 1 year ago

I don’t know of any software platform where that would not happen.

Even with a text-only platform people can still post URLs to unsafe content.

I think OP is referring to some kind of automated scanner but I’m not sure there are publicly available ones. I guess using them would come at a cost - either computational or $$. And even so, there can be false positives so you would probably still have to check the report anyway someday.

PreparaTusNalgasPorque@kbin.social · 1 year ago

I’m sure those repugnant assholes do it “for the lulz” and if they want to mess with you they’ll do it anywhere.

There’s this study that says playing Tetris helps ease recently acquired trauma https://www.ox.ac.uk/news/2017-03-28-tetris-used-prevent-post-traumatic-stress-symptoms

And the admin from his eponymous instance dbzero created an interesting script to get rid of CSAM without having to review it manually, take a look -> https://github.com/db0/lemmy-safety

renard_roux@beehaw.org · edit-2 1 year ago

Just tagging @admin in case they don’t see this ❤️

Edit: aaand I did it wrong 🙄 @admin@beehaw.org 👈 Better?

AndreTelevise@beehaw.org · edit-2 1 year ago

Lemm.ee, another instance I am in, isn’t hosting images anymore or letting people upload images directly due to this issue. When your platform is supposed to be 100% open source and decentralized, there are bound to be issues like this, and they should be dealt with, even if proprietary tech is necessary for it. I’m sorry to hear about this.

🇰 🌀 🇱 🇦 🇳 🇦 🇰 ℹ️@yiffit.net · edit-2 1 year ago

Sadly, the only 100% way to never have that kind of material ever touch your servers is to not allow image uploads from the public. Whether it’s on Lemmy or another social site, or something you control entirely on your own. Maybe sooner than we think, AI could deal with the moderation of it so a human never has to witness that filth, but it’s not quite there yet.

Kangie@lemmy.srcfiles.zip · 1 year ago

A software platform that makes it nearly impossible for Beehaw to host, in any way, CSAM.

I hate to say it, but you’ll need to find a text-only platform. Allowing any image uploads opens the door to things like this.

Besides that, if your concern is that no moderator should be exposed to anything like that, well on a text-only site you might have to deal with disguised spam links to gore, scam, etc. You’ll still have to click on links to effectively moderate.

Maybe you should consider if this is a position that you want to put yourself in again. It sounds like this may just not be for you.

Chobbes@beehaw.org · 1 year ago

This was my immediate thought as well. It’s unfortunate, but there will probably always be people who abuse online platforms like this. It’s totally okay if you’re not up to the task of moderating disturbing content like that — it sounds like it can be a really brutal job. I don’t know what the moderation tools on Lemmy are like, but maybe there’s a way to flag different kinds of moderation concerns for different moderators (so not everybody has to be exposed to this kind of stuff if they’re not comfortable with it). And maybe there could also be a system where if user’s flag the post it can be automatically marked as NSFW and images can be hidden by default so moderators and other users don’t have to be exposed to it without warning (though of course such a system could potentially be abused as well).

PenguinCoder@beehaw.org · 1 year ago

Does that mean a platform that does not allow any images to be uploaded? Or a platform that has better access control and remediation controls?

Chris Remington@beehaw.org · 1 year ago

I’d be willing to consider either and would love your, particular, feedback on this as well.

flatbield@beehaw.org · edit-2 1 year ago

By the way. I have always been surprised that Beehaw did host images. The extra cost (they are large and costly in both storage and bandwidth), added security and attack vector possibilities, IP issues, CSAM issues, etc.

flatbield@beehaw.org · 1 year ago

Also, I do not think this is a Lemmy specific issue. It is an image availability, and scale issue. Federation of course increases the scale a lot too.

Scary le Poo@beehaw.org · 1 year ago

Did you forget to log into your alts or are you unaware of how the edit button functions?

Storage is super cheap, fwiw.

flatbield@beehaw.org · 1 year ago

Now be nice. Of course I know about the edit button. The comments were not posted at the same time and generally later editing is discouraged. Nor are long comments or one comment on different topics great.

Why on earth would I have multiple accounts? I am sure people do, but that too is kind of strange behavior and perhaps abusive depending on how they are used.

flatbield@beehaw.org · 1 year ago

Not as cheap as you think at scale and your renting the bandwidth and space from a hosting company and most of the users are probably free loading. The whole challenge of FOSS and services is that there is no one to pay operating costs.

Rentlar@lemmy.ca · 1 year ago

May I gently suggest for next time that you reply to yourself in a chain, if you’d like to add something on, and if you are against editing your post? I have trouble reading the order of your posts from the default sorting method.

PenguinCoder@beehaw.org · 1 year ago

The RATE of storage both the increasing and the bandwidth transferring, is the expensive part.

flatbield@beehaw.org · edit-2 1 year ago

I think if a platform has image capabilities this is to be expected. I guess the only exception if there are filters that can be used, but this seems unlikely. So I think it is an image vs. no image decision. The other problem with images is they can be attack vectors from a security point of view. Any complex file format can be an attack vector as interpreters of complex file formats often have bugs.

Can you imagine that the large platforms have whole teams of people that have to look at this stuff all day and filter it out. Not sure how that works, but it is probably the reality. Notice R$ never hosted images.

Storksforlegs@beehaw.org · 1 year ago

As others have suggested, I think temporarily suspending images until you guys can settle on a safe alternative to lemmy is a good idea.

Im sorry you had to see something like this, i hope you are able to seek out some counceling asap, talk to someone about it. Even something like https://www.7cups.com/ might be helpful.

Sina@beehaw.org · 1 year ago

I think temporarily suspending images until you guys can settle on a safe alternative to lemmy is a good idea.

There is no such thing as a safer alternative to Lemmy. It’s very easy to say things like “use tools” to filter these things, but in actuality it’s anything but, it’s way beyond a foss project. (or Reddit for that matter, though they are trying and good gawd, I just remembered something I saw on reddit and have not thought of for years, damn it)

Storksforlegs@beehaw.org · edit-2 1 year ago

Well true, but I meant more like a forum with limited access (no images or links) until you meet certain requirements etc. So not totally safe, but a bit safer than the current setup

apis@beehaw.org · 1 year ago

So, so sorry you had to see that, and thank you for protecting the rest of us from seeing it.

On traditional forums, you’d have a lot of control over the posting of images.

If you don’t wish to block images entirely, you could block new members from uploading images, or even from sharing links. You could set things up so they’d have to earn the right to post by being active for a randomised amount of time, and have made a randomised number of posts/comments. You could add manual review to that, so that once a member has ostensibly been around long enough and participated enough, admin look at their activity pattern as well as their words to assess if they should be taken off probation or not… Members who have been inactive for a while could have image posting abilities revoked and be put through a similar probation if they return. You could totally block all members from sharing images & links via DM, and admin email accounts could be set to reject images.

It is probably possible to obtain the means to reject images which could contain any sexual content (checked against a database of sexual material which does not involve minors), and you could probably also reject images which could contain children and which might not be wholesome (checked against a database of normal images of children).

Aside from the topic in hand, a forum might decide to block all images of children, because children aren’t really in a position to consent to their images being shared online. That gets tricky when it comes to late teens & early 20s, but if you’ve successfully filtered out infants, young children, pre-teens & early teens as well as all sexual content, it is very unlikely that images of teenagers being abused would get through.

Insisting that images are not uploaded directly, but via links to image hosting sites, might give admin an extra layer of protection, as the hosting sites have their own anti-CSAM mechanisms. You’d probably want to whitelist permitted sites. You might also want a slight delay between the posting of an image link and the image appearing on Beehaw - this would allow time for the image hosting site to find & remove any problem images before they could appear on Beehaw (though I’d imagine these things are pretty damn fast by now).

You could also insist that members who wish to post images or links to images can only do so if they have their VPN and other privacy preserving methods disabled. Most members wouldn’t be super-enthused about this, until they’ve developed trust in the admin of the site, but anyone hoping to share images of children being abused or other illegal content will just go elsewhere.

Admin would probably need to be able to receive images of screenshots from members trying to report technical issues, but those should be relatively easy to whitelist with a bot of some sort? Or maybe there’s some nifty plugin for this?

Really though, blocking all images is going to be your best bet. I like the idea of just having the Beehaw bee drawings. You could possibly let us have access to a selection of avatars to pick, or have a little draw plugin so members can draw their own. On that note, those collaborative drawing plugin things can be a fun addition to a site… If someone is very keen for others to see a particular image, they can explain how to find it, or they can organise to connect with each other off Beehaw.

jarfil@beehaw.org · 1 year ago

block new members from uploading images

I’ve tried those methods something like 10 years ago. It didn’t work; people would pose as decent users, then suddenly switch to posting shit when allowed. I’m thinking nowadays, with the use of ChatGPT and similar, those methods would fail even more.

Modern filtering methods for images may be fine(-ish), but won’t stop NSFL and text based stuff.

Blocking VPN access, to a site intended as a safe space, seems contradictory.

anyone hoping to share […] illegal content will just go elsewhere

Like some else’s free WiFi. Wardriving is still a thing.

draw plugin so members can draw their own

That can be easily abused, either manually or through a bot. Reddit has the right idea there, where they have an avatar generator with pre-approved elements. Too bad they’re pretty stifling (and sell the interesting ones as NFTs).

apis@beehaw.org · 1 year ago

Yup, as it gets ever easier to overwhelm systems, there are no good solutions to the matter, aside from keeping it text only + Beehaw’s own drawings.

jarfil@beehaw.org · edit-2 1 year ago

Some text-only creepepastas are equally disturbing and illegal in some places. IIRC some Lemmy instance in Ireland had to close shop because their legislation applies to both “images” and “descriptions of images”.

apis@beehaw.org · 1 year ago

True, but this is assuming one wishes to have a place to communicate online at all.

And though text can be intensely disturbing, it is inherently different to images/footage of actual children actually being harmed.

jarfil@beehaw.org · 1 year ago

Yeah… you’ll have to excuse me, because while I’d love to delve deeper into the philosophy of perception, the art of rhetoric, or how the AIs can upend it all… I’ll have to leave it here, since I’ve been told in no uncertain terms that this is not the place to discuss this kind of stuff.

Maybe we could meet in some other safe space, focused on pure intellectual discussions, if such existed.

apis@beehaw.org · 1 year ago

That’s fair.

Not currently using other spaces, nor aware of any suited to the topic (gladly, I suspect).

Storksforlegs@beehaw.org · edit-2 1 year ago

I second everything you said here

👁️👄👁️@lemm.ee · 1 year ago

As long as you can post links or upload images, there is an avenue for CSAM to be spammed. Beehaw should probably start with a whitelist and slowly expand. Refuse to federate with anyone that has open registration.

Flax@feddit.uk · 1 year ago

How would moving platform help with this? Can’t someone just post them to whatever platform beehaw moves (and dies) on?

Zev@beehaw.org · edit-2 1 year ago

Removed by mod

Flax@feddit.uk · edit-2 1 year ago

Satanic is being used as a synonym for evil here. And there’s nothing evil about Christianity. Just evil satanic people claiming to be Christian

Zev@beehaw.org · edit-2 1 year ago

Removed by mod

sludge@beehaw.org · 1 year ago

Removed by mod

Zev@beehaw.org · 1 year ago

Removed by mod

Flax@feddit.uk · 1 year ago

How is it worse? Have you read the Old Testament in comparison to the New Testament?

sludge@beehaw.org · 1 year ago

Removed by mod

Flax@feddit.uk · edit-2 1 year ago

There are many different institutions based on Christianity scaling from the Roman Catholic Church, Church of England, Presbyterian Church of the United States of America to tiny little churches such as small baptist churches and nondenoms.

So if your criticism is specifically with the Roman Catholic Church, fair enough, but the teachings of Christianity aren’t evil at all and if everyone acted how Jesus acted the world would be a far better place

Flax@feddit.uk · 1 year ago

You say to read the Bible, but at what point does the New Testament condone slavery, paedophilia and war? It sounds like you haven’t read it

Zev@beehaw.org · 1 year ago

Removed by mod

Flax@feddit.uk · 1 year ago

Fair enough

Zev@beehaw.org · 1 year ago

Removed by mod

Flax@feddit.uk · 1 year ago

Paedophiles, hate mongerers, etc

PenguinCoder@beehaw.org · 1 year ago

Removed by mod