froztbyte@awful.systems to

TechTakes@awful.systemsEnglish · 12 hours ago

what the everLLM-ing fuck is going on

join.piefed.social

39

what the everLLM-ing fuck is going on

join.piefed.social

froztbyte@awful.systems to

TechTakes@awful.systemsEnglish · 12 hours ago

AI-assisted moderation in the fediverse is happening. Now what? - PieFed

join.piefed.social

UPDATE: proof is at https://piefed.social/c/fediverse/p/2035409/proof-of-ai-assisted-political-profiling-by-unruffled-lemmy-dbzer0-com. The main instance is lemmy.dbzer0.com but anarchist.nexus and quokka.au share admin/mod teams so those two are suspect also. I recently discovered that some popular federated instances have been using LLM-assisted moderation tooling that evaluates whether someone has said something bannable. They do this by running a script/app that sends the […]

as seen here and here, some instances are feeding posts wholesale to prompts, for what seem like extremely unsound reasons to me

any of you run into this shit yet?

Chat

db0@lemmy.dbzer0.comBanned from community
link
fedilink
English
arrow-up
8·
edit-2
11 hours ago

yeah, it’s ok because the LLM wasn’t hooked up directly to the ban API, you just used it systemically to not do the only fucking thing you’re supposed to be doing as a moderator

Can you fucking read? What “systematically” are you talking about? How did we use it " only fucking thing you’re supposed to be doing as a moderator"? Is summarizing “the thing you’re supposed to be doing as a moderator”? Is doing a summary once “systematically”? Why are you continuing to spread disinfo?

Once again. The admin in question DID NOT USE THE LLM TO DECIDE ON THE ADMIN ACTION. Can you understand this? Can you read this? Am I talking to a wall?! You are swallowing disinfo and then spitting your outrage mindlessly on people.
- rainwall@piefed.social
  link
  fedilink
  English
  arrow-up
  1·
  edit-2
  28 minutes ago
  
  Once again. The admin in question DID NOT USE THE LLM TO DECIDE ON THE ADMIN ACTION. Can you understand this? Can you read this?
  
  Literally no one believes this corpo speak bullshit. That they just coincidentally ran this unpublished python tool, did their own work, then just happened to use an LLM to do the exact same work right after, totally innocently? That reads as absolute ass covering and nothing more. This is the “I smelled weed” of cop stops, just filtered through nerdy fediverse bullshit.
  
  Then, because the above totally happened like you said it did, as a one off joke that no one would ever notice, the same admin opted to put a current OpenAI model name in the LLM field in an absolutely not tongue in cheek way for other admins to totally catch and joke about? Which of course happened, haha, y’all had a big laugh about it before this blew up, yeah?
  
  Oh, and of course this only happened the one time, and never again. Of course no one on your team used this unpublished time saving and thought terminating tool again, of course not.
  
  Come the fuck on.
- self@awful.systems
  link
  fedilink
  English
  arrow-up
  15·
  11 hours ago
  no but you’re about to be talking to a defed unless you stop this all caps fake outrage shit. should I do you the favor of deleting the garbage post you just made so you can give it another go?
  - self@awful.systems
    link
    fedilink
    English
    arrow-up
    13·
    11 hours ago
    think very carefully on this because I’m not sure pretending to not understand what I’m asking is working out for you
    - self@awful.systems
      link
      fedilink
      English
      arrow-up
      10·
      11 hours ago
      assume I know and understand that the LLM did not literally do the banning
      
      maybe there’s a large body of existing research on how even human in the loop systems confirm and worsen biases? maybe it’s a bit obvious when you go through the process the moderator took to get to their decisions in your head? slowly now, maybe you’ll get there
      - db0@lemmy.dbzer0.comBanned from community
        link
        fedilink
        English
        arrow-up
        5·
        edit-2
        11 hours ago
        
        assume I know and understand that the LLM did not literally do the banning
        
        I am telling, again, that the human did not use the LLM to think for them either. The admin took the decision to ban the user irrespective of the LLM, and the rest of our admin team and me specifically, would never let an admin become a “human in the loop”. The LLM was used just to summarize, as part of the test, with a misguided inside joke on using OpenAI tech.
        
        I will readily admit that there was mistakes made by the admin. Not on their actions, but on their visuals. Because those visuals were spun to keep feeding this made-up controversy. We didn’t use the LLM to decide or even guide our decision, but it appeared like we did, and we already owned that.
        
        self@awful.systems
        link
        fedilink
        English
        arrow-up
        12·
        11 hours ago
        
        The admin took the decision to ban the user irrespective of the LLM, and the rest of our admin team and me specifically, would never let an admin become a “human in the loop”. The LLM was used just to summarize
        
        you don’t appear to have much understanding of how a human in the loop system works in practice. LLM summaries are used to confirm biases, especially when the prompt is something along the lines of “do these posts contain <negative term>?” though these systems are stochastic so you’re going to get unpredictable biases regardless of the prompt.
        
        I don’t accept that the LLM summary didn’t influence the decision because the mod in question confirmed that he knew the LLM agreed with him (that’s bias, and also not something LLMs are capable of actually doing) and because if it didn’t, then the summary is worthless
        
        which is why maybe you should just not have them in the future? just don’t touch LLMs when you’re doing mod work. either there’s no reason for it or you’re doing something monstrously wrong.
        
        db0@lemmy.dbzer0.comBanned from community
        link
        fedilink
        English
        arrow-up
        7·
        edit-2
        10 hours ago
        
        I don’t accept that the LLM summary didn’t influence the decision because the mod in question confirmed that he knew the LLM agreed with him (that’s bias, and also not something LLMs are capable of actually doing) and because if it didn’t, then the summary is worthless
        
        In this case, according to the admin in question, the LLM summary came after the decision, as a sort of a test. I.e. the admin made a decision, and wanted to see if an LLM would subsequently agree with that decision. In this specific case, it did, which is why they misguidedly decided to keep its summary in the modlog (opening us up to this whole shitstorm), but ultimately, that admin anyway decided LLMs in the mix is not good at all, which is why you never again saw an LLM summary in the modlog.
        
        I can only put so much fault for a person for just testing shit out, yanno? I am not happy that they decided to use the output of the test because they are not familiar with how quickly disinfo breeds, but ultimately they came to the right decision anyway. If they had not and they had raised the issue on using LLMs officially, they would have been shut down.
        
        [deleted]@piefed.world
        link
        fedilink
        English
        arrow-up
        8·
        10 hours ago
        Having a LLM confirm a decision is the same thing as having the LLM make a decision and then figure out if the mod agrees with it. If they would have chosen not to rule based on the LLM output, then the LLM was part of the decision making process. The order does not matter.
        
        Including the LLM outputting something that implies a determination at any step automatically makes it part of the process.
        
        Simon_Shitewood@lemmy.mlBanned from community
        link
        fedilink
        English
        arrow-up
        1·
        8 hours ago
        Removed by mod
    - db0@lemmy.dbzer0.comBanned from community
      link
      fedilink
      English
      arrow-up
      5·
      edit-2
      11 hours ago
      
      this human in loop shit is how corporations absolve themselves of responsibility for decisions taken purely on the word of an LLM. it lets them fire a worker instead of an executive. you’re sure this is the route you want to go?
      
      I completely agree with you. We have never and will never go that route.
      
      Here’s the answer to the only question you posted which should be obvious from everything else I’ve said and done.
      - self@awful.systems
        link
        fedilink
        English
        arrow-up
        11·
        11 hours ago
        cool! please make it clear to the mod in question that they shouldn’t be using an LLM for anything in the future, even summarizing posts. make it part of your instance’s policies.
        
        midribbon_action@lemmy.blahaj.zoneBanned from community
        link
        fedilink
        English
        arrow-up
        2·
        10 hours ago
        How could an admin possibly enforce that? What if a mod created a summary locally and never shared it with anyone? The ai summary wasn’t used as evidence, that is already policy and has been explained to you multiple times. You are shifting the goal posts to the moon, and no policy change will ever satisfy you.
        
        self@awful.systems
        link
        fedilink
        English
        arrow-up
        10·
        10 hours ago
        oh no the goalposts! think of the theoretical shitheads who might do this in secret!
        
        please see the pinned post and fuck off
- froztbyte@awful.systemsOP
  link
  fedilink
  English
  arrow-up
  13·
  11 hours ago
  lol, lmao even
  
  you do realize all this comes across as running interference while feeling real guilty? you could just, like, drop a link to a repo with the bits. you know, with commit history. like i’ve been asking you for. just simply linking to that would be a real quick way to settle the issue! it’s even some free code review for you!
  - db0@lemmy.dbzer0.comBanned from community
    link
    fedilink
    English
    arrow-up
    3·
    11 hours ago
    You realize there might not be a repo, right?
    - froztbyte@awful.systemsOP
      link
      fedilink
      English
      arrow-up
      16·
      10 hours ago
      you realize that mod probably shouldn’t be one, right?
      - froztbyte@awful.systemsOP
        link
        fedilink
        English
        arrow-up
        13·
        10 hours ago
        seriously this just looks more and more embarassing the more you post about it
      - db0@lemmy.dbzer0.comBanned from community
        link
        fedilink
        English
        arrow-up
        3·
        10 hours ago
        As I said elsewhere, I’d rather this be a learning moment, than a purging moment.
        
        froztbyte@awful.systemsOP
        link
        fedilink
        English
        arrow-up
        12·
        10 hours ago
        the two things can go together, and not removing said mod means that if they do not learn they can still do plenty enough damage elsewhere later on
        
        and I say this out of the position of having fucked up in a mod role before

TechTakes@awful.systems

techtakes@awful.systems

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !techtakes@awful.systems

Big brain tech dude got yet another clueless take over at HackerNews etc? Here’s the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

277 users / day
441 users / week
813 users / month
2.81K users / 6 months
1 local subscriber
2.57K subscribers
318 Posts
6.55K Comments
Modlog

mods:
David Gerard@awful.systems