I just found out reddit sold everything we wrote to AI companies… and honestly I don’t know how to feel

I just found out reddit sold everything we wrote to AI companies… and honestly I don’t know how to feel

So I was reading about Reddit’s API controversy from 2023 and fell down a rabbit hole.

Turns out every post, every comment, every opinion you’ve shared here - reddit licensed it to openai and google. No opt-out. No warning. Just. - done.

And that’s just reddit. Meanwhile Google, Meta, and basically every major platform are quietly building a profile on you — your interests, your political leanings, your daily routine, your insecurities. All from things you said or clicked on “anonymously.”

The wild part? We already knew this was happening. It’s not new. Yet here we all are, still posting.

So I’m genuinely curious — why do you still use reddit (or big tech in general) knowing this?

Is it because:

  • The alternatives (Lemmy- kbin- etc…) just aren’t there yet?
  • You’ve accepted it as the price of the internet
  • You actually don’t think it’s that big a deal?
  • Or you simply never thought about it until now?

Not judging anyone — I’m still here too. Just want to hear honest answers.

  • CodenameDarlen@lemmy.world
    link
    fedilink
    arrow-up
    26
    arrow-down
    2
    ·
    15 hours ago

    I commented on this post.

    I said it doesn’t make sense, anyone can scrap Reddit or Lemmy just fine to train on LLM.

    They’re just making it official.

    If you don’t want your data trained on LLMs just stop interacting on the internet.

    I’m not very into this subject but I think they can use proxies to bypass rate limitation by IPs while scrapping a lot of Reddit per second. Even Lemmy can be scrapped.

    Even your views might help LLMs training because your views tell which content is worth to focus and scrap.

    • JackbyDev@programming.dev
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 hours ago

      This is 100% why Reddit made the API changes the originally brought many of us over. AI companies scraped the web, made LLMs, and Reddit missed out. They wanted to make sure the next ones paid them.

    • Lag@piefed.world
      link
      fedilink
      English
      arrow-up
      20
      ·
      edit-2
      14 hours ago

      We switch to local meeting spots and pass around flashdrives of weekly memes. Refer to your local memester to turn in your flashdrive for a new weekly updated meme drive.

      I need to open a meme truck.

      • CodenameDarlen@lemmy.world
        link
        fedilink
        arrow-up
        9
        ·
        14 hours ago

        That was unironically my take when AI bots started taking over the internet forums.

        I even said to a friend, people will start to go back to small closed communities or groups where everybody knows each other, and everybody knows no AI crap will be shared in those groups. Local art will be more valuable because we know for sure that it’s human made and someone put real effort on it.

        Mainstream media will be always prone to be AI poisoned.

        • Tiresia@slrpnk.net
          link
          fedilink
          arrow-up
          1
          ·
          11 hours ago

          In many cases, it’s easier to have an online culture with an anti-AI policy than a local one. A bunch of people already insist on using AI when interacting with others irl, and many more are passively supportive of them doing so. (i.e. “they don’t care”, but in a very different way from how “they don’t care” about someone eating vegan).

          So an online group that has persistent identities where it’s hard to get a new account with a good status whose culture opposes AI is going to be much easier to keep AI-free than your local neighborhood third space.

          • REDACTED@infosec.pub
            link
            fedilink
            arrow-up
            1
            ·
            edit-2
            8 minutes ago

            Right, like how heavily Lemmy hates AI but then I see AI generated memes or shitposts constantly?

    • OwOarchist@pawb.social
      link
      fedilink
      English
      arrow-up
      5
      ·
      14 hours ago

      anyone can scrap Reddit or Lemmy just fine to train on LLM.

      Well, until you hit flood limits and reddit keeps giving you ‘prove you’re not a robot’ screens or just timegates you from loading new pages at all.

      Plus, it would be a lot more convenient to have API access to automatically provide your AI with only the text of posts and not have to scrape/strip entire pages.

      You could scrape reddit without paying for it, but it would probably be a much slower and more annoying process.

      • CodenameDarlen@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        12 hours ago

        You can use proxies to bypass rate limits of access, so it wouldn’t be slower just expensive. But the only ones doing it are bigger companies anyway, they’ve money to do it.

        • OwOarchist@pawb.social
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 hours ago

          But the only ones doing it are bigger companies anyway, they’ve money to do it.

          Yeah… Compared to the expense of buying literally all the computer hardware in the world, paying reddit for API access is nothing.