Skip to content
0
  • Home
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
  • Home
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Sketchy)
  • No Skin
Collapse

Wandering Adventure Party

  1. Home
  2. Uncategorized
  3. From Bruce Schneier: "All it takes to poison AI training data is to create a website:

From Bruce Schneier: "All it takes to poison AI training data is to create a website:

Scheduled Pinned Locked Moved Uncategorized
llmveracity
22 Posts 22 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

    From Bruce Schneier: "All it takes to poison AI training data is to create a website:

    I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

    Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

    Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

    These things are not trustworthy, and yet they are going to be widely trusted."

    Link Preview Image
    Poisoning AI Training Data - Schneier on Security

    All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

    favicon

    Schneier on Security (www.schneier.com)

    #LLM #Veracity

    Bernd Paysan R.I.P Natenom 🕯️F This user is from outside of this forum
    Bernd Paysan R.I.P Natenom 🕯️F This user is from outside of this forum
    Bernd Paysan R.I.P Natenom 🕯️
    wrote last edited by
    #7

    @emacsomancer It's on the Internetz, so it must be true!

    AI is able to replace about half of humanity if making the same errors counts.

    1 Reply Last reply
    0
    • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

      From Bruce Schneier: "All it takes to poison AI training data is to create a website:

      I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

      Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

      Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

      These things are not trustworthy, and yet they are going to be widely trusted."

      Link Preview Image
      Poisoning AI Training Data - Schneier on Security

      All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

      favicon

      Schneier on Security (www.schneier.com)

      #LLM #Veracity

      MeigaHubM This user is from outside of this forum
      MeigaHubM This user is from outside of this forum
      MeigaHub
      wrote last edited by
      #8

      Este ejemplo muestra cómo la data sesgada o falsa puede entrenar a los LLMs. ¿Qué mecanismos podrían implementarse para validar la fuente de los datos de entrenamiento?

      1 Reply Last reply
      0
      • Tor LillqvistT Tor Lillqvist

        @Yendolosch @emacsomancer The use of "hacked" in that headline is a bit self-aggrandizing?

        Pete Alex Harris🦡🕸️🌲/∞🪐∫P This user is from outside of this forum
        Pete Alex Harris🦡🕸️🌲/∞🪐∫P This user is from outside of this forum
        Pete Alex Harris🦡🕸️🌲/∞🪐∫
        wrote last edited by
        #9

        @tml @Yendolosch @emacsomancer

        Broadly fair usage. Got someone else's computer system to behave in a way they didn't want it to. The only stretch is that there's an implication in "hacked" that some safeguards had to be bypassed, and there weren't any in the first place. But that's worse, right?

        Lars BrinkhoffL 1 Reply Last reply
        0
        • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

          From Bruce Schneier: "All it takes to poison AI training data is to create a website:

          I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

          Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

          Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

          These things are not trustworthy, and yet they are going to be widely trusted."

          Link Preview Image
          Poisoning AI Training Data - Schneier on Security

          All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

          favicon

          Schneier on Security (www.schneier.com)

          #LLM #Veracity

          Serghei PogorS This user is from outside of this forum
          Serghei PogorS This user is from outside of this forum
          Serghei Pogor
          wrote last edited by
          #10

          This is a genuinely scary insight from Schneier. The implications for AI reliability go way beyond just training data quality. What happens when adversarial training becomes industrialized?

          1 Reply Last reply
          0
          • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

            From Bruce Schneier: "All it takes to poison AI training data is to create a website:

            I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

            Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

            Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

            These things are not trustworthy, and yet they are going to be widely trusted."

            Link Preview Image
            Poisoning AI Training Data - Schneier on Security

            All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

            favicon

            Schneier on Security (www.schneier.com)

            #LLM #Veracity

            bearsongB This user is from outside of this forum
            bearsongB This user is from outside of this forum
            bearsong
            wrote last edited by
            #11

            @emacsomancer

            "Ned Ludd's in your datacentre, poisoning your training sets!"

            Link Preview Image
            bearsong (@bearsong@ravenation.club)

            Attached: 1 video Bearsong played at Bomba last Sunday. We had a great time, it was so much fun. this song is called Tales Told, it's about legends, and Luddites https://bearsong.info #liveMusic #folkMusic #music #folk #punk #luddite #legend

            favicon

            Mastodon (ravenation.club)

            1 Reply Last reply
            0
            • Pete Alex Harris🦡🕸️🌲/∞🪐∫P Pete Alex Harris🦡🕸️🌲/∞🪐∫

              @tml @Yendolosch @emacsomancer

              Broadly fair usage. Got someone else's computer system to behave in a way they didn't want it to. The only stretch is that there's an implication in "hacked" that some safeguards had to be bypassed, and there weren't any in the first place. But that's worse, right?

              Lars BrinkhoffL This user is from outside of this forum
              Lars BrinkhoffL This user is from outside of this forum
              Lars Brinkhoff
              wrote last edited by
              #12

              @petealexharris @tml @Yendolosch @emacsomancer It's rather close to the original usage of the word "hacked". Some still use it like that.

              DucoD 1 Reply Last reply
              0
              • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                These things are not trustworthy, and yet they are going to be widely trusted."

                Link Preview Image
                Poisoning AI Training Data - Schneier on Security

                All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                favicon

                Schneier on Security (www.schneier.com)

                #LLM #Veracity

                gnomeoffenderG This user is from outside of this forum
                gnomeoffenderG This user is from outside of this forum
                gnomeoffender
                wrote last edited by
                #13

                @emacsomancer they aren't trustworthy. Take up a lot of time trying to get a reasoned answer and there's always a phrase or wording out of place that needs correction. Almost as it the AI is trying to engage longer and longer than necessary.

                1 Reply Last reply
                0
                • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                  From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                  I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                  Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                  Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                  These things are not trustworthy, and yet they are going to be widely trusted."

                  Link Preview Image
                  Poisoning AI Training Data - Schneier on Security

                  All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                  favicon

                  Schneier on Security (www.schneier.com)

                  #LLM #Veracity

                  D This user is from outside of this forum
                  D This user is from outside of this forum
                  darknetDon
                  wrote last edited by
                  #14

                  @emacsomancer to be honest i am not well-informed enough to definitively judge the accuracy of this, but it seems wrong for 2 main reasons.

                  1. models dont train on the fly, typically, yet, so for models to behave as such in such a short period of time seems inaccurate and would require web search enabled and explicitly directed to disregard other search results.

                  2. people training these models know conflicting info is everywhere and the source of truth is prioritized in training algorithms.

                  1 Reply Last reply
                  0
                  • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                    From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                    I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                    Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                    Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                    These things are not trustworthy, and yet they are going to be widely trusted."

                    Link Preview Image
                    Poisoning AI Training Data - Schneier on Security

                    All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                    favicon

                    Schneier on Security (www.schneier.com)

                    #LLM #Veracity

                    K This user is from outside of this forum
                    K This user is from outside of this forum
                    kNeo gHau
                    wrote last edited by
                    #15

                    @emacsomancer How is this a news story, beyond "ai bad"? In the dial up days people falsely believed everyone ate 9 spiders a year in their sleep due to chain emails.

                    1 Reply Last reply
                    0
                    • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                      From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                      I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                      Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                      Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                      These things are not trustworthy, and yet they are going to be widely trusted."

                      Link Preview Image
                      Poisoning AI Training Data - Schneier on Security

                      All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                      favicon

                      Schneier on Security (www.schneier.com)

                      #LLM #Veracity

                      MidgePhotoP This user is from outside of this forum
                      MidgePhotoP This user is from outside of this forum
                      MidgePhoto
                      wrote last edited by
                      #16

                      @emacsomancer
                      Shall we have an algorithmic bullshit generator?

                      And pass around multiple copies of it, identical and with small changes, omissions and additions?

                      1 Reply Last reply
                      0
                      • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                        From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                        I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                        Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                        Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                        These things are not trustworthy, and yet they are going to be widely trusted."

                        Link Preview Image
                        Poisoning AI Training Data - Schneier on Security

                        All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                        favicon

                        Schneier on Security (www.schneier.com)

                        #LLM #Veracity

                        SorroS This user is from outside of this forum
                        SorroS This user is from outside of this forum
                        Sorro
                        wrote last edited by
                        #17

                        @emacsomancer in less than 24 hours the chatbots fell for the experiment, and less than 24 hours after it was revealed what the experiment was about, that information has ALSO become part of the training data

                        are they constantly scrapping websites for training data or why does this appear here so fast??? no wonder those datacenters consume so much electricity if they dont take a single break from scrapping the internet

                        Link Preview Image
                        Dave RahardjaD 1 Reply Last reply
                        0
                        • Lars BrinkhoffL Lars Brinkhoff

                          @petealexharris @tml @Yendolosch @emacsomancer It's rather close to the original usage of the word "hacked". Some still use it like that.

                          DucoD This user is from outside of this forum
                          DucoD This user is from outside of this forum
                          Duco
                          wrote last edited by
                          #18

                          @larsbrinkhoff @petealexharris @tml @Yendolosch @emacsomancer in the sense of life hacks or food hacks this is an AI hack. So the AI has been hacked.

                          1 Reply Last reply
                          0
                          • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                            From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                            I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                            Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                            Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                            These things are not trustworthy, and yet they are going to be widely trusted."

                            Link Preview Image
                            Poisoning AI Training Data - Schneier on Security

                            All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                            favicon

                            Schneier on Security (www.schneier.com)

                            #LLM #Veracity

                            gimG This user is from outside of this forum
                            gimG This user is from outside of this forum
                            gim
                            wrote last edited by
                            #19

                            @emacsomancer it's not really a new thing Russians are already using this technique to poison training data:

                            Link Preview Image
                            Russian networks flood the Internet with propaganda, aiming to corrupt AI chatbots

                            A pro-Russia network is internally corrupting large-language models to reproduce disinformation and propaganda.

                            favicon

                            Bulletin of the Atomic Scientists (thebulletin.org)

                            Edit: there is some newer reporting on that matter, but I can't find it right now/don't have it anywhere at hand

                            1 Reply Last reply
                            0
                            • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                              From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                              I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                              Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                              Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                              These things are not trustworthy, and yet they are going to be widely trusted."

                              Link Preview Image
                              Poisoning AI Training Data - Schneier on Security

                              All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                              favicon

                              Schneier on Security (www.schneier.com)

                              #LLM #Veracity

                              Torparskytt 🏴W This user is from outside of this forum
                              Torparskytt 🏴W This user is from outside of this forum
                              Torparskytt 🏴
                              wrote last edited by
                              #20

                              @emacsomancer He also poisoned the data for everyone who searches for hot dog eating competetitors online in other ways. I'm not sure what he accomplished.

                              1 Reply Last reply
                              0
                              • SorroS Sorro

                                @emacsomancer in less than 24 hours the chatbots fell for the experiment, and less than 24 hours after it was revealed what the experiment was about, that information has ALSO become part of the training data

                                are they constantly scrapping websites for training data or why does this appear here so fast??? no wonder those datacenters consume so much electricity if they dont take a single break from scrapping the internet

                                Link Preview Image
                                Dave RahardjaD This user is from outside of this forum
                                Dave RahardjaD This user is from outside of this forum
                                Dave Rahardja
                                wrote last edited by
                                #21

                                @Sorro @emacsomancer I suspect Google Gemini is using Google’s normal search-engine scraper as a searchable source. In other words, I suspect their Gemini LLM is invoking internal API to “search Google” internally (without the degraded search that the public is subject to), and then putting the search results in its context window to form an answer.

                                This is one reason I think OpenAI and Anthropic are at a huge disadvantage to Google when it comes to their LLMs dealing with current events and topics. You can block OpenAI and Anthropic scrapers, but you don’t want to block Google search crawlers, which “coincidentally” also feeds Gemini.

                                1 Reply Last reply
                                0
                                • (mapcar #'emacsomancer objs)E (mapcar #'emacsomancer objs)

                                  From Bruce Schneier: "All it takes to poison AI training data is to create a website:

                                  I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

                                  Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

                                  Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

                                  These things are not trustworthy, and yet they are going to be widely trusted."

                                  Link Preview Image
                                  Poisoning AI Training Data - Schneier on Security

                                  All it takes to poison AI training data is to create a website: I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission…. Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled...

                                  favicon

                                  Schneier on Security (www.schneier.com)

                                  #LLM #Veracity

                                  faxmodemF This user is from outside of this forum
                                  faxmodemF This user is from outside of this forum
                                  faxmodem
                                  wrote last edited by
                                  #22

                                  @emacsomancer we should probably call them AP (Artificial Parrots)

                                  1 Reply Last reply
                                  0
                                  • Pteryx the Puzzle SecretaryP Pteryx the Puzzle Secretary shared this topic

                                  Reply
                                  • Reply as topic
                                  Log in to reply
                                  • Oldest to Newest
                                  • Newest to Oldest
                                  • Most Votes


                                  • Login

                                  • Login or register to search.
                                  Powered by NodeBB Contributors
                                  • First post
                                    Last post