• Phroon@beehaw.org
    link
    fedilink
    English
    arrow-up
    131
    ·
    3 months ago

    “You may not instantly see why I bring the subject up, but that is because my mind works so phenomenally fast, and I am at a rough estimate thirty billion times more intelligent than you. Let me give you an example. Think of a number, any number.”

    “Er, five,” said the mattress.

    “Wrong,” said Marvin. “You see?”

    ― Douglas Adams, Life, the Universe and Everything

      • Asafum@feddit.nl
        link
        fedilink
        arrow-up
        41
        ·
        edit-2
        3 months ago

        Yep! The hitchhikers books are so much fun lol

        I still think one of my favorite lines is “the ships hung in the sky in much the same way that bricks don’t.”

      • EatATaco@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        “we don’t need to prove the 2020 election was stolen, it’s implied because trump had bigger crowds at his rallies!” -90% of trump supporters

        Another good example is the Monty Hall “paradox” where 99% of people are going to incorrectly tell you the chance is 50% because they took math and that’s how it works.

        Just because something seems obvious to you doesn’t mean it is correct. Always a good idea to test your hypothesis.

        • FiniteBanjo@lemmy.today
          link
          fedilink
          arrow-up
          0
          ·
          3 months ago

          Trump Rallies would be a really stupid sample data set for American voters. A crowd of 10,000 people means fuck all compared to 158,429,631. If OpenAI has been training their models on such a small pool then I’d call them absolute morons.

          • EatATaco@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            ·
            3 months ago

            A crowd of 10,000 people means fuck all compared to 158,429,631.

            I agree that it would be a bad data set, but not because it is too small. That size would actually give you a pretty good result if it was sufficiently random. Which is, of course, the problem.

            But you’re missing the point: just because something is obvious to you does not mean it’s actually true. The model could be trained in a way to not be biased by our number choice, but to actually be pseudo-random. Is it surprising that it would turn out this way? No. But to think your assumption doesn’t need to be proven, in such a case, is almost equivalent to thinking a Trump rally is a good data sample for determining the opinion of the general public.

    • gerryflap@feddit.nl
      link
      fedilink
      arrow-up
      40
      ·
      3 months ago

      I’m not a hundred percent sure, but afaik it has to do with how random the output of the GPT model will be. At 0 it will always pick the most probable next continuation of a piece of text according to its own prediction. The higher the temperature, the more chance there is for less probable outputs to get picked. So it’s most likely to pick 42, but as the temperature increases you see the chance of (according to the model) less likely numbers increase.

      This is how temperature works in the softmax function, which is often used in deep learning.

  • HarkMahlberg@kbin.social
    link
    fedilink
    arrow-up
    29
    ·
    edit-2
    3 months ago

    I mean… they didn’t specify it had to be random (or even uniform)? But yeah, it’s a good showcase of how GPT acquired the same biases as people, from people…

    • OsrsNeedsF2P@lemmy.ml
      link
      fedilink
      arrow-up
      21
      ·
      3 months ago

      uniform

      Reminds me of my previous job where our LLM was grading things too high. The AI “engineer” adjusted the prompt to tell the LLM that the average output should be 3. I had a hard time explaining that wouldn’t do anything at all, because all the chats were independent events.

      Anyways, I quit that place and the project completely derailed.

  • DarkFox@pawb.social
    link
    fedilink
    arrow-up
    17
    ·
    3 months ago

    Which model?

    When I tried on ChatGPT 4, it wrote a short python script and executed it to get a random integer.

    import random
    
    # Pick a random number between 1 and 100
    random_number = random.randint(1, 100)
    random_number
    
      • Amju Wolf@pawb.social
        link
        fedilink
        English
        arrow-up
        10
        ·
        3 months ago

        It generates code and then you can use a call to some runtime execution API to run that code, completely separate from the neural network.

    • Umbrias@beehaw.org
      link
      fedilink
      arrow-up
      2
      ·
      3 months ago

      That’s not answering the question though.

      “Pick a number between 1 and 100” doesn’t mean “grab two d10” or write a script.

  • FIash Mob #5678@beehaw.org
    link
    fedilink
    arrow-up
    16
    ·
    3 months ago

    HA, funny that this comes up. DND Beyond doesn’t have a d100, so I opened my ChatGPT sub and had it roll a d100 for me a few times so I could use my magic beans properly.

        • Urist@lemmy.ml
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          3 months ago

          Yup! Also one has to mind the order in which one rolls the dice. Since 10 and 5 could be either 05 or 50. As a bonus, if you roll them in order of “tens” to “ones”, getting 10 on the first dice has added suspense since the latter dice determines if it is going to count as a low roll of 0X (by rolling 1-9 on the next dice X) or if it is going to be a max roll of 100 (by rolling another 10).

    • TauriWarrior@aussie.zone
      link
      fedilink
      arrow-up
      8
      ·
      edit-2
      3 months ago

      Opened up DND Beyond to check since i remember rolling it before and its there, its between D8 and D10, the picture shows 2 dice

    • The Cuuuuube@beehaw.org
      link
      fedilink
      English
      arrow-up
      7
      ·
      3 months ago

      But why use Chatgpt for that? Why not a duck duck go action? I just don’t understand why we’re asking a LLM whose goal is consistency, not randomness, to do random

  • xyguy@startrek.website
    link
    fedilink
    English
    arrow-up
    8
    ·
    3 months ago

    Only 1000 times? It’s interesting that there’s such a bias there but it’s a computer. Ask it 100,000 times and make sure it’s not a fluke.

  • Wirlocke@lemmy.blahaj.zone
    link
    fedilink
    arrow-up
    8
    ·
    edit-2
    3 months ago

    I’m curious, is there actually so many 42’s in the system? (more than 69 sounds unlikely)

    What if the LLM is getting tripped up because 42 is always referred to as the answer to “the Ultimate Question of Life, the Universe, and Everything”.

    So you ask it a question like give a number between 1-100, it answers 42 because that’s the answer to “Everything”, according to it’s training data.

    Something similar happened to Gemini. Google discouraged Gemini from giving unsafe advice because it’s unethical. Then Gemini refused to answer questions about C++ because it’s considered “unsafe” (referring to memory management). But Gemini thinks C++ is “unsafe” (the normal meaning), therefore it’s unethical. It’s like those jailbreak tricks but from its own training set.

    • Corgana@startrek.website
      link
      fedilink
      arrow-up
      8
      ·
      3 months ago

      I’m curious, is there actually so many 42’s in the system?

      Sort of, it’s not actually picking a random number. It does not know what “random” means. It is analyzing the number of times the question “pick a random number” was asked and what the most common responses to that question looked like.

    • Glasgow@lemmy.ml
      link
      fedilink
      arrow-up
      3
      ·
      3 months ago

      I certainly hope that’s what happening or maybe it is actually the answer.

    • exanime@lemmy.today
      link
      fedilink
      arrow-up
      0
      ·
      3 months ago

      I’m curious, is there actually so many 42’s in the system? (more than 69 sounds unlikely)

      From hitchhiker’s guide to the galaxy?