It would be very interesting to observe the generation with token probabilities shown as well. Because this is either the neural network itself generating (pseudo-)randomness, or it’s throwing out all number tokens in equal (ish) probabilities and letting the sampler (which is fed actual random numbers) do the picking.
2 Comments
We all know 50 isn’t very random. Good job Llama
It would be very interesting to observe the generation with token probabilities shown as well. Because this is either the neural network itself generating (pseudo-)randomness, or it’s throwing out all number tokens in equal (ish) probabilities and letting the sampler (which is fed actual random numbers) do the picking.