Bramblethorn
Sleep-deprived
- Joined
- Feb 16, 2012
- Posts
- 18,381
Does it? I believe that question depends on pure logic and nothing else. You cannot differentiate two identical sequences of words, so as long as both the AI and the Human are capable (and likely) to produce the same sequence for whatever reasons, you cannot really differentiate between them.
A lot hangs on the word "likely" there. When we're talking about documents the length of a Literotica story, any given story is extremely unlikely; what really matters is relative likelihood and our priors.
If we sit a monkey down at a typewriter and let it mash keys at random, it's capable of producing a perfect script of Hamlet on its first try, but the likelihood of it doing so is around 1 in 10^350,000. (Numbers are obviously very approximate; based on an estimate of 180,000 characters and a keyboard with 80 possible characters selected with equal probability.)
OTOH, if we take an English speaker who's unfamiliar with Hamlet and ask them to write whatever comes to mind, the likelihood is approximately 1 in 10^54,000. (Based on estimate of 180,000 characters and Shannon entropy of approximately one bit per character for English.)
If I stand outside the room where the monkey and the human are typing, and a manuscript flies out of the window that happens to be a perfect copy of Hamlet, I can estimate the probability that it was written by a human via Bayes' theorem:
P(human author) = L(human would write this)/(L(human would write this) + L(monkey would write this))
i.e. (10^-54000)/(10^-54000 + 10^-350000)
which works out at 99.99999...%, with about 296,000 nines after the decimal point.
So even though both the human and the monkey are capable of writing Hamlet, and both are extremely unlikely to do so, one is much more unlikely to do so, and that lets us be pretty confident that this one was written by a human.
When we're looking at a long sequence of characters (one Literotica page is approximately 20,000 characters), even small differences in the generating mechanism can become detectable, in the same way that even a slightly biased coin can be distinguished from a fair one if one tosses it many times.
Interesting read, but besides the point we are discussing. You are only talking about your own limitation here in expressing some knowledge your brain has acquired. Our brains are tailored to recognize shapes and especially faces that are familiar to us. Our vocabulary is not.
And likewise, our brains are "tailored" (as an atheist, I'd prefer "adapted" here) to deal with languages, to process a string of words and infer things about the speaker/author, without being adapted to articulate how they do that. There doesn't seem to be any obvious reason to assume that a person's inability to describe how they analyse language means they lack the capacity to analyse language.
If somebody is unable to explain to me what their mind and body are doing when they catch a ball or ride a bicycle, do I conclude from that that they're unable to do these things?
An interesting observation, but nothing to do with what you are trying to prove. All you've proven is that words are ill suited to accurately convey certain types of information.
...which is exactly the point.
Now you are talking about a living and evolving language and its rules. I didn't live back in Tolkien's days, but I know for a fact, that just 100 years ago, language was used quite differently. Go back 200 or 300 years and it could start to feel like a foreign language or a very different dialect from what you are used to.
Indeed! And by reading examples of older texts, one can develop the ability to distinguish between material written 200 years ago and material written today, even without being able to write out a comprehensive list of the differences.
I'm not claiming that I or any other human would achieve 100% accuracy on picking AI-generated from human-written, nor that I personally am highly skilled in doing so.
Sooo.. you cannot tell me what makes something feel like it's been written by AI, but you can tell if its written by AI and not someone who is not a native speaker of the language or someone who is too meticulous when it comes to grammar and proper sentence structure.
Excellent reading comprehension. 10/10. That's definitely, absolutely, exactly what I said there.