I don’t necessarily expect anyone to know the answer to this; maybe there isn’t one exactly. But I have a better chance here than anywhere else!
I love random poetry generators. Ever since the first computer-written poetry was publicized (back in the 80’s, I think) I have been fascinated with the idea. I like the ones that compose original poetry far more than the ones that simply take great poets’ work and cut-and-paste it.
My favorite, Random Verse Lab, has a lot of cool features. For one thing, it lets you modify the lexicon any way you like – add to it, delete words, have separate subsets of words in a separate word set. The way it works is, the words are separated into parts of speech; you make a template in a grid where each word in each line is represented by a box into which you enter a part of speech. It even has a few blank part-of-speech categories; which is fortunate because it works much better if you don’t have to lump the articles in with the adjectives.
My question is this: It’s pretty much only by chance that an occasional poem will sound like natural language. The rest of the time, it sounds like phrasebook English. Are the rules of construction in English simply so complex and with so many exceptions that it’s just not practical to write so many rules into a program?
I’ve noticed that translation engines seem to have the same problem; as if they can’t look at the inputted text as a whole and translate it, but rather they do it one word at a time.