Okay, just some brainstorming along the lines of “what if the clues don’t matter?” Or rather, “what if the meaning of the clues doesn’t matter?”
Each clue/answer pair is a string of words followed by a string of numbers. Some words are capitalized; let’s assume those are the only ones we care about. For example:
A. Former Name of Transylvania: [69, 95, 55, 174, 147, 11, 165, 137, 30, 91, 102, 181, 110]
There are 13 numbers in the answer. There are three important words in the clue. These words have 6, 4 and 12 letters. (As far as I can tell, none of the clue words have more letters than there are numbers in the answer.) So maybe the 6, 4, 12 maps to the 6th, 4th and 12th number in the answer:
11-174-181
Similarly you could go through the rest of the clues, and you’d end up with a string of numbers that you could then work on decoding. The vast majority of the numbers in the answers, and the meanings of the clues, are all just smokescreens.
Now I don’t know where you go from there, but it could be that some kind of scheme like this would let us cut out all of the red herrings and focus on the real message.
One advantage of a scheme like this is that it’s relatively simple to encode. You just write your clues (any clue will do) and then put the numbers you care about in the correct positions, filling in the rest with garbage numbers.
I don’t know how the Sentence part fits in, though.