First of all, quantum mechanics does not violate classical logic. This is straightforwardly demonstrated through the possibility of so-called ‘hidden variable’-interpretations, like Bohmian mechanics. They do have unusual properties (non-locality, contextuality, etc.), and they essentially refer to ‘surplus structure’, i.e. unobservable stuff that the OP (rightly) has a problem with, but it’s always at least possible, if perhaps inconvenient. In such interpretations, one can always reason classically about every observable, and weirdnesses of quantum theory are indeed only apparent, and due to our ignorance of the hidden variables.
Nevertheless, you can also choose to look at quantum theory using a logic different from classical logic; however, in this case, it’s not the principle of the excluded middle that goes, but rather, distributivity. This is known as quantum logic (and while it has been proposed that it should replace classical logic as the ‘proper’ logic, because of the above argument, that doesn’t actually follow).
The reason for this is so-called complementarity. Two propositions are complementary roughly if both can’t simultaneously be known exactly. This is the basis of the uncertainty principle, which thus provides the classic example of why quantum logic does not distribute. First of all, the principle itself says that the accuracy of both position and momentum can’t exceed a specific threshold – the formalization is typically: ΔxΔp > h, where Δx and Δp are the uncertainties in position and momentum, respectively. One can translate this into the following proposition:
z: The particle’s momentum is in the interval Δp, and its position lies in the interval Δx.
Now we can think about the following three propositions:
p: The particle’s momentum is in Δp,
q: The particle’s position is in Δx[sub]1[/sub]
r: The particle’s position is in Δx[sub]2[/sub]
Where Δx[sub]1[/sub] and Δx[sub]2[/sub] are just the, say, left and right halves of the interval Δx. Then the following proposition: p ˄ (q ˅ r) (where ˄ denotes the logical and, and ˅ denotes the logical or) is clearly true, because it is equivalent to z, and thus, just a restatement of the uncertainty principle. However, the proposition: (p ˄ q) ˅ (q ˄ r), which is equivalent in classical logic, fails to be true, as both p ˄ q and q ˄ r are false; each asserts a proposition incompatible with the uncertainty principle, e.g. ‘the particle’s momentum is in Δp and the particle’s position is in Δx[sub]1[/sub]’, which would mean Δx[sub]1[/sub]Δp < h.
Now it is in this complementarity that the key to superposition lies. Let’s say there’s a property, which can obtain either of two values (typically, this is a particle’s spin along some axis), which is complementary to a similar property (spin along another axis, for example). If the particle thus has a definite spin along one axis, its spin along the other axis must be fully indeterminate – which means, it must be in a superposition of both possible values. Now Schrödinger’s cat just consists in coupling the welfare of a macroscopic being to such a superposition – in principle, it’s a straightforward consequence of the theory.
The superposition is then not of the logical form ‘A and not-A’, i.e. it’s not a violation of the principle of the excluded middle. Rather, if ‘A’ is ‘alive’, it’s just ‘not-A’ – the cat’s two-valued property doesn’t have a definite value along the alive-dead axis (so it’s certainly not alive, but also not dead – one way of viewing this is simply that alive and dead don’t exhaust the possibilities in the quantum world, but one can get rather tangled up thinking about it this way).
It’s interesting that almost all of quantum theory follows from the existence of complementary propositions – the way you can ‘build’ ordinary probability theory on classical logic, you can build quantum mechanics on quantum logic, too. I’ve written up a sketch of how this works (it will contain a lot of redundancy if you’re familiar with classical logic, set theory, and probability, so you can probably skim those parts), and also of how thinking about quantum mechanics in terms of such a novel theory of probability illustrates the origin (and meaning) of interference effects, such as in the classical double slit experiment. And for the complete three-fer, I discuss superpositions in the beginning of this bit on quantum entanglement in the light of what’s known as ‘Zeilinger’s principle’ (who, in a word, considers the origin of complementarity to be the finiteness of information in any given system – once something about it is known perfectly, there simply isn’t enough information to decide its other properties). In the interest of openness, however, I should mention that my perspectives probably can’t be considered entirely mainstream.
So much for shameless self-promotion… 