I think the main point of quantum mechanics on phase space is that you can essentially conceive of it as a deformation (about what exactly that means, later) of classical (statistical) mechanics. JWT Kottekoe has done a good job at explaining the relevant notions, I’ll just try and add some more things from this perspective.
First, if you do classical mechanics on phase space, your whole system is represented by a point, whose coordinates give you the positions and momenta of all the constituents of your system. So, in general, if you have a system of N point particles, this means 6N coordinates in a three-dimensional position space: three coordinates for the position of each of the N particles, and again three for the momenta. The evolution of your system then is a trajectory through phase space—in JWT Kottekoe’s example, for instance, a circle (as depicted on this wiki page).
Now, the phase space area for the harmonic oscillator has dimensions of [positionmomentum]=[energytime]=Js, which is not coincidentally the dimension of Planck’s constant ℏ. As you may know, the uncertainty principle constrains the accuracy with which one can know both position and momentum of a quantum system simultaneously. Roughly, the product of the uncertainty in each quantity, ΔxΔp, must always be at least ℏ or greater. But this means that your system can’t be represented by a point in phase space anymore, but rather, must be constrained to a spot of area at least ℏ.
There is another situation in physics in which a system does not occupy a definite spot in phase space: statistical mechanics. There, the precise state of the system is simply not known; rather, what is known is a probability distribution over possible states. In general, and in principle, one can always ‘sharpen’ this distribution to a point, by measuring all parts of the system. This is not possible in quantum mechanics, due to the uncertainty principle, as we have seen. This emphasizes the irreducible nature of probability in quantum mechanics: it doesn’t just quantify our ignorance (as it does in the statistical mechanics case), but rather, is fundamental.
But still, it is natural to suppose that, if ℏ were vanishingly small, quantum mechanics on phase space will transition to ordinary statistical mechanics. This is indeed the case, and is the reason why I have called quantum mechanics a deformation of phase space statistical mechanics. In the mathematical sense, a deformation of some object is a related object that goes over into the original object upon the vanishing of some deformation parameter. For instance, an ellipse is a deformation of a circle in this sense: if the eccentricity goes to zero, an ellipse simply becomes a circle.
The same is now the case with phase space quantum mechanics: here, the deformation parameter is simply ℏ, and if that tends to zero, we get classical statistical mechanics back out. In particular, the Wigner function is simply a deformed version of the probability distribution on phase space (known as the Liouville distribution); it gives you exactly the same information as, for instance, the quantum wave function or density matrix (to the latter of which it is related by the Weyl-Wigner transform).
Doing quantum mechanics in this way has some advantages, for instance, in quantum optics; additionally, it is sometimes useful to be able to do quantum mechanics in the same arena as classical mechanics (rather than more abstract ones such as Hilbert spaces). But I think the greatest gain is conceptual: while nothing in the discussion I gave should be mistaken for rigorous, the intuition it builds through being able to conceptualize quantum systems, I think, can be very useful (the same point, incidentally, as is being raised by the article linked in the OP).