And, here we see the crux of the problem:
(1) You are setting standards for what you believe to be correct practice in science that are not the standards that actually exist. Maybe they are the standards that you think ought to exist…but then, who are you to decide?
(2) You are listening to only one side when complaints arise about people refusing to provide data or code or whatever.
As one example of the first issue above: You have arbitrarily decided that you will only accept evidence of testing on future data, which conveniently means we will have to wait a long time for verification since the future is…well…the future and even intention will tell you that because of fluctuations it takes more than a decade’s worth of data…probably more like two…to do any sort of meaningful comparison.
And, when it is pointed out that there has been enough time to see how Hansen’s predictions from 20 years ago fared, you come up with excuses not to believe the results. (E.g., you demand that the original code be re-run rather than looking at the results that were produced back then and picking the emissions scenario that came closest to the actual forcings that materialized.)
As a second example, you have determined that everyone must release their source codes even though this is simply not standard practice in most fields. Replication in the physical sciences means that you follow the procedures that the original paper has outlined and see if you get similar results. It has never to my knowledge meant that you check the other person’s code line-by-line and/or requiring that the procedure be precisely mathematically duplicated. [As it turns out, I believe that Mann has now released his code although the NSF was very clear in stating that there was absolutely no requirement on him whatsoever to do so.]
One other point I should make here since you keep making references to what I assume to be Michael Mann’s behavior: It is worth noting that Mann’s temperature reconstruction is not what we are (or at least I am) talking about when we refer to climate modeling. It is not a climate model; it is a proxy temperature reconstruction. As such, it is much more of a statistical process of fitting to data and issues such as tests of robustness to having certain data included or not included do become much more important. (However, despite what others have claimed, Mann et al. have always discussed these issues, even going back to this paper.)