How does Microsoft keep anything in Windows a secret?

aldiboronti · June 11, 2006, 6:03pm

OK, I just know this is a dumb question, but I need some ignorance dispelled here.

I recall instances where some proprietary part of Windows has been leaked on to the net, allowing hackers to attack vulnerabilities, etc.

My question (and here comes the dumb bit): how is anything in Windows a secret when the code is all there on any computer that uses it, and can presumably be disassembled and examined at leisure.

What am I missing?

Revtim · June 11, 2006, 6:14pm

What gets leaked is source code, which is different from the runnable software that the computer actually executes. Software source code is written in higher-level languages such as C, C++, C#, etc, which is much easier to understand than the executable code it is compiled into.

The runnable software can be reverse-engineered into source code, but the result of that process is still pretty hard to understand.

ultrafilter · June 11, 2006, 6:34pm

Here’s a short source code sample:


#include <iostream>
#include <tchar.h>

int Tally[6] = { 0, 0, 0, 0, 0, 0 };

int min( int a, int b )
{
	return ( a + b - abs( a - b ) ) / 2;
}

void FindMin( int a, int b, int c, int d )
{
	Tally[ min( min( a, b ), min( c, d ) ) - 1 ]++;
}

int _tmain( int argc, _TCHAR* argv[] )
{
	for ( int i = 1; i < 7; ++i )
	{
		for ( int j = 1; j < 7; ++j )
		{
			for ( int k = 1; k < 7; ++k )
			{
				for ( int l = 1; l < 7; ++l )
				{
					FindMin( i, j, k, l );
				}
			}
		}
	}

	for ( int n = 1; n < 7; ++n )
	{
		printf( "%d: %d
", n, Tally[n - 1] );
	}

	return 0;
}

And here’s what that actually looks like as an executable:


MZ       ÿÿ  ¸       @                                   è   º ´	Í!¸LÍ!This program cannot be run in DOS mode.$       æ¯ "¢ÎNq¢ÎNq¢ÎNq§Âq¡ÎNq§Â.q£ÎNq§ÂAq£ÎNq§Âq¯ÎNq!ÆqÎNq¢ÎOq¶ÎNq§Â*qÎNq§Âq£ÎNqRich¢ÎNq                PE  L ´aŒD        à a
                      @                      @                                       d!  <                                   `                              ¸   H               X                           .text   „                          `.rdata  *                        @  @.data   L    0                    @  À                                                                                                                                                                                                                                                                                                                                                                                                                                        j(h€ @ èt  3ÿWÿ  @ f8MZu‹H<È9PE  u·A=  t=  t‰}äë'ƒ¹„   vò3À9¹ø   ëƒytvâ3À9¹è   •À‰Eä‰}üjÿ8 @ Yƒ
D0@ ÿƒ
H0@ ÿÿ4 @ ‹
$0@ ‰ÿ0 @ ‹
 0@ ‰¡, @ ‹ £@0@ è.  èÉ  9=0@ uh|@ ÿL @ Yèž  h0@ h0@ è‰  h @ èå   ¡0@ ‰EÜEÜPÿ50@ EàPEØPEÔPÿ  @ ƒÄ ‰EÌ;Ç}jè„   Yh0@ h 0@ è:  ÿ @ ‹Mà‰ÿuàÿuØÿuÔèà  ƒÄ‹ð‰uÈ9}äuaVÿ @ ÿ @ ë-‹Eì‹‹	‰MÐPQè(   YYÃ‹eè‹uÐƒ}ä uaVÿ @ ÿ( @ ƒMüÿ‹Æè$  Ãÿ% @ ÿ% @ ƒ=H0@ ÿuÿ%D @ hD0@ hH0@ ÿt$è  ƒÄÃÿt$èÑÿÿÿ÷ØeÀ÷ØYHÃjh @ è˜   ÇEäX!@ }äX!@ s"ƒeü ‹Eä‹ …ÀtÿÐëa3À@Ã‹eèƒMüÿƒEäëÕèœ   Ãjh @ èT   ÇEä`!@ }ä`!@ s"ƒeü ‹Eä‹ …ÀtÿÐëa3À@Ã‹eèƒMüÿƒEäëÕèX   Ãÿ%$ @ h   h   è_   YYÃ3ÀÃÌhÌ@ d¡    P‹D$‰l$l$+àSVW‹Eø‰eèP‹EüÇEüÿÿÿÿ‰EøEðd£    Ã‹Mðd‰
    Y_^[ÉQÃÿ%< @ ÿ%@ @ ÿ%H @ ‹Æ+Á™W‹ø3ú+ú+Ç™+ÂÑø_ÃVW‹ðèßÿÿÿ‹L$‹t$‹øèÐÿÿÿ‹Ï‹ðèÇÿÿÿ…$0@ ÿ _^ÃSUV3íWE3ÛC3ÿG3öFSU‹Î‹Çè¸ÿÿÿFƒþaYY|íGƒÿa|äCƒûa|ÛEƒýa|Ò3ÿG¾(0@ ÿ6Wh¬ @ ÿP @ ƒÄƒÆGþ@0@ |ã_^]3À[Ã                                                                                                                            #      "   "  ."  8"  @"  P"  ^"  n"  "  Ž"  ž"  ®"  ¼"  Î"  â"  ð"  ú"  z"  ø!                  ´aŒD       A    !   	      ÿÿÿÿ_@ s@     ÿÿÿÿ	@ 
@     ÿÿÿÿM@ Q@ %d: %d
     H                                                           0@ P!@    RSDS©½PÍp B¹¹ŒýhEÿ   d:\sandbox\dicetest\Release\dicetest.pdb                Ì                  ¨!          "     !          #                          #      "   "  ."  8"  @"  P"  ^"  n"  "  Ž"  ž"  ®"  ¼"  Î"  â"  ð"  ú"  z"  ø!      ìprintf  MSVCR71.dll Ê _c_exit ú _exit K _XcptFilter Í _cexit  —exit  | __p___initenv Â _amsg_exit  n __getmainargs ?_initterm Ÿ __setusermatherr  » _adjust_fdiv  ‚ __p__commode  ‡ __p__fmode  œ __set_app_type  ñ _except_handler3  k __dllonexit ¸_onexit Û _controlfp  wGetModuleHandleA  KERNEL32.dll                                                                                                                                                                                                                                           Næ@»

I wrote that code as a test of some kind, and I don’t really remember what it does. Figuring it out from the executable is significantly more difficult, as you might imagine.

chrisk · June 11, 2006, 6:43pm

Just to add this… ‘disassembly’ of source code is a very basic transform of the machine code instructions into marginally human-readable commands… taking the form of a huge mass of instructions like ‘load from memory address 55323 into register 17. Add registers 12 and 19 into register 3. If register 7 is anything but zero, branch ahead 234 lines. Store from register 22 into memory address 28341. Now jump to line 48124.’

This will tell you, kind of, the stuff that a program is actually doing and how it is doing it, but in such a disorganized way that it’s hard to track the significance of anything that’s going on… you can see plenty of trees, but can’t get the lay of the forest.

Some programming languages have had decompilers worked out, which can go one step further that disassembly, and make a guess at expression statements, if-then and looping structures, functions and subprocedures. However, certain elements from the source code are never compiled, such as programmer comments, variable names, function and subprocedure names. Thus, decompilers can’t even see the comments, and they make up their own variable names etcetera. This means that it can still be enormously difficult to figure out what the decompiled source code was trying to do.

(Heck, sometimes it’s even hard to tell that with the original source code, depending on how good the author was as far as writing self-documenting code - choosing variable names that mean something, etcetera.)

Hope that helps.

aldiboronti · June 11, 2006, 7:37pm

Whew!

Considering the millions of lines of code there must be in Windows, and if it all presents in the same way as ultrafilter’s example, I can see one might have a slight problem disentangling it!

KlondikeGeoff · June 11, 2006, 7:49pm

Furthermore, people have a big problem just getting the damn OS to run properly.

SmackFu · June 11, 2006, 7:57pm

Well, you might actually be able to figure out ultrafilter’s code. As shown, the compiled version is gibberish, but if you format it properly, it makes a lot more sense. Since the logic is pretty simple, you might be able to follow it.

Given that, the logic in Windows is not simple. It’s not simple what it does and it’s not simple how it does it, since the logic is spread across dozens of libraries.

Topic		Replies	Views
Windows leak--Are Windows disks uncrackable? No one ever broke into one? Factual Questions	6	1065	February 14, 2004
Explain "source code" to me Factual Questions	41	10796	September 3, 2010
Is the Windows Kernel Code still a Secret? Factual Questions	23	2718	September 28, 2009
Decompiling Bill Factual Questions	12	783	December 5, 1999
Why can't you decompile binaries? Factual Questions	19	2373	January 4, 2007

How does Microsoft keep anything in Windows a secret?

Related topics