The Internet archive just crawls sites that have a robots.txt file present, basically sites that want to be found. It used the same fundamental concept of how search engines found websites in the old Internet before they got more algorithmic.
There may be easy “tap points” where you could just have a few locations tapped and you magically get every phone call (I don’t know enough about the telecommunications system), but it’s fundamentally different from how the Internet archive works either way.
There was a discussion in Cafe Society recently about Terminator 1, where someone claimed the Terminator should have just hacked the phone system and waited until Sarah Connor made a phone call in order to track her (as opposed to the method that actually worked that he ended up using–it was a silly thread), and someone piped in with information on how the nature of how phone calls works makes that essentially and impossibility. Maybe now that the old POTS network is mostly (entirely?) replaced by fiber it’s a different beast, though.
But yeah, the guy who wrote a web crawler to crawl sites that specifically wanted crawled would in fact not impress me as an expert on how to record every phone call in the world that passes through the U.S. physical infrastructure (which is apparently 80% of all global calls.) And in fact, he doesn’t attempt to explain the collection at all, he simply estimates (using true “envelope” math of no particular validity, in which he assumes total calling time based on his personal family’s monthly calling minutes) the amount of storage it would require and then quotes standard pricing for that amount of storage.
So again, doesn’t address my point. But maybe if you try to repeat it a third time without addressing my point I’ll buy it at that point and agree with you, so go ahead and try that out.