Where is the data on the dark web stored?

Saffer · February 8, 2013, 5:59am

Motivated by Cecil’s recent column:
Link

Specifically, what servers store the data from the hidden wiki/silk road etc.
Someone must be providing a facility to store this stuff.
The Tor browser knows how to get there, how does it do it?
I get that the user’s connection enters the network, is bounced around for a bit, and then exits at the ultimate destination, but surely the browser has to know what that destination is. Can’t the authorities work out where these websites are coming from?

Derleth · February 8, 2013, 6:09am

Nobody knows. It could be in Bangladesh, it could be next door.

True.

By asking a computer which doesn’t know, but can ask another computer, which can ask another, and so on, until, finally, it reaches the right one.

No. As I alluded to above, the whole point of Tor is onion routing, which works like this:

The Tor Project:

The idea is similar to using a twisty, hard-to-follow route in order to throw off somebody who is tailing you — and then periodically erasing your footprints. Instead of taking a direct route from source to destination, data packets on the Tor network take a random pathway through several relays that cover your tracks so no observer at any single point can tell where the data came from or where it’s going.

To create a private network pathway with Tor, the user’s software or client incrementally builds a circuit of encrypted connections through relays on the network. The circuit is extended one hop at a time, and each relay along the way knows only which relay gave it data and which relay it is giving data to. No individual relay ever knows the complete path that a data packet has taken. The client negotiates a separate set of encryption keys for each hop along the circuit to ensure that each hop can’t trace these connections as they pass through.

It’s like going from Chicago to New York City via Dallas, Los Angeles, Seattle, and Miami to cover your tracks, picking a different identity and a new credit card for each flight. In addition, the data itself (the web page, in this case) is encrypted, so someone monitoring your connection can’t tell what you’re doing.

If they could, they’d be shut down by now.

Jragon · February 8, 2013, 6:12am

I’ve heard that some of the sites are distributed, so the whole doesn’t come from any single server (and if one server is taken down another can pick up the slack).

ETA: Not that this is particularly uncommon in the web world. Hell, Youtube is distributed across multiple servers, it’s just that for Youtube it’s load management more than obfuscation.

grude · February 8, 2013, 6:48am

Silkroad is hosted on a normal web server somewhere, just like all websites. The difference is it is not serving pages over HTTP protocol on port 80 like normal websites, instead it only serves pages over the .tor protocol which obfuscates both ends of the transaction so not only are you not able to know the servers IP it also cannot see yours.

There is a weakness in this though, if enough nodes on the .tor network were set up by the DEA they might be able to use traffic analysis to start getting an idea of where Silkroad is hosted. One also wonders if the data center where it is hosted could figure it out, they would have a web server with an incredible amount of traffic mostly to .tor nodes. Even if the disk is totally encrypted the amount of traffic only to .tor nodes could be suspicious.

EDIT:This assumes Silkroad is one server, it very well could be several.

Saffer · February 8, 2013, 1:20pm

Let’s continue this analogue if possible. In the above I would still need to know that I am going to NYC, and I need to know where NYC is. Can you explain how I obtain this information.

Hail_Ants · February 8, 2013, 6:59pm

[small hijack]

Is this really a new column by Cecil? Cause I know I’d read all about this (Dark Web) before and I could have sworn it was his column. Maybe it was a post here a while back?

Der_Trihs · February 8, 2013, 7:45pm

I vaguely recall reading about it on this site (or from a link I got here?), but don’t recall this specific column. It must have been something else.

Drum_God · February 8, 2013, 8:02pm

Not a computer expert, just continuing the analogy. Why would you need to know where NYC is? You just need to know how to find an airplane going to your next destination. The pilot needs to know where the destination is, but he doesn’t need to know that you’re on the plane or what your business is when you arrive at the destination. Ultimately, you board a flight bound for NYC. The pilot knows where NYC is, but he doesn’t know why you want to go there.

black_rabbit · February 8, 2013, 8:33pm

Broadly speaking, .onion sites run their services bound to localhost, and proxy those services through a plain old Tor stack that’s remarkably similar to the one you use as a client. There are a few other steps related to registering the service “address” with the shared directory service, but since everybody’s speaking Tor, nobody knows for sure which request is coming from an endpoint and which is coming from a relay.

engineer_comp_geek · February 8, 2013, 8:45pm

You may be thinking of this thread:

njtt · February 8, 2013, 8:48pm

Would there really be “an incredible amount of traffic”? Most people have never heard of Silk Road, and I should imagine that most people who have heard of it (such as me - admittedly I only learned of it from another thread on this board, last year) don’t use it (and probably don’t know how to reach it). Even most of those who do use it probably only do so occasionally, as is the case with legitimate e-commerce sites.

Der_Trihs · February 8, 2013, 9:24pm

I think that was what I recall.

tellyworth · February 8, 2013, 9:38pm

I believe that to the data centre, a Tor hidden service would look just like a regular Tor node.

With some further work the operators could distribute parts of the service amongst several data centres.

Saffer · February 9, 2013, 6:26am

There’s a link in my first post.

Saffer · February 9, 2013, 6:34am

I don’t think there is a pilot in this analogy. Unless you consider each node along the way to be a pilot, but personally I think that stretches the analogy.

Let me hazard a guess:
I need to get to destination A.
I randomly pick a node B and ask are you A or do you know A. No, try C.
Ask C the same questions, then D, E, F … until some node says yes.

This is clearly wrong because it does not guarantee that I will find B and even if I do, I will know where B is. Which defeats the objective.

grude · February 9, 2013, 6:43am

Well I was speaking relatively, obviously it is minuscule compared to the traffic ebay.com generates. They could never know for sure they could just suspect.

How many .tor hidden service sites are out there? What is the average amount of traffic they generate? If Silkroad is the most popular by far…

moriah · February 9, 2013, 4:17pm

If you’re a .tor hidden service, please speak up so you can be counted.

Hail_Ants · February 11, 2013, 1:41am

Yeah, I think it was one of the linked articles in it that reminded me of the current Cecil column…

sisu · February 11, 2013, 2:38am

Yeah it’s hard to say how much traffic there would be but I guess it would so small it would be hard to track given what I have read there was $22m in sales last year. That’s nothing in the scheme of things really…

snailboy · February 11, 2013, 4:34am

How about a different analogy? You have a group of people with some people looking for information and some people offering information. The people offering information don’t want to make it clear to everyone what they’re offering because it’s illegal information and the same with the people looking for information. So you write a message with a certain request for information and give it to a random person. Unless that person happens to be the person with the information, they will then give the message to a random other person, and so on. Eventually the message makes its way to the person with the information, so they write a response and give it back to the person who handed it to them, as does everyone else until it gets back to you. No one who passes the message has anyway of knowing if the person they received it from is actually the person who wrote the message or just another person passing it along, and they can’t know if the person they give it to is the intended recipient or just someone that’s going to pass it further along. Still, you asked for certain information and it came back from the person who was supposed to respond, even though you don’t know who that person is and he doesn’t know who you are.

Topic		Replies	Views
Underground/hidden internet Factual Questions	68	49769	August 31, 2012
Uses for Tor and Darkwebs? Cecil's Columns/Staff Reports	14	13592	February 25, 2013
How Do I Access The "Deep Web""TOR"technology Factual Questions	19	8395	November 14, 2013
Tell me about Tor / the deep web Factual Questions	10	6545	August 15, 2013
Do you surf the "Deep Web" ? In My Humble Opinion	27	4513	June 14, 2016

Where is the data on the dark web stored?

Related topics