Add tcpquic.md
This commit is contained in:
commit
6ef6ccecaa
|
@ -0,0 +1,228 @@
|
|||
# Connection Latency
|
||||
|
||||
Hi! Let's say you go to visit your favorite website, and it takes 3 seconds to show up. You run a speed test, and your
|
||||
connection is doing fine at around 100 Mbps. So how come it took 3 seconds to show up? That's what connection latency is.
|
||||
|
||||
Let's say our network has 3 nodes:
|
||||
|
||||
```
|
||||
your phone ---------> our router ---------> the server
|
||||
<--------- <---------
|
||||
```
|
||||
|
||||
(This is a rather simplistic model, real networks are a lot more complicated. But it'll work for this demonstration.)
|
||||
|
||||
Let's say sending a packet between two adjacent nodes takes n seconds. This means that for a packet to go from you to
|
||||
the server, it takes 2\*n seconds (n seconds to go from you to our router, then another n to go from our router to the
|
||||
server). As such, we want to keep this n very low. Ideally, it'd be 0, but in practice we're limited by things like the
|
||||
speed of light.
|
||||
|
||||
```
|
||||
your phone ---------> our router ---------> the server
|
||||
<--------- <--DATA---
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<--DATA--- <---------
|
||||
```
|
||||
|
||||
So you might be thinking, "it takes 3 seconds for the server to send me the page?! n is 1.5s?!" No, not quite.
|
||||
|
||||
Before the server can send you anything, it first needs to know what you want. You need to tell the server what you want.
|
||||
So our n is no longer 1.5s but 0.75s instead.
|
||||
|
||||
```
|
||||
your phone ---HTTP--> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---HTTP--> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<--------- <--DATA---
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<--DATA--- <---------
|
||||
```
|
||||
|
||||
|
||||
But that's not all! You can't just ask the server to send you stuff and have it send you stuff! There are things like
|
||||
spoofing that we need to be concerned about, otherwise we'd get massive DDoS amplification attacks! Instead, meet TCP:
|
||||
|
||||
## TCP Connection
|
||||
|
||||
Transmission Control Protocol, or TCP, is the protocol used to prevent evil hackers from bringing down the internet. It
|
||||
accomplishes that by employing a 3-way handshake. So, how does it work? Well, first, you ask for a connection. This is
|
||||
called a SYN in TCP:
|
||||
|
||||
```
|
||||
your phone ---SYN---> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---SYN---> the server
|
||||
<--------- <---------
|
||||
```
|
||||
|
||||
This lets the server know you want to send data.
|
||||
|
||||
When the server receives the SYN, it then tells you that it got the SYN, and asks *you* for a connection. This is called
|
||||
a SYN-ACK:
|
||||
|
||||
```
|
||||
your phone ---------> our router ---------> the server
|
||||
<--------- <-SYN-ACK-
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<-SYN-ACK- <---------
|
||||
```
|
||||
|
||||
This lets you know the server wants to send data, and acknowledges that you want to send data. But we're not quite done yet.
|
||||
|
||||
We still need to acknowledge that the server wants to send data. So, we send an ACK:
|
||||
|
||||
|
||||
```
|
||||
your phone ---ACK---> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---ACK---> the server
|
||||
<--------- <---------
|
||||
```
|
||||
|
||||
*Now you can get your data.*
|
||||
|
||||
We took 6n to get a connection, and 2n to get our data... and another 2n to request the data. As such, 10n = 3s, or
|
||||
n = 0.3s... So if we were to simply send a packet to the server and get a similar packet back, it'd take about 1.2s. However,
|
||||
we're not quite done yet. Before your phone can talk to the server, it needs to know where the server is. When you type an
|
||||
address into the browser's address bar, that's only the name of the server - we need instructions to get the packets there.
|
||||
|
||||
This is where DNS comes in:
|
||||
|
||||
## DNS Queries
|
||||
|
||||
Domain Name System, or DNS, is the protocol that takes a domain name and converts it into an IP address - the latter is
|
||||
basically a map/instructions on how to get the packets to the destination.
|
||||
|
||||
Thankfully, DNS is usually stored in the router. Additionally, it doesn't use TCP, so there's no 3-way handshake.
|
||||
|
||||
```
|
||||
your phone ---NAME--> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<---IP---- <---------
|
||||
```
|
||||
|
||||
If the router doesn't know a name, it has to ask another router about it. However, this generally only happens once every few
|
||||
hours, so it's not something we have to worry about.
|
||||
|
||||
This adds another 2n to our time. We're up to 12n = 3s, or n = 0.25s. It only takes 1 second to send a packet to the server
|
||||
and get it back! TCP is awful! ... Not so fast, tho. You might've noticed that the network is busy with only one packet at a
|
||||
time. Maybe we can do something to improve this. Okay, we can't improve the DNS query, as it's required to happen before we
|
||||
can do anything. But can we improve the TCP? What if we terminate the TCP at the router?
|
||||
|
||||
## Terminating the TCP at the router
|
||||
|
||||
While not strictly allowed by the internet specifications, it's not strictly disallowed either. If implemented, our flow can
|
||||
look like this:
|
||||
|
||||
```
|
||||
your phone ---NAME--> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<---IP---- <---------
|
||||
|
||||
your phone ---SYN---> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---SYN---> the server
|
||||
<-SYN-ACK- <---------
|
||||
|
||||
your phone ---ACK---> our router ---------> the server
|
||||
<--------- <-SYN-ACK-
|
||||
|
||||
your phone ---HTTP--> our router ---ACK---> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---HTTP--> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<--------- <--DATA---
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<--DATA--- <---------
|
||||
```
|
||||
|
||||
We're down from 12n to only 9n! With our n = 0.25s, we've shaved off 0.75s from our original 3s! This is a noticeable
|
||||
improvement.
|
||||
|
||||
However, you might've noticed I've been talking about `HTTP` so far. Additionally, you can have both an ACK and an HTTP in
|
||||
transit at the same time, this shaves off 1n from both our original 12n and our 9n, so we have 11n = 3s and an improvement
|
||||
of approximately 0.81s. So it's even slightly better.
|
||||
|
||||
HTTPS, on the other hand, also has its own handshake after TCP's. I don't wanna get into this, because you can probably see
|
||||
how ridiculous it's getting by now. This handshake can also be partially terminated by the router, so *it* can also be
|
||||
optimized slightly, and we can shave off more n's.
|
||||
|
||||
But let's look at QUIC real quick:
|
||||
|
||||
## QUIC
|
||||
|
||||
(I don't know what QUIC stands for.)
|
||||
|
||||
QUIC is a protocol that does something similar to TCP, with one major difference: it uses UDP.
|
||||
|
||||
User Datagram Protocol, or UDP, is also used by DNS (see above). This means it has no handshake. QUIC implements its own
|
||||
handshake, on top of UDP. This means QUIC is basically like TCP, but it comes with a serious caveat: being UDP-based, it
|
||||
DOESN'T benefit from our TCP optimization from earlier!
|
||||
|
||||
As such, going QUIC over existing networks has one serious drawback: it adds back those 3n that we were able to shave off!
|
||||
And if we optimize for QUIC in addition to TCP, we still only manage to shave off those 3n again.
|
||||
|
||||
So, is there any room for improvement? Can we shave off more n's?
|
||||
|
||||
... Maybe. It would require some changes to the web. More specifically, what if the router could serve some of the content
|
||||
directly, without ever reaching the server?
|
||||
|
||||
That's where we need to change the protocols slightly:
|
||||
|
||||
## Terminating "HTTP" at the router
|
||||
|
||||
Rather than terminating just TCP at the router, can we go one step further?
|
||||
|
||||
Can we create a protocol such that the great majority of the connections look more like this:
|
||||
|
||||
```
|
||||
your phone ---NAME--> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<---IP---- <---------
|
||||
|
||||
your phone ---SYN---> our router ---------> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---SYN---> the server
|
||||
<-SYN-ACK- <---------
|
||||
|
||||
your phone ---ACK---> our router ---------> the server
|
||||
<--------- <-SYN-ACK-
|
||||
|
||||
your phone --NHTTP--> our router ---ACK---> the server
|
||||
<--------- <---------
|
||||
|
||||
your phone ---------> our router ---------> the server
|
||||
<--DATA--- <---------
|
||||
```
|
||||
|
||||
(shave off another n if you combine the ACK and the NHTTP)
|
||||
|
||||
We just managed to shave off another 2n! While this requires extensive changes to the existing infrastructure, the load
|
||||
times go from the 2.25s/2.19s from our "Terminating TCP at the router" to an even lower 1.64s! This is almost half the
|
||||
original 3s! However, this improvement is not as perfect as our "Terminating TCP at the router" and "Partially terminating
|
||||
HTTPS at the router" - you want your private data to go encrypted all the way to the server, so anything dealing with
|
||||
private data would be back to the original 3s/2.25s/2.19s depending on optimizations. This is okay tho, as most data on the
|
||||
web - images, videos, HTML (page layout/behaviour), CSS (also page layout), Javascript (also page behaviour) - are generally
|
||||
not private. For example, your neighbor probably watches the same videos as you - thus the videos are not private - but your
|
||||
bank statement is exclusive to you - and as such, private.
|
Loading…
Reference in New Issue