Principles of Computer Systems

Autumn 2019

Stanford University

Computer Science Department

Instructors: Chris Gregg

Philip Levis

PDF of this presentation

Lecture 16: Networks, HTTP, Proxies and MapReduce

Computer networks are a type of computer system
Most exciting systems and applications today are networked
Today we'll look under the covers a bit about how networks are designed and work
- Layering and encapsulation
- The 7 OSI layers
This lecture will just skim the surface: if you want to learn more, take CS144

Computer Networks

Computer networks mean your application is no longer limited to the data that it has locally on your own machine
- It can receive data from and send data to other machines
- This data can be interactive (people), not just documents
Today, when we talk about computer networks we generally mean the Internet
- There were many networking technologies before and concurrent with the Internet
- The Internet won! Thankfully. Ever heard of IPX, SNA, AppleTalk?
Operating systems provide two basic abstractions to the Internet
- A reliable, in-order byte stream to an application running on another computer
  - Provided by TCP (Transmission Control Protocol)
  - Used for: ssh, http, email, NFS, etc.
- An unreliable message (finite datagrams) service to an application running on another computer
  - Provided by UDP (User Datagram Protocol)
  - Used for DNS, some latency-sensitive applications (voice, video, gaming)

The Basic Benefit and Two Abstractions

Computer networks mean your application is no longer limited to the data that it has locally on your own machine
- It can receive data from and send data to other machines
- This data can be interactive (people), not just documents
Today, when we talk about computer networks we generally mean the Internet
- There were many networking technologies before and concurrent with the Internet
- The Internet won! Thankfully. Ever heard of IPX, SNA, AppleTalk?
Operating systems provide two basic abstractions to the Internet
- A reliable, in-order byte stream to an application running on another computer
  - Provided by TCP (Transmission Control Protocol)
  - Used for: ssh, http, email, NFS, etc.
- An unreliable message (finite datagrams) service to an application running on another computer
  - Provided by UDP (User Datagram Protocol)
  - Used for DNS, some latency-sensitive applications (voice, video, gaming)

The Basic Benefit and Two Abstractions

Reliable, In-Order Byte Stream

A TCP connection behaves much like a bidirectional pipe
- Both sides can read, both sides can write
- Establishing a connection has nothing to do with forking: two programs can be on different computers across the globe
- Usually, one computer (a server) waits for connection requests from clients
  - You've seen this with connect and accept
  - Other approaches are for edge cases and we won't worry about them
Making TCP work as well as it does has taken 30 years of continual research and engineering: people still publish papers on it (e.g., Keith Winstein, Balaji Prabhakar)
- One example: TCP was originally designed for wide area networks, where nodes are milliseconds apart; making it work well in the datacenter, where nodes are microseconds apart, required changing a few of its algorithms (Balaji).
- Another example: TCP is designed to be extremely robust and work in all kinds of network conditions. Can you tune TCP to perform better in certain network settings (e.g., LTE vs. WiFi vs. wired) and how much improvement will you see (Keith)?

Web: HTTP over Reliable, In-Order Byte Stream

Internet

Web: HTTP over Reliable, In-Order Byte Stream

Internet

open a connection

Web: HTTP over Reliable, In-Order Byte Stream

Internet

 GET / HTTP/1.1

write a request

Web: HTTP over Reliable, In-Order Byte Stream

Internet

 GET / HTTP/1.1

read the request

Web: HTTP over Reliable, In-Order Byte Stream

Internet

 HTTP/1.1 200 OK
 document data

write the response

Web: HTTP over Reliable, In-Order Byte Stream

Internet

 HTTP/1.1 200 OK
 document data

read the response

BitTorrent: Organizing a Swarm

Internet

Client learns locations of nodes storing "chunks" of file
Opens connections to these nodes, uses a "tit-for-tat"
algorithm to share data
Each connection is a reliable, in-order byte stream
- Might not actually be TCP for some low-level reasons
- Modularity and layering: has identical API to
  TCP, just behaves a little differently so it doesn't
  wreck your video chat

BitTorrent peers

Client

Internet: Under the Covers

Internet

It's not a magic cloud (although it sometimes seems to be)
When you make a web request, there are four layers of name in play
- Application layer: name of document (e.g., index.html)
- Transport layer: port identifying what service (e.g., HTTP on port 80)
- Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
- Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Internet in 4 Layers

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Internet in 4 Layers

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Internet in 4 Layers

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Internet in 4 Layers

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Internet in 4 Layers

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Internet in 4 Layers

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Internet in 4 Layers

Internet

Application layer: name of document (e.g., index.html)
Transport layer: port identifying what service (e.g., HTTP on port 80)
Network layer: global IP address identifying which computer (e.g., 171.67.76.12)
Link layer: MAC address specifying which device on a link (e.g., 00:25:90:e7:10:48)

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Hop by Hop

Internet

A packet from one node to another takes many hops to get to its destination
At each hop, the packet is sent to the link layer address of the next hop
Each hop checks the IP (network) address to see if it's the final destination
- If no, forward to next hop using a new link layer destination
- If yes, bubble up to transport and potentially application layers

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

Hop by Hop

Internet

A packet from one node to another takes many hops to get to its destination
At each hop, the packet is sent to the link layer address of the next hop
Each hop checks the IP (network) address to see if it's the final destination
- If no, forward to next hop using a new link layer destination
- If yes, bubble up to transport and potentially application layers

Application

Transport

Network

Link

Network

Link

Network

Link

Network

Link

Application

Transport

Network

Link

How many hops do you think it is from here to MIT? UC Berkeley?

7 Layer Model of Networks

Application

Presentation

Session

Transport

Network

Link

Physical

Application level data: HTTP

Let's not worry about this one

A communication session: TLS

Application: TCP, UDP ports

Node: IP address

A network interface: MAC

How you encode bits

Layering and Modularity

One of the seven principles of system design
Layering: decomposing a system into a well defined set of services, each with a clear interface it provides above and an interface it uses below
- Internet and network protocols are the canonical example
Transmission Control Protocol (TCP)
- Provides a bidirectional, reliable, in-order byte stream above
- Uses an unreliable datagram protocol below
If layers meet these interfaces very precisely, they are modular: they can be interchanged
- TCP has many variants: NewReno, CUBIC, Compound, DCTCP
- Your application can use any of them, they all provide the same interface
- They all use the Internet Protocol: IP
- They are interchangeable and so are modular
- TLS (Transport Layer Security) is a session layer protocol that secures TCP: it looks likes a bidrectional, reliable, in-order byte stream, so it can often be swapped in for TCP (e.g., https: instead of http:)

AMA about networks

Example Layer 7 Protocol: HyperText Transfer Protocol (HTTP)

HTTP/1.x Properties

HTTP is an ASCII protocol: protocol is in text, not binary values
- In contrast to IP and TCP, which use binary: a 32-bit IP address is a 32-bit number
- Makes everything much simpler and easier to debug!
- ASCII is expensive, but if you're sending large files it's a small overhead
- HTTP/2.0 is binary, trying to optimize things for mobile devices
Basic model: request, response

HTTP/1.x Request

Three principal methods
- GET: fetch a resource (page, image, etc.)
- HEAD: fetch only the headers (useful to see if the resource has changed)
- POST: send data

HTTP/1.x Response

Five classes of status codes
- 1xx: hold on
- 2xx: here you go
- 3xx: go away
- 4xx: you screwed up
- 5xx: I screwed up

Let's Look at a Request/Reponse in telnet and Firefox

partysaur:~ pal$ telnet www.scs.stanford.edu 80
Trying 2001:470:806d:1::9...
Connected to www.scs.stanford.edu.
Escape character is '^]'.
GET / HTTP/1.1
Host: www.scs.stanford.edu

HTTP/1.1 200 OK
Connection: keep-alive
Content-Length: 5624

HTTP Web Proxy

A proxy server acts as a go-between a node and other nodes on the Internet
- Clients opt-in to a proxy server (we'll see in a moment)
- A web proxy handles client's HTTP requests and responds to them
Proxy servers can be useful
- Cache requests for static data (e.g., images) so it can later serve local copies rather than re-request from the web (happens a lot in mobile devices), welcome to 1999
- Intercept image requests to scale them down/make them more efficient (e.g., Google Flywheel)
If clients don't opt-in (it happens automatically), it's called a middlebox
- Redirect to a paywall (e.g., what happens at airports)
- Require agreeing to terms of service (e.g., Stanford Visitor)
- Block access to certain websites (e.g., YouTube)
- Block access to certain documents (big documents, .zip files, video streams.)

Web Proxy

Internet

Proxy Server

Client

Web Server

Client explicitly asks proxy server to fetch web page for it
- Proxy requests it from the server (or maybe another proxy server), maybe modifies it
- Or, proxy gives a local cached copy

Middlebox

Client

Web Server

Client thinks it's requesting from the web server
- Middlebox interposes, responds to request looking like web server
The difference between proxy and middlebox is the IP address to which the client sends its request. With a proxy, the client sends requests to the proxy. With a middlebox, it sends requests to the web server and the middlebox masquerades as the web server.

Middlebox

Client

Web Server

Client thinks it's requesting from the web server
- Middlebox interposes, responds to request looking like web server
The difference between proxy and middlebox is the IP address to which the client sends its request. With a proxy, the client sends requests to the proxy. With a middlebox, it sends requests to the web server and the middlebox masquerades as the web server.

We have built a very basic proxy for you, which you are going to turn into a full web proxy
To test it, you need to set up Firefox (or another browser, but we suggest Firefox) to forward all web requests to the proxy:

Next Assignment: HTTP Web Proxy

To set up Firefox, go to Firefox->Preferences, then type "proxy" in the search box, then click on settings. You should have a window as above. Then, click on "Manual proxy configuration," and type in the myth machine and port number you get after starting your proxy:

$ ./proxy
Listening for all incoming traffic on port 19419.

Make sure you also select the checkbox for "Use this proxy server for all protocols."

Proxy requests use the full URL:
When a proxy server receives such as request, it forwards the request to www.cornell.edu, with the first line rewritten as follows:

Web Proxy Requests

GET http://www.cornell.edu/research/ HTTP/1.1

GET /research/ HTTP/1.1

Internet

Proxy Server

Client

Web Server

GET http://www.cornell.edu/research/ HTTP/1.1

GET /research/ HTTP/1.1

Properly proxying requires a couple of extra headers
x-forwarded-for: allows the proxy to tell the server who is making the request
- Should be filled in with the IP address of the requesting client
- If x-forwarded-for is already present, extend its value into a comma-separated chain of IP addresses the request has passed through before arriving at your proxy. The IP address of the machine you’re directly hearing from would be appended to the end.
  - E.g., if your proxy receives a request from 172.27.76.12 with
  - it will rewrite it to be
x-forwarded-proto: allows a server sitting behind a load-balancing proxy to be sure that the client request to the proxy was secure (https: instead of http:)
- We're not going to worry about this use case, but we have to support it
- Set its value to be http. If x-forwarded-proto is already included in the request header, then simply add it again.
- Here's a nice explanation

HTTP Web Proxy Headers

x-forwarded-for: 172.27.76.96

x-forwarded-for: 172.27.76.96, 172.27.76.12

AMA about HTTP and proxies

The Genesis of Datacenter Computing: MapReduce

Problem for large service providers such as Google: computation requires hundreds or thousands of computers
- How do you distribute the computation?
- Distributing the computation boils down to distributing data
- Nodes fail, some nodes are slow, load balancing: difficult to make it work well
System came from observation that many early Google systems had a common pattern
Designing a computing system around this pattern
- allows the system (written once) to do all of the hard systems work (fault tolerance, load balancing, parallelization)
- allows tens of thousands of programmers to just write their computation
Canonical example of how the right abstraction revolutionized computing
- An open source version immediately appeared: Hadoop

Core Data Abstraction: Key/Value Pairs

Take a huge amount of data (more than can fit in the memory of 1000 machines)
Write two functions:
Using these two functions, MapReduce parallelizes the computation across thousands of machines, automatically load balancing, recovering from failures, and producing the correct result.
You can string together MapReduce programs: output of reduce becomes input to map.
Simple example of word count (wc):

map(k1, v1) -> list(k2, v2)

reduce(k2, list(v2)) -> list(v2)

map(String key, String value):
  // key: document name
  // value: document contents
  for word w in value:
    EmitIntermediate(w,"1")
    
reduce(String key, List values):
  // key: a word
  // values: a list of counts
  int result = 0
  for v in values:
    result += ParseInt(v)
  Emit(AsString(result))

"The number of partitions (R) and the partitioning function are specified by the user."

("the", "1") , ("the", "1"), ("The", "1"), ("of", 1),

(number, "1"), ...

"the", ("1", "1") -> "2"

"The", ("1") -> "1"

"of", ("1") -> "1"

"number", ("1") -> "1"

map output

input

reduce

Key/Value Pairs: How and Where

Keys allow MapReduce to distribute and parallelize load
In the original paper...
- Each mapper writes a local file for each key in k2, and reports its files to a master node
- The master node tells the reducer for k2 where the all of the k2 files are
- The reducer reads all of the k2 files from the nodes that ran the mappers and writes its own output locally, reporting this to the master node
- There have been lots of optimizations since
Core abstraction: data can be partitioned by key, there is no locality between keys
One problem: how much do you parallelize load?
- One map per input file, one reduce per key, is too fine grained (small input)
- Split the key space into R partitions: partition = (hash(key) % R)
- Jeff Dean rule of thumb: if you have N nodes, R >= 10N

map(k1, v1) -> list(k2, v2)

reduce(k2, list(v2)) -> list(v2)

MapReduce System Architecture

Table From Paper (August 2004)

Google's IPO was September 2004

AMA about MapReduce

CS110 Lecture 16: Networks, HTTP, Proxies, and MapReduce

By philip_levis

CS110 Lecture 16: Networks, HTTP, Proxies, and MapReduce

2,844

Lecture 16: Networks, HTTP, Proxies and MapReduce

Computer Networks

The Basic Benefit and Two Abstractions

The Basic Benefit and Two Abstractions

Reliable, In-Order Byte Stream

Web: HTTP over Reliable, In-Order Byte Stream

Web: HTTP over Reliable, In-Order Byte Stream

Web: HTTP over Reliable, In-Order Byte Stream

Web: HTTP over Reliable, In-Order Byte Stream

Web: HTTP over Reliable, In-Order Byte Stream

Web: HTTP over Reliable, In-Order Byte Stream

BitTorrent: Organizing a Swarm

Internet: Under the Covers

Internet in 4 Layers

Internet in 4 Layers

Internet in 4 Layers

Internet in 4 Layers

Internet in 4 Layers

Internet in 4 Layers

Internet in 4 Layers

Internet in 4 Layers

Hop by Hop

Hop by Hop

7 Layer Model of Networks

Layering and Modularity

AMA about networks

Example Layer 7 Protocol: HyperText Transfer Protocol (HTTP)

HTTP/1.x Properties

HTTP/1.x Request

HTTP/1.x Response

Let's Look at a Request/Reponse in telnet and Firefox

HTTP Web Proxy

Web Proxy

Middlebox

Middlebox

Next Assignment: HTTP Web Proxy

Web Proxy Requests

HTTP Web Proxy Headers

AMA about HTTP and proxies

The Genesis of Datacenter Computing: MapReduce

The Genesis of Datacenter Computing: MapReduce

Core Data Abstraction: Key/Value Pairs

Key/Value Pairs: How and Where

MapReduce System Architecture

Table From Paper (August 2004)

AMA about MapReduce

CS110 Lecture 16: Networks, HTTP, Proxies, and MapReduce

More from philip_levis