Load balancer

What

They’re the magic sauce that makes scaling horizontally possible. They route incoming requests to one of many application servers that are typically clones / mirror images of each other and send the response from the app server back to the client.
- Any one of them should process the request the same way so it’s just a matter of distributing the requests across the set of servers so none of them are overloaded.
Problem: When a server simultaneously receives a lot of requests, it can slow down (throughput reduces, latency rises). After a point it may even fail (no availability).
- Solution: You can give the server more muscle power (vertical scaling) or you can add more servers (horizontal scaling).
  - With horizontal scaling, you got to work out how the income requests get distributed to the various servers - which requests get routed to which servers and how to ensure they don't get overloaded too?
  - In other words, how do you balance and allocate the request load?
load balancers are like traffic managers who direct traffic.
load balancers can be thought of as reverse proxies

How

A load balancer's job is to sit between the client and server (but there are other places it can be inserted) and work out how to distribute incoming request loads across multiple servers, so that the end user (client's) experience is consistently fast, smooth and reliable.

Why

Used to maintain availability and throughput.

Server Selection Strategies

How it decides how to route and allocate request traffic
every time you add a server, you need to let your load balancer know that there is one more candidate for it to route traffic to.
If you remove a server, the load balancer needs to know that too.
The load balanacer configuration ensures that the load balancer knows how many servers it has in its go-to list and which ones are available.
the load balancer can be kept informed on each server's load levels, status, availability, current task and so on.
Approaches
- A naive approach to this is for the load balancer to just randomly pick a server and direct each incoming request that way.
  - randomness can cause problems and "unbalanced" allocations where some servers get more loaded than others, and that could affect performance of the overall system negatively.

Consistent hashing

https://www.cs.cmu.edu/~adamchik/15-121/lectures/Hashing/hashing.html
- is that hashing converts an input into a fixed-size value, often an integer value (the hash).
One of the key principles for a good hashing algorithm or function is that the function must be deterministic, which is a fancy way for saying that identical inputs will generate identical outputs when passed into the function.
Sometimes the hashing function can generate the same hash for more than one input - this is not the end of the world and there are ways to deal with it
- when more than one input deterministically generates the same output, it's called a "collision".
Example
- Let's say you have 5 servers to allocate loads across. An easy to understand method would be to hash incoming requests (maybe by IP address, or some client detail), and then generate hashes for each request. Then you apply the modulo operator to that hash, where the right operand is the number of servers.

request#1 => hashes to 34
request#2 => hashes to 23
request#3 => hashes to 30
request#4 => hashes to 14

// You have 5 servers => [Server A, Server B ,Server C ,Server D ,Server E]

// so modulo 5 for each request...

request#1 => hashes to 34 => 34 % 5 = 4 => send this request to servers[4] => Server E

request#2 => hashes to 23 => 23 % 5 = 3 => send this request to servers[3] => Server D

request#3 => hashes to 30 => 30 % 5 = 0 => send this request to  servers[0] => Server A

request#4 => hashes to 14 => 14 % 5 = 4 => send this request to servers[4] => Server E

Adding Servers, and Handling Failing Servers
- Issues
  - we could add a sixth server but that would never get any traffic because our mod operator is 5, and it will never yield a number that would include the newly added 6th server.
  - he hashing function (refer to the pseudo code snippet above) still thinks there are 5 servers, and the mod operator generates a range from 0-4. But we only have 4 servers now that one has failed, and we are still sending it traffic.
- Solution consistent hashing
  - Consistent hashing does not eliminate the problems, especially adding new servers. But it does reduce the problems a lot. the underlying downside still exists - yes, but to a much smaller extent, and that itself is a valuable improvement in very large scale systems.
  - Consistent hashing applies a hash function to incoming requests and the servers. The resulting outputs therefore fall in a set range (continuum) of values. This detail is very important.

Types

Layer 4

Layer 4 load balancing, operating at the transport level, manages traffic based on network information such as application ports and protocols without visibility into the actual content of messages. This is an effective approach for simple packet-level load balancing. The fact that messages are neither inspected nor decrypted allows them to be forwarded quickly, efficiently, and securely. On the other hand, because layer 4 load balancing is unable to make decisions based on content, it’s not possible to route traffic based on media type, localization rules, or other criteria beyond simple algorithms such as round-robin routing.

Pros

Ideal for simple packet-level load balancing
Because it doesn’t consider the data, it’s fast and efficient.
More secure because packets aren’t looked into. In the event that it gets compromised, no one can see the data.
Does not need to decrypt the content—it merely forwards them
Uses NAT
Maintains only one connection between client and server NATed so your load balancer can serve a maximum number of TCP connections = to (number of servers * max connections per server)

cons

Not capable of smart load balancing based on the content
Can’t do real microservices
Needs to be sticky as it is a stateful protocol. Once a connection is established, it goes to one server at the backend. - All packets flowing to this connection goes to one server. The next connection will then pick another server based on the algorithm.

layer 7

Layer 7 load balancing operates at the application level, using protocols such as HTTP and SMTP to make decisions based on the actual content of each message. Instead of merely forwarding traffic unread, a layer 7 load balancer terminates network traffic, performs decryption as needed, inspects messages, makes content-based routing decisions, initiates a new TCP connection to the appropriate upstream server, and writes the request to the server.
layer 7 load balancing allows more intelligent load balancing decisions and content optimizations. By viewing or actively injecting cookies, the load balancer can identify unique client sessions to provide server persistence, or “sticky sessions,” sending all client requests to the same server for greater efficiency. Packet-level visibility allows content caching to be used, holding frequently accessed items in memory for easy retrieval. Importantly for modern organizations, layer 7 load balancing provides the intelligence to handle protocols that piggyback or multiplex requests onto a single connection to optimize traffic and reduce overhead.

Pros

Offers smart routing based on the URL
Provides caching

cons

More expensive
Requires decrypting
In terms of security, you have to share your certificate with the load balancers. If an attacker gets access to the load balancer, they automatically have access to all your data.
Its proxy creates multiple connections—client to proxy/proxy to server—so you are bounded by the max TCP connection on your load balancer.

Nginix & Proxy servers

https://nginx.org/en/
https://www.nginx.com/resources/wiki/
https://en.wikipedia.org/wiki/Nginx
https://en.wikipedia.org/wiki/Proxy_server

PreviousLeader Election Nextlong-polling

Last updated 3 years ago

Was this helpful?

Load balancer

What

How

Why

Server Selection Strategies

Consistent hashing

Types

Layer 4

Pros

cons

layer 7

Pros

cons

Links

Nginix & Proxy servers

What

How

Why

Server Selection Strategies

Consistent hashing

Types

Layer 4

Pros

cons

layer 7

Pros

cons

Links

Nginix & Proxy servers