Redirect

A lightweight scalable URL Shortener system

graph TD
  user((User)):::blue
  user --- |HTTP Request| proxy
  proxy --- replica1
  proxy --- replica2
  subgraph proxy["Proxy Service"]
  note([Note: Data is partitioned between replicas <br> and replicated within replicas]) -.-
  orchestrator(Multithreaded Orchestrator) --- cache[(<br>Server Response <br> Cache)]
  end
  subgraph monitor["Monitor Service"]
  health((Health & Recovery))
  end
  monitor --- proxy
  monitor -.- host1
  monitor -.- host2
  monitor -.- host3
  monitor -.- host4
  monitor -.- server1
  monitor -.- server2
  monitor -.- server3
  monitor -.- server4
  subgraph replica2["Replica 2"]
  host3([Host 3]) --- server3(Multithreaded Server)
  server3 --- cache3[(Url Cache)]
  server3 --- |Buffer| replica2db1[(<br> replica 2 Data <br><br>)]

  host4([Host 4]) --- server4(Multithreaded Server)
  server4 --- cache4[(Url Cache)]
  server4 --- |Buffer| replica2db2[(<br> replica 2 Data <br><br>)]
  end
  subgraph replica1["Replica 1"]
  host1([Host 1]) --- server1(Multithreaded Server)
  server1 --- cache1[(Url Cache)]
  server1 --- |Buffer| replica1db1[(<br> replica 1 Data <br><br>)]

  host2([Host 2]) --- server2(Multithreaded Server)
  server2 --- cache2[(Url Cache)]
  server2 --- |Buffer| replica1db2[(<br> replica 1 Data <br><br>)]
  end

%% Colors %%
classDef blue fill:#2374f7,stroke:#000,stroke-width:2px,color:#fff

The following mermaid diagram architecturally describes the system, view in a markdown viewer that supports mermaid diagrams such as GitHub.

Architecture
- Components
  - GET Data Flow
  - PUT Data Flow
- Code Overview
Running The System
Testing The System
- Performance Testing
  - Read Test
  - Write Test
- Correctness Testing
Analysis

Architecture

Components

The system structure is composed of the following components:

Proxy: The proxy service is responsible for receiving HTTP requests from the user and forwarding them to the appropriate server(s). It will load balance the requests between the servers with the orchestrator. It will also cache the responses from the servers to reduce the load on the servers. The proxy is multithreaded and will handle requests concurrently.
Orchestrator: The orchestrator is responsible for managing the groupings of hosts to replicas, along with the load balancing of the requests between the replicas and servers. The strategy used for load balancing is consistent hashing (ring pattern), so the hashing will stay consistent if the number of hosts scale up. By default, the orchestrator will hash the short url to a replica of 2 hosts.
Replica: The replica is an abstraction of a group of hosts and servers. Data is partitioned between the replicas and replicated within the replicas. This way, if a host or server fails, the data will still be available on another host within the replica. Replica groupings are different for every hash key, and each replica size is configurable, defaulting to 2 hosts. The replica size can be adjusted to increase the replication factor. This way, if a host or server fails, the data will still be available on another host within the replica.
Host: The host is responsible for running the server and storing the data. They are multithreaded and will handle requests concurrently. For writes, the server writes to a buffer to minimize the write to the database. For reads, the server will first check the cache for the key value pair of short and long url respectively before checking the database.
Monitor: The monitor service is responsible for monitoring the health of the system. It will check on the health of all hosts and servers every 5 seconds by default. If a host or server is down, it will recover by spawning a new host within the same replica as the failed host for each hash. It will also notify the orchestrator of the new host so that it can be used in replacement of the failed node.

GET Data Flow

User sends a GET request to the proxy with a short url.
The proxy will check the cache for the short url and returns the server response if found.
The proxy selects a replica to use for the request by hashing the short url.
The proxy forwards the request to a host and will retry until a response is received on a different host in the replica if the host is unreachable.
The host server will check its own cache for the short url and returns the long url if found.
The host server will check the database for the short url and returns the long url if found.
The host server will cache the short and long url pair and return the url to the proxy.
The proxy will cache the short and server response pair.
The proxy will return the server response to the user.

graph TD
  user((User)):::blue
  user --> |1. GET Request| proxy
  proxy --> |3 & 4. Forward Request to Host 1 Server| replica1
  proxy --- replica2
  proxy --> |9. Redirected URL Response| user
  subgraph proxy["Proxy Service"]
  cache --> |2. Server Response| orchestrator
  orchestrator(Multithreaded Orchestrator):::blue --> |2. Check Cache| cache[(<br>Server Response <br> Cache)]:::blue
  orchestrator --> |8. Cache Short & Server Response| cache
  end
  subgraph replica2["Replica 2"]
  host3([Host 3]) --- server3(Multithreaded Server)
  server3 --- cache3[(Url Cache)]
  server3 --- |Buffer| replica2db1[(<br> replica 2 Data <br><br>)]
  host4([Host 4]) --- server4(Multithreaded Server)
  server4 --- cache4[(Url Cache)]
  server4 --- |Buffer| replica2db2[(<br> replica 2 Data <br><br>)]
  end
  subgraph replica1["Replica 1"]
  host1([Host 1]):::blue --> server1(Multithreaded Server):::blue
  server1 --> |7. Cache Short & Long URL| cache1
  cache1 --> |5. Long URL Response| server1
  server1 --> |5. Check Cache| cache1[(Url Cache)]:::blue
  replica1db1 --> |6. DB Response| server1
  server1 --> |6. Check DB| replica1db1[(<br> replica 1 Data <br><br>)]:::blue
  host2([Host 2]) --- server2(Multithreaded Server)
  server2 --- cache2[(Url Cache)]
  server2 --- |Buffer| replica1db2[(<br> replica 1 Data <br><br>)]
  end
  replica1 --> |7. URL Response| proxy

%% Colors %%
classDef blue fill:#2374f7,stroke:#000,stroke-width:2px,color:#fff

The following mermaid diagram the GET Request Data Flow in blue, view in a markdown viewer that supports mermaid diagrams such as GitHub.

PUT Data Flow

User sends a PUT request to the proxy with a short and long url.
The proxy selects a replica to use for the request by hashing the short url.
The proxy will forward the request to all hosts in the replica.
The host server will write the short and long url pair to their own buffers.
The host server buffer will flush to the database when it reaches a certain size / time limit.
The host server will cache the short and long url pair.
The host server will notify the proxy that the write was successful.
The proxy will return a success response to the user.

graph TD
  user((User)):::blue
  user --> |1. PUT Request| proxy
  proxy --> |2 & 3. Forward Request to Host 1 & 2 Server| replica1
  proxy --- replica2
  subgraph proxy["Proxy Service"]
  orchestrator(Multithreaded Orchestrator):::blue --- cache[(<br>Server Response <br> Cache)]
  end
  subgraph replica2["Replica 2"]
  host3([Host 3]) --- server3(Multithreaded Server)
  server3 --- cache3[(Url Cache)]
  server3 --- |Buffer| replica2db1[(<br> replica 2 Data <br><br>)]
  host4([Host 4]) --- server4(Multithreaded Server)
  server4 --- cache4[(Url Cache)]
  server4 --- |Buffer| replica2db2[(<br> replica 2 Data <br><br>)]
  end
  subgraph replica1["Replica 1"]
  host1([Host 1]):::blue --> server1(Multithreaded Server):::blue
  server1 --> |6. Cache URL Pair| cache1[(Url Cache)]:::blue
  server1 --> |4 & 5. Buffer writing to DB| replica1db1[(<br> replica 1 Data <br><br>)]:::blue
  host2([Host 2]):::blue --> server2(Multithreaded Server):::blue
  server2 --> |6. Cache URL Pair| cache2[(Url Cache)]:::blue
  server2 --> |4 & 5. Buffer writing to DB| replica1db2[(<br> replica 1 Data <br><br>)]:::blue
  end
  replica1 --> |7. Successfully Saved| proxy
  proxy --> |8. Got it Response| user

%% Colors %%
classDef blue fill:#2374f7,stroke:#000,stroke-width:2px,color:#fff

The following mermaid diagram the PUT Request Data Flow in blue, view in a markdown viewer that supports mermaid diagrams such as GitHub.

Code Overview

The system code is organized into the following directories:

orchestration: The orchestration directory contains the scripts for the proxy, orchestrator, and monitor services. It will handle the orchestration of the system along with the recovery of the system.
server: The server directory contains the code for the server. It will handle the requests from the proxy and respond with the appropriate response along with writing to the database.
storage: The storage directory contains the code and drivers to setup the database. It will handle optionally populating the database with data when setting up the system.

Running The System

Initial Setup

On the system, fix the ~/.bashrc file to include the following:

export JAVA_HOME="/opt/jdk-20.0.1"
export PATH="/opt/jdk-20.0.1/bin:$PATH"

Run the confirmAllHosts.bash script to accept new ssh connections from the hosts so that there are no prompts when running the system.

./confirmAllHosts.bash

Run the following command from the root folder to build the system:

./make.bash

Configuration

The system will initially use the hosts in the HOSTS file on the port found in the PORT file.

Server configurations can be adjusted in orchestration/runServerLocal.bash. The configurations are as follows:

IS_VERBOSE: toggle log statements
HOSTPORT: port which the server runs on
CACHE_SIZE: size of cache used to store URLs obtained from GET and PUT requests
NUM_THREADS: number of threads running in the server
WRITE_BUFFER_SIZE: size of write buffer, which contains results from PUT client requests is periodically flushed to the database
SLEEP_DURATION: interval to check for write buffer flushing

Proxy configurations can be adjusted in orchestration/proxy/runProxyLocal.bash. The configurations are as follows:

IS_VERBOSE: toggle log statements
PROXYPORT: port which the proxy runs on
CACHE_SIZE: size of cache used to store server responses
NUM_THREADS: number of threads running in the server
REPLICATION_FACTOR: number of hosts to replicate data to

Usage

Run the following command from the root folder to run the system:

./dostuff.bash

Once the system is running, the following commands can be used to interact with the system:

Sample PUT:
curl -X PUT "http://localhost:{PROXYPORT}?short=arnold&long=http://google.com"

Sample GET:
curl "http://localhost:{PROXYPORT}/arnold"

Scaling Up

If we want to add a host to the system while its running, we can run the following command:

./orchestration/addHost.bash

We can optionally pass in arguments to the script, where the first argument is a host we want to replace, and the second argument is the host we want to clone data from.

Scaling Down

If we want to remove a host from the system, we can run the following command:

./orchestration/removeHost.bash

We can optionally pass in an argument to the script, where the argument is the host we want to remove.

Testing The System

Performance Testing

For performance testing, we used ab (apache benchmark). Our usage of this tool was very simple as it was just a load test.

We can run the following command to run the performance test after starting the system:

./testing/plotting/plotAll.bash

Read Test

From the read test, we can see that the system averages about 20ms for a read request.

The following table contains the timing results of sending 4000 read requests to the proxy.

Host count	Time to complete all requests
1	6.045 seconds
2	5.255 seconds
3	4.849 seconds
4	4.532 seconds

Write Test

From the write test, we can see that the system averages about 50ms for a write request.

The following table contains the timing results of sending 4000 write requests to the proxy.

Host count	Time to complete all requests
1	12.812 seconds
2	11.255 seconds
3	15.008 seconds
4	14.605 seconds

Correctness Testing

For correctness tests, we used curl to send requests and bash to validate the responses automatically.

We can run the following command to run the correctness test after starting the system:

$ ./testing/correctness/correctTest.bash
All tests passed!

Analysis

Load Balancing

The proxy utilizes consistent hashing to load balance. The hash space is partitioned into 360 slots, with each server claiming 3 slots. Short URLs retrieved from requests are hashed. The data is replicated across two servers that are placed after the hash. Writes will go to both servers. Reads will select from one of the available servers and use the other if one is not available.

Caching

Caches exist for both the proxy and server. The proxy cache maps URLs to responses sent by the servers, so they can be returned without having to contact the server again. This reduces the amount of time communicating over the network, which can be a significant bottleneck.

he server cache stores short and long URLs to avoid creating another database connection. This is also a significant bottleneck (IO).

Scalability

Horizontal Scalability

The system is highly scalable as increasing the number of hosts and servers will increase the capacity of the system. The durability of the system will increase since the data is partitioned on more hosts and servers so there will be less data loss on a host or server failure. The system will also scale up and down dynamically. If we want to add a host to the system while its running, we can run the ./orchestration/addHost.bash script. We can optionally pass in arguments to the script, where the first argument is a host we want to replace, and the second argument is the host we want to clone data from. If we want to remove a host from the system, we can run the ./orchestration/removeHost.bash script. We can optionally pass in an argument to the script, where the argument is the host we want to remove.

Vertical Scalability

The application is capable of scaling with processing power and memory through configurations. Thread size can be adjusted to take advantage of the number and speed of available processors. Cache and write buffer size can be increased if necessary.

Latency

The read latency of the application is 7.9 ms. The write latency of the application is 19.7 ms.

Throughput

The read throughput is 504 requests / second. The write throughput is 203 requests / second.

Availability

The system is highly available as it is replicated within the replicas. If a host or server fails, the data will still be available on another host within the replica. If a host is unresponsive, the proxy will retry the request on another host within the replica. The system will also recover from failures by spawning a new host within the same replica as the failed host for each hash. It will also notify the orchestrator of the new host so that it can be used in replacement of the failed node. This is a result of the ring pattern (consistent hashing) used for load balancing and data partitioning. On the first host failure, no data will be lost as all of its data is partitioned throughout the system in various replicas. On a second host failure, minimal data will be lost since the data is partitioned and replicated within the replicas.

Durability

The durability of the system is strong since each url pairing will be replicated on each replica (with a default of 2 hosts). Since each replica is unique for each hash, data is partitioned throughout the entire system of nodes but not replicated on each host. This means that if a host go down (first host failure), no data will be lost.

Health Check

The application periodically pings each host and the status of the server on each host. If a node goes down, the system will spawn another node and start the process within 5 seconds. While the service is down, requests to other nodes still operate as normal.

anthonytedja / redirectv1 Goto Github PK

redirectv1's Introduction