Key-Value-Store

Functional Spec: Design a scalable and efficient key-value store with immutable data. Data will be loaded at the initialization and it won't be modified during the service. User will give a key and the server should return the value.

Designing scalable KV store involves answering at least two primary questions:

How to store the data: Before we answer this, we first need to answer what kind of queries the data storage layer should expect from the client?

a) Is there regular expression or range queries? or is it just a key ?

This will help in deciding the data structure, do we want to use LSM, b+ trees or just hash maps. Random reads are more costly than the sequential ones hence we need to carefully choose data structure.

b) Will there be lots of deletes ?

If deletes are permitted then we have to consider how to handle memory fragmentation or compression?

c) what would be the size of the value ? is it just int or is it in M/G/TB's ?

This will answer should we keep key-value in-memory or store values in HDD/SDD and just store the mapping in-memory.

Since our problem statement just says "read" queries, we will use hashmaps and to make our map light weight, instead of storing values in the map, we will store pointers to the corresponding values.

How to handle the connections?

a) Do we have a million clients? or each client will send millions of requests? or both?

This will help in determining how we set up our socket resources, if clients are in millions then probably they will not send requests so frequently, Hence we don't have to spin up a dedicated thread and keep waiting for the request. Instead we can use epoll to handle those clients and we will get notified when the request arrives. On the other hand, if we have a limited number of clients and very high request rate, we can have active threads which will keep churning the requests as soon as they land but again that will depend on the following question too.

b) Requests are cpu intensive or I/O intensive?

If a job involves lots of I/O ops, we don't really have to hold on to the cpu resources. We can again use an event driven approach to get notification when anything arrives on the I/O resource endpoint.

In our cases, requests are I/O bound and we are not limited by number of clients or requests, hence we used Boost::asio to handle the sockets (which internally uses epoll).

Performance

Machine configuration:

    CPU: 3 vCPU - Intel(R) Core(TM) i7-1068NG7 CPU @ 2.30GHz
    RAM: 8G

No of key-value	Avg Latency(Microsecond)	Mem after initialization	Mem during test
10k	100 to 200, median 130	16MB (0.2%)	16MB (0.2%)
1 Million	100 to 200, median 130	224MB(2.8%)	224MB(2.8%)

Execution

Generate data
Based on data size requirement, edit data_generator.py and modify the value of "data_size".

python3 data_generator.py

Build

make

Run the server with above generated data file.

./Server testdata_1M.data

Now run the client. Client will also take same data file as input as it will randomly pick a key and query the server. In kv_store_client.cpp, we have two paramter to control concurrency (NUM_CLIENT) and total number of requests (NUM_REQUEST). Currently set to 10 clients and 1 Million requests.

./Client testdata_1M.data

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
data_generator.py		data_generator.py
key_value_store.cpp		key_value_store.cpp
key_value_store.h		key_value_store.h
kv_store_client.cpp		kv_store_client.cpp
kv_store_server.cpp		kv_store_server.cpp
kv_store_server.h		kv_store_server.h
utils.hpp		utils.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Key-Value-Store

Performance

Execution

Generate data

Build

About

Uh oh!

Releases

Packages

Languages

License

veerkumar/Key-Value-Store

Folders and files

Latest commit

History

Repository files navigation

Key-Value-Store

Performance

Execution

Generate data

Build

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages