Lock-free & memory efficient network communications using queues
# server.py
import net_queue as nq
with nq.new(purpose=nq.Purpose.SERVER) as queue:
message = queue.get()
queue.put("Hello, Client!")# client.py
import net_queue as nq
with nq.new(purpose=nq.Purpose.CLIENT) as queue:
queue.put("Hello, Server!")
message = queue.get()| Configuration | Used |
|---|---|
| OS | Debian GNU/Linux 13 (trixie) |
| CPU | 13th Gen Intel® Core™ i5-13400 × 16 |
| RAM | 64 GB |
| Test | Transfer | Operations | Executed |
|---|---|---|---|
| Sync | 17.18 GB | 8.0 K | python test/iops.py proto peer sync --step-size 0 --max-size 21 --reps 4_000 |
| Async | 17.18 GB | 8.0 K | python test/iops.py proto peer async --step-size 0 --max-size 21 --reps 4_000 |
| Mix | 8.59 GB | 8.19 K | python test/iops.py proto peer async --min-size 8 --step-size 2 --step-expo 0.5 --max-size 32 |
| Time | TCP | MQTT | gRPC |
|---|---|---|---|
| Sync | 5.3 s | 36.2 s | 23.1 s |
| Async | 8.5 s | 27.0 s | 20.9 s |
| Mix | 9.2 s | 23.8 s | 20.2 s |
| Transfer | TCP | MQTT | gRPC |
|---|---|---|---|
| Sync | 25.24 Gbps | 3.71 Gbps | 5.81 Gbps |
| Async | 15.77 Gbps | 4.98 Gbps | 6.43 Gbps |
| Mix | 14.91 Gbps | 5.78 Gbps | 6.79 Gbps |
| Operations | TCP | MQTT | gRPC |
|---|---|---|---|
| Sync | 1500.00 IOPS | 220.85 IOPS | 346.37 IOPS |
| Async | 939.73 IOPS | 296.78 IOPS | 383.04 IOPS |
| Mix | 1780.00 IOPS | 689.41 IOPS | 809.01 IOPS |
| Memory | TCP | MQTT | gRPC |
|---|---|---|---|
| Sync | 21.94 MB | 8377.33 MB | 30.68 MB |
| Async | 33.39 MB | 7585.78 MB | 41.46 MB |
| Mix | 8639.32 MB | 10481.62 MB | 8642.59 MB |
pip install net-queuegit clone https://github.com/hpca-uji/net-queue.git
cd net-queue
pip install -e .-
Protocol:Comunication protocol
TCPMQTT(requires external broker)GRPC
-
Purpose:Comunication purpose
CLIENTSERVER
-
CommunicatorOptions(...)Comunicatior options
-
id: uuid.UUID = uuid.uuid4()(random) -
netloc: NetworkLocation = NetworkLocation('127.0.0.1', 51966) -
workers: int = 1Maximun number of threads to use for connection handeling. Depending on the protocol 1~3 more maybe used, however they will be idle most of the time. On high throughput aplications or high latency networks this may need increasing.
-
connection: ConnectionOptions = ConnectionOptions() -
serialization: SerializationOptions = SerializationOptions() -
security: SecurityOptions | None = None
-
-
ConnectionOptions(...)Connection options
There are no definite values, depends on: usecase, OS/stack/version and link specs. Its recommended to test your configuration (or preferably set them dynamicly). There are ruls of thumb but they serve as a baseline.
-
max_size: int = 4 * 1024 ** 2(4 MiB)Maximun message size to send to underlying protocol before splitting.
Selection: Bandwidth-delay product of network (less +streaming, more +bursty). Default: Tipical connection (80Mbps @ 50ms).
-
merge_size: int = max_sizeMaximun message size to merge to when chunks are too small to efficently send. Internally a buffer of this size is preallocated on construction.
Selection: Max size but tunable for RAM/CPU usage (less -RAM/+CPU, more +RAM/-CPU). Default: Balanced RAM/CPU usage.
-
efficient_size: int = max_size / 64Minimum message size to consider the send efficient before attempting merging. If no more massages are queued then the message will be sent as-is.
Selection: Amortice overhead of abstractions/syscalls/headers (less -latency, more +efficiency). Default: Maximun TCP segment size.
-
-
SerializationOptions(...)Serialization options
-
message_size: int = -1(unlimited)Maximun message size to deserialize before attempting splitting.
-
queue_size: int = -1(unlimited)Maximun queued up messages before dropping incoming messages.
-
load: Callable[[Stream], Any] = PickleSerializer().loadMessage deserialization handler
-
dump: Callable[[Any], Stream] = PickleSerializer().dumpMessage serialization handler
-
-
SecurityOptions(...)Security options
-
key: Path | None = NoneServer's private key
Required for servers, for clients always
None. -
certificate: Path | None = NoneServer's certifcate chain or client's trust chain
Required for servers, for clients if not provided, it defaults to the system's chain.
-
-
NetworkLocation(...)Network location
Extends:
NamedTuplehost: str = "127.0.0.1"port: int = 51966
-
new(protocol, purpose, options)Create a comunicator.
protocol: Protocol = Purpose.TCPpurpose: Purpose = Purpose.Clientoptions: ComunicatorOptions = ComunicatorOptions()
-
Comunicator(options)Communicator implementation
Operations are thread-safe.
Comunicator has
withsupport.options: ComunicatorOptions = ComunicatorOptions()
-
id: uuid.UUID(helper foroptions.id) -
options: ComunicatorOptions -
put(data: Any, *peers: uuid.UUID) -> Future[None]Publish data to peers
For clients if no peers are defined, data is send to the server. For servers if no peers are defined, data is send to all clients.
It is prefered to specify multiple peers insted of issuing multiple puts, as data will only be serialized once and protocols may use optimized routes.
Note: Only servers can send to a particular client.
Future is resolved when data is safe to mutate again. Future may raise
ResouceClose(uuid.UUID)if the peer or itself are closed. Future may raise protocol specific exceptions. -
get(*peers: uuid.UUID) -> AnyGet data from peers
If no peers are defined, data is returned from the first available peer.
Note: Currently peers can not be specified.
-
close() -> NoneClose the communicator
-
{protocol}.{purpose}.Comunicator(options)Concrete communicator implementation fot the given protocol and purpose
-
stream.Stream()Zero-copy non-blocking pipe-like
Interface mimics a non-blocking BufferedRWPair, but operations return memoryviews insted of bytes.
Operations are not thread-safe. Reader is responsible of releasing chunks. Writer hands off responsibility over chunks.
Stream has
withandbytessupport. Stream hascopy.copy()support, however it does not supportcopy.deepcopy().Extends:
BufferedIOBase-
nchunks -> intNumber of chunks held in stream
-
nbytes -> intNumber of bytes held in stream
-
empty() -> boolIs stream empty (would read block)
-
readchunk() -> memoryviewRead a chunk from stream
-
unreadchunk(chunk: memoryview) -> intUnread a chunk into the stream
-
readchunk() -> memoryviewRead a chunk from stream
-
unwritechunk() -> memoryviewUnwrite a chunk from the stream
-
writechunk(chunk: memoryview) -> intWrite a chunk into the stream
-
peekchunk() -> memoryviewPeek a chunk from stream
-
readchunks() -> Iterable[memoryview]Read all chunks from stream
-
writechunks(chunks: Iterable[memoryview]) -> intWrite many chunks into the stream
-
update(bs: Iterable[Buffer]) -> intWrite many buffers into the stream
-
clear() -> NoneRelease all chunks
-
copy() -> StreamShallow copy of stream
-
tobytes() -> bytesTransform stream to bytes (will copy)
-
frombytes(b: Buffer) -> StreamConstruct a stream from bytes
-
-
stream.PickleSerializer(...)Pickle-stream serializer
Warning: The
picklemodule is not secure. Only unpickle data you trust.-
restrict: Iterable[str] | None = NoneIf defined it limits the range of trusted types.
Example:
["builtins"]for a whole moduleExample:
["uuid.UUID"]for a single classNote: Some builtins may be implicitly allowed due to optimizations.
-
load(data: Any) -> StreamTransform a data into a stream
-
dump(data: Stream) -> AnyTransform a stream into useful data
-
-
stream.BytesSerializer(...)Bytes-stream serializer
-
load(data: bytes) -> StreamTransform bytes into a stream
-
dump(data: Stream) -> bytesTransform a stream into bytes
-
- ini: connection start (identify)
- fin: connection stop (flush)
- com: message exchange (generic)
- c2s: message exchange (client -> server)
- s2c: message exchange (server -> client)
Ini:
- Server & client sends ID
- Server & client wait for ID
- Server create session or continues session
Fin:
- Server & client flushes message queue
- Server & client sends ID
- Server & client wait for ID
Ini:
- Must be done on first or changing connection
Fin:
- Must be done on session end (not connection)
Constructor
- Never blocks
- Only one communicator per ID
- Reusing ID retain server queues
Put
- Never blocks
- Communication will not modify object
- Consumer must not modify object util future resolved
- Resolved futures acknowledge peer reception
- ResourceClosed error futures indicates peer diconnected
Get
- Always block
- Returns a message or raises ResourceClosed
- Once closed it continues working until exhausted then it raises ResourceClosed
Close
- Always block
- Server waits for peers to disconnect
Library: socket Parallelism: Thread pool (n+1+1 threads)
Library: paho-mqtt Options: tcp transport, 0 QOS, 3.1.1 protocol Parallelism: Single threaded (1+1+1 threads)
MQTT broker implementations are not common, so the server provided here is actually another client. Therefore the address and port provided to both, the client and server, should be the one of the actual broker, not where the server is running.
The MQTT library handles comunications single-threaded, therefore operations on related callbacks are limited to pushing or pulling data from queues without blocking, so all operations are minimal and fast.
Peer-groups and global comunications are not optimized.
First, chunked message ordering must be resolved. Single chunk order it is guaranteed by the protocol, even on with diferent topics. Second, peer-groups could be implemented using grouping requests that generate new UUID per group. This would reduce also reduce load on the broker.
Library: grpcio Options: compresion disabled, protobuf disabled Parallelism: Thread pool (n+1+? threads)
gRPC does not conform well to a async send & async receive model, it expects remote procedure calls to be called, processed and responded. To simulate this model we created a bidirectional streaming procedure. Sent data is queued at the server, recived data is polled until available.
This also means there is no eager client reception or eager server send, so polling is requiered.
Polling is implemented with a exponential backoff time and a limit. The gRPC library queues requests, so requests would always be replyed in a timely maner, but we do not want to hogh the CPU or network with usesless requests.
It is important to not hold the prodedures indefinitely, since this could starve the server of threads. Additionaly, if a streaming direction was already closed, messages could end up queued forever if not restarted.
To alleviate network latency queues are flushed unidirectionally in turns, insted of interleaving directions. However on hight throughout applications this could lead to a very bursty receive pattern.
Implement reconnection support. The protocol already has support for it, server support is done, clients can reconnect but can not yet disconnect without flushing.
Implement two-way connection expiration and keep-alives. There is no reliable way to track connection drops between communication implementations. Most of them end up with memory leaks. If desired expiration periods could be long and automatic client reconnections could be allowed, enabling MQTT-like reliability without the cost.
Implement message cancelling support. It is already plausible to cancel a message if it is queued but not buffered. However chaning the future from a pending state to running would cause a lock acquire.
The library has been partially supported by:
- Project PID2023-146569NB-C22 "Inteligencia sostenible en el Borde-UJI" funded by the Spanish Ministry of Science, Innovation and Universities.
- Project C121/23 Convenio "CIBERseguridad post-Cuántica para el Aprendizaje FEderado en procesadores de bajo consumo y aceleradores (CIBER-CAFE)" funded by the Spanish National Cybersecurity Institute (INCIBE).