Skip to content

Latest commit

 

History

History
159 lines (97 loc) · 8.3 KB

README.md

File metadata and controls

159 lines (97 loc) · 8.3 KB

HydRand

This is a python implementation of the HydRand protocol for generating publicly-verifiable randomness in a distributed fashion. The corresponding research paper is accepted for the IEEE S&P conference and available online at https://doi.ieeecomputersociety.org/10.1109/SP40000.2020.00003. This work is a joint research project by my team members Aljosha Judmayer and Nicholas Stifter and myself, conducted at SBA Research and Vienna University of Technology under the supervision of Edgar Weippl.

Security notice

Currently, the open source implementation provided in this repository is a proof-of-concept implementation and mainly used for performance evaluation. While the main functionally, including signatures checks and verification procedures of the messages, have been fully implemented the code is still considered a prototype and NOT SAFE TO USE IN PRODUCTION. Please contact me or one of my team members at SBA research if you are interested in running HydRand in production.

Flowchart

(Somewhat simplified) figure of the internals of the Hydrand protocol, from the view of an individual Hydrand node.

throughput

Message Format

Overview of format of the messages exchanged by HydRand nodes.

throughput

Performance evaluation

We tested the performance of the protocol using Amazon Web Services (AWS) with up to 128 EC2 instances of type t2.micro (1GB of RAM, 1 virtual CPU core, and a 60-80 Mbit/s internet connection) in different globally distributed datacenters. We discuss our evaluation in the research paper https://eprint.iacr.org/2018/319. In the following we provide additional metrics obtained during our performance tests. The corresponding raw data is provided upon request (the collected files are bigger than the limit of this free git repository).

Throughput

The following figure shows the troughput of the HydRand protocol benchmark runs for different configurations. The round duration was experimentally derived. We tested the protocol in two settings: for the first setting we run the HydRand software as is and ensured that every single node was able to complete the protocol run successfully. For the second settings we deliberately stopped f nodes during the execution of the protocol. In this case all remaining node are able to finished the protocol run successfully.

throughput

Network bandwidth

The following figure shows the average bandwidth used by the HydRand nodes, again with and without simulated failures.

bandwidth

CPU utilization

It can be observed, that the limiting resource in our evaluation was the vCPU of the AWS instances. The following figure shows the overall CPU utilization in percent for different runs with the respective number of nodes on the x axis.

cpuruns

The following figure shows a normal run with 128 nodes.

cpuall

Memory utilization

The following figure shows the memory utilization of the AWS instances in MiB for different runs with the respective number of nodes on the x axis.

memruns

The following figure shows a normal run with 128 nodes.

memall

Network utilization

The following figures shows the sent data in Mbit/s for different runs with the respective number of nodes on the x axis.

sendruns

The following figure shows the detailed data received in a normal run with 128 nodes for every node. The bursts show that almost all nodes have been selected as leader and broadcasted a propose message.

sendall

The following figure shows the data received in a normal run with 128 nodes for every node.

recvall

Running HydRand on a single machine

To try HydRand and for development purposes it is useful to run the protocol on a single machine. In the following, we give additional instructions on running Hydrand for this purpose.

Dependencies

Ensure you have python3.7 and the package pyzmq installed.
You can install pyzmq using the command python3.7 -m pip install pyzmq.

Network configuration

HydRand automatically determines which network configuration to use. It uses the config/network/localhost.txt configuration file per default. Without modification of this file the ports 5000, 5001, 5002 ... are used by the nodes with id 0, 1, 2, ..., respectively. Ensure that these ports are not used by any other software running on the system or change the configuration file to use different ports. (The file config/network/amazon.txt is used if HydRand is executed on a Amazon EC2 instance, all nodes are listening on port 5000 per default).

Cryptographic Keys and Initial PVSS Shares

In the config/005, config/007, config/008 and config/016, we provide a pre generated set of keypairs used to sign and verfify the sent messages as well as the initial pvss shares for scenarios with 5, 7, 8 and 16 nodes.

If you want to test the software with a different number of nodes (e.g. 4) nodes you need to generate these files for your specifiy setting. For this purpose have a look at the file config.py, it contains a function which creates a sample configuration. From the root directory of the respositity you can invoke it direclty by executing
python3.7 -c "from hydrand.config import generate_sample_config; generate_sample_config(n=4, write_to_disk=True)".
It generates the folder config/004 with all required files.

Starting HydRand nodes

To test the protocol with e.g. 5 nodes locally, open up 5 terminal windows and switch the root directory of this reposity.

In terminal window A execute python3.7 -m hydrand 0 -n 5 --start-at "2019-04-17 09:09:50" --num-rounds 20
In terminal window B execute python3.7 -m hydrand 1 -n 5 --start-at "2019-04-17 09:09:50" --num-rounds 20
...
In terminal window E execute python3.7 -m hydrand 4 -n 5 --start-at "2019-04-17 09:09:50" --num-rounds 20

The first argument (i.e. 0, 1, ..., 4 in the example) specifies the HydRand node id (a zero-based index).
The parameter -n 5 tells all nodes to use a configuration with 5 nodes in total.

Finally, --start-at 2019-04-17 09:09:50 is used to specifiy the common time the protocol should start. This timestamp MUST BE SPECIFIED IN UTC TIME. Ensure that the timestamp is sufficiently far in the future so that you can start all 5 nodes before the start time is reached. (Failing to do so triggers a recovery of the unstarted nodes.)

--num-rounds 20 instructs the nodes to automatically shutdown after completing 20 protocol rounds.

You call also execute python3.7 -m hydrand --help to get additional inforamation on the available command line options.

Notes

Depencencies

For running the software, python3.7 and with the package pyzmq is required. For executing all test cases we additionally require the pytest and sympy packages. For developement we provide dependency files for the use with pipenv.

See also ./aws/setup-instance.sh for a script which installs all the required dependencies to run HydRand on an Amazon EC2 instance.

Configuration

Configuration files FOR TESTING ONLY can be generated for differnent number of nodes using the pyhton file ./hydrand/config.py. Notice that the script generates key files for all nodes and stores them in the ./config directory. It is NOT SECURE to use the script for setting up the protocol IN PRODUCTION.

In addition the scripts provided in the ./aws folder automatically generate a network configuration file.

Tesing locally

The available unit tests can be executed using pytest as a test runner.

For testing a high number of nodes on a single local machine the ulimit has to be increased, e.g. ulimit -n 4096.
See also /etc/security/limits.conf.

Acknowlegdements

I would like to express my very great appreciation to my co-authors Aljosha Judmayer and Nicholas Stifter for the excellent collabortion and support throughout the design and implementation of this project, a variety of critical discussions, and their valuable contributions to the paper.