Skip to content

Kostia-Zuev/Critical-Infrastructure-Networks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 

Repository files navigation

Critical Infrastructure Networks

Network data collection

DOI

Table of Contents

Data Format

Each network is available in three different formats: .mat, .cvs, and .txt.

  • .mat file contains at least three variables:
    • A is a sparse n-by-n adjacency matrix, where n is the number of nodes (vertices). If a network is undirected and unweighted, then A is symmetric, A(i,j)=1 if there is a link (edge) between nodes i and j, and A(i,j)=0 otherwise. If a network is directed, then A is not necessarily symmetric and A(i,j)=1 if there is a link from i to j (and zero otherwise). If a network is weighted, then A(i,j) is the weight of the link from i to j (a zero weight means no link).
    • E is an edge list. If a network is undirected and unweighted, then E is m-by-2, where m is the number of links and a row [i,j] represents a link between nodes i and j. If a network is directed, then a row [i,j] represents a link from i to j. If a network is weighted, then E is m-by-3, where the 3rd column contains weights.
    • G is a MATLAB graph object representing the network.
  • .cvs and .txt files contain an edge list, which is the same as E above.

Power Grids

1. Western States Power Grid of the United States

  • Path: Critical-Infrastructure-Networks/power-grids/us-western-states/

  • Description: An undirected, unweighted network representing the topology of the Western States Power Grid of the United States. Nodes represent transformers, substations, and generators. Links represent high-voltage transmission lines.

  • Remark: The transmission lines can be directed and weighted (where weight describe their capacity), but this information is not available.

  • Data: Adjacency matrix A, edge list E, and MATLAB graph object G, as described in Data Format.

  • Data Source: Originally compiled by D. Watts and S. Strogatz and used in D.J. Watts and S.H. Strogatz (1998) "Collective dynamics of “small-world” networks," Nature 393, 440-442. Availalbe online at: https://toreopsahl.com/datasets/#uspowergrid

  • Basic Network Properties

Property Value
Number of nodes 4941
Number of links 6594
Density 5.403e-04
Maximum degree 19
Average degree 2.669
Average clustering coefficient 0.106
Number of connected components 1
Diameter 46
Average shortest path length 18.989

alt tag

2. North American Power Grid

  • Path: Critical-Infrastructure-Networks/power-grids/north-american/

  • Description: An undirected, unweighted network representing the topology and geography of the North American Power Grid. Nodes represent transformers, substations, and generators. Links represent high-voltage transmission lines.

  • Data: In addition to the adjacency matrix A, edge list E, and MATLAB graph object G (see Data Format), file powergridNA.mat contains an n-by-3 matrix V describing the geographical coordinates of network nodes: V(i,1)=i is the node ID, V(i,2) is the latitude of node i, and V(i,3) is its longitude. Files powergridNA_E and powergridNA_V contain E and V in both .cvs and .txt formats.

  • Data Source: The network was extracted on March 14, 2016 uing GridKit, a power grid extraction toolkit.

  • Basic Network Properties

Property Value
Number of nodes 14990
Number of links 18804
Density 1.674e-04
Maximum degree 22
Average degree 2.509
Average clustering coefficient 0.103
Number of connected components 1
Diameter 138
Average shortest path length 47.504

alt tag alt tag

3. European Power Grid

  • Path: Critical-Infrastructure-Networks/power-grids/european/

  • Description: An undirected, unweighted network representing the topology and geography of the European Power Grid. Nodes represent transformers, substations, and generators. Links represent high-voltage transmission lines.

  • Data: In addition to the adjacency matrix A, edge list E, and MATLAB graph object G (see Data Format), file powergridEU.mat contains an n-by-3 matrix V describing the geographical coordinates of network nodes: V(i,1)=i is the node ID, V(i,2) is the latitude of node i, and V(i,3) is its longitude. Files powergridEU_E and powergridEU_V contain E and V in both .cvs and .txt formats.

  • Data Source: The network was extracted on March 14, 2016 uing GridKit, a power grid extraction toolkit.

  • Basic Network Properties

Property Value
Number of nodes 13478
Number of links 16922
Density 1.8632e-04
Maximum degree 16
Average degree 2.511
Average clustering coefficient 0.111
Number of connected components 1
Diameter 147
Average shortest path length 49.505

alt tag alt tag

Citing CIN repository

If you find this network data collection useful in your work, I encourage you to cite the repository. You can use the following BibTeX citation:

@misc{KZ_CIN_2017_583631,
  author       = {Konstantin M. Zuev},
  title        = {Kostia-Zuev/Critical-Infrastructure-Networks v1.3},
  month        = may,
  year         = 2017,
  doi          = {10.5281/zenodo.583631},
  url          = {https://doi.org/10.5281/zenodo.583631}
}