Militant Map Knowledge Graph

Stanford's Militant Mapping Project in Neo4j

Data Model

The data is stored under a labeled property graph model.

Militant groups are represented by nodes of type MilitantGroup and are connected to other groups though relationships which hold the bulk of the data, shown below. The edges represent some kind of relation between groups. These may be alliances, neutrality, or enemy declarations.

Data

The data from MMP isn't in an analysis-ready format (invalid csv formats, duplicate data, typos) and required extensive cleaning. The cleaned data files used by this project can be found in the data/ folder.

Creating the Knowledge Graph

Create a new Neo4j project with version 5.10.0.
Copy the data out of data/ to the import directory
Load the militant groups into the graph with

LOAD CSV with headers FROM 'file:///Militant Group Key.csv' as row
CREATE (:MilitantGroup {name: row.name, group_id: row.group_id})

Add the relations between groups

LOAD CSV WITH HEADERS FROM 'file:///Militant-Group-Relationships-clean.csv' as row
MATCH (group1:MilitantGroup {group_id:row.group1_id}), (group2:MilitantGroup {group_id:row.group2_id})
CREATE (group1)-[r:hasRelation]->(group2)
SET r.date=row.date, r.description=row.description, r.relation_type=row.type

Network Analysis

A few interesting things can be seen from this small dataset.

Neo4j's graph data science library requires a graph projection of the existing graph. To create the projection,

CALL gds.graph.project('militant-projection', 'MilitantGroup', 'hasRelation')

Centrality

High centrality can be summarized as groups that have many relations (alliances or enemies) to others.

Low centrality groups on the other hand have fewer relations to others and thus may be less important to the larger terror topology.

Betweenness

In this context, betweenness is used to determine influential nodes based based on how critical they are w.r.t connecting other nodes. Whether the connections are alliances or enemies is ignored in this analysis.

CALL gds.betweenness.stream('militant-projection') YIELD nodeId, score
MATCH (militantGroup:MilitantGroup) WHERE id(militantGroup) = nodeId
RETURN militantGroup.name AS militantGroup,score
ORDER BY score DESC LIMIT 10;

Community Detection

The underlying goals of terrorist organizations vary group to group, with groups having similar goals being more connected. For example, the Irish National Liberation Army is connected to several groups concerned with the people of Ireland and shares no connection with those concerned with terror efforts in the Middle East.

Community detection helps make sense of coalitions and identify potential factions with similar underlying goals.

Local Clustering Coefficient

This measure of community detection is used to find "good" entry points to a community. This was run on the entire dataset, which explains the skew for Middle East based groups.

CALL gds.degree.stream('militant-projection') YIELD nodeId, score
MATCH (group:MilitantGroup) WHERE id(group)=nodeId
WITH group, score
ORDER BY score DESC
WITH COLLECT({name: group.name, score: score}) as aggregations
RETURN aggregations

Louvain method

The Louvain is capable of classifying nodes into different detected communities.

CALL gds.louvain.stream('militant-projection')
YIELD nodeId, communityId, intermediateCommunityIds
RETURN gds.util.asNode(nodeId).name AS name, communityId
ORDER BY communityId

The image below shows successful detection of militant groups concerned with Ireland.

The image below shows successful detection of militant groups concerned with militant groups from the soviet area.

Highest connected groups

MATCH (militantGroup)-[r:hasRelation]->(b)
with militantGroup, count(r) as relationCount
ORDER BY (relationCount) DESC LIMIT 10
return militantGroup.name, relationCount

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

images

images

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Militant Map Knowledge Graph

Data Model

Data

Creating the Knowledge Graph

Network Analysis

Centrality

Betweenness

Community Detection

Local Clustering Coefficient

Louvain method

Highest connected groups

About

Releases

Packages

License

ThomasThelen/militant-map-neo4j

Folders and files

Latest commit

History

Repository files navigation

Militant Map Knowledge Graph

Data Model

Data

Creating the Knowledge Graph

Network Analysis

Centrality

Betweenness

Community Detection

Local Clustering Coefficient

Louvain method

Highest connected groups

About

Topics

Resources

License

Stars

Watchers

Forks