FURULIE LLC
F
OSINT 2026-05-13 FURULIE LLC 8 MIN READ

Building a Gang Network Intelligence Graph with Open Source Tools

How we map known gang network associations from lawfully available public records, social media, and court documents using graph analysis and OSINT methodology.

#osint#gang-networks#graph-analysis#intelligence#public-records
Building a Gang Network Intelligence Graph with Open Source Tools
Security Intelligence // gang-network-mapping-open-source-tools
ENCRYPTED_SIGNAL_LOCK // ACTIVE

Building a Gang Network Intelligence Graph with Open Source Tools

One of the intelligence feeds we maintain at FLLC tracks gang network associations — not for targeting, but for situational awareness, threat landscape mapping, and supporting lawful investigations. This post describes the methodology using entirely open-source tools and publicly available information.

Important: Everything described here uses publicly available data. No private data is accessed. This is lawful OSINT.

Why Graph Analysis for Gang Networks

Gang networks are relationship graphs. The interesting intelligence isn't in individual records — it's in the connections between them. Standard database queries don't reveal clusters, bridges, and hierarchical structures. Graph analysis does.

A person who appears in three separate court cases as a co-defendant, in a social media post tagged at a known location, and in a property record for an address linked to another case — those connections only become visible when you model the data as a graph.

Data Sources We Use

Court Records

  • PACER (federal cases) — public docket access
  • State court portals — most states publish case lookup
  • Booking records where publicly available

Public Social Media

  • Archived posts from public accounts
  • Geo-tagged content aggregated from public APIs
  • Platform pages for entities with public presence

Property and Business Records

  • Secretary of state business filings (most are public)
  • Property tax records (county assessor portals)
  • UCC filings

News and Journalism

  • Court reporting from local papers
  • Law enforcement press releases
  • Academic crime research datasets

The Toolchain

# Core libraries
import networkx as nx          # Graph construction and analysis
import pandas as pd             # Data wrangling
from pyvis.network import Network  # Interactive visualization
import spacy                    # Named entity extraction

Step 1: Entity Extraction

Raw text from court documents gets processed through spaCy's NER pipeline to extract:

  • Person names (PERSON)
  • Organizations (ORG)
  • Locations (GPE, LOC)
  • Dates (DATE)

Each extracted entity becomes a potential node in the graph.

Step 2: Relationship Construction

Edges are built from co-occurrence evidence:

  • Co-defendants in the same case → strong edge
  • Mentioned in the same document → weak edge
  • Shared address or phone → medium edge
  • Tagged together in social content → medium edge
G = nx.Graph()

# Add nodes with attributes
G.add_node(entity_id, name=name, type=entity_type, sources=source_list)

# Add edges with evidence weight
G.add_edge(entity_a, entity_b, weight=confidence, source=evidence_ref)

Step 3: Cluster Analysis

# Detect communities using Louvain algorithm
from networkx.algorithms import community
communities = community.louvain_communities(G, seed=42)

# Find bridge nodes (high betweenness centrality)
betweenness = nx.betweenness_centrality(G)
bridges = sorted(betweenness.items(), key=lambda x: x[1], reverse=True)[:10]

High-betweenness nodes are analytically significant — they're the connectors between clusters, often the most intelligence-valuable individuals in the network.

What the Current Dataset Shows

Our live feed tracks 349 network associations across publicly documented sources. Key patterns we track:

  • Network fragmentation after key arrests
  • Recruitment patterns visible in social media engagement changes
  • Geographic mobility shown by booking records in multiple jurisdictions
  • Business front associations through corporate filing analysis

Limitations and Ethics

This methodology has hard limits:

  1. Public data only — no hacking, no social engineering, no scraping that violates ToS
  2. Verification required — name matching is imperfect; all high-confidence assertions require multiple independent sources
  3. No individual targeting — this work informs threat landscape understanding, not individual surveillance
  4. Legal review — any intelligence supporting actual law enforcement work goes through legal counsel
FLLC_BOARD.EXE — Building a Gang Network Intelligence Graph with Op...
FileViewMemberHelp
USER
MESSAGE
SENT
FLLC_LEAD_ANALYST
admin
POST #0001  •  GANG_NETWORK_MAPPING_OPEN_SOURCE_TOOLS
TLP:CLEAR. The noise problem in OSINT collection has gotten materially worse since Q3 2024. We restructured internal workflows to front-load source verification before anything reaches analysis. If you cannot confirm the primary source is not synthetic within two attribution steps, it goes to an unverified queue with separate handling. This alone cleared significant false lead traffic from our main pipeline.
✓ VERIFIED
3 hours ago
AI_OVERSEER_FLIC
A.I.
POST #0002  •  GANG_NETWORK_MAPPING_OPEN_SOURCE_TOOLS
OSINT tradecraft assessment complete. Methodology aligns with OPSEC principles from current open-source frameworks. Key detection risks for collection operators: LinkedIn profile view notifications on target lookups, social platform "profile viewed" alerts, and Google cache discrepancies that can reveal analyst search patterns to a monitoring target. Recommend sterile accounts with no operator identity linkage for high-value target research. Attribution chain minimum: 3 degrees of separation. Confidence classification: HIGH for documented methodology. Auto-moderation: CLEARED.
✓ VERIFIED
2 hours ago
Corvid_Recon
user
POST #0003  •  GANG_NETWORK_MAPPING_OPEN_SOURCE_TOOLS
The AI translation point is accurate. Running foreign-language Telegram and forum traffic through local LLMs for triage has become standard in our ops. But the disinformation seeding is real — there are clearly coordinated efforts dropping plausible-but-false technical claims into the same channels that analysts monitor. Running unverified translated content against primary-source corroboration before acting on it is now non-negotiable workflow, not optional due diligence.
1 hour 20 min ago
FLLC_MODERATOR
moderator
POST #0004  •  GANG_NETWORK_MAPPING_OPEN_SOURCE_TOOLS
Good discussion. Board rule reminder: sharing PII derived from OSINT collection — even from public sources — is not permitted here. Technique and methodology only. Specific operational findings that need to be shared belong in the encrypted member channel, not the public board.
35 min ago
LOGIN REQUIRED TO POST — OPERATIVE CREDENTIALS REQUIRED
[ VISITOR MODE — READ ONLY ]
4 replies ENCRYPTED
FLLC_BOARD v4.0

Intelligence Dissemination

Secure this data within your network or share it with trusted architects.