HelpAutosomal Tools › Warthen Interactive Cluster

Warthen Interactive Cluster Gold

Overview

The Warthen Interactive Cluster (WIC) is an advanced clustering tool that groups your DNA matches based on their “In Common With” (ICW) relationships and displays the results in a fully interactive heatmap directly inside the DNAGedcom Client. Unlike other clustering tools that produce static HTML files, WIC lets you pan, zoom, and click the heatmap to explore your clusters in real time.

This is a Gold subscription feature and requires a DNAGedcom Gold subscription.

WIC offers two clustering algorithms (Threshold and Hierarchical), identifies superclusters that group related clusters together, and optionally overlays ancestor and chromosome data onto the heatmap. A set of detail tabs below the heatmap lets you drill into cluster members, shared ancestors, chromosome segments, and common ancestors from match family trees.

A*DNA, FTDNA, 23andMe, MyHeritage, and GEDmatch data are supported. WIC operates on data in your database (file import is not supported — use CLM for file-based clustering).

Prerequisites

Before running the Warthen Interactive Cluster, you must first gather data from your DNA testing service using the Gather tools in the Client.

Data Required? Purpose
MatchesRequiredThe list of DNA matches to cluster
ICW (In Common With)RequiredThe shared match relationships that drive the clustering algorithm
Chromosome SegmentsOptionalEnables the Chromosome Browser tab — click any heatmap cell to view shared DNA segments between two matches
Ancestors / TreesOptionalEnables the ancestor overlay (green dots on the heatmap) and the Common Ancestors and All Ancestors tabs

Only kits that have both match and ICW data will appear in the DNA Kit dropdown.

WIC Settings

Setting Purpose
Kit and Filters
Kit FilterPrefilter list of kits in the database
DNA KitSelects the target kit for clustering
cM RangeSelects matches based on total cM shared (default: 20–400)
Surname FilterFilters matches for matching surnames (comma-separated, optional)
Clustering MethodChooses the clustering algorithm: Threshold or Hierarchical
Display Options
Show SuperclustersGroups related clusters with green dashed boundary lines
Notes CharsNumber of characters from match notes to show in heatmap labels (0 = off)
Show AncestorsOverlays green dots where two matches share a common ancestor
Show Chromo BrowserLoads chromosome data for cell-click segment viewing
Exclude UnclusteredHides singleton matches for a cleaner view
Advanced
Inclusion ThresholdConnectivity requirement for Threshold method (Easy/Standard/Strict)
Max Cluster SizeDissolves clusters larger than this (0 = disabled)
Min cM to ClusterTwo-phase extension pipeline for weaker matches (0 = disabled)
Min Cluster SizeMinimum members to keep a cluster (default: 3)
Min ICW Shared cMMinimum cM for an ICW relationship to count (0 = disabled)

Setup Details

Kit Selection

Kit Filter

Typing a name or part of a name in this field will limit the kits listed in the DNA Kit dropdown to only those kits that have a name containing that input. This is especially helpful if you have many kits in your database.

DNA Kit

Select the DNA kit you want to cluster. The dropdown shows only kits that have both match and ICW data in the database. If a kit is not listed, you may need to gather ICW data for it first.

Filters

cM Range

Sets the upper and lower limit for total shared cM of matches to include in clustering. The default value is 20 to 400 cM.

  • Lowering the minimum (e.g., to 10 cM) will include more distant matches but increases processing time and may introduce noise.
  • Raising the maximum (e.g., to 3000 cM) will include closer relatives, which can sometimes dominate the heatmap.
  • Different ranges will yield different results, so running WIC at various ranges may offer more insight.

Surname Filter

Optional. Enter one or more surnames separated by commas. Only matches whose name contains at least one of these surnames will be included. Leave blank to include all matches.

Clustering Method

Choose the algorithm used to group matches into clusters:

Threshold (default) — Uses the Collins-Leeds Method. Iteratively prunes matches that don’t meet the connectivity threshold. Often produces fewer, more cohesive clusters.

Hierarchical — Builds a dendrogram by merging the most similar matches first. Produces detailed cluster boundaries. Also supports the Min cM to Cluster extension pipeline.

Display Options

Show Superclusters

Default: On. Groups related primary clusters that share ICW connections. Shown as green dashed boundary lines on the heatmap. Superclusters often represent broader family branches.

Notes Chars

Default: 0 (off). Sets the number of characters from match notes to display before the match name in heatmap labels. Set to 1–10 to show annotations.

Show Ancestors

Default: On (when ancestor data is available). Overlays green dots on cells where two matches share a common ancestor in their family trees.

Show Chromo Browser

Default: On (when chromosome data is available). Click any heatmap cell to see shared DNA segments. Enable “Show Entire Cluster” to see all segments among cluster members.

Exclude Unclustered Matches

Default: On. Removes singletons from the heatmap. Turn off to see all matches including unclustered ones.

Advanced Settings

Inclusion Threshold (Threshold method only)

Threshold Ratio Description
Easy1/3A match stays if it shares ICW with at least 1/3 of the cluster. Larger clusters.
Standard1/2Must share ICW with at least half the cluster. Balanced default.
Strict2/3Must share ICW with at least 2/3 of the cluster. Smaller, tighter clusters.

Max Cluster Size

Default: 0 (disabled). Clusters larger than this are dissolved into singletons. Useful for endogamous populations.

Min cM to Cluster (Hierarchical only)

Default: 0 (disabled). Enables a two-phase extension pipeline:

Phase 1: Build clusters from strong matches in the cM Range.

Phase 2: Place weaker matches into the nearest existing cluster.

Example: cM Range 20–400 with this set to 10 clusters 20+ cM first, then places 10–20 cM matches.

Min Cluster Size

Default: 3. Clusters with fewer members are dissolved. Increase to eliminate low-confidence clusters.

Min ICW Shared cM

Default: 0 (disabled). Minimum cM for an ICW relationship to count. Higher values produce tighter clusters.

Running the Cluster

  1. Select your DNA Kit from the dropdown.
  2. Adjust the cM Range and other settings.
  3. Click Run Clustering.
  4. Progress bar and status messages show progress.
  5. When complete, the heatmap appears in the main panel.

To cancel, click Cancel.

Reading the Heatmap

The heatmap is a correlation matrix where each row and column represents a DNA match.

Heatmap Colors

  • Blue (diagonal) — Self match
  • Red gradient — Direct ICW relationship. Deeper = stronger
  • Gray gradient — Indirect connections through intermediaries
  • White — No correlation

Cluster Boundaries

  • Black solid lines — Primary cluster boundaries
  • Green dashed lines — Supercluster boundaries

Ancestor Overlay

Green dots appear where two matches share a common ancestor.

Interactive Features

Heatmap Navigation

  • Drag to pan
  • Scroll wheel to zoom
  • Click a cell for details
  • Fit to View resets zoom
  • Pop Out Heatmap opens in separate window

Cluster Navigator

Left panel lists all clusters. Click to zoom to that cluster.

Hover Info

Hover over any cell to see match names and correlation strength.

Detail Panels

Cluster Members Tab

Column Description
ClusterCluster number
NameMatch name
cMTotal shared DNA in centimorgans
SegNumber of shared segments
LongestLongest shared segment in cM
TreeTree type
SizePeople in match’s tree
StarredStarred/favorited
HintA* hint indicator
CorrelatedCorrelated match count
NotesMatch notes

Shared Ancestors Tab

Lists ancestors across match data: surname, given name, clusters, match count.

Chromosome Browser Tab

Click a heatmap cell to view shared DNA segments. Enable “Show Entire Cluster” for all cluster segments. Requires chromosome data.

Common Ancestors Tab

Shows ancestors from match family trees across the selected cluster. Columns: Ancestor, Years, MRCA, Shared By.

All Ancestors Tab

Comprehensive view across all clusters. Columns: Surname, Given, Years, Clusters, Matches, Source.

Export

After clustering:

  • HTML — Static HTML file with heatmap
  • Excel — Excel file with cluster details
  • Open Folder — Opens output folder

Tips and Strategies

  • Start with defaults (Threshold, Standard, 20–400 cM).
  • Try both clustering methods.
  • Narrow the cM range for large match sets.
  • Use surname filters for specific branches.
  • Enable ancestors and chromosome data for richest analysis.
  • Use Cluster Navigator to jump between clusters.
  • Pop out the heatmap for a larger view.
  • Compare with CLM and Shared Clustering.

Important Notes

  • Clusters suggest but do not prove a common ancestor.
  • A match in a cluster doesn’t mean all members share the same ancestor.
  • Very close relatives may connect to many clusters.
  • Superclusters represent broader, less certain relationships.
  • Cluster colors are arbitrary and cycle through a palette.