The Collins Leeds Method (CLM) Tool uses a grid (often called a matrix) to show your matches arrayed in groups (clusters). Membership in clusters is based on matches sharing other matches in common (ICW) with your selected kit. Cluster members share a common line of descent within a few generations, often even the same ancestors. You may optionally show ICW matches that do not meet the criteria you have set for inclusion in clusters.
A*DNA, FTDNA, 23andMe, MyHeritage, and GEDmatch data are supported. CLM can operate on the database or CSV files output by the Client.
| Setting | Purpose |
|---|---|
| Kit and Match Selection | |
| Kit Filter | Prefilter list of kits in the database |
| DNA Kit | Selects the target kit for clustering |
| Match/ICW files | Sets the kit data source to previously output CSV files |
| cM Range | Selects matches based on total cM shared (default: 50–400) |
| Inclusion Threshold | Selects matches based on criteria for cluster membership |
| Surname List | Filters matches for matching surnames |
| Sort Order | |
| Matches | Determines how matches appear within clusters |
| Clusters | Determines the order clusters are arrayed |
| Show | |
| Unclustered matches | Allows other relationships for cluster members to be shown |
| Painted Midline | Creates a diagonal midline to make match pairs easier to locate |
| Include Chromosome | Shows chromosome segment data in cluster output |
| Include Ancestors | Shows ancestor information in cluster output |
| Open HTML | Automatically opens the saved HTML output on completion |
Typing a name or part of a name in this field will limit the kits listed in the DNA Kit dropdown to only those kits that have a name containing that input. For example, to find all kits for John Collins, you may type the full name or just “John”. After you select a kit, there are two methods for clustering: DNA Kit vs Match File/ICW File.
The DNA Kit dropdown lists all the kits in your database previously gathered with the DNAGedcom Client. You can cluster any A*DNA, FTDNA, 23andMe, MyHeritage, or GEDmatch kit in your database. Alternatively, you can cluster a set of Match/ICW CSV output files.
Kits in the database can be distinguished from files:
A file selection form will appear automatically when you select an available Match/ICW File option.
Using a Match and ICW file instead of database has advantages:
Because the Client overwrites output files in its default folder, if you want to cluster different versions of Match/ICW CSV files for the same kit, you will need to store them in a separate directory.
Sets the upper and lower limit for total cM. Default: 50 to 400 cM. Try moving it up or down to target matches in certain relationship ranges. The wider the range the longer clustering takes. Different ranges yield different results, so running at various ranges may offer more insight. Output files include the range values in file names.
Filter matches for matching surname(s) by inserting whole or partial surname strings separated by commas.
Determines how many people within a set of matches a person must match to be included.
| Threshold | Ratio | Description |
|---|---|---|
| Easy | 1/3 | Match stays with 1/3 ICW. Larger clusters. |
| Standard | 1/2 | Must match at least half. Balanced default. |
| Strict | 2/3 | Must match 2/3. Smaller, tighter clusters. |
Default is 1/2. Try 2/3 for tighter clusters.
Order of matches within each cluster:
Two types of clusters: full clusters (solid color square) and superclusters (a set of full clusters sharing cross-cluster matches).
Options:
Default: On. Shows matches that don’t fit any cluster. Helpful because cross-cluster matches are still drawn. Density of unclustered matches can indicate endogamy or pedigree collapse.
Default: On. Self-matches painted black. Helpful when reordering clusters in Excel.
Default: On. Shows chromosome segment data. Click a cluster in HTML to see Chromosome Browser details (excluding A*).
Default: On. Shows ancestor information. Small green leaf icons appear where matches share common ancestors.
Default: On. Auto-opens HTML in default browser.
CLM generates two output files: HTML (primary, auto-displayed in browser) and Excel, both saved in the database folder.
HTML maps are drawn with filled-in squares, each representing a match between two people. Mousing over any colored square shows the kit names.
Three types of color-coded squares (four with diagonal):
Squares with a small green leaf indicate ancestors in common.
Clusters are surrounded by a black line. Membership is based on ICW, indicating shared descent.
Superclusters are surrounded by a dark grey line. They show related clusters sharing the same or closely related lines of descent.
While CLM cluster maps provide valuable clues to shared descent, they are not proof per se of specific ancestors.
Click a cluster (with internet connection) for a popup with cluster members or Chromosome Browser details.
Following the horizontal line to the leftmost columns shows:
The Excel Report has three worksheets: Chart, Data, and Ancestors tabs.
Same clusters as HTML but in Excel. Dark black border around clusters, gray border around Superclusters. Non-cluster matches shown as gray boxes.
Important notes:
List of clustered matches: Cluster number, Name, cM, Match Page link, Tree page link.
List of specific ancestors appearing in trees of matches associated with clusters.