Murphy

Tree Position

Unique Mutations

The mutations unique to this man are summarized in the table below. Those with a '+' or '*' confidence level are considered by FamilyTreeDNA or FullGenomesCorp to be high quality SNPs/INDELs. For completeness, all other mutations of lesser confidence are included as well. These additional mutations may be useful for distinguishing between very closely related men.

Occasionally, some of the mutations listed here will be thought to be shared with other men in which case they might appear in upstream blocks on the tree. When this happens, the 'Blocks' field will indicate what block they appear in. Such a situation might arise with BigY men if the BED data suggests another man may be positive for a SNP, even though it doesn't appear in his VCF data. It might also happen if Chromo2 testing or Sanger sequencing of other men not on the tree show the SNP to be shared.

POS-REF-ALT (hg19)

POS-REF-ALT (hg38)

Blocks

Names

Region

McDonald BED

combBED

STR

FTDNA167501

16364133-CAT-C

14252253-CAT-C

+

15341741-T-C

13229860-T-C

Y

Y

+

17516985-T-G

15405105-T-G

Y

Y

+

17847709-C-T

15735829-C-T

Y

Y

+

18545747-A-G

16433867-A-G

Y

+

14072774-C-G

11952068-C-G

Y

Y

+

22825934-G-T

20664048-G-T

Y

Y

+

6913407-G-T

7045366-G-T

Y

Y

+

9132595-T-C

9294986-T-C

Y

+

14113569-C-A

11992863-C-A

CTS1896

**

22777196-G-A

20615310-G-A

CTS10784 M1779

**

27190007-A-T

25043860-A-T

P1_g3

**

22234335-T-C

20072449-T-C

DYZ19

**

13855640-A-G

11734934-A-G

**

13857537-AATCAG-A,C

11736831-AATCAG-A,C

***

13865852-A-C

11745146-A-C

***

13846299-A-C,G

11725593-A-C,G

***

13847041-G-A

11726335-G-A

***

13847075-C-G

11726369-C-G

***

13847327-T-A

11726621-T-A

***

13870085-C-G

11749379-C-G

***

13866295-C-G

11745589-C-G

***

13848059-C-A

11727353-C-A

***

13857545-G-C

11736839-G-C

***

13850387-G-A

11729681-G-A

***

13854407-G-A

11733701-G-A

***

13864445-C-G

11743739-C-G

***

13863759-GAACTGTGTAC-G

11743053-GAACTGTGTAC-G

***

13860681-C-T

11739975-C-T

***

13855226-C-G

11734520-C-G

***

13859444-GGACACAAAAT-G

11738738-GGACACAAAAT-G

***

13859737-G-A

11739031-G-A

***

13138873-C-T

10628359-C-T

***

13845541-A-AAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGG,G

11724835-A-AAATGGAATGGAATGGAATGGAATGGAATGGAATGGAATGG,G

8×AATGG

***

58972719-C-T

56826572-C-T

***

13867406-A-AG,T

11746700-A-AG,T

***

13817622-T-TGAATG

11696916-T-TGAATG

4×GAATG

***

13652115-A-ATGGAG

11496439-A-ATGGAG

***

28798927-GCGAAGTGGAGTGTATTGGAGTGGAC-G

26652780-GCGAAGTGGAGTGTATTGGAGTGGAC-G

***

19739340-AAAC-A

17627460-AAAC-A

P5_Prx

13×AAC

***

58981527-A-C

56835380-A-C

***

58976325-T-C

56830178-T-C

***

58976092-T-C

56829945-T-C

***

58972495-A-C

56826348-A-C

***

22231361-G-C

20069475-G-C

DYZ19

***

28818242-A-G

26672095-A-G

***

28800254-G-GTGGAA

26654107-G-GTGGAA

***

28787540-G-T

26641393-G-T

***

28787271-ATTCAATGGAG-A

26641124-ATTCAATGGAG-A

***

28786779-A-G

26640632-A-G

***

22510192-C-A

20348306-C-A

DYZ19

***

22289100-G-A

20127214-G-A

DYZ19

***

22284497-G-C

20122611-G-C

DYZ19

***

20802020-A-G

18640134-A-G

P4_Prx

***

13832441-A-G

11711735-A-G

***

13842097-A-G

11721391-A-G

***

13138886-T-C

10628372-T-C

***

13457568-T-A

11301892-T-A

***

13457563-T-A

11301887-T-A

***

13452522-T-A

11296846-T-A

***

13405232-A-T

11249556-A-T

***

13196115-C-T

11040439-C-T

***

13196095-C-T

11040419-C-T

***

13142070-G-C

10631556-G-C

***

13141831-T-C

10631317-T-C

***

13140609-T-A

10630095-T-A

***

13140223-TTGAAC-T,TTCCAT

10629709-TTGAAC-T,TTCCAT

***

13138871-C-T

10628357-C-T

***

13457596-T-C

11301920-T-C

FGC31580

***

13138870-C-G

10628356-C-G

***

13138178-C-T

10627664-C-T

***

9974223-T-TTTC

10136614-T-TTTC

***

9962121-T-C

10124512-T-C

***

13839825-G-A

11719119-G-A

***

13803101-T-G

11682395-T-G

***

22508107-T-G

20346221-T-G

DYZ19

***

13839023-G-A

11718317-G-A

***

58979122-T-A

56832975-T-A

***

28798978-T-A

26652831-T-A

***

13457587-T-C

11301911-T-C

***

13457611-A-C

11301935-A-C

***

13842068-T-G

11721362-T-G

***

13700810-C-A

11545134-C-A

***

13830299-T-G

11709593-T-G

***

13820944-G-A

11700238-G-A

***

13819945-GCATCT-G

11699239-GCATCT-G

***

13818313-C-G

11697607-C-G

***

13818289-C-A

11697583-C-A

***

13810168-CGAAAA-T

11689462-CGAAAA-T

***

13803062-C-G

11682356-C-G

***

13799471-G-T

11678765-G-T

***

13739663-G-C

11583987-G-C

***

13735006-C-G

11579330-C-G

***

13700809-A-G

11545133-A-G

***

13464359-T-G

11308683-T-G

***

13486727-A-T

11331051-A-T

***

13486724-T-G

11331048-T-G

***

13485821-G-T

11330145-G-T

***

13485820-T-A

11330144-T-A

***

13485778-T-G

11330102-T-G

***

13479698-A-C

11324022-A-C

***

13479678-A-AAGCC

11324002-A-AAGCC

***

13476344-A-T

11320668-A-T

***

13469197-CT-C

11313521-CT-C

***

13469031-G-A

11313355-G-A

***

13800987-A-G,T

11680281-A-G,T

***

In the table above, the meaning of the confidence field depends on whether the data comes from an FTDNA kit or an FGC kit. For FTDNA kits, + implies a "PASS" result with just one possible variant, * indicates a "PASS" but with multiple variants, ** indicates "REJECTED" with just a single variant, and *** indicates "REJECTED" with multiple possible variants. 'A*' are heterozygous variants not called by FTDNA, but still pulled from the VCF file. For FGC kits, + indicates over 99% likely genuine (95% for INDELs); * over 95% likely genuine (90% for INDELs); ** about 40% likely genuine; *** about 10% likely genuine. Manual entries read directly from a BAM file will be either + indicating positive, or * indicating that the data show a mixture of possible variants.

For the FTDNA kits, the BED data is encoded in the background color of the cells. Those cells with a white background have coverage, those with a grey background indicate no coverage in the BED file, and those with a pink background indicate the mutation is on the edge of a coverage region. These pink regions often indicate that the individual may be positive for a SNP even if there is no corresponding entry in the vcf file.

The McDonald BED column indicates whether or not the mutation is a SNP and falls in the BED region used by Dr. Iain McDonald in the age analysis he does for R-U106 men.

Uncertain Mutations

NGS tests don't always cover all of the Y chromosome and, even when they do, the results can be inconclusive. The following mutations don't have clear results and have been assumed to be positive or negative. For those mutations that are assumed to be positive, the assumed SNP/INDEL will likely be found on the tree in some upstream block. It is unclear if it should be placed upstream of your results or parallel to them. The situation is reversed for mutations assumed negative. Those SNPs/INDELs will likely be found on a parallel branch on the tree, and it is unclear if they should be positioned upstream. As more results come in, the ambiguity may resolve itself, or it may be necessary to consult the BAM file for your test (available from FamilyTreeDNA or FullGenomes Corp) or by direct SNP testing (Sanger Sequencing - YSEQ or FamilyTreeDNA).

SNPs/INDELs:

Haplotype Progression

The data below reveals the progression of changes in the haplotype. The bottom row is the haplotype for this man and above him are the inferred ancestral haplotypes for various upstream blocks. These inferred haplotypes are very much a work in progress, and any suggested modifications are appreciated. I think we can use our vast collection of haplotype data to refine these haplotypes. Mutations made from each upstream block are shown in sequence. The cell color is determined by the block in which the mutation took place.