GenBank Frequency Information

The current GenBank frequency data in our variant tables is derived from 61,134 human mitochondrial DNA sequences with size greater than 15.4 kbp . The sequences were collected from GenBank on Jan 16, 2024, aligned to rCRS using BLASTn and haplotyped using Haplogrep via the Mitomaster web service. A list of the sequence IDs in the current GenBank set may be downloaded from the Mitobank web page. A spreadsheet of all variants found and their frequencies in the current set of sequences is also available. These GenBank sequences have been pre-loaded into Mitomaster and represent almost all haplogroups known to date. We will continue to update this sequence set on a regular basis. A set of short control region sequences (81,124 sequences as of Jan 16, 2024) has also been collected from GenBank and is included in the variant frequency counts in Mitomap and Mitomaster where indicated. Update History

Caveats: We do not presently tally counts or frequencies of reference alleles (those positions identical to rCRS) or those of ambiguous nucleotides (R, Y, M, K, etc). Indel calls in repetitive regions may not always match those of the original publications due the different manners of indel reporting over the years (e.g., positioning at the beginning or end of a polytract or repeat, forward or backward reading of inserted or deleted bases). The sequences in this GenBank set have not been individually reviewed by Mitomap. Please also be aware that (1) GenBank sequences may not be of equal quality (Yao, et al, 2009); (2) some sequences might be present in GenBank more than once under different IDs; (3) some sequences might be from clones or cell lines; (4) sequence collection is not evenly distributed across the continents; and (5) some of the GenBank sequences are derived from pathology samples or from diseased patients, presenting a somewhat biased sampling of the global mitochondriome.


Lineage distribution of 61,134 sequences from our current data set:

LMN
LineagesLineagesLineages
"African"          "Asian"          "Eurasian"
hg#   %          hg#   %          hg#   %
L32,35735%          M5,71846%          H12,38629%
L01,68325%          D2,76222%          U5,68514%
L21,50323%          C2,16217%          B4,90512%
L196815%          G6305%          J3,0627%
L41112%          E4714%          T2,9727%
L5381%          Q4253%          K2,2495%
L6120%          Z2932%          F1,9385%
Total6,672100%          Total12,461100%          A1,7554%
Overall 11% (6,672 / 61,134)          Overall 20% (12,461 / 61,134)          R1,2293%
                    HV1,0212%
                    I9422%
                    N9262%
                    V9092%
                    X6842%
                    W6602%
                    P4081%
                    Y2131%
                    S490%
                    O80%
                    Total42,001100%
                    Overall 69% (42,001 / 61,134)
Topic revision: r24 - 01 Feb 2024, UnknownUser

POLG Server
MitoScape

This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback