introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 3555687
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17676604 | GT-AG | 0 | 2.85567327731786e-05 | 801 | rna-XM_038338010.1 3555687 | 2 | 1939538 | 1940338 | Arvicola amphibius 1047088 | AAA|GTAAGTGTGC...CTGTCCTTATTA/CATATACTCACA...TGTAG|CTT | 2 | 1 | 4.494 |
| 17676605 | GT-AG | 0 | 1.000000099473604e-05 | 1100 | rna-XM_038338010.1 3555687 | 3 | 1940452 | 1941551 | Arvicola amphibius 1047088 | TAG|GTAAATTATC...TATTTCTTTTTT/TAATAATTAATA...TGTAG|GCC | 1 | 1 | 6.971 |
| 17676606 | GT-AG | 0 | 3.740262397650095e-05 | 6363 | rna-XM_038338010.1 3555687 | 4 | 1941859 | 1948221 | Arvicola amphibius 1047088 | AAG|GTAACTAATA...GGTATTTTTTTT/TAGTTTGTAAGC...TCCAG|GTT | 2 | 1 | 13.7 |
| 17676607 | GT-AG | 0 | 1.000000099473604e-05 | 249 | rna-XM_038338010.1 3555687 | 5 | 1948485 | 1948733 | Arvicola amphibius 1047088 | AAG|GTAATTGCTT...ATATCCATATAA/AGTTGTTTTATG...TTTAG|GTA | 1 | 1 | 19.465 |
| 17676608 | GT-AG | 1 | 99.99909147180684 | 5629 | rna-XM_038338010.1 3555687 | 6 | 1948782 | 1954410 | Arvicola amphibius 1047088 | TCC|GTATCCTTTA...AACTCCTTAATG/AAATATTTAACT...CTAAG|CGG | 1 | 1 | 20.517 |
| 17676609 | GT-AG | 0 | 5.368212403111467e-05 | 426 | rna-XM_038338010.1 3555687 | 7 | 1954482 | 1954907 | Arvicola amphibius 1047088 | GAT|GTAAGTATTT...AATCTGTTGACT/TATGTATTAATA...CTTAG|ATT | 0 | 1 | 22.074 |
| 17676610 | GT-AG | 0 | 1.000000099473604e-05 | 1178 | rna-XM_038338010.1 3555687 | 8 | 1955041 | 1956218 | Arvicola amphibius 1047088 | ACA|GTGAGTATAT...ACAATTTTAAAA/CTGGTGTTCAAA...TTTAG|TGG | 1 | 1 | 24.989 |
| 17676611 | GT-AG | 0 | 0.0003807505651129 | 688 | rna-XM_038338010.1 3555687 | 9 | 1956371 | 1957058 | Arvicola amphibius 1047088 | CCA|GTAAGTATTA...AATTCCTTATCA/TAATTCCTTATC...TGTAG|GAG | 0 | 1 | 28.321 |
| 17676612 | GT-AG | 0 | 1.000000099473604e-05 | 213 | rna-XM_038338010.1 3555687 | 10 | 1957137 | 1957349 | Arvicola amphibius 1047088 | GAG|GTAGTGAACG...TCCCTCTTGATT/TTTGGTTTTATA...CCTAG|GTA | 0 | 1 | 30.031 |
| 17676613 | GT-AG | 0 | 1.4077453874825142e-05 | 6764 | rna-XM_038338010.1 3555687 | 11 | 1957506 | 1964269 | Arvicola amphibius 1047088 | GAC|GTAAGTCACA...GCTTCCTTTTTT/TAGAACGTAATG...TCCAG|ATC | 0 | 1 | 33.45 |
| 17676614 | GT-AG | 0 | 0.0001923553767356 | 85 | rna-XM_038338010.1 3555687 | 12 | 1964471 | 1964555 | Arvicola amphibius 1047088 | GTG|GTATGGATTA...CAGTGTTTAACC/TTTAACCTGATT...TCCAG|TTT | 0 | 1 | 37.856 |
| 17676615 | GT-AG | 0 | 1.000000099473604e-05 | 2338 | rna-XM_038338010.1 3555687 | 13 | 1964645 | 1966982 | Arvicola amphibius 1047088 | CAG|GTTTGTCCTT...AATTTATTAATT/AATTTATTAATT...AAAAG|GTT | 2 | 1 | 39.807 |
| 17676616 | GT-AG | 0 | 1.000000099473604e-05 | 621 | rna-XM_038338010.1 3555687 | 14 | 1967027 | 1967647 | Arvicola amphibius 1047088 | AAG|GTAGGTTTAA...GGTGTGTTCCTG/CAGTGGCTGACA...TGCAG|ATG | 1 | 1 | 40.772 |
| 17676617 | GT-AG | 0 | 0.0002718570051599 | 355 | rna-XM_038338010.1 3555687 | 15 | 1967750 | 1968104 | Arvicola amphibius 1047088 | TCT|GTAAGTAAAA...TATCCCTTGATT/TATCCCTTGATT...GATAG|TTA | 1 | 1 | 43.007 |
| 17676618 | GT-AG | 0 | 1.000000099473604e-05 | 1231 | rna-XM_038338010.1 3555687 | 16 | 1968251 | 1969481 | Arvicola amphibius 1047088 | CAG|GTGAGGGGTG...TGTGCTTTGGCT/TCTTTGCTGACC...TGCAG|TTT | 0 | 1 | 46.208 |
| 17676619 | GT-AG | 0 | 1.841046722066921e-05 | 677 | rna-XM_038338010.1 3555687 | 17 | 1969577 | 1970253 | Arvicola amphibius 1047088 | CAG|GTAACAAAGA...ATGTTTTTGAAA/TTGAAATTCACA...ATTAG|TTT | 2 | 1 | 48.29 |
| 17676620 | GT-AG | 0 | 0.0004632359371858 | 3418 | rna-XM_038338010.1 3555687 | 18 | 1970388 | 1973805 | Arvicola amphibius 1047088 | ACG|GTTTGTTACT...TATTTTTTAAAA/TATTTTTTAAAA...TGCAG|GTT | 1 | 1 | 51.228 |
| 17676621 | GT-AG | 0 | 1.000000099473604e-05 | 1103 | rna-XM_038338010.1 3555687 | 19 | 1973854 | 1974956 | Arvicola amphibius 1047088 | AAG|GTTAGATATA...TCTTCCTTATTT/GTCTTCCTTATT...AACAG|AAT | 1 | 1 | 52.28 |
| 17676622 | GT-AG | 0 | 0.0077318785671184 | 1280 | rna-XM_038338010.1 3555687 | 20 | 1975038 | 1976317 | Arvicola amphibius 1047088 | CAG|GTATTTTTCC...TCTGTCTTTCCT/AACTTGCTCAGT...AACAG|AGG | 1 | 1 | 54.055 |
| 17676623 | GT-AG | 0 | 0.0015167012179171 | 1291 | rna-XM_038338010.1 3555687 | 21 | 1976410 | 1977700 | Arvicola amphibius 1047088 | TTA|GTAAGTTTTC...ATTTGCTTAAAC/AATTTGCTTAAA...CATAG|ATT | 0 | 1 | 56.072 |
| 17676624 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_038338010.1 3555687 | 22 | 1977808 | 1977890 | Arvicola amphibius 1047088 | CAG|GTACGAAGTT...TTCTATTTAGTT/TGTGTTTTTATT...TGCAG|ATT | 2 | 1 | 58.417 |
| 17676625 | GT-AG | 0 | 1.000000099473604e-05 | 4866 | rna-XM_038338010.1 3555687 | 23 | 1977939 | 1982804 | Arvicola amphibius 1047088 | TTG|GTAATGAAAA...TGACCTTTCACA/TGACCTTTCACA...CCCAG|TCT | 2 | 1 | 59.47 |
| 17676626 | GT-AG | 0 | 0.0217638971702275 | 79 | rna-XM_038338010.1 3555687 | 24 | 1982903 | 1982981 | Arvicola amphibius 1047088 | GAT|GTATGTTCGA...TTTTCCTCAGTA/ATTTTCCTCAGT...TTTAG|GGT | 1 | 1 | 61.618 |
| 17676627 | GT-AG | 0 | 1.000000099473604e-05 | 5242 | rna-XM_038338010.1 3555687 | 25 | 1983114 | 1988355 | Arvicola amphibius 1047088 | TCG|GTAGGTACTC...CACTCTTTGGTT/TTCTATTTCATT...CACAG|GCA | 1 | 1 | 64.511 |
| 17676628 | GT-AG | 0 | 0.0003325408125829 | 539 | rna-XM_038338010.1 3555687 | 26 | 1988442 | 1988980 | Arvicola amphibius 1047088 | CTG|GTATGTAGTA...AAAGTTTTAAGT/AAAGTTTTAAGT...TTTAG|ATT | 0 | 1 | 66.396 |
| 17676629 | GT-AG | 0 | 1.000000099473604e-05 | 126 | rna-XM_038338010.1 3555687 | 27 | 1989065 | 1989190 | Arvicola amphibius 1047088 | CAG|GTGAATATTC...GTTTTTTTATAC/AGTTTTTTTATA...TGAAG|ATT | 0 | 1 | 68.238 |
| 17676630 | GT-AG | 0 | 1.000000099473604e-05 | 1407 | rna-XM_038338010.1 3555687 | 28 | 1989349 | 1990755 | Arvicola amphibius 1047088 | TAT|GTGAGTGATT...TTTACATTACTG/GATTAGTTCATT...TTCAG|GGA | 2 | 1 | 71.701 |
| 17676631 | GT-AG | 0 | 1.000000099473604e-05 | 408 | rna-XM_038338010.1 3555687 | 29 | 1990860 | 1991267 | Arvicola amphibius 1047088 | ATG|GTAAGTTAAT...ATGATTTTAATA/ATGATTTTAATA...TCTAG|TTG | 1 | 1 | 73.981 |
| 17676632 | GT-AG | 0 | 1.000000099473604e-05 | 8840 | rna-XM_038338010.1 3555687 | 30 | 1991415 | 2000254 | Arvicola amphibius 1047088 | AAG|GTAAGCAGCA...TTTTTCTCACAT/ATTTTTCTCACA...TTTAG|GAA | 1 | 1 | 77.203 |
| 17676633 | GT-AG | 0 | 1.1705484586233982e-05 | 574 | rna-XM_038338010.1 3555687 | 31 | 2000336 | 2000909 | Arvicola amphibius 1047088 | GCT|GTGAGTTCTG...CTTACTTTAAAA/CTTACTTTAAAA...AATAG|GTA | 1 | 1 | 78.979 |
| 17676634 | GT-AG | 0 | 1.000000099473604e-05 | 2407 | rna-XM_038338010.1 3555687 | 32 | 2001026 | 2003432 | Arvicola amphibius 1047088 | AAG|GTGAGCTCTA...ATTTTTTTAATC/ATTTTTTTAATC...AACAG|ATA | 0 | 1 | 81.521 |
| 17676635 | GT-AG | 0 | 1.000000099473604e-05 | 2034 | rna-XM_038338010.1 3555687 | 33 | 2003513 | 2005546 | Arvicola amphibius 1047088 | AAG|GTACAAAGCC...TTTTGCTTATCA/GTTTTGCTTATC...TAAAG|GTC | 2 | 1 | 83.275 |
| 17676636 | GT-AG | 0 | 3.872290079246615e-05 | 5135 | rna-XM_038338010.1 3555687 | 34 | 2005651 | 2010785 | Arvicola amphibius 1047088 | AAG|GTAGATTACT...TGTGGTTTAACA/TGTGGTTTAACA...AACAG|ATT | 1 | 1 | 85.555 |
| 17676637 | GT-AG | 0 | 1.000000099473604e-05 | 2163 | rna-XM_038338010.1 3555687 | 35 | 2011020 | 2013182 | Arvicola amphibius 1047088 | CAG|GTAAAACCAC...ATTGTCTTTCTA/TGTCAACTCACA...CGTAG|GAT | 1 | 1 | 90.684 |
| 17676638 | GC-AG | 0 | 1.000000099473604e-05 | 2063 | rna-XM_038338010.1 3555687 | 36 | 2013274 | 2015336 | Arvicola amphibius 1047088 | CAG|GCAAGCTTCA...TTTTTCTTGTCA/TCATATTTCATA...TTCAG|TTA | 2 | 1 | 92.679 |
| 17676639 | GT-AG | 0 | 3.417319740367804e-05 | 93 | rna-XM_038338010.1 3555687 | 37 | 2015449 | 2015541 | Arvicola amphibius 1047088 | CTT|GTAAGTCATG...GAATGCTTAACT/TTGTTTGTTACA...CATAG|GTC | 0 | 1 | 95.134 |
| 17676640 | GT-AG | 0 | 1.000000099473604e-05 | 1132 | rna-XM_038338010.1 3555687 | 38 | 2015694 | 2016825 | Arvicola amphibius 1047088 | CAG|GTAAGAACTC...ATCATTTTAACA/ATCATTTTAACA...TCCAG|TAA | 2 | 1 | 98.466 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);