introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
22 rows where transcript_id = 3555673
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17676182 | GT-AG | 0 | 1.000000099473604e-05 | 28901 | rna-XM_038345728.2 3555673 | 1 | 147554715 | 147583615 | Arvicola amphibius 1047088 | CGG|GTAAGGTGGG...CCCGTCCTGACC/CCCGTCCTGACC...CACAG|CCT | 1 | 1 | 1.938 |
| 17676183 | GT-AG | 0 | 0.0003025229753875 | 7984 | rna-XM_038345728.2 3555673 | 2 | 147546334 | 147554317 | Arvicola amphibius 1047088 | CGG|GTAAACCTCT...CCATCCTTGCCT/CTCTCTTCCATC...ATTAG|GTA | 2 | 1 | 10.122 |
| 17676184 | GT-AG | 0 | 1.000000099473604e-05 | 2493 | rna-XM_038345728.2 3555673 | 3 | 147543643 | 147546135 | Arvicola amphibius 1047088 | CCG|GTGAGTGCAC...TTCTTCTGACCA/CTTCTTCTGACC...TGCAG|GCA | 2 | 1 | 14.203 |
| 17676185 | GT-AG | 0 | 3.5156906100727605e-05 | 1642 | rna-XM_038345728.2 3555673 | 4 | 147541804 | 147543445 | Arvicola amphibius 1047088 | CCT|GTAAGTGTGA...CTTTCTTTCTCT/GTTTCCTGCACT...GCCAG|TCC | 1 | 1 | 18.264 |
| 17676186 | GT-AG | 0 | 1.000000099473604e-05 | 18847 | rna-XM_038345728.2 3555673 | 5 | 147522825 | 147541671 | Arvicola amphibius 1047088 | CAG|GTGAGGTGGA...GCTGCCTCAGCC/TCTGACTTCACT...CACAG|GGG | 1 | 1 | 20.985 |
| 17676187 | GT-AG | 0 | 1.000000099473604e-05 | 1907 | rna-XM_038345728.2 3555673 | 6 | 147520521 | 147522427 | Arvicola amphibius 1047088 | GGG|GTAAGGCTGG...ATTTGCTTTTTT/ACCATGGTCATT...TCTAG|CCT | 2 | 1 | 29.169 |
| 17676188 | GT-AG | 0 | 1.000000099473604e-05 | 5723 | rna-XM_038345728.2 3555673 | 7 | 147514626 | 147520348 | Arvicola amphibius 1047088 | GAG|GTGAGTGAAG...TCTCCATTTACA/TCTCCATTTACA...AACAG|GTG | 0 | 1 | 32.715 |
| 17676189 | GT-AG | 0 | 1.000000099473604e-05 | 1089 | rna-XM_038345728.2 3555673 | 8 | 147513320 | 147514408 | Arvicola amphibius 1047088 | TTG|GTGAGTCCTG...GCATCTGTGAAC/GCCCCTCTCATT...CACAG|GAA | 1 | 1 | 37.188 |
| 17676190 | GT-AG | 0 | 0.0078820102598539 | 2784 | rna-XM_038345728.2 3555673 | 9 | 147510246 | 147513029 | Arvicola amphibius 1047088 | AAG|GTACCTAGTG...GGATCCTAACCT/TCCATCCTCACC...CCCAG|ACT | 0 | 1 | 43.166 |
| 17676191 | GT-AG | 0 | 4.276246892229856e-05 | 1619 | rna-XM_038345728.2 3555673 | 10 | 147508400 | 147510018 | Arvicola amphibius 1047088 | AGG|GTAAGCATTG...CTCACCTGAGCC/CCACCACTCACC...TCCAG|CTA | 2 | 1 | 47.846 |
| 17676192 | GT-AG | 0 | 1.000000099473604e-05 | 2017 | rna-XM_038345728.2 3555673 | 11 | 147506198 | 147508214 | Arvicola amphibius 1047088 | TGG|GTGAGGGCTA...GCTCTCTTGCTG/ATTGGCCTGACA...CACAG|GTC | 1 | 1 | 51.659 |
| 17676193 | GT-AG | 0 | 1.1045504820679444e-05 | 2123 | rna-XM_038345728.2 3555673 | 12 | 147503751 | 147505873 | Arvicola amphibius 1047088 | GCC|GTAAGTGCCG...GCGCCCTCAGCT/AGCTTGCTGATG...TCCAG|CGC | 1 | 1 | 58.338 |
| 17676194 | GC-AG | 0 | 1.000000099473604e-05 | 4438 | rna-XM_038345728.2 3555673 | 13 | 147499113 | 147503550 | Arvicola amphibius 1047088 | CAG|GCAGGTGCCC...GGTTCCTGTGCA/TTCCTGTGCACG...CGCAG|CTC | 0 | 1 | 62.461 |
| 17676195 | GT-AG | 0 | 1.000000099473604e-05 | 1189 | rna-XM_038345728.2 3555673 | 14 | 147497715 | 147498903 | Arvicola amphibius 1047088 | AGG|GTAGGTGCCC...TGCTCTTTATCC/CTGCTCTTTATC...TACAG|GTA | 2 | 1 | 66.77 |
| 17676196 | GT-AG | 0 | 0.0001224911631945 | 1304 | rna-XM_038345728.2 3555673 | 15 | 147496220 | 147497523 | Arvicola amphibius 1047088 | CAG|GTATGTGTCT...TGACTCTTCACT/GTAGGTCTGACT...CCTAG|GGG | 1 | 1 | 70.707 |
| 17676197 | GT-AG | 0 | 0.0015530557309174 | 4515 | rna-XM_038345728.2 3555673 | 16 | 147491495 | 147496009 | Arvicola amphibius 1047088 | TCT|GTATGTGAGA...AGAACCTGATCT/GAGAACCTGATC...TACAG|CAG | 1 | 1 | 75.036 |
| 17676198 | GT-AG | 0 | 1.000000099473604e-05 | 2961 | rna-XM_038345728.2 3555673 | 17 | 147488408 | 147491368 | Arvicola amphibius 1047088 | GTG|GTGAGTGAAC...GTCTGCGTGACT/GTGTGGTTCACT...CCCAG|AGC | 1 | 1 | 77.633 |
| 17676199 | GT-AG | 0 | 1.000000099473604e-05 | 2308 | rna-XM_038345728.2 3555673 | 18 | 147485863 | 147488170 | Arvicola amphibius 1047088 | ATG|GTGAGGCTCT...GACCCCTTTTTC/CTGAGCGTGACC...CACAG|CTG | 1 | 1 | 82.519 |
| 17676200 | GT-AG | 0 | 1.000000099473604e-05 | 2078 | rna-XM_038345728.2 3555673 | 19 | 147483674 | 147485751 | Arvicola amphibius 1047088 | GTG|GTGAGTGCTT...TTGGGGTTAACG/TTGGGGTTAACG...AACAG|AAA | 1 | 1 | 84.807 |
| 17676201 | GT-AG | 0 | 1.000000099473604e-05 | 603 | rna-XM_038345728.2 3555673 | 20 | 147482834 | 147483436 | Arvicola amphibius 1047088 | CAG|GTAAGGAGCC...AGTTCGTTCATT/AGTTCGTTCATT...CACAG|GCA | 1 | 1 | 89.693 |
| 17676202 | GT-AG | 0 | 1.000000099473604e-05 | 3483 | rna-XM_038345728.2 3555673 | 21 | 147479211 | 147482693 | Arvicola amphibius 1047088 | CCG|GTGAGTGAGA...GGTGGTTTAGCC/GGGTGGTTTAGC...TTTAG|ATC | 0 | 1 | 92.579 |
| 17676203 | GT-AG | 0 | 1.000000099473604e-05 | 2887 | rna-XM_038345728.2 3555673 | 22 | 147476226 | 147479112 | Arvicola amphibius 1047088 | CAG|GTAGGAGTCC...CCTCTCTTTTCC/TATAGGCGCACC...TCTAG|ACC | 2 | 1 | 94.599 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);