introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 15236020
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82574281 | GT-AG | 0 | 1.000000099473604e-05 | 2584 | rna-XM_041501004.1 15236020 | 2 | 34327795 | 34330378 | Gigantopelta aegis 1735272 | ACG|GTGAGTGGGA...CATATTTTAATT/CATATTTTAATT...TTCAG|CTG | 1 | 1 | 8.996 |
| 82574282 | GT-AG | 0 | 1.294780513918145e-05 | 578 | rna-XM_041501004.1 15236020 | 3 | 34327146 | 34327723 | Gigantopelta aegis 1735272 | AAG|GTACGTCATG...ATGTCATTAAAA/AGTGATGTCATT...TCCAG|TGT | 0 | 1 | 10.157 |
| 82574283 | GT-AG | 0 | 4.686630020046326e-05 | 1369 | rna-XM_041501004.1 15236020 | 4 | 34325524 | 34326892 | Gigantopelta aegis 1735272 | AAG|GTACATCCCT...ATATTTTTATTT/GATATTTTTATT...TGTAG|GAC | 1 | 1 | 14.295 |
| 82574284 | GT-AG | 0 | 1.000000099473604e-05 | 3639 | rna-XM_041501004.1 15236020 | 5 | 34321760 | 34325398 | Gigantopelta aegis 1735272 | TGG|GTGAATATAT...TCAAGTTTGAAA/GTCTATGTGACT...TTCAG|CTG | 0 | 1 | 16.34 |
| 82574285 | GT-AG | 0 | 1.000000099473604e-05 | 2344 | rna-XM_041501004.1 15236020 | 6 | 34319290 | 34321633 | Gigantopelta aegis 1735272 | CAG|GTAAGAAGTA...CAGATTATATCA/GATTATATCACT...TTTAG|GGT | 0 | 1 | 18.4 |
| 82574286 | GT-AG | 0 | 1.000000099473604e-05 | 166 | rna-XM_041501004.1 15236020 | 7 | 34318900 | 34319065 | Gigantopelta aegis 1735272 | AGT|GTAAGACATT...TTTGTTTTAGAC/TCTAACCTAATT...TTCAG|ACA | 2 | 1 | 22.064 |
| 82574287 | GT-AG | 0 | 1.000000099473604e-05 | 1641 | rna-XM_041501004.1 15236020 | 8 | 34317087 | 34318727 | Gigantopelta aegis 1735272 | AAT|GTAAGATCTT...TGCTCCATACTT/TTGCAACTGAAA...TATAG|GCA | 0 | 1 | 24.877 |
| 82574288 | GT-AG | 0 | 1.000000099473604e-05 | 1227 | rna-XM_041501004.1 15236020 | 9 | 34315749 | 34316975 | Gigantopelta aegis 1735272 | TAT|GTAAGTCAAG...AATTATTTATTT/TAATTATTTATT...TTCAG|ACG | 0 | 1 | 26.693 |
| 82574289 | GT-AG | 0 | 0.0032608811993752 | 944 | rna-XM_041501004.1 15236020 | 10 | 34314739 | 34315682 | Gigantopelta aegis 1735272 | TCG|GTACGTTTTA...CATTCCTTTCCT/TTTCTACTTATT...TCCAG|ATC | 0 | 1 | 27.772 |
| 82574290 | GT-AG | 0 | 1.000000099473604e-05 | 232 | rna-XM_041501004.1 15236020 | 11 | 34314279 | 34314510 | Gigantopelta aegis 1735272 | CAG|GTAAGTAGTT...ATGTTATTATAA/AATGTTATTATA...TACAG|GTG | 0 | 1 | 31.501 |
| 82574291 | GT-AG | 0 | 0.0053250201111096 | 1478 | rna-XM_041501004.1 15236020 | 12 | 34312686 | 34314163 | Gigantopelta aegis 1735272 | GAG|GTATTCACGT...TTGTTTTTGTTT/TAAATATTAAAT...CAAAG|CCA | 1 | 1 | 33.382 |
| 82574292 | GT-AG | 0 | 1.24150354200509e-05 | 2000 | rna-XM_041501004.1 15236020 | 13 | 34310538 | 34312537 | Gigantopelta aegis 1735272 | CAG|GTAACAGTCA...GTCACTGTATCA/CTATATGTCACT...CTCAG|ATA | 2 | 1 | 35.803 |
| 82574293 | GT-AG | 0 | 1.000000099473604e-05 | 4094 | rna-XM_041501004.1 15236020 | 14 | 34306335 | 34310428 | Gigantopelta aegis 1735272 | GAT|GTCAGTATAT...TTTTTCTTGTTT/TTTCTGTTAAAT...GACAG|GAT | 0 | 1 | 37.586 |
| 82574294 | GT-AG | 0 | 1.000000099473604e-05 | 1403 | rna-XM_041501004.1 15236020 | 15 | 34304814 | 34306216 | Gigantopelta aegis 1735272 | AAG|GTAGAGAACA...GGAGCTTTAACA/GGAGCTTTAACA...TTTAG|CAT | 1 | 1 | 39.516 |
| 82574295 | GT-AG | 0 | 1.000000099473604e-05 | 2255 | rna-XM_041501004.1 15236020 | 16 | 34302487 | 34304741 | Gigantopelta aegis 1735272 | GAG|GTCAGACTTG...GGCTGCTTGATG/GATGTGCTCACT...TACAG|GGG | 1 | 1 | 40.693 |
| 82574296 | GT-AG | 0 | 0.0001042599892186 | 5915 | rna-XM_041501004.1 15236020 | 17 | 34296436 | 34302350 | Gigantopelta aegis 1735272 | TGG|GTAAATTTAT...TGTCACTTGATG/ACGCATGTCACT...CCCAG|CAT | 2 | 1 | 42.918 |
| 82574297 | GT-AG | 0 | 1.000000099473604e-05 | 2003 | rna-XM_041501004.1 15236020 | 18 | 34294360 | 34296362 | Gigantopelta aegis 1735272 | CAA|GTAAGATACA...TGTTTTTCAATT/ATGTTTTTCAAT...TCTAG|ATG | 0 | 1 | 44.112 |
| 82574298 | GT-AG | 0 | 0.0002260154248182 | 1300 | rna-XM_041501004.1 15236020 | 19 | 34292910 | 34294209 | Gigantopelta aegis 1735272 | GAA|GTACGTTGAC...TTGTGTTTGATG/TTGTGTTTGATG...TCTAG|AAG | 0 | 1 | 46.565 |
| 82574299 | GT-AG | 0 | 2.0328993491228158e-05 | 320 | rna-XM_041501004.1 15236020 | 20 | 34292496 | 34292815 | Gigantopelta aegis 1735272 | CAG|GTAGGCATTT...TGTGTATTATTT/TATTATTTCATT...AACAG|GTG | 1 | 1 | 48.103 |
| 82574300 | GT-AG | 0 | 9.932002915735824e-05 | 2506 | rna-XM_041501004.1 15236020 | 21 | 34289837 | 34292342 | Gigantopelta aegis 1735272 | GGT|GTAAGTGTTA...GTATTGTTAACT/GTTGGTTTCACA...TGCAG|CCA | 1 | 1 | 50.605 |
| 82574301 | GT-AG | 0 | 1.000000099473604e-05 | 448 | rna-XM_041501004.1 15236020 | 22 | 34289228 | 34289675 | Gigantopelta aegis 1735272 | AAG|GTTAACCGTT...ACTTTCTTATTT/TACTTTCTTATT...TTCAG|AAG | 0 | 1 | 53.238 |
| 82574302 | GT-AG | 0 | 1.000000099473604e-05 | 801 | rna-XM_041501004.1 15236020 | 23 | 34288349 | 34289149 | Gigantopelta aegis 1735272 | AGG|GTCAGTTTGT...TTATTATTATTT/ATTATTATTATT...TTCAG|AAC | 0 | 1 | 54.514 |
| 82574303 | GT-AG | 0 | 1.000000099473604e-05 | 945 | rna-XM_041501004.1 15236020 | 24 | 34287198 | 34288142 | Gigantopelta aegis 1735272 | AAG|GTAAGGTTTT...TGTTCCCCAAAA/AAAAGATTCACC...TTCAG|GTT | 2 | 1 | 57.884 |
| 82574304 | GT-AG | 0 | 1.000000099473604e-05 | 2113 | rna-XM_041501004.1 15236020 | 25 | 34284982 | 34287094 | Gigantopelta aegis 1735272 | CAG|GTGAGACCAG...TTTCCTTTGACA/TTTCCTTTGACA...TTCAG|ATG | 0 | 1 | 59.568 |
| 82574305 | GT-AG | 0 | 1.000000099473604e-05 | 419 | rna-XM_041501004.1 15236020 | 26 | 34284466 | 34284884 | Gigantopelta aegis 1735272 | CAG|GTGAGTAAAT...TATCGTTTACCA/ATATCGTTTACC...TTCAG|TTT | 1 | 1 | 61.155 |
| 82574306 | GT-AG | 0 | 1.000000099473604e-05 | 4459 | rna-XM_041501004.1 15236020 | 27 | 34279841 | 34284299 | Gigantopelta aegis 1735272 | ACA|GTAAGTCTCA...GATTCCGTGTTT/ACAGGATTAATA...TTTAG|GGA | 2 | 1 | 63.87 |
| 82574307 | GT-AG | 0 | 0.0016580894924468 | 49160 | rna-XM_041501004.1 15236020 | 28 | 34230559 | 34279718 | Gigantopelta aegis 1735272 | GAG|GTATTTGCTT...TTTTTTTTAAAC/TATATTTTGACA...TGCAG|AAG | 1 | 1 | 65.865 |
| 82574308 | GT-AG | 0 | 1.000000099473604e-05 | 13535 | rna-XM_041501004.1 15236020 | 29 | 34216821 | 34230355 | Gigantopelta aegis 1735272 | CTG|GTAAGTCTTA...ATTTTCTGAGTT/CATTTTCTGAGT...TTCAG|AAT | 0 | 1 | 69.185 |
| 82574309 | GT-AG | 0 | 0.0012195485273921 | 8420 | rna-XM_041501004.1 15236020 | 30 | 34208308 | 34216727 | Gigantopelta aegis 1735272 | CCG|GTATGTCACA...TTTTTCTGAATT/CTGAATTTCATT...TTCAG|ACT | 0 | 1 | 70.707 |
| 82574310 | GT-AG | 0 | 1.000000099473604e-05 | 1794 | rna-XM_041501004.1 15236020 | 31 | 34205370 | 34207163 | Gigantopelta aegis 1735272 | CAG|GTATGTGGAA...TGTATTTTCTCT/ATATATGTAAGT...TGCAG|TTT | 1 | 1 | 89.418 |
| 82574311 | GT-AG | 0 | 1.000000099473604e-05 | 18459 | rna-XM_041501004.1 15236020 | 32 | 34186755 | 34205213 | Gigantopelta aegis 1735272 | AAG|GTGAGAGTTA...TATTTTTTATTT/TTATTTTTTATT...TTCAG|AAA | 1 | 1 | 91.969 |
| 82574312 | GT-AG | 0 | 1.000000099473604e-05 | 20960 | rna-XM_041501004.1 15236020 | 33 | 34165744 | 34186703 | Gigantopelta aegis 1735272 | TTG|GTGAGTTTAC...TATGTTTTATTT/TTATGTTTTATT...TACAG|ATA | 1 | 1 | 92.803 |
| 82574313 | GT-AG | 0 | 2.459520922333903e-05 | 3094 | rna-XM_041501004.1 15236020 | 34 | 34162546 | 34165639 | Gigantopelta aegis 1735272 | GTG|GTAAGCTAAT...TTATGTTTATTT/TGTTTATTTATA...TACAG|AAA | 0 | 1 | 94.504 |
| 82574314 | GT-AG | 0 | 3.281623961446828e-05 | 1198 | rna-XM_041501004.1 15236020 | 35 | 34161258 | 34162455 | Gigantopelta aegis 1735272 | GAA|GTATGTAAAA...GGTCACTAAATC/CACCATTTCACA...TTTAG|GAT | 0 | 1 | 95.976 |
| 82579827 | GT-AG | 0 | 5.129779541373608e-05 | 5471 | rna-XM_041501004.1 15236020 | 1 | 34330648 | 34336118 | Gigantopelta aegis 1735272 | GAG|GTTCGTTTGT...CCTCTCTTATCC/ATTATTGTTATT...TTCAG|ATC | 0 | 5.25 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);