introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
59 rows where transcript_id = 22607851
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122606248 | GT-AG | 0 | 1.000000099473604e-05 | 243 | rna-XM_021209545.2 22607851 | 1 | 109046769 | 109047011 | Mus pahari 10093 | AAG|GTGGGTCTCA...TGGACTGTGACA/CCTCTGCTGATT...GATAG|ATG | 1 | 1 | 0.614 |
| 122606249 | GT-AG | 0 | 1.000000099473604e-05 | 206 | rna-XM_021209545.2 22607851 | 2 | 109046438 | 109046643 | Mus pahari 10093 | AAG|GTGAAGCCCT...CTTTCCTTTTCC/CCTGGCTTCACT...TACAG|ATG | 0 | 1 | 2.18 |
| 122606250 | GT-AG | 0 | 1.000000099473604e-05 | 476 | rna-XM_021209545.2 22607851 | 3 | 109045830 | 109046305 | Mus pahari 10093 | CTT|GTAAGACCAC...CTCACCTGAGCA/AGTTGGCTCACC...CTTAG|GAA | 0 | 1 | 3.833 |
| 122606251 | GT-AG | 0 | 1.000000099473604e-05 | 2780 | rna-XM_021209545.2 22607851 | 4 | 109042985 | 109045764 | Mus pahari 10093 | AAG|GTAGGTATGC...ACATTCTGACTT/AACATTCTGACT...CCCAG|GTA | 2 | 1 | 4.647 |
| 122606252 | GT-AG | 0 | 1.000000099473604e-05 | 830 | rna-XM_021209545.2 22607851 | 5 | 109041990 | 109042819 | Mus pahari 10093 | GAT|GTGAGTGGGG...TGTTTTCTGATG/TGTTTTCTGATG...TCCAG|AGA | 2 | 1 | 6.714 |
| 122606253 | GT-AG | 0 | 1.000000099473604e-05 | 473 | rna-XM_021209545.2 22607851 | 6 | 109041393 | 109041865 | Mus pahari 10093 | ATG|GTAAGTGGAC...CCTCCTTTGAAT/CCTCCTTTGAAT...GGCAG|GTT | 0 | 1 | 8.268 |
| 122606254 | GT-AG | 0 | 1.919281395804117e-05 | 793 | rna-XM_021209545.2 22607851 | 7 | 109040505 | 109041297 | Mus pahari 10093 | AGC|GTAAGTGGTT...TCATTTTTAACA/TCATTTTTAACA...TCTAG|GAA | 2 | 1 | 9.458 |
| 122606255 | GT-AG | 0 | 1.000000099473604e-05 | 2401 | rna-XM_021209545.2 22607851 | 8 | 109037993 | 109040393 | Mus pahari 10093 | AAA|GTAAGTGGGA...TGTTTCTTTCCT/CTTGTGCTGAGC...TCCAG|AAC | 2 | 1 | 10.848 |
| 122606256 | GT-AG | 0 | 1.000000099473604e-05 | 6428 | rna-XM_021209545.2 22607851 | 9 | 109031465 | 109037892 | Mus pahari 10093 | GAA|GTGAGTGAGT...CTGTCATTAATG/TTTGTGTTCAAA...TCCAG|GAT | 0 | 1 | 12.101 |
| 122606257 | GT-AG | 0 | 1.000000099473604e-05 | 579 | rna-XM_021209545.2 22607851 | 10 | 109030787 | 109031365 | Mus pahari 10093 | GAG|GTAAAGCCTT...CAAGCCTCAGCT/TAGCCATTCAAT...TACAG|GAG | 0 | 1 | 13.341 |
| 122606258 | GT-AG | 0 | 1.000000099473604e-05 | 4730 | rna-XM_021209545.2 22607851 | 11 | 109025866 | 109030595 | Mus pahari 10093 | TAG|GTGGGTACCA...GGCTCCCTAGCA/CTCCCTCTCATG...TCTAG|TCT | 2 | 1 | 15.733 |
| 122606259 | GT-AG | 0 | 1.000000099473604e-05 | 251 | rna-XM_021209545.2 22607851 | 12 | 109025386 | 109025636 | Mus pahari 10093 | CAG|GTACAGACCT...GGTGCCTCAGAT/TCAGATATCATG...TGTAG|GCA | 0 | 1 | 18.602 |
| 122606260 | GT-AG | 0 | 1.000000099473604e-05 | 2583 | rna-XM_021209545.2 22607851 | 13 | 109022682 | 109025264 | Mus pahari 10093 | AAG|GTAAGTGCTG...TATTCTATACCC/CACTTTTTCATT...TTCAG|TTT | 1 | 1 | 20.118 |
| 122606261 | GT-AG | 0 | 1.000000099473604e-05 | 535 | rna-XM_021209545.2 22607851 | 14 | 109022023 | 109022557 | Mus pahari 10093 | GAG|GTGGGCCAGC...GTTTCCTTCTCA/TTCCTTCTCACC...AGCAG|AAT | 2 | 1 | 21.671 |
| 122606262 | GT-AG | 0 | 0.002909041190243 | 646 | rna-XM_021209545.2 22607851 | 15 | 109021246 | 109021891 | Mus pahari 10093 | GAG|GTACGCATCC...TGTGCTTTGATT/TGTGCTTTGATT...CTCAG|GGA | 1 | 1 | 23.312 |
| 122606263 | GT-AG | 0 | 1.000000099473604e-05 | 1194 | rna-XM_021209545.2 22607851 | 16 | 109019927 | 109021120 | Mus pahari 10093 | GAG|GTGCGTCAGG...TCTGCCTTGCTA/CTTGCTATGACT...CACAG|GCC | 0 | 1 | 24.878 |
| 122606264 | GT-AG | 0 | 1.000000099473604e-05 | 863 | rna-XM_021209545.2 22607851 | 17 | 109018930 | 109019792 | Mus pahari 10093 | CAG|GTGGGTGGGA...TCGGCGTTGAGC/GTTGAGCTGACC...TTCAG|AAC | 2 | 1 | 26.556 |
| 122606265 | GT-AG | 0 | 1.000000099473604e-05 | 539 | rna-XM_021209545.2 22607851 | 18 | 109018178 | 109018716 | Mus pahari 10093 | AGG|GTGGGTGTGG...GCTCTCATGTCA/ACAGCTCTCATG...TCCAG|AGA | 2 | 1 | 29.225 |
| 122606266 | GT-AG | 0 | 0.000387750953721 | 108 | rna-XM_021209545.2 22607851 | 19 | 109017904 | 109018011 | Mus pahari 10093 | GAG|GTAACTAAGC...CTCCTCTTAGAT/TCTCCTCTTAGA...TTTAG|GTC | 0 | 1 | 31.304 |
| 122606267 | GT-AG | 0 | 1.000000099473604e-05 | 747 | rna-XM_021209545.2 22607851 | 20 | 109017007 | 109017753 | Mus pahari 10093 | CAG|GTGCTGAGGA...CTTTCCTCTGCT/TGGGCACTCAAG...TCCAG|AGT | 0 | 1 | 33.183 |
| 122606268 | GT-AG | 0 | 1.000000099473604e-05 | 597 | rna-XM_021209545.2 22607851 | 21 | 109016302 | 109016898 | Mus pahari 10093 | CAG|GTGAGTCTCC...TCTCCCTTGCTT/CCTTGCTTCACA...GTAAG|GTG | 0 | 1 | 34.536 |
| 122606269 | GT-AG | 0 | 1.000000099473604e-05 | 3592 | rna-XM_021209545.2 22607851 | 22 | 109012543 | 109016134 | Mus pahari 10093 | ACC|GTAAGTGACT...CTCCCCTCACTT/TCTCCCCTCACT...CCCAG|GAG | 2 | 1 | 36.628 |
| 122606270 | GT-AG | 0 | 0.0001180180386125 | 533 | rna-XM_021209545.2 22607851 | 23 | 109011878 | 109012410 | Mus pahari 10093 | CAG|GTATGCAACG...GGTTTTTCAGAC/GGGTTTTTCAGA...TTCAG|GTT | 2 | 1 | 38.281 |
| 122606271 | GT-AG | 0 | 1.000000099473604e-05 | 1605 | rna-XM_021209545.2 22607851 | 24 | 109010110 | 109011714 | Mus pahari 10093 | CAG|GTGCCACAGG...TTCCCTCCAACA/GAGGAACACATC...TTAAG|GAG | 0 | 1 | 40.323 |
| 122606272 | GT-AG | 0 | 1.000000099473604e-05 | 2779 | rna-XM_021209545.2 22607851 | 25 | 109007243 | 109010021 | Mus pahari 10093 | CAG|GTTGGTAGCC...GCCCCCTTGAGG/TGAGGCCCCATC...CCCAG|GGG | 1 | 1 | 41.426 |
| 122606273 | GT-AG | 0 | 1.000000099473604e-05 | 399 | rna-XM_021209545.2 22607851 | 26 | 109006705 | 109007103 | Mus pahari 10093 | CCT|GTGAGTGCCC...TGATCCTTTTCT/CGTTACCTGATC...TTCAG|CTT | 2 | 1 | 43.167 |
| 122606274 | GT-AG | 0 | 1.000000099473604e-05 | 1886 | rna-XM_021209545.2 22607851 | 27 | 109004635 | 109006520 | Mus pahari 10093 | CAG|GTCTGTGTGG...TTGACCTGATTA/GTTGACCTGATT...CAAAG|AAA | 0 | 1 | 45.472 |
| 122606275 | GT-AG | 0 | 1.000000099473604e-05 | 585 | rna-XM_021209545.2 22607851 | 28 | 109003878 | 109004462 | Mus pahari 10093 | AAG|GTGAGGGTGA...GGAGCCTAATTC/CCCAGACTCACA...TTCAG|GGC | 1 | 1 | 47.626 |
| 122606276 | GT-AG | 0 | 1.000000099473604e-05 | 480 | rna-XM_021209545.2 22607851 | 29 | 109003192 | 109003671 | Mus pahari 10093 | CAG|GTAAGGGAAG...CAACTTATAATT/CAACTTATAATT...CCCAG|ATT | 0 | 1 | 50.207 |
| 122606277 | GT-AG | 0 | 1.000000099473604e-05 | 145 | rna-XM_021209545.2 22607851 | 30 | 109002840 | 109002984 | Mus pahari 10093 | GAG|GTTTATTCTG...GAGAAGGAGAAG/AGGAGAAGGAGA...AGAAG|GAG | 0 | 1 | 52.8 |
| 122606278 | GT-AG | 0 | 1.000000099473604e-05 | 260 | rna-XM_021209545.2 22607851 | 31 | 109002379 | 109002638 | Mus pahari 10093 | CAG|GTGAGTGGGT...GCTGCTTTTTCT/ATCCGTCTCAGG...CGCAG|GTT | 0 | 1 | 55.318 |
| 122606279 | GT-AG | 0 | 1.000000099473604e-05 | 2312 | rna-XM_021209545.2 22607851 | 32 | 108999917 | 109002228 | Mus pahari 10093 | CCC|GTGAGTAGAT...CTGCCTGTGACC/TAGTGTGTGACC...CGCAG|ACG | 0 | 1 | 57.197 |
| 122606280 | GT-AG | 0 | 1.000000099473604e-05 | 703 | rna-XM_021209545.2 22607851 | 33 | 108999065 | 108999767 | Mus pahari 10093 | GCG|GTTGGTATAC...GCCTCCTCATCT/TGCCTCCTCATC...TTCAG|ACT | 2 | 1 | 59.063 |
| 122606281 | GT-AG | 0 | 1.000000099473604e-05 | 1610 | rna-XM_021209545.2 22607851 | 34 | 108997356 | 108998965 | Mus pahari 10093 | CAG|GTGTGTGCAG...CTTCTGTTACAT/TGCTATTTCACT...ACTAG|CTG | 2 | 1 | 60.303 |
| 122606282 | GT-AG | 0 | 1.000000099473604e-05 | 1033 | rna-XM_021209545.2 22607851 | 35 | 108996190 | 108997222 | Mus pahari 10093 | AAG|GTAGGCTGGC...GCATCTTTGGGT/CTTTGGGTCATG...CACAG|ATT | 0 | 1 | 61.969 |
| 122606283 | GT-AG | 0 | 1.000000099473604e-05 | 998 | rna-XM_021209545.2 22607851 | 36 | 108995045 | 108996042 | Mus pahari 10093 | CAG|GTAAGGCGTC...CATCCTGTACCA/ATGGAGCTAACA...TTTAG|GAG | 0 | 1 | 63.811 |
| 122606284 | GT-AG | 0 | 1.000000099473604e-05 | 110 | rna-XM_021209545.2 22607851 | 37 | 108994734 | 108994843 | Mus pahari 10093 | ACA|GTGAGTTGGG...GAGGTCTGACCA/TGAGGTCTGACC...CCCAG|GTC | 0 | 1 | 66.328 |
| 122606285 | GT-AG | 0 | 1.000000099473604e-05 | 690 | rna-XM_021209545.2 22607851 | 38 | 108993937 | 108994626 | Mus pahari 10093 | CAG|GTAATGATGC...AATCCCATAGTC/AAAAGCTTCAAT...ACCAG|GTG | 2 | 1 | 67.669 |
| 122606286 | GT-AG | 0 | 1.000000099473604e-05 | 2204 | rna-XM_021209545.2 22607851 | 39 | 108991556 | 108993759 | Mus pahari 10093 | GAG|GTAAGGATGT...TTGTTCTAAACC/TTTGTTCTAAAC...TAAAG|GTT | 2 | 1 | 69.886 |
| 122606287 | GT-AG | 0 | 0.0012075767399487 | 4218 | rna-XM_021209545.2 22607851 | 40 | 108987202 | 108991419 | Mus pahari 10093 | GAC|GTAAGTTTGA...CTTGCTTTACCT/CCTTGCTTTACC...CTCAG|TTA | 0 | 1 | 71.59 |
| 122606288 | GT-AG | 0 | 1.000000099473604e-05 | 889 | rna-XM_021209545.2 22607851 | 41 | 108986124 | 108987012 | Mus pahari 10093 | TTG|GTAAAAGCAG...ACCTTCTTGTCT/TTGGTGGTGACA...CAAAG|CAA | 0 | 1 | 73.957 |
| 122606289 | GT-AG | 0 | 1.000000099473604e-05 | 1525 | rna-XM_021209545.2 22607851 | 42 | 108984410 | 108985934 | Mus pahari 10093 | TTG|GTAGGTGTCC...GACACCTTCTAT/CAGGAATCCACA...TGTAG|GTA | 0 | 1 | 76.325 |
| 122606290 | GT-AG | 0 | 5.121752139949723e-05 | 80 | rna-XM_021209545.2 22607851 | 43 | 108984195 | 108984274 | Mus pahari 10093 | AAG|GTACGCAGCA...GTGGCTTGAGCT/TGTGGCTTGAGC...CACAG|AAG | 0 | 1 | 78.016 |
| 122606291 | GT-AG | 0 | 2.357799013507045e-05 | 475 | rna-XM_021209545.2 22607851 | 44 | 108983540 | 108984014 | Mus pahari 10093 | CAG|GTATAGCCCT...CCACTCTTCTCT/ACAGCACACACC...TTCAG|AGC | 0 | 1 | 80.271 |
| 122606292 | GT-AG | 0 | 1.000000099473604e-05 | 1246 | rna-XM_021209545.2 22607851 | 45 | 108982123 | 108983368 | Mus pahari 10093 | AGG|GTAATGGTCC...CCATTCTGAGTG/GCCATTCTGAGT...CACAG|GCA | 0 | 1 | 82.413 |
| 122606293 | GT-AG | 0 | 1.000000099473604e-05 | 510 | rna-XM_021209545.2 22607851 | 46 | 108981515 | 108982024 | Mus pahari 10093 | CAG|GTGAGGCTTT...GGGCCTGTGACC/TGGCTGTTTACC...TGCAG|GTC | 2 | 1 | 83.64 |
| 122606294 | GT-AG | 0 | 0.0002012897616758 | 416 | rna-XM_021209545.2 22607851 | 47 | 108981022 | 108981437 | Mus pahari 10093 | GCG|GTAAGCTTGT...CTGCTCTGAGCC/TCTGCTCTGAGC...GACAG|GTT | 1 | 1 | 84.605 |
| 122606295 | GT-AG | 0 | 0.0002079203786412 | 525 | rna-XM_021209545.2 22607851 | 48 | 108980402 | 108980926 | Mus pahari 10093 | AAG|GTACACAGCC...TATATCTTGCCT/TGTGATTTGAGT...TTAAG|GAG | 0 | 1 | 85.795 |
| 122606296 | GT-AG | 0 | 0.0019700066683222 | 807 | rna-XM_021209545.2 22607851 | 49 | 108979497 | 108980303 | Mus pahari 10093 | CAG|GTATCGGCAT...AAAGCCTGAACA/TGTTTCCTAAAG...TTCAG|AAA | 2 | 1 | 87.022 |
| 122606297 | GT-AG | 0 | 1.000000099473604e-05 | 2110 | rna-XM_021209545.2 22607851 | 50 | 108977307 | 108979416 | Mus pahari 10093 | CAG|GTAAGTGGCA...TGTGTCTGAGCT/CTGTGTCTGAGC...CACAG|GGC | 1 | 1 | 88.025 |
| 122606298 | GT-AG | 0 | 1.000000099473604e-05 | 2413 | rna-XM_021209545.2 22607851 | 51 | 108974741 | 108977153 | Mus pahari 10093 | TAG|GTGAGTAGTA...TTTGCATTATCG/CTTTGCATTATC...CACAG|AAG | 1 | 1 | 89.941 |
| 122606299 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_021209545.2 22607851 | 52 | 108974571 | 108974669 | Mus pahari 10093 | AGG|GTAGGTGGGT...GAGGCGTTGATA/GAGGCGTTGATA...TGCAG|AGC | 0 | 1 | 90.831 |
| 122606300 | GT-AG | 1 | 99.99977756805129 | 75 | rna-XM_021209545.2 22607851 | 53 | 108974435 | 108974509 | Mus pahari 10093 | AGT|GTATCCTTTG...TTGTCCTTGACT/TTGTCCTTGACT...CCCAG|TTG | 1 | 1 | 91.595 |
| 122606301 | GT-AG | 0 | 1.000000099473604e-05 | 361 | rna-XM_021209545.2 22607851 | 54 | 108974038 | 108974398 | Mus pahari 10093 | CAG|GTAAGAAGGT...CATGCCTTGATA/GTCTCTTTCAAT...TGCAG|ACT | 1 | 1 | 92.046 |
| 122606302 | GT-AG | 0 | 1.000000099473604e-05 | 941 | rna-XM_021209545.2 22607851 | 55 | 108972994 | 108973934 | Mus pahari 10093 | AAG|GTTGGGTCCT...CTGGTTTCACCC/GCTGGTTTCACC...TGCAG|CCA | 2 | 1 | 93.336 |
| 122606303 | GT-AG | 0 | 1.000000099473604e-05 | 1515 | rna-XM_021209545.2 22607851 | 56 | 108971366 | 108972880 | Mus pahari 10093 | AAG|GTGAGGCCCC...GGAGCCTGACTG/AGGAGCCTGACT...TGCAG|AAT | 1 | 1 | 94.751 |
| 122606304 | GT-AG | 0 | 0.0126820657126226 | 353 | rna-XM_021209545.2 22607851 | 57 | 108970931 | 108971283 | Mus pahari 10093 | GAG|GTACCTACTG...AGCCCCTTGGTA/AGCACCATCAGC...TACAG|CAC | 2 | 1 | 95.779 |
| 122606305 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_021209545.2 22607851 | 58 | 108970690 | 108970793 | Mus pahari 10093 | ACA|GTGAGTGGGC...AGACTTTTCTCT/GGGTAGCTCACC...CCCAG|ACC | 1 | 1 | 97.495 |
| 122606306 | GT-AG | 0 | 1.000000099473604e-05 | 1923 | rna-XM_021209545.2 22607851 | 59 | 108968689 | 108970611 | Mus pahari 10093 | AAG|GTAAGGTAGA...TGTGTCTATGCT/CTGGGATTCACA...CACAG|ACG | 1 | 1 | 98.472 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);