introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
55 rows where transcript_id = 12801835
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68109368 | GT-AG | 0 | 1.000000099473604e-05 | 13844 | rna-XM_029498431.1 12801835 | 1 | 11239772 | 11253615 | Echeneis naucrates 173247 | GCT|GTGAGTACGA...ATTCATTTAAAT/CCGACATTCATT...CACAG|AGA | 2 | 1 | 2.003 |
| 68109369 | GT-AG | 0 | 1.000000099473604e-05 | 126 | rna-XM_029498431.1 12801835 | 2 | 11239574 | 11239699 | Echeneis naucrates 173247 | GCT|GTGAGTATGC...CGTACCATATCA/CTGTTAGTTATA...GTCAG|GTA | 2 | 1 | 2.522 |
| 68109370 | GT-AG | 0 | 1.000000099473604e-05 | 1365 | rna-XM_029498431.1 12801835 | 3 | 11238140 | 11239504 | Echeneis naucrates 173247 | CCT|GTGAGTACAA...TTTGTTTTAATT/TTTGTTTTAATT...CACAG|CGA | 2 | 1 | 3.019 |
| 68109371 | GT-AG | 0 | 5.775833655590394e-05 | 1617 | rna-XM_029498431.1 12801835 | 4 | 11236451 | 11238067 | Echeneis naucrates 173247 | AAT|GTAAGTTATC...CCATCATTATCT/TATCCACTAAAT...TATAG|CGA | 2 | 1 | 3.538 |
| 68109372 | GT-AG | 0 | 2.9408500745677745e-05 | 150 | rna-XM_029498431.1 12801835 | 5 | 11236137 | 11236286 | Echeneis naucrates 173247 | GTG|GTATGGGACA...TAGCCTTTAAAT/GATGTTTTGAAC...TTCAG|GTC | 1 | 1 | 4.72 |
| 68109373 | GT-AG | 0 | 0.0012725599385521 | 426 | rna-XM_029498431.1 12801835 | 6 | 11235463 | 11235888 | Echeneis naucrates 173247 | AAG|GTATGTAGCA...TGTCCTTTGATT/TGTCCTTTGATT...CACAG|AAA | 0 | 1 | 6.507 |
| 68109374 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-XM_029498431.1 12801835 | 7 | 11235089 | 11235201 | Echeneis naucrates 173247 | GAG|GTGACAAGGA...GGGCCTTTAATT/TTTTGGTTCATG...TTAAG|GTC | 0 | 1 | 8.387 |
| 68109375 | GT-AG | 0 | 1.000000099473604e-05 | 382 | rna-XM_029498431.1 12801835 | 8 | 11234538 | 11234919 | Echeneis naucrates 173247 | AAG|GTAAGCAACA...TGGGTCTAAAAG/TTGTCTGTCATC...GCCAG|CCT | 1 | 1 | 9.605 |
| 68109376 | GT-AG | 0 | 1.000000099473604e-05 | 1847 | rna-XM_029498431.1 12801835 | 9 | 11232513 | 11234359 | Echeneis naucrates 173247 | ACA|GTGAGTGTTT...TGTGTTTTCATT/TGTGTTTTCATT...CACAG|AGA | 2 | 1 | 10.888 |
| 68109377 | GT-AG | 0 | 4.997455286816586e-05 | 1145 | rna-XM_029498431.1 12801835 | 10 | 11231159 | 11232303 | Echeneis naucrates 173247 | AAG|GTAAGCTCTG...GTCACCTGGACT/TCTTTTGTCACC...TACAG|TAC | 1 | 1 | 12.394 |
| 68109378 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_029498431.1 12801835 | 11 | 11230946 | 11231036 | Echeneis naucrates 173247 | GAG|GTGAGGAGGA...TCCTCCTTAAAC/TTTTTTGTAAAA...CTCAG|CTG | 0 | 1 | 13.273 |
| 68109379 | GT-AG | 0 | 1.000000099473604e-05 | 1286 | rna-XM_029498431.1 12801835 | 12 | 11229509 | 11230794 | Echeneis naucrates 173247 | CCA|GTGAGTCAAA...GTACTCATGAAA/AGGGTACTCATG...CTCAG|GAT | 1 | 1 | 14.361 |
| 68109380 | GT-AG | 0 | 1.000000099473604e-05 | 483 | rna-XM_029498431.1 12801835 | 13 | 11228763 | 11229245 | Echeneis naucrates 173247 | GTG|GTGAGAGGAT...CTTACTTTGACT/CTTACTTTGACT...TGCAG|GTA | 0 | 1 | 16.256 |
| 68109381 | GT-AG | 0 | 1.000000099473604e-05 | 127 | rna-XM_029498431.1 12801835 | 14 | 11228306 | 11228432 | Echeneis naucrates 173247 | CTG|GTAAGAAATC...TGTTTTTTAATT/TGTTTTTTAATT...TTCAG|GTG | 0 | 1 | 18.634 |
| 68109382 | GT-AG | 0 | 0.0001582642245437 | 98 | rna-XM_029498431.1 12801835 | 15 | 11227734 | 11227831 | Echeneis naucrates 173247 | GTT|GTAAGTGTTT...TATTCTTTATAT/TTCATATTTACT...TTTAG|AGC | 0 | 1 | 22.049 |
| 68109383 | GT-AG | 0 | 0.0289131412371598 | 539 | rna-XM_029498431.1 12801835 | 16 | 11227063 | 11227601 | Echeneis naucrates 173247 | ACA|GTATGTTCTG...CCACTCTCAATT/AGTGGACTCAGT...TTCAG|GTG | 0 | 1 | 23.0 |
| 68109384 | GT-AG | 0 | 2.00638019486322e-05 | 627 | rna-XM_029498431.1 12801835 | 17 | 11226260 | 11226886 | Echeneis naucrates 173247 | CCG|GTATGAACAG...GAACCCCTAATA/ATAGTGTTTACG...TGCAG|ATG | 2 | 1 | 24.269 |
| 68109385 | GT-AG | 0 | 0.0060936617625989 | 165 | rna-XM_029498431.1 12801835 | 18 | 11225964 | 11226128 | Echeneis naucrates 173247 | CAG|GTAACCCCTC...CTTTTCTTGCTG/GATCATTTCACC...AACAG|GAA | 1 | 1 | 25.213 |
| 68109386 | GT-AG | 0 | 1.000000099473604e-05 | 610 | rna-XM_029498431.1 12801835 | 19 | 11222736 | 11223345 | Echeneis naucrates 173247 | CAG|GTAAAGAGAG...CCTCTCTTAATG/CCTCTCTTAATG...TCCAG|GTG | 0 | 1 | 44.077 |
| 68109387 | GT-AG | 0 | 0.0002191022986511 | 77 | rna-XM_029498431.1 12801835 | 20 | 11222461 | 11222537 | Echeneis naucrates 173247 | GAG|GTAAGCTCAC...TGTGTCTTAATG/ATGTGTCTTAAT...TCCAG|CCC | 0 | 1 | 45.504 |
| 68109388 | GT-AG | 0 | 0.0195251495996435 | 136 | rna-XM_029498431.1 12801835 | 21 | 11222019 | 11222154 | Echeneis naucrates 173247 | CAG|GTATTTTTAT...AGTTTTTTGACT/AGTTTTTTGACT...GGTAG|GTG | 0 | 1 | 47.709 |
| 68109389 | GT-AG | 0 | 1.000000099473604e-05 | 324 | rna-XM_029498431.1 12801835 | 22 | 11221249 | 11221572 | Echeneis naucrates 173247 | CAG|GTCAGTACAG...TTTTTTTTTTCC/TAATAATTGACT...GTCAG|GCT | 2 | 1 | 50.922 |
| 68109390 | GT-AG | 0 | 1.000000099473604e-05 | 584 | rna-XM_029498431.1 12801835 | 23 | 11220514 | 11221097 | Echeneis naucrates 173247 | AAG|GTAAGAGCAG...TATTATTTATTT/TTATTATTTATT...TACAG|GGT | 0 | 1 | 52.01 |
| 68109391 | GT-AG | 0 | 9.58251668878805e-05 | 1205 | rna-XM_029498431.1 12801835 | 24 | 11219156 | 11220360 | Echeneis naucrates 173247 | ACC|GTAAGCGCGC...GTTAGCTTAACG/TGTTAGCTTAAC...GACAG|GTG | 0 | 1 | 53.113 |
| 68109392 | GT-AG | 0 | 4.604050208052236e-05 | 1495 | rna-XM_029498431.1 12801835 | 25 | 11217520 | 11219014 | Echeneis naucrates 173247 | GGG|GTAAACACAC...AACATCTTATAT/AAACATCTTATA...CAAAG|CAC | 0 | 1 | 54.129 |
| 68109393 | GT-AG | 0 | 1.000000099473604e-05 | 175 | rna-XM_029498431.1 12801835 | 26 | 11217023 | 11217197 | Echeneis naucrates 173247 | CAG|GTAGGAAAAA...CCCTTTTTCACC/CCCTTTTTCACC...TCCAG|GTT | 1 | 1 | 56.449 |
| 68109394 | GT-AG | 0 | 1.000000099473604e-05 | 1083 | rna-XM_029498431.1 12801835 | 27 | 11215723 | 11216805 | Echeneis naucrates 173247 | CAA|GTAATGGCCA...GATTCTTTATTA/AGATTCTTTATT...TCCAG|ATC | 2 | 1 | 58.013 |
| 68109395 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_029498431.1 12801835 | 28 | 11215476 | 11215562 | Echeneis naucrates 173247 | GAG|GTAAGAGTCA...CATTCCTTTATA/CAATCATTCATT...TACAG|TAC | 0 | 1 | 59.166 |
| 68109396 | GT-AG | 0 | 1.000000099473604e-05 | 187 | rna-XM_029498431.1 12801835 | 29 | 11215133 | 11215319 | Echeneis naucrates 173247 | ACA|GTGAGTAAAG...CAAACCTCACAA/ACAAACCTCACA...TCCAG|GCG | 0 | 1 | 60.29 |
| 68109397 | GT-AG | 0 | 1.000000099473604e-05 | 951 | rna-XM_029498431.1 12801835 | 30 | 11214037 | 11214987 | Echeneis naucrates 173247 | TGG|GTAGGAAAGA...CAGCTGTTAACC/CAGCTGTTAACC...TCCAG|GTG | 1 | 1 | 61.334 |
| 68109398 | GT-AG | 0 | 7.217289039477515e-05 | 120 | rna-XM_029498431.1 12801835 | 31 | 11213587 | 11213706 | Echeneis naucrates 173247 | GAG|GTGTGTACTT...AATCCCTTAATT/TCCTCTCTCAAT...TCCAG|ATT | 1 | 1 | 63.712 |
| 68109399 | GT-AG | 0 | 4.42672544846635e-05 | 275 | rna-XM_029498431.1 12801835 | 32 | 11213223 | 11213497 | Echeneis naucrates 173247 | CAG|GTAACATCAG...CTCCTCTTATGC/TCTCCTCTTATG...CCAAG|GTC | 0 | 1 | 64.354 |
| 68109400 | GT-AG | 0 | 1.000000099473604e-05 | 755 | rna-XM_029498431.1 12801835 | 33 | 11212155 | 11212909 | Echeneis naucrates 173247 | AGG|GTGAGTGCTC...TTTTCCTTTTCT/TTGATGGTAACT...CCCAG|ATG | 1 | 1 | 66.609 |
| 68109401 | GT-AG | 0 | 1.000000099473604e-05 | 138 | rna-XM_029498431.1 12801835 | 34 | 11211842 | 11211979 | Echeneis naucrates 173247 | AGA|GTAAGAGCCC...TTTTTTTTGTCC/TTTTGTCCCATC...CACAG|GTT | 2 | 1 | 67.87 |
| 68109402 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_029498431.1 12801835 | 35 | 11211490 | 11211579 | Echeneis naucrates 173247 | CCA|GTCAGTATCA...CACACTTTAACA/CACACTTTAACA...CTTAG|GAG | 0 | 1 | 69.758 |
| 68109403 | GT-AG | 0 | 7.386740152504203e-05 | 329 | rna-XM_029498431.1 12801835 | 36 | 11210965 | 11211293 | Echeneis naucrates 173247 | CAG|GTATGTTAGG...TGAAAGTTAACA/TAGAAATTAAAT...ATTAG|GCA | 1 | 1 | 71.17 |
| 68109404 | GT-AG | 0 | 1.000000099473604e-05 | 97 | rna-XM_029498431.1 12801835 | 37 | 11210697 | 11210793 | Echeneis naucrates 173247 | AAG|GTATAGACAC...CATTTCATGATC/CATCATTTCATG...TTTAG|GAC | 1 | 1 | 72.402 |
| 68109405 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_029498431.1 12801835 | 38 | 11210470 | 11210552 | Echeneis naucrates 173247 | CAG|GTAAGAAACC...TGAGGCTGGATG/TGACGTCTGATG...TTCAG|ACG | 1 | 1 | 73.44 |
| 68109406 | GT-AG | 0 | 1.000000099473604e-05 | 94 | rna-XM_029498431.1 12801835 | 39 | 11210165 | 11210258 | Echeneis naucrates 173247 | CAG|GTACAGGAAT...TTGTTCCTACAT/TTGCATCTGATT...TTCAG|CCC | 2 | 1 | 74.96 |
| 68109407 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_029498431.1 12801835 | 40 | 11209932 | 11210037 | Echeneis naucrates 173247 | AAG|GTAATCCGAT...GTCACTTGAATA/TTGTGTGTCAAC...ATTAG|TGT | 0 | 1 | 75.875 |
| 68109408 | GT-AG | 0 | 4.166667544043956e-05 | 237 | rna-XM_029498431.1 12801835 | 41 | 11209587 | 11209823 | Echeneis naucrates 173247 | GAG|GTACAGTCTG...AGGTCATTAGTA/GTCTATCTAACT...ATCAG|ACA | 0 | 1 | 76.654 |
| 68109409 | GT-AG | 0 | 1.000000099473604e-05 | 374 | rna-XM_029498431.1 12801835 | 42 | 11209163 | 11209536 | Echeneis naucrates 173247 | AAG|GTGAATAATT...TTTGCCGAAGTC/TGTAAGATAACA...TGCAG|TGT | 2 | 1 | 77.014 |
| 68109410 | GT-AG | 0 | 0.0006008012466204 | 101 | rna-XM_029498431.1 12801835 | 43 | 11208847 | 11208947 | Echeneis naucrates 173247 | CAG|GTAACCAAAT...CTTCTTTTCACT/CTTCTTTTCACT...CTCAG|ATG | 1 | 1 | 78.563 |
| 68109411 | GT-AG | 0 | 0.0002222057237965 | 103 | rna-XM_029498431.1 12801835 | 44 | 11208665 | 11208767 | Echeneis naucrates 173247 | CCT|GTAAGTTTGA...TATGTCTAATTG/CTATGTCTAATT...GTCAG|GTC | 2 | 1 | 79.132 |
| 68109412 | GT-AG | 0 | 1.0608049243005927e-05 | 1283 | rna-XM_029498431.1 12801835 | 45 | 11207275 | 11208557 | Echeneis naucrates 173247 | CAG|GTGTGTGTCA...CTTGTTTTAATT/CTTGTTTTAATT...TCCAG|TCG | 1 | 1 | 79.903 |
| 68109413 | GT-AG | 0 | 1.000000099473604e-05 | 591 | rna-XM_029498431.1 12801835 | 46 | 11206481 | 11207071 | Echeneis naucrates 173247 | AAG|GTAGACGCAT...ATCTTCCTATTT/CCTATTTTCATT...CCCAG|GTG | 0 | 1 | 81.366 |
| 68109414 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_029498431.1 12801835 | 47 | 11206195 | 11206285 | Echeneis naucrates 173247 | AAG|GTGAGCCGCA...CATGCCTCATTT/CCATGCCTCATT...TTCAG|AAT | 0 | 1 | 82.771 |
| 68109415 | GT-AG | 0 | 6.471441531719484e-05 | 419 | rna-XM_029498431.1 12801835 | 48 | 11205636 | 11206054 | Echeneis naucrates 173247 | CAG|GTACACACAC...TTTTCTTCAGAC/ATTTTCTTCAGA...TAAAG|CCG | 2 | 1 | 83.78 |
| 68109416 | GT-AG | 0 | 1.000000099473604e-05 | 161 | rna-XM_029498431.1 12801835 | 49 | 11205344 | 11205504 | Echeneis naucrates 173247 | AGG|GTGAGACTCG...ATCTTGTTGACT/ATTAGTTTAATT...TGGAG|GTG | 1 | 1 | 84.724 |
| 68109417 | GT-AG | 0 | 1.000000099473604e-05 | 124 | rna-XM_029498431.1 12801835 | 50 | 11205135 | 11205258 | Echeneis naucrates 173247 | CCG|GTAAGAAAGG...TTTTCTTTAATG/TAATGTCTCATT...CCCAG|CCA | 2 | 1 | 85.337 |
| 68109418 | GT-AG | 0 | 0.0003540998198868 | 582 | rna-XM_029498431.1 12801835 | 51 | 11204415 | 11204996 | Echeneis naucrates 173247 | CAG|GTAGGCCTTG...CATGCCTTGATG/AGTATTTTGACT...CCCAG|GAC | 2 | 1 | 86.331 |
| 68109419 | GT-AG | 0 | 0.0044326649816715 | 111 | rna-XM_029498431.1 12801835 | 52 | 11204132 | 11204242 | Echeneis naucrates 173247 | GTG|GTAACTCCTG...GAATCTTTGAAA/ACTTCCTTCATT...TCCAG|ATC | 0 | 1 | 87.57 |
| 68109420 | GT-AG | 0 | 1.000000099473604e-05 | 880 | rna-XM_029498431.1 12801835 | 53 | 11202961 | 11203840 | Echeneis naucrates 173247 | AAG|GTAAGAGAGA...TCTCTCATATTT/TTTTCTCTCATA...TAAAG|GCA | 0 | 1 | 89.667 |
| 68109421 | GT-AG | 0 | 1.000000099473604e-05 | 747 | rna-XM_029498431.1 12801835 | 54 | 11202079 | 11202825 | Echeneis naucrates 173247 | CTG|GTCAGACTTT...TCAAACTTATTT/TTCAAACTTATT...TGCAG|CTT | 0 | 1 | 90.64 |
| 68109422 | GT-AG | 0 | 1.2619796901346324e-05 | 223 | rna-XM_029498431.1 12801835 | 55 | 11201535 | 11201757 | Echeneis naucrates 173247 | GAG|GTGCATTTCC...TTTTCTTTTTCA/TTCTTTTTCATG...CCTAG|TTT | 0 | 1 | 92.953 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);