introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
42 rows where transcript_id = 32688864
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182658599 | GT-AG | 0 | 1.000000099473604e-05 | 194 | rna-XM_022760498.1 32688864 | 2 | 1309439 | 1309632 | Seriola dumerili 41447 | AAG|GTGAGTGTTC...TTATTTTTATCT/ATTATTTTTATC...TTCAG|AGT | 2 | 1 | 7.17 |
| 182658600 | GT-AG | 0 | 1.000000099473604e-05 | 374 | rna-XM_022760498.1 32688864 | 3 | 1309853 | 1310226 | Seriola dumerili 41447 | CAG|GTAGAGTTGA...TTTTCCTTTCCG/AACTGTTTCACA...TCCAG|GAG | 0 | 1 | 10.198 |
| 182658601 | GT-AG | 0 | 1.000000099473604e-05 | 11112 | rna-XM_022760498.1 32688864 | 4 | 1310737 | 1321848 | Seriola dumerili 41447 | GAG|GTAAGATGCT...ACCCCTTTAAAT/TTAAATCTAACT...GCCAG|GCC | 0 | 1 | 17.217 |
| 182658602 | GT-AG | 0 | 1.000000099473604e-05 | 2711 | rna-XM_022760498.1 32688864 | 5 | 1321944 | 1324654 | Seriola dumerili 41447 | AGG|GTGAGTAATT...CTTTTCCTATTT/TTCCTATTTACT...TTTAG|GGC | 2 | 1 | 18.525 |
| 182658603 | GT-AG | 0 | 1.000000099473604e-05 | 1250 | rna-XM_022760498.1 32688864 | 6 | 1324718 | 1325967 | Seriola dumerili 41447 | GAG|GTGAGTAGGG...CTGGATTTAACA/CTGGATTTAACA...TTCAG|GAA | 2 | 1 | 19.392 |
| 182658604 | GT-AG | 0 | 1.000000099473604e-05 | 397 | rna-XM_022760498.1 32688864 | 7 | 1326068 | 1326464 | Seriola dumerili 41447 | CAG|GTACAGTGCT...ATTCTCTCATCT/CATTCTCTCATC...TGTAG|CAA | 0 | 1 | 20.768 |
| 182658605 | GT-AG | 0 | 2.1088927355397648e-05 | 391 | rna-XM_022760498.1 32688864 | 8 | 1326566 | 1326956 | Seriola dumerili 41447 | ACA|GTAAGCCAAT...ATTTTTATGACT/ATTGTTGTCACT...TGCAG|GAT | 2 | 1 | 22.158 |
| 182658606 | GT-AG | 0 | 0.000131998424187 | 81 | rna-XM_022760498.1 32688864 | 9 | 1327087 | 1327167 | Seriola dumerili 41447 | AAG|GTAACATATT...GTGTCTTTGAGT/CAATACTTCACT...CTCAG|GTT | 0 | 1 | 23.947 |
| 182658607 | GT-AG | 0 | 1.000000099473604e-05 | 352 | rna-XM_022760498.1 32688864 | 10 | 1327258 | 1327609 | Seriola dumerili 41447 | GAG|GTAGGCACCA...TGTTCATTGAAG/GTTATGTTCATT...ATCAG|GCT | 0 | 1 | 25.186 |
| 182658608 | GT-AG | 0 | 1.000000099473604e-05 | 156 | rna-XM_022760498.1 32688864 | 11 | 1327727 | 1327882 | Seriola dumerili 41447 | CCG|GTGAGACACA...ATTCCCTTCCTG/CTGCTCCTCACT...TCCAG|TGC | 0 | 1 | 26.796 |
| 182658609 | GT-AG | 0 | 1.000000099473604e-05 | 213 | rna-XM_022760498.1 32688864 | 12 | 1328018 | 1328230 | Seriola dumerili 41447 | CAG|GTGAGCGGTT...ATGACTTCACCA/CATGACTTCACC...TGCAG|GAG | 0 | 1 | 28.654 |
| 182658610 | GT-AG | 0 | 1.000000099473604e-05 | 153 | rna-XM_022760498.1 32688864 | 13 | 1328353 | 1328505 | Seriola dumerili 41447 | CAA|GTAAGACACA...TTTTCCTTGTTT/TCTCTATGCACC...TCCAG|CTT | 2 | 1 | 30.333 |
| 182658611 | GT-AG | 0 | 1.5352348817716964e-05 | 169 | rna-XM_022760498.1 32688864 | 14 | 1328648 | 1328816 | Seriola dumerili 41447 | AAG|GTAAGCAAGA...ATGTTCTTGATA/ATATGTTTGACC...ACCAG|GAT | 0 | 1 | 32.287 |
| 182658612 | GT-AG | 0 | 3.510910866777058e-05 | 421 | rna-XM_022760498.1 32688864 | 15 | 1329013 | 1329433 | Seriola dumerili 41447 | CAG|GTACGTTATC...CACTTGTTAATA/CACTGTTTCACT...TTTAG|GTG | 1 | 1 | 34.985 |
| 182658613 | GT-AG | 0 | 1.000557807909144e-05 | 2084 | rna-XM_022760498.1 32688864 | 16 | 1329510 | 1331593 | Seriola dumerili 41447 | CAA|GTAAGTAATG...CACTCCTTCCCT/CCTCTCTATACT...ATCAG|AGA | 2 | 1 | 36.031 |
| 182658614 | GT-AG | 0 | 1.000000099473604e-05 | 857 | rna-XM_022760498.1 32688864 | 17 | 1331708 | 1332564 | Seriola dumerili 41447 | CCA|GTGAGTCTGC...CAGCTTTTAACA/TTCTAACTCACC...CACAG|GAA | 2 | 1 | 37.6 |
| 182658615 | GT-AG | 0 | 2.9900585988093412e-05 | 162 | rna-XM_022760498.1 32688864 | 18 | 1332605 | 1332766 | Seriola dumerili 41447 | AAG|GTAAACCTGA...TTTGCTGTATTA/AGAAGATTAATT...TGCAG|AGT | 0 | 1 | 38.15 |
| 182658616 | GT-AG | 0 | 1.000000099473604e-05 | 121 | rna-XM_022760498.1 32688864 | 19 | 1332890 | 1333010 | Seriola dumerili 41447 | CAG|GTAAAATCAC...CACTCTTTGTTT/GCAGTCCTGACC...TTCAG|ACT | 0 | 1 | 39.843 |
| 182658617 | GT-AG | 0 | 1.000000099473604e-05 | 319 | rna-XM_022760498.1 32688864 | 20 | 1333092 | 1333410 | Seriola dumerili 41447 | AAG|GTAAGACTTT...GGTATCTGAATG/AATATGCTTACA...AGCAG|AAG | 0 | 1 | 40.958 |
| 182658618 | GT-AG | 0 | 1.000000099473604e-05 | 371 | rna-XM_022760498.1 32688864 | 21 | 1333522 | 1333892 | Seriola dumerili 41447 | CAG|GTACTGCAAA...ATTTCTTTTATG/ATTTCTTTTATG...TGCAG|GAG | 0 | 1 | 42.486 |
| 182658619 | GT-AG | 0 | 0.0034154305892559 | 111 | rna-XM_022760498.1 32688864 | 22 | 1334019 | 1334129 | Seriola dumerili 41447 | AAG|GTACACTGTT...ACAATTTTGAAT/ACAATTTTGAAT...TTCAG|GTG | 0 | 1 | 44.22 |
| 182658620 | GT-AG | 0 | 1.000000099473604e-05 | 136 | rna-XM_022760498.1 32688864 | 23 | 1334280 | 1334415 | Seriola dumerili 41447 | CAG|GTAGATGATG...ACAGTTGTGATA/ACAGTTGTGATA...CTTAG|CGT | 0 | 1 | 46.284 |
| 182658621 | GT-AG | 0 | 1.000000099473604e-05 | 167 | rna-XM_022760498.1 32688864 | 24 | 1334577 | 1334743 | Seriola dumerili 41447 | GAG|GTAACAACAC...TGTTTTGTATTT/GTTATATTTATG...TACAG|GTG | 2 | 1 | 48.5 |
| 182658622 | GT-AG | 0 | 0.0015889279749015 | 631 | rna-XM_022760498.1 32688864 | 25 | 1335926 | 1336556 | Seriola dumerili 41447 | ACT|GTACGTCTTC...CTTTTCATAAAT/TTGTGTCTCATC...TGCAG|GAG | 2 | 1 | 64.767 |
| 182658623 | GT-AG | 0 | 0.0598481627973634 | 122 | rna-XM_022760498.1 32688864 | 26 | 1336633 | 1336754 | Seriola dumerili 41447 | GAG|GTATTTTATA...TCAGCTTTAATT/TCAGCTTTAATT...TGCAG|ATA | 0 | 1 | 65.813 |
| 182658624 | GT-AG | 0 | 1.000000099473604e-05 | 836 | rna-XM_022760498.1 32688864 | 27 | 1336874 | 1337709 | Seriola dumerili 41447 | CAG|GTACTGCATT...TATTCTTTAGAG/AGGTTTTTGAAT...TGCAG|GTC | 2 | 1 | 67.451 |
| 182658625 | GT-AG | 0 | 1.000000099473604e-05 | 565 | rna-XM_022760498.1 32688864 | 28 | 1337829 | 1338393 | Seriola dumerili 41447 | AAG|GTAAATAATA...CAATTTTTACTA/TTTTTCCTCATG...TTCAG|CCC | 1 | 1 | 69.089 |
| 182658626 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_022760498.1 32688864 | 29 | 1338588 | 1338676 | Seriola dumerili 41447 | CAG|GTAAATTAAA...GTCATTTTAACT/TTTTAACTTACC...TGCAG|GTC | 0 | 1 | 71.759 |
| 182658627 | GT-AG | 0 | 3.487234543737143e-05 | 111 | rna-XM_022760498.1 32688864 | 30 | 1338785 | 1338895 | Seriola dumerili 41447 | ACA|GTAAGTCTAT...TCGTCCTCATTT/CTCGTCCTCATT...ACCAG|AAA | 0 | 1 | 73.245 |
| 182658628 | GT-AG | 0 | 1.000000099473604e-05 | 94 | rna-XM_022760498.1 32688864 | 31 | 1339067 | 1339160 | Seriola dumerili 41447 | AAG|GTGAAACACT...TTAGTCCAAATA/AAAGTATTCACA...TTCAG|CCT | 0 | 1 | 75.599 |
| 182658629 | GT-AG | 0 | 0.0001106443380437 | 97 | rna-XM_022760498.1 32688864 | 32 | 1339203 | 1339299 | Seriola dumerili 41447 | AGC|GTAAGTGTGC...GATTTCTTATTG/TTCTTATTGATT...ATTAG|ATG | 0 | 1 | 76.177 |
| 182658630 | GT-AG | 0 | 1.000000099473604e-05 | 502 | rna-XM_022760498.1 32688864 | 33 | 1339412 | 1339913 | Seriola dumerili 41447 | GCT|GTGAGTTAAC...CATTTTTTAAAA/CATTTTTTAAAA...TGCAG|ATT | 1 | 1 | 77.718 |
| 182658631 | GT-AG | 0 | 4.238787887743063e-05 | 458 | rna-XM_022760498.1 32688864 | 34 | 1339988 | 1340445 | Seriola dumerili 41447 | AAG|GTAGGTTTTT...CCTGTCTGAATT/TTCTTGCTGACT...CACAG|AGT | 0 | 1 | 78.737 |
| 182658632 | GT-AG | 0 | 1.000000099473604e-05 | 140 | rna-XM_022760498.1 32688864 | 35 | 1340639 | 1340778 | Seriola dumerili 41447 | CTG|GTTAGACTGC...AAAAACTTGATT/TCATTGTTGATG...CTTAG|ACC | 1 | 1 | 81.393 |
| 182658633 | GT-AG | 0 | 1.000000099473604e-05 | 252 | rna-XM_022760498.1 32688864 | 36 | 1340908 | 1341159 | Seriola dumerili 41447 | TGG|GTAAGGAGGC...TGTGTTTTCTCC/GTGTGATTCATT...TCCAG|AAC | 1 | 1 | 83.168 |
| 182658634 | GT-AG | 0 | 1.000000099473604e-05 | 257 | rna-XM_022760498.1 32688864 | 37 | 1341266 | 1341522 | Seriola dumerili 41447 | CAG|GTAGGGATTC...AATATCTAATCA/TAATATCTAATC...TGCAG|GGT | 2 | 1 | 84.627 |
| 182658635 | GT-AG | 0 | 1.000000099473604e-05 | 187 | rna-XM_022760498.1 32688864 | 38 | 1341655 | 1341841 | Seriola dumerili 41447 | CAC|GTAAGACTGC...GTTCTCTTCTCC/TACATACTCATA...TATAG|GTG | 2 | 1 | 86.444 |
| 182658636 | GT-AG | 0 | 1.000000099473604e-05 | 279 | rna-XM_022760498.1 32688864 | 39 | 1341966 | 1342244 | Seriola dumerili 41447 | ACG|GTGAGGAAGA...TGTTCCTTGTCT/TCTGCAGTTACA...CTTAG|CAC | 0 | 1 | 88.15 |
| 182658637 | GT-AG | 0 | 9.7680115374956e-05 | 849 | rna-XM_022760498.1 32688864 | 40 | 1342296 | 1343144 | Seriola dumerili 41447 | GTG|GTATGTATCA...ATAACTTTCAGG/TTGGGTATCAGT...TGCAG|GTG | 0 | 1 | 88.852 |
| 182658638 | GT-AG | 0 | 1.000000099473604e-05 | 103 | rna-XM_022760498.1 32688864 | 41 | 1343261 | 1343363 | Seriola dumerili 41447 | GAA|GTAAAGAAAC...ACGTCCTTATCA/GACGTCCTTATC...TGCAG|AGA | 2 | 1 | 90.449 |
| 182658639 | GT-AG | 0 | 1.000000099473604e-05 | 493 | rna-XM_022760498.1 32688864 | 42 | 1343504 | 1343996 | Seriola dumerili 41447 | AAG|GTCAGTATTA...TTTTTTTTTTCT/AACATGTTTATT...TCCAG|GAC | 1 | 1 | 92.375 |
| 182662132 | GT-AG | 0 | 1.000000099473604e-05 | 19274 | rna-XM_022760498.1 32688864 | 1 | 1289944 | 1309217 | Seriola dumerili 41447 | AAG|GTACGTTAAA...TTCGTTATAAAA/CATGGATTGATC...TTCAG|GTG | 0 | 4.996 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);