introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
42 rows where transcript_id = 12801865
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68110610 | GT-AG | 0 | 1.000000099473604e-05 | 709 | rna-XM_029500776.1 12801865 | 2 | 15021510 | 15022218 | Echeneis naucrates 173247 | TGG|GTAAGTGAAT...ATCCTGTTAATG/ACGATACTCAAT...TGCAG|ATA | 1 | 1 | 2.547 |
| 68110611 | GT-AG | 0 | 3.13060824793392e-05 | 1168 | rna-XM_029500776.1 12801865 | 3 | 15020175 | 15021342 | Echeneis naucrates 173247 | AAG|GTACAGTGCA...CTTTCTTTACCC/TATCTTCTCATC...ACCAG|GGG | 0 | 1 | 5.401 |
| 68110612 | GT-AG | 0 | 9.724940963031624e-05 | 559 | rna-XM_029500776.1 12801865 | 4 | 15019388 | 15019946 | Echeneis naucrates 173247 | CAG|GTATGTAAAA...CGTCCCTCAATT/TCGTCCCTCAAT...ATCAG|AGG | 0 | 1 | 9.298 |
| 68110613 | GT-AG | 0 | 3.978648608131951e-05 | 379 | rna-XM_029500776.1 12801865 | 5 | 15018886 | 15019264 | Echeneis naucrates 173247 | CGG|GTAAACCTGA...TTTTTGTTGAAA/AAGCTTCTCAGT...CCCAG|GAT | 0 | 1 | 11.4 |
| 68110614 | GT-AG | 0 | 1.000000099473604e-05 | 719 | rna-XM_029500776.1 12801865 | 6 | 15018083 | 15018801 | Echeneis naucrates 173247 | CAG|GTGAGACATT...TATTTTTTATCA/TCTCTTTTCAAC...CCCAG|GTT | 0 | 1 | 12.835 |
| 68110615 | GT-AG | 0 | 1.000000099473604e-05 | 131 | rna-XM_029500776.1 12801865 | 7 | 15017844 | 15017974 | Echeneis naucrates 173247 | AAG|GTGTGTGCGG...AATTCCAAAAAT/CCAAAAATGAGC...TCTAG|CTC | 0 | 1 | 14.681 |
| 68110616 | GT-AG | 0 | 2.6688239413404465e-05 | 183 | rna-XM_029500776.1 12801865 | 8 | 15017558 | 15017740 | Echeneis naucrates 173247 | CAG|GTAACAGTAT...TTTTCCTTCAAT/TCAATGCTTATT...CCCAG|TCT | 1 | 1 | 16.442 |
| 68110617 | GT-AG | 0 | 1.339161174763422e-05 | 450 | rna-XM_029500776.1 12801865 | 9 | 15017016 | 15017465 | Echeneis naucrates 173247 | TCT|GTGAGTTGCG...CCTGCTTTAATA/CCTGCTTTAATA...CAAAG|GTG | 0 | 1 | 18.014 |
| 68110618 | GT-AG | 0 | 1.000000099473604e-05 | 238 | rna-XM_029500776.1 12801865 | 10 | 15016564 | 15016801 | Echeneis naucrates 173247 | CAG|GTGCTTTGCG...CAATCTGTGAAA/TTTTTCTGTATG...TTTAG|GGA | 1 | 1 | 21.672 |
| 68110619 | GT-AG | 0 | 1.0454223247528255e-05 | 554 | rna-XM_029500776.1 12801865 | 11 | 15015935 | 15016488 | Echeneis naucrates 173247 | AGG|GTAGGCACCC...GCCTGTTTATCT/TGCCTGTTTATC...TCCAG|GTG | 1 | 1 | 22.953 |
| 68110620 | GT-AG | 0 | 0.0010297537938017 | 109 | rna-XM_029500776.1 12801865 | 12 | 15015746 | 15015854 | Echeneis naucrates 173247 | ATC|GTACGTCTGT...TAATCTCTGACT/TAATCTCTGACT...TTAAG|GTC | 0 | 1 | 24.321 |
| 68110621 | GT-AG | 0 | 1.000000099473604e-05 | 882 | rna-XM_029500776.1 12801865 | 13 | 15014717 | 15015598 | Echeneis naucrates 173247 | GAG|GTAAATACAA...ACTCTCTTAATG/TGTATATTCATA...TTAAG|GTG | 0 | 1 | 26.833 |
| 68110622 | GT-AG | 0 | 1.000000099473604e-05 | 177 | rna-XM_029500776.1 12801865 | 14 | 15014357 | 15014533 | Echeneis naucrates 173247 | AAG|GTGAAAGTAG...ATCTTCTTCTCT/TCTTCTCTCACC...TCCAG|TTG | 0 | 1 | 29.961 |
| 68110623 | GT-AG | 0 | 1.000000099473604e-05 | 404 | rna-XM_029500776.1 12801865 | 15 | 15013791 | 15014194 | Echeneis naucrates 173247 | CAG|GTCAGAGACG...TCTTCTATATCC/TCCACTCTGACA...TTTAG|GAT | 0 | 1 | 32.729 |
| 68110624 | GT-AG | 0 | 0.0009546079236943 | 883 | rna-XM_029500776.1 12801865 | 16 | 15012698 | 15013580 | Echeneis naucrates 173247 | CCA|GTATGTGAAG...GCCTATTTAACT/GCCTATTTAACT...CTCAG|CTG | 0 | 1 | 36.319 |
| 68110625 | GT-AG | 0 | 0.0015357692860188 | 1375 | rna-XM_029500776.1 12801865 | 17 | 15011152 | 15012526 | Echeneis naucrates 173247 | ACG|GTATGTCTGG...ATTATTTTATTC/TATTATTTTATT...CTTAG|GAA | 0 | 1 | 39.241 |
| 68110626 | GT-AG | 0 | 0.0050955032549896 | 1927 | rna-XM_029500776.1 12801865 | 18 | 15009095 | 15011021 | Echeneis naucrates 173247 | TTG|GTATGTTCTC...CTCACCTTTGCT/CCTCCACTCACC...GGCAG|GGA | 1 | 1 | 41.463 |
| 68110627 | GT-AG | 0 | 1.000000099473604e-05 | 3378 | rna-XM_029500776.1 12801865 | 19 | 15005518 | 15008895 | Echeneis naucrates 173247 | GAG|GTGAGGAGCC...TAATATTTAACA/TAATATTTAACA...CCTAG|TAT | 2 | 1 | 44.864 |
| 68110628 | GT-AG | 0 | 1.000000099473604e-05 | 1490 | rna-XM_029500776.1 12801865 | 20 | 15003698 | 15005187 | Echeneis naucrates 173247 | AAG|GTAATGTAAG...TTATTTTTGCTT/TTTCCATTCAAC...TCCAG|TTG | 2 | 1 | 50.504 |
| 68110629 | GT-AG | 0 | 1.000000099473604e-05 | 589 | rna-XM_029500776.1 12801865 | 21 | 15002911 | 15003499 | Echeneis naucrates 173247 | AAG|GTTAGTGAAT...GGAATTTCAGCT/AGGAATTTCAGC...TCCAG|TGT | 2 | 1 | 53.888 |
| 68110630 | GT-AG | 0 | 1.000000099473604e-05 | 926 | rna-XM_029500776.1 12801865 | 22 | 15001807 | 15002732 | Echeneis naucrates 173247 | GAG|GTGTGTATGT...TGTGTGCTAACT/TGTGTGCTAACT...CCAAG|GTT | 0 | 1 | 56.93 |
| 68110631 | GT-AG | 0 | 7.790405462372598e-05 | 418 | rna-XM_029500776.1 12801865 | 23 | 15001278 | 15001695 | Echeneis naucrates 173247 | CAG|GTAGTCTGTT...CTACTCTCCACT/TTACATATAAAT...CGCAG|TCT | 0 | 1 | 58.828 |
| 68110632 | GT-AG | 0 | 0.0007176705926181 | 444 | rna-XM_029500776.1 12801865 | 24 | 15000768 | 15001211 | Echeneis naucrates 173247 | AAG|GTAACTCTTC...TTAGTCATGATG/CTTTTAGTCATG...TCCAG|GAA | 0 | 1 | 59.956 |
| 68110633 | GT-AG | 0 | 1.000000099473604e-05 | 494 | rna-XM_029500776.1 12801865 | 25 | 15000202 | 15000695 | Echeneis naucrates 173247 | AAG|GTTGGTTCGT...TGTTCAATAATG/TGCCTGTTCAAT...CCAAG|GTA | 0 | 1 | 61.186 |
| 68110634 | GT-AG | 0 | 1.000000099473604e-05 | 1001 | rna-XM_029500776.1 12801865 | 26 | 14999098 | 15000098 | Echeneis naucrates 173247 | GAA|GTAAGTCTGC...TGTTTGTTCACT/TGTTTGTTCACT...CTTAG|AAA | 1 | 1 | 62.947 |
| 68110635 | GT-AG | 0 | 1.000000099473604e-05 | 326 | rna-XM_029500776.1 12801865 | 27 | 14998563 | 14998888 | Echeneis naucrates 173247 | GAG|GTAAGGAGGA...ATGTTGTTATTT/AATGTTGTTATT...TCCAG|ATC | 0 | 1 | 66.519 |
| 68110636 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-XM_029500776.1 12801865 | 28 | 14998359 | 14998470 | Echeneis naucrates 173247 | CAG|GTAATAAGCA...TATATCCCAGCT/GGGATAATGACA...ACTAG|GAA | 2 | 1 | 68.091 |
| 68110637 | GT-AG | 0 | 1.000000099473604e-05 | 249 | rna-XM_029500776.1 12801865 | 29 | 14997977 | 14998225 | Echeneis naucrates 173247 | CAG|GTACAATACC...CTTTCTTTGTCC/AAAGACTTAACA...TGCAG|GAT | 0 | 1 | 70.364 |
| 68110638 | GT-AG | 0 | 1.000000099473604e-05 | 424 | rna-XM_029500776.1 12801865 | 30 | 14997460 | 14997883 | Echeneis naucrates 173247 | AAG|GTGTGGCATG...ATTTTGTTGAAT/CAAGTTCTCATA...CGTAG|GAT | 0 | 1 | 71.954 |
| 68110639 | GT-AG | 0 | 1.5156630194427154e-05 | 95 | rna-XM_029500776.1 12801865 | 31 | 14997290 | 14997384 | Echeneis naucrates 173247 | AGG|GTAAACACAC...CCTGACTTAATG/GTCATCCTGACT...TTCAG|GTG | 0 | 1 | 73.235 |
| 68110640 | GT-AG | 0 | 0.0001574755552773 | 296 | rna-XM_029500776.1 12801865 | 32 | 14996688 | 14996983 | Echeneis naucrates 173247 | CTG|GTATGATCGG...GTCTCTATAATT/GTCTCTATAATT...ACCAG|GGG | 0 | 1 | 78.465 |
| 68110641 | GT-AG | 0 | 0.0001367394429711 | 455 | rna-XM_029500776.1 12801865 | 33 | 14996149 | 14996603 | Echeneis naucrates 173247 | GAG|GTATTATGGG...CGACTCTGAACT/TGTATTCTCAAA...CCCAG|GTA | 0 | 1 | 79.901 |
| 68110642 | GT-AG | 0 | 1.000000099473604e-05 | 1174 | rna-XM_029500776.1 12801865 | 34 | 14994781 | 14995954 | Echeneis naucrates 173247 | GAA|GTACAAACCT...CAGATTTTCACC/CAGATTTTCACC...TACAG|CAA | 2 | 1 | 83.217 |
| 68110643 | GT-AG | 0 | 1.000000099473604e-05 | 114 | rna-XM_029500776.1 12801865 | 35 | 14994519 | 14994632 | Echeneis naucrates 173247 | CAG|GTCTGACACA...ATTGTCTTCATG/ATTGTCTTCATG...CTTAG|AGT | 0 | 1 | 85.746 |
| 68110644 | GT-AG | 0 | 0.0002465788562109 | 212 | rna-XM_029500776.1 12801865 | 36 | 14994235 | 14994446 | Echeneis naucrates 173247 | CAG|GTAAACTGTT...ACGATTTTAACA/ACGATTTTAACA...CTTAG|GAG | 0 | 1 | 86.977 |
| 68110645 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_029500776.1 12801865 | 37 | 14994003 | 14994110 | Echeneis naucrates 173247 | GCA|GTGAGTGGAG...ACCATCTTAAAA/TACCATCTTAAA...CACAG|CAT | 1 | 1 | 89.096 |
| 68110646 | GT-AG | 0 | 3.009007591449907e-05 | 213 | rna-XM_029500776.1 12801865 | 38 | 14993755 | 14993967 | Echeneis naucrates 173247 | AAG|GTAGCACATA...GGGTCTCTAATT/TTTTGTCTAACA...TACAG|CAT | 0 | 1 | 89.694 |
| 68110647 | GT-AG | 0 | 1.2858516139926529e-05 | 420 | rna-XM_029500776.1 12801865 | 39 | 14993212 | 14993631 | Echeneis naucrates 173247 | CAG|GTGTGTATTA...TGTCTTTTGTCT/AGAAGTCCGAGT...ATTAG|ACT | 0 | 1 | 91.796 |
| 68110648 | GT-AG | 0 | 1.000000099473604e-05 | 528 | rna-XM_029500776.1 12801865 | 40 | 14992579 | 14993106 | Echeneis naucrates 173247 | AAG|GTAATGTTGC...CATCTCATAGTT/ATTCATCTCATA...TTTAG|GTG | 0 | 1 | 93.591 |
| 68110649 | GT-AG | 0 | 1.000000099473604e-05 | 569 | rna-XM_029500776.1 12801865 | 41 | 14991906 | 14992474 | Echeneis naucrates 173247 | CTG|GTAATAATCA...TCTCCCTTTGCT/TTGCTTTCTATT...CACAG|TGC | 2 | 1 | 95.368 |
| 68110650 | GT-AG | 0 | 1.000000099473604e-05 | 661 | rna-XM_029500776.1 12801865 | 42 | 14991121 | 14991781 | Echeneis naucrates 173247 | AAG|GTGACATAAC...CAGTTTATGACT/CAGTTTATGACT...TGCAG|GGA | 0 | 1 | 97.488 |
| 68120302 | GT-AG | 0 | 1.000000099473604e-05 | 818 | rna-XM_029500776.1 12801865 | 1 | 15022311 | 15023128 | Echeneis naucrates 173247 | AGG|GTAGGTGGTG...TTTTCTTTTGTT/TTTGTTGTTATT...TGTAG|GTG | 0 | 2.324 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);