introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 32688872
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182658896 | GT-AG | 0 | 1.000000099473604e-05 | 762 | rna-XM_022740121.1 32688872 | 2 | 1468415 | 1469176 | Seriola dumerili 41447 | CAG|GTGAAGCTGT...CTTCCTTTAAAA/TCCTTTCTGACT...TGCAG|GAT | 0 | 1 | 2.222 |
| 182658897 | GT-AG | 0 | 1.000000099473604e-05 | 355 | rna-XM_022740121.1 32688872 | 3 | 1469797 | 1470151 | Seriola dumerili 41447 | GAG|GTGAGATGGA...ACCCCCTTCCTG/AGCTGGTTAACC...GTCAG|ACT | 2 | 1 | 14.859 |
| 182658898 | GT-AG | 0 | 1.000000099473604e-05 | 758 | rna-XM_022740121.1 32688872 | 4 | 1470309 | 1471066 | Seriola dumerili 41447 | ATG|GTAATGATGA...GTGTGTTTATTC/TGTGTGTTTATT...TTCAG|GCC | 0 | 1 | 18.06 |
| 182658899 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_022740121.1 32688872 | 5 | 1471194 | 1471301 | Seriola dumerili 41447 | GAG|GTGAGGCTGA...CGACCTTTACTT/GCTGTTTTCATC...TGTAG|CGG | 1 | 1 | 20.648 |
| 182658900 | GT-AG | 0 | 1.000000099473604e-05 | 1187 | rna-XM_022740121.1 32688872 | 6 | 1471357 | 1472543 | Seriola dumerili 41447 | TAT|GTAAGTCTAC...CTCCATTTAAAT/AATCATCTCATT...CACAG|AGC | 2 | 1 | 21.769 |
| 182658901 | GT-AG | 0 | 1.000000099473604e-05 | 607 | rna-XM_022740121.1 32688872 | 7 | 1472619 | 1473225 | Seriola dumerili 41447 | TGA|GTAAGTGCGG...GTTTCCTTATGT/TGTTTCCTTATG...ACCAG|GAA | 2 | 1 | 23.298 |
| 182658902 | GT-AG | 0 | 1.000000099473604e-05 | 373 | rna-XM_022740121.1 32688872 | 8 | 1473335 | 1473707 | Seriola dumerili 41447 | AAG|GTAAACCCAA...ATTTTCTTGTTC/CTGTGTTTGATT...TGCAG|GGT | 0 | 1 | 25.52 |
| 182658903 | GT-AG | 0 | 1.000000099473604e-05 | 8573 | rna-XM_022740121.1 32688872 | 9 | 1473898 | 1482470 | Seriola dumerili 41447 | TGA|GTGAGTAACG...TGTTCCTGTCCT/GCAACACTGATC...TTCAG|ATT | 1 | 1 | 29.393 |
| 182658904 | GT-AG | 0 | 0.0001681305401366 | 841 | rna-XM_022740121.1 32688872 | 10 | 1482539 | 1483379 | Seriola dumerili 41447 | AAG|GTAAACACTC...CTGTCTTTATTT/GCTGTCTTTATT...CACAG|AGA | 0 | 1 | 30.779 |
| 182658905 | GT-AG | 0 | 1.000000099473604e-05 | 1443 | rna-XM_022740121.1 32688872 | 11 | 1483426 | 1484868 | Seriola dumerili 41447 | TTG|GTAAGTAGAA...ATTCCCTCAGTT/CCGTCTCTCATC...CCCAG|CGC | 1 | 1 | 31.716 |
| 182658906 | GT-AG | 0 | 1.000000099473604e-05 | 678 | rna-XM_022740121.1 32688872 | 12 | 1485077 | 1485754 | Seriola dumerili 41447 | AGG|GTGAGTATCT...CCACCTTGGACC/GTGGTTGTCATC...CTCAG|ATC | 2 | 1 | 35.956 |
| 182658907 | GT-AG | 0 | 1.000000099473604e-05 | 1097 | rna-XM_022740121.1 32688872 | 13 | 1485862 | 1486958 | Seriola dumerili 41447 | AAG|GTGAGTTGAC...TGGGTCATAGTT/TAATGGGTCATA...TTCAG|ATG | 1 | 1 | 38.137 |
| 182658908 | GT-AG | 0 | 1.000000099473604e-05 | 346 | rna-XM_022740121.1 32688872 | 14 | 1487100 | 1487445 | Seriola dumerili 41447 | ATG|GTGAGCGTGA...TAAAACTTAATG/TTTCTTCTCAAA...TACAG|AGC | 1 | 1 | 41.011 |
| 182658909 | GT-AG | 0 | 1.000000099473604e-05 | 235 | rna-XM_022740121.1 32688872 | 15 | 1487697 | 1487931 | Seriola dumerili 41447 | CAG|GTAGGTGATT...GTTTCTTTTGTT/ACAGAGATGACA...CCCAG|TTC | 0 | 1 | 46.127 |
| 182658910 | GT-AG | 0 | 1.000000099473604e-05 | 128 | rna-XM_022740121.1 32688872 | 16 | 1488058 | 1488185 | Seriola dumerili 41447 | CGG|GTAAGACACA...TGCTCTTTCTCA/TGCAATCTGATA...CTCAG|AAA | 0 | 1 | 48.695 |
| 182658911 | GT-AG | 0 | 8.450137922214371e-05 | 125 | rna-XM_022740121.1 32688872 | 17 | 1488304 | 1488428 | Seriola dumerili 41447 | AAG|GTCTTTCTGC...TTTGACTTAATC/TTAAATTTGACT...TCTAG|CGG | 1 | 1 | 51.101 |
| 182658912 | GT-AG | 0 | 5.291019466499288e-05 | 252 | rna-XM_022740121.1 32688872 | 18 | 1488684 | 1488935 | Seriola dumerili 41447 | AAG|GTATGATGAG...TCATCTCTAATT/TCATCTCTAATT...TCCAG|ATA | 1 | 1 | 56.298 |
| 182658913 | GT-AG | 0 | 1.000000099473604e-05 | 231 | rna-XM_022740121.1 32688872 | 19 | 1489013 | 1489243 | Seriola dumerili 41447 | GTG|GTAAGTGATT...CTTTTTTTGATC/CTTTTTTTGATC...CTCAG|GAT | 0 | 1 | 57.868 |
| 182658914 | GT-AG | 0 | 0.0002073429623046 | 237 | rna-XM_022740121.1 32688872 | 20 | 1489423 | 1489659 | Seriola dumerili 41447 | GAG|GTAGCTGCAG...TTTGTCTTTCTA/TCCTCTCCCACT...CTCAG|CTG | 2 | 1 | 61.517 |
| 182658915 | GT-AG | 0 | 0.0009766109922585 | 338 | rna-XM_022740121.1 32688872 | 21 | 1489734 | 1490071 | Seriola dumerili 41447 | AAG|GTAACCAGAT...CTTTTCTTGTTC/TTCCATTTCACG...TGTAG|AGC | 1 | 1 | 63.025 |
| 182658916 | GT-AG | 0 | 1.000000099473604e-05 | 230 | rna-XM_022740121.1 32688872 | 22 | 1490258 | 1490487 | Seriola dumerili 41447 | AGG|GTAGGACAAG...TCAACCTTAACT/TTAACTTTTATC...TTCAG|TGG | 1 | 1 | 66.816 |
| 182658917 | GT-AG | 0 | 1.000000099473604e-05 | 385 | rna-XM_022740121.1 32688872 | 23 | 1490635 | 1491019 | Seriola dumerili 41447 | AGA|GTGAGTGCAG...TTGGTGTTGATG/GTGTTTGTCACT...GCCAG|ATG | 1 | 1 | 69.812 |
| 182658918 | GT-AG | 0 | 1.000000099473604e-05 | 580 | rna-XM_022740121.1 32688872 | 24 | 1491136 | 1491715 | Seriola dumerili 41447 | ACG|GTAAGGGCTC...ATATCATTAGTA/GTGTGTCTGACT...CTCAG|GTT | 0 | 1 | 72.177 |
| 182658919 | GT-AG | 0 | 0.0004079025562782 | 190 | rna-XM_022740121.1 32688872 | 25 | 1491785 | 1491974 | Seriola dumerili 41447 | GAG|GTAAACTCTC...GATTATTTGATT/GATTATTTGATT...TCCAG|TTC | 0 | 1 | 73.583 |
| 182658920 | GT-AG | 0 | 1.000000099473604e-05 | 225 | rna-XM_022740121.1 32688872 | 26 | 1492023 | 1492247 | Seriola dumerili 41447 | CAG|GTAAAACAGA...TGTTCCCTAATT/TGTTCCCTAATT...CCCAG|GCC | 0 | 1 | 74.562 |
| 182658921 | GT-AG | 0 | 1.000000099473604e-05 | 129 | rna-XM_022740121.1 32688872 | 27 | 1492530 | 1492658 | Seriola dumerili 41447 | GAG|GTGAAATGCA...AAATATTTAACA/AAATATTTAACA...TGCAG|GAA | 0 | 1 | 80.31 |
| 182658922 | GT-AG | 0 | 1.000000099473604e-05 | 8303 | rna-XM_022740121.1 32688872 | 28 | 1492809 | 1501111 | Seriola dumerili 41447 | GAG|GTGGGGAACC...CATGTCTCAGCA/TCATGTCTCATG...TCCAG|GTG | 0 | 1 | 83.367 |
| 182658923 | GT-AG | 0 | 1.000000099473604e-05 | 599 | rna-XM_022740121.1 32688872 | 29 | 1501464 | 1502062 | Seriola dumerili 41447 | CAG|GTAAGGAGTT...TGGTCCTCCTCC/CTGGTCCTCCTC...CTTAG|CCT | 1 | 1 | 90.542 |
| 182658924 | GT-AG | 0 | 0.0002584511219199 | 500 | rna-XM_022740121.1 32688872 | 30 | 1502183 | 1502682 | Seriola dumerili 41447 | CGG|GTACTTACTT...TGACCTTTACTT/CTTTACTTTATG...GACAG|ATC | 1 | 1 | 92.988 |
| 182662133 | GT-AG | 0 | 1.000000099473604e-05 | 5785 | rna-XM_022740121.1 32688872 | 1 | 1462469 | 1468253 | Seriola dumerili 41447 | AAT|GTAAGTAGAA...TGTCCATTATCT/TTGGGACTCACA...TTCAG|CAT | 0 | 1.794 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);