introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 21436586
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115635163 | GT-AG | 0 | 0.000146319713254 | 35256 | rna-XM_034066332.1 21436586 | 1 | 68575712 | 68610967 | Melopsittacus undulatus 13146 | AAG|GTACCGCGGG...ACTGCTGTAATC/TGTAATCTAACA...TACAG|CTT | 0 | 1 | 3.796 |
| 115635164 | GT-AG | 0 | 1.000000099473604e-05 | 11458 | rna-XM_034066332.1 21436586 | 2 | 68564147 | 68575604 | Melopsittacus undulatus 13146 | TGG|GTAAGTAAAT...ACACTTTTATAT/TACACTTTTATA...TTTAG|GAG | 2 | 1 | 6.257 |
| 115635165 | GT-AG | 0 | 1.000000099473604e-05 | 124032 | rna-XM_034066332.1 21436586 | 3 | 68440050 | 68564081 | Melopsittacus undulatus 13146 | TTG|GTAAGTGAAG...TTATGCTTAACT/GCTTAACTTATT...CCCAG|AAA | 1 | 1 | 7.752 |
| 115635166 | GT-AG | 0 | 1.0444874212915253e-05 | 16096 | rna-XM_034066332.1 21436586 | 4 | 68423886 | 68439981 | Melopsittacus undulatus 13146 | CAG|GTAAGCAGGT...GTGTCTTTATCT/TGTGTCTTTATC...TACAG|GAT | 0 | 1 | 9.317 |
| 115635167 | GT-AG | 0 | 1.000000099473604e-05 | 33815 | rna-XM_034066332.1 21436586 | 5 | 68389965 | 68423779 | Melopsittacus undulatus 13146 | GAG|GTAAAGGAAG...TGCATTTTATCT/CCGGGTCTGATT...AATAG|CTT | 1 | 1 | 11.755 |
| 115635168 | GT-AG | 0 | 1.000000099473604e-05 | 26097 | rna-XM_034066332.1 21436586 | 6 | 68363773 | 68389869 | Melopsittacus undulatus 13146 | AAG|GTAAGACACT...TCACTTTTTTCT/ACAAGTCTCACT...TCCAG|TTC | 0 | 1 | 13.941 |
| 115635169 | GT-AG | 0 | 1.000000099473604e-05 | 7275 | rna-XM_034066332.1 21436586 | 7 | 68356386 | 68363660 | Melopsittacus undulatus 13146 | GAG|GTAAGTAGCC...TCCATTTTGATC/TCCATTTTGATC...TCTAG|GAG | 1 | 1 | 16.517 |
| 115635170 | GT-AG | 0 | 0.0008697994212661 | 6460 | rna-XM_034066332.1 21436586 | 8 | 68349831 | 68356290 | Melopsittacus undulatus 13146 | AAG|GTTTGTTTAC...TTTCCTTTGATT/TTTCCTTTGATT...CTTAG|ACG | 0 | 1 | 18.703 |
| 115635171 | GT-AG | 0 | 8.831343396480419e-05 | 2329 | rna-XM_034066332.1 21436586 | 9 | 68347358 | 68349686 | Melopsittacus undulatus 13146 | GAG|GTAACATAAC...TCTTCCTTTCTT/AGAAGCCTCACC...CTCAG|ATG | 0 | 1 | 22.015 |
| 115635172 | GT-AG | 0 | 1.000000099473604e-05 | 18909 | rna-XM_034066332.1 21436586 | 10 | 68328210 | 68347118 | Melopsittacus undulatus 13146 | GAG|GTAAGTGCAT...TTTCTCCTACCT/TTCTGTCTCACT...TTCAG|ATA | 2 | 1 | 27.513 |
| 115635173 | GT-AG | 0 | 1.000000099473604e-05 | 9606 | rna-XM_034066332.1 21436586 | 11 | 68318478 | 68328083 | Melopsittacus undulatus 13146 | CAG|GTTTGTAAAT...ATGATCTTCCTT/AGTATAATCAAT...TACAG|GCC | 2 | 1 | 30.412 |
| 115635174 | GT-AG | 0 | 1.000000099473604e-05 | 3327 | rna-XM_034066332.1 21436586 | 12 | 68315016 | 68318342 | Melopsittacus undulatus 13146 | CAA|GTAGGAACCA...GTTACCTTATTC/TGTTACCTTATT...TCTAG|CCG | 2 | 1 | 33.517 |
| 115635175 | GT-AG | 0 | 3.582970129825644e-05 | 4645 | rna-XM_034066332.1 21436586 | 13 | 68310320 | 68314964 | Melopsittacus undulatus 13146 | GAG|GTTTGTATAT...TTTTTCTTTACG/TTTTTCTTTACG...TACAG|ATA | 2 | 1 | 34.691 |
| 115635176 | GT-AG | 0 | 1.000000099473604e-05 | 6570 | rna-XM_034066332.1 21436586 | 14 | 68303520 | 68310089 | Melopsittacus undulatus 13146 | AAG|GTAAGTGAAA...ATATCTTTCTCC/CCGTTGTTAAGT...ACCAG|TGA | 1 | 1 | 39.982 |
| 115635177 | GT-AG | 0 | 1.000000099473604e-05 | 2948 | rna-XM_034066332.1 21436586 | 15 | 68300278 | 68303225 | Melopsittacus undulatus 13146 | CTG|GTAAAGTGAA...CTCCCTTTGACT/CTCCCTTTGACT...TGCAG|GTC | 1 | 1 | 46.745 |
| 115635178 | GT-AG | 0 | 1.8016168361139613e-05 | 5818 | rna-XM_034066332.1 21436586 | 16 | 68293570 | 68299387 | Melopsittacus undulatus 13146 | AAG|GTACAGCTCA...ATATTCTTGTTC/TTCTTCATAAAA...TCAAG|AAA | 0 | 1 | 67.219 |
| 115635179 | GT-AG | 0 | 1.000000099473604e-05 | 6698 | rna-XM_034066332.1 21436586 | 17 | 68286713 | 68293410 | Melopsittacus undulatus 13146 | GAG|GTAAGGAATA...CACTTCTTGGCC/CTTCCACTGACA...CACAG|AAA | 0 | 1 | 70.876 |
| 115635180 | GT-AG | 0 | 1.000000099473604e-05 | 5402 | rna-XM_034066332.1 21436586 | 18 | 68281128 | 68286529 | Melopsittacus undulatus 13146 | AAG|GTGAGAAAAA...GCAGCCTTCACT/GCAGCCTTCACT...GGCAG|GAA | 0 | 1 | 75.086 |
| 115635181 | GT-AG | 0 | 1.01231092431245e-05 | 478 | rna-XM_034066332.1 21436586 | 19 | 68280530 | 68281007 | Melopsittacus undulatus 13146 | AAC|GTAAGTCTGA...TCTTCCTTTTTT/TTCTTTTTCATG...TGCAG|GCC | 0 | 1 | 77.847 |
| 115635182 | GT-AG | 0 | 0.0015178819675032 | 10508 | rna-XM_034066332.1 21436586 | 20 | 68269860 | 68280367 | Melopsittacus undulatus 13146 | AAG|GTATGTCTAG...TTTTCTTTATTC/CTTTTCTTTATT...TTCAG|GTT | 0 | 1 | 81.573 |
| 115635183 | GT-AG | 0 | 1.000000099473604e-05 | 2055 | rna-XM_034066332.1 21436586 | 21 | 68267614 | 68269668 | Melopsittacus undulatus 13146 | CAG|GTACAGAATT...TTTCCTTTTGCA/TATGAACTCATT...TCCAG|GTT | 2 | 1 | 85.967 |
| 115635184 | GT-AG | 0 | 1.000000099473604e-05 | 7598 | rna-XM_034066332.1 21436586 | 22 | 68259823 | 68267420 | Melopsittacus undulatus 13146 | GAT|GTAAGTGGCA...TTTCTGTTAAAA/TTTCTGTTAAAA...CTCAG|TCC | 0 | 1 | 90.407 |
| 115635185 | GT-AG | 0 | 1.000000099473604e-05 | 2274 | rna-XM_034066332.1 21436586 | 23 | 68257377 | 68259650 | Melopsittacus undulatus 13146 | GAG|GTAAGTGCCA...TTTCCCTCAATT/TTTTCCCTCAAT...TCCAG|GCC | 1 | 1 | 94.364 |
| 115635186 | GT-AG | 0 | 1.000000099473604e-05 | 14003 | rna-XM_034066332.1 21436586 | 24 | 68243212 | 68257214 | Melopsittacus undulatus 13146 | CAT|GTGAGTATAC...TTCTTTTTGCCT/TTGGAACTTATG...TCCAG|TGA | 1 | 1 | 98.091 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);