introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 21436553
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115634292 | GT-AG | 0 | 1.000000099473604e-05 | 1499 | rna-XM_034060467.1 21436553 | 1 | 102779682 | 102781180 | Melopsittacus undulatus 13146 | AAG|GTTAAAACTT...TTTTTCTTTTTT/AAATGTTTGATA...CCTAG|ACT | 0 | 1 | 5.078 |
| 115634293 | AT-AC | 1 | 99.9999999999858 | 1364 | rna-XM_034060467.1 21436553 | 2 | 102781300 | 102782663 | Melopsittacus undulatus 13146 | TTC|ATATCCTTTT...TTTTCCTTAACT/CCTTAACTTATT...TTCAC|ATT | 2 | 1 | 7.292 |
| 115634294 | GT-AG | 0 | 0.0004250215317344 | 1048 | rna-XM_034060467.1 21436553 | 3 | 102782751 | 102783798 | Melopsittacus undulatus 13146 | TGA|GTAAGTATTT...ATATTTTTATTT/AATATTTTTATT...CTCAG|ATA | 2 | 1 | 8.91 |
| 115634295 | GT-AG | 0 | 1.000000099473604e-05 | 4225 | rna-XM_034060467.1 21436553 | 4 | 102783928 | 102788152 | Melopsittacus undulatus 13146 | AAC|GTAAGTAGAA...GGTGATTTAATT/GGTGATTTAATT...TATAG|GTA | 2 | 1 | 11.31 |
| 115634296 | GT-AG | 0 | 1.000000099473604e-05 | 1595 | rna-XM_034060467.1 21436553 | 5 | 102788233 | 102789827 | Melopsittacus undulatus 13146 | CAG|GTAAGGAACA...TGAATCTTACTT/CCTCTTCTCATT...TTTAG|GTC | 1 | 1 | 12.798 |
| 115634297 | GT-AG | 0 | 7.2940651434545 | 592 | rna-XM_034060467.1 21436553 | 6 | 102790071 | 102790662 | Melopsittacus undulatus 13146 | TGT|GTATTTTGTT...GGTTCCTTGATT/GATTTATTAATT...ATTAG|ATA | 1 | 1 | 17.318 |
| 115634298 | GT-AG | 0 | 1.000000099473604e-05 | 594 | rna-XM_034060467.1 21436553 | 7 | 102790727 | 102791320 | Melopsittacus undulatus 13146 | TGA|GTAAGTGAGC...GTTTCTTTTGTT/TACTGTTTAATG...GCTAG|CAA | 2 | 1 | 18.508 |
| 115634299 | GT-AG | 0 | 1.000000099473604e-05 | 1209 | rna-XM_034060467.1 21436553 | 8 | 102791466 | 102792674 | Melopsittacus undulatus 13146 | CAA|GTAGGTACTT...TTTATTATAACT/TGTTTATTTATT...CACAG|ACT | 0 | 1 | 21.205 |
| 115634300 | GT-AG | 0 | 1.000000099473604e-05 | 1928 | rna-XM_034060467.1 21436553 | 9 | 102792858 | 102794785 | Melopsittacus undulatus 13146 | CAG|GTTTGTAAAT...AGATCTTTTGCT/CTTTTGCTGATT...TACAG|ATG | 0 | 1 | 24.609 |
| 115634301 | GT-AG | 0 | 0.000145645551702 | 1535 | rna-XM_034060467.1 21436553 | 10 | 102794978 | 102796512 | Melopsittacus undulatus 13146 | CTT|GTAAGTATTT...CATTTCTTCATA/CATTTCTTCATA...ACCAG|AGG | 0 | 1 | 28.181 |
| 115634302 | GT-AG | 0 | 1.000000099473604e-05 | 1652 | rna-XM_034060467.1 21436553 | 11 | 102796616 | 102798267 | Melopsittacus undulatus 13146 | AGT|GTTAAGTCAT...GTCACTTTACCT/CTGTATGTCACT...TTTAG|AAA | 1 | 1 | 30.097 |
| 115634303 | GT-AG | 0 | 1.000000099473604e-05 | 123 | rna-XM_034060467.1 21436553 | 12 | 102798507 | 102798629 | Melopsittacus undulatus 13146 | TTG|GTAAGTTAGA...ATTTCTTTTTCC/AACTTTTTCAGT...TCTAG|GTC | 0 | 1 | 34.542 |
| 115634304 | GT-AG | 0 | 1.000000099473604e-05 | 1085 | rna-XM_034060467.1 21436553 | 13 | 102798783 | 102799867 | Melopsittacus undulatus 13146 | CTG|GTAATTATTT...GTTCTCTTGTGT/TTGCTACTCACA...TACAG|CTC | 0 | 1 | 37.388 |
| 115634305 | GT-AG | 0 | 1.000000099473604e-05 | 763 | rna-XM_034060467.1 21436553 | 14 | 102800222 | 102800984 | Melopsittacus undulatus 13146 | GTG|GTGAGTTTCA...CTTACTTCAACT/CTCCAACTTACT...TTAAG|GTG | 0 | 1 | 43.973 |
| 115634306 | GT-AG | 0 | 1.000000099473604e-05 | 3591 | rna-XM_034060467.1 21436553 | 15 | 102801438 | 102805028 | Melopsittacus undulatus 13146 | CAG|GTAGGGAATA...GTAACCTTATTA/CTCTTCTTTACC...TGCAG|AAA | 0 | 1 | 52.4 |
| 115634307 | GT-AG | 0 | 1.000000099473604e-05 | 1122 | rna-XM_034060467.1 21436553 | 16 | 102805141 | 102806262 | Melopsittacus undulatus 13146 | AAA|GTAAGAATCC...TCTTCTTTCTCT/TGTCTCTACATT...TTCAG|GTT | 1 | 1 | 54.483 |
| 115634308 | GT-AG | 0 | 8.550836276535376e-05 | 2198 | rna-XM_034060467.1 21436553 | 17 | 102806418 | 102808615 | Melopsittacus undulatus 13146 | CTG|GTAAATTTAA...AATGTTTTAAAA/AATGTTTTAAAA...TCCAG|GCA | 0 | 1 | 57.366 |
| 115634309 | AT-AC | 0 | 0.0254793352867212 | 1474 | rna-XM_034060467.1 21436553 | 18 | 102808824 | 102810297 | Melopsittacus undulatus 13146 | TGA|ATTTTGATAT...ACAGCTGTCATT/ACAGCTGTCATT...ATCAC|TTT | 1 | 1 | 61.235 |
| 115634310 | GT-AG | 0 | 1.000000099473604e-05 | 2306 | rna-XM_034060467.1 21436553 | 19 | 102810387 | 102812692 | Melopsittacus undulatus 13146 | AAG|GTAAGGATAT...TTCATTTTATTT/ATTCTTTTCATT...TTAAG|GTT | 0 | 1 | 62.891 |
| 115634311 | GT-AG | 0 | 3.372852230475252e-05 | 676 | rna-XM_034060467.1 21436553 | 20 | 102812948 | 102813623 | Melopsittacus undulatus 13146 | GTG|GTAAGTTCTA...AAAATTTTAATT/AAAATTTTAATT...TCCAG|GCA | 0 | 1 | 67.634 |
| 115634312 | GT-AG | 0 | 0.0006005449662868 | 917 | rna-XM_034060467.1 21436553 | 21 | 102813678 | 102814594 | Melopsittacus undulatus 13146 | CAG|GTATGTCACT...TCTGTTTTGATT/TCTGTTTTGATT...AACAG|ATA | 0 | 1 | 68.638 |
| 115634313 | GT-AG | 0 | 2.935765153537384e-05 | 525 | rna-XM_034060467.1 21436553 | 22 | 102814737 | 102815261 | Melopsittacus undulatus 13146 | TAA|GTACTACATG...CCTGCTTAAACT/TTAAACTTCAGC...CTTAG|GTG | 1 | 1 | 71.28 |
| 115634314 | GT-AG | 0 | 1.000000099473604e-05 | 814 | rna-XM_034060467.1 21436553 | 23 | 102815363 | 102816176 | Melopsittacus undulatus 13146 | TTG|GTAAGACCAT...TGGGTCTTACAA/GCTCTTCTCATT...TGCAG|AAT | 0 | 1 | 73.158 |
| 115634315 | GT-AG | 0 | 0.0004845656173129 | 1466 | rna-XM_034060467.1 21436553 | 24 | 102816442 | 102817907 | Melopsittacus undulatus 13146 | TTA|GTAAGTATTA...TTCTTTTTAATT/TTCTTTTTAATT...CAAAG|GTA | 1 | 1 | 78.088 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);