introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 28131421
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
156032341 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna130.p1 28131421 | 1 | 303200 | 303248 | Planoprotostelium fungivorum 1890364 | GGC|GTGAGTCCGT...AGTCCGCTGACT/CGCTGACTCATT...TTCAG|GTC | 0 | 1 | 0.817 |
156032342 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna130.p1 28131421 | 2 | 303471 | 303534 | Planoprotostelium fungivorum 1890364 | ACG|GTGATCTAAT...CATCTCTCATCT/TCATCTCTCATC...CGCAG|GTT | 0 | 1 | 7.539 |
156032343 | GT-AG | 0 | 1.000000099473604e-05 | 44 | rna130.p1 28131421 | 3 | 303847 | 303890 | Planoprotostelium fungivorum 1890364 | GAG|GTGACGATTA...ACATTCTGAATG/AATGACCTGATT...ATCAG|ACG | 0 | 1 | 16.985 |
156032344 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna130.p1 28131421 | 4 | 304296 | 304344 | Planoprotostelium fungivorum 1890364 | TCG|GTGGGTGTCG...CAGCTGATGACG/AGTCAGCTGATG...CGCAG|GTA | 0 | 1 | 29.246 |
156032345 | GT-AG | 0 | 1.000000099473604e-05 | 40 | rna130.p1 28131421 | 5 | 304501 | 304540 | Planoprotostelium fungivorum 1890364 | AGA|GTGAGTGGTG...ACACTCATTATC/GGGACACTCATT...ATCAG|ACC | 0 | 1 | 33.969 |
156032346 | GT-AG | 0 | 1.000000099473604e-05 | 46 | rna130.p1 28131421 | 6 | 304749 | 304794 | Planoprotostelium fungivorum 1890364 | TCG|GTACGTACCG...TACGTACTGACC/TACGTACTGACC...ACCAG|CTG | 1 | 1 | 40.266 |
156032347 | GT-AG | 0 | 1.000000099473604e-05 | 42 | rna130.p1 28131421 | 7 | 304968 | 305009 | Planoprotostelium fungivorum 1890364 | CCG|GTGATGCTGA...TGAATCTAAATA/GTGAATCTAAAT...ATCAG|GCG | 0 | 1 | 45.504 |
156032348 | GT-AG | 0 | 3.0336660982652176e-05 | 43 | rna130.p1 28131421 | 8 | 305145 | 305187 | Planoprotostelium fungivorum 1890364 | CAG|GTACCGCACA...TTCGATGTGACG/TTCGATGTGACG...TACAG|TTG | 0 | 1 | 49.591 |
156032349 | GT-AG | 0 | 1.000000099473604e-05 | 42 | rna130.p1 28131421 | 9 | 305379 | 305420 | Planoprotostelium fungivorum 1890364 | GCC|GTGAGTCCAT...CATCCGTTGAAA/ACTGGATTGACT...TGTAG|AGA | 2 | 1 | 55.374 |
156032350 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna130.p1 28131421 | 10 | 305725 | 305774 | Planoprotostelium fungivorum 1890364 | CAG|GTGATGATGT...ATGATGATGATG/ATGATGATGATG...ATCAG|AAT | 0 | 1 | 64.578 |
156032351 | GT-AG | 0 | 6.976761431081065e-05 | 97 | rna130.p1 28131421 | 11 | 305940 | 306036 | Planoprotostelium fungivorum 1890364 | GAG|GTACATCATC...CATTTCTGAATC/TACAGACTCATT...CGCAG|CAT | 0 | 1 | 69.573 |
156032352 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna130.p1 28131421 | 12 | 306178 | 306225 | Planoprotostelium fungivorum 1890364 | AGA|GTAAGTACCC...GCTGACGTAATC/CGCACGCTCACT...CGCAG|GTC | 0 | 1 | 73.842 |
156032353 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna130.p1 28131421 | 13 | 306541 | 306647 | Planoprotostelium fungivorum 1890364 | ATG|GTGCATAATC...CAGGTGTTGACA/CAGGTGTTGACA...GACAG|GTC | 0 | 1 | 83.379 |
156032354 | GT-AG | 0 | 1.000000099473604e-05 | 73 | rna130.p1 28131421 | 14 | 306866 | 306938 | Planoprotostelium fungivorum 1890364 | GGA|GTGAGTTCAT...ATCATCTGAGTG/CATCATCTGAGT...TATAG|GAT | 2 | 1 | 89.979 |
156032355 | GT-AG | 0 | 1.000000099473604e-05 | 43 | rna130.p1 28131421 | 15 | 307026 | 307068 | Planoprotostelium fungivorum 1890364 | CAG|GTGATGATGT...ACGATCGTGACG/ACGATCGTGACG...ACAAG|GTA | 2 | 1 | 92.613 |
156032356 | GT-AG | 0 | 1.000000099473604e-05 | 41 | rna130.p1 28131421 | 16 | 307188 | 307228 | Planoprotostelium fungivorum 1890364 | AGG|GTGAGATGCC...TGATGATTGAAA/TGATGATTGAAA...AACAG|ATG | 1 | 1 | 96.216 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);