introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
18 rows where transcript_id = 28131402
This data as json, CSV (advanced)
Suggested facets: score, length, phase, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
156031851 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna46.p1 28131402 | 2 | 94867 | 94949 | Planoprotostelium fungivorum 1890364 | GAG|GTGACATTCA...TGATCCATACAA/AATCCATTCATC...AACAG|GTT | 0 | 1 | 8.369 |
156031852 | GT-AG | 0 | 1.000000099473604e-05 | 39 | rna46.p1 28131402 | 3 | 95043 | 95081 | Planoprotostelium fungivorum 1890364 | AAG|GTGAAAATCT...ATCATCTGAACC/CATCATCTGAAC...CCCAG|TCG | 0 | 1 | 10.162 |
156031853 | GT-AG | 0 | 1.000000099473604e-05 | 38 | rna46.p1 28131402 | 4 | 95176 | 95213 | Planoprotostelium fungivorum 1890364 | AAG|GTGAGGCGAT...TATTCCATACAC/TACACACTGACG...TGCAG|CTA | 1 | 1 | 11.975 |
156031854 | GT-AG | 0 | 1.000000099473604e-05 | 43 | rna46.p1 28131402 | 5 | 95444 | 95486 | Planoprotostelium fungivorum 1890364 | CCC|GTGGGTGTCG...ACGTCTCTGATT/ACGTCTCTGATT...ATCAG|CTG | 0 | 1 | 16.41 |
156031855 | GT-AG | 0 | 1.000000099473604e-05 | 44 | rna46.p1 28131402 | 6 | 95788 | 95831 | Planoprotostelium fungivorum 1890364 | CTG|GTGAGTCTGA...AGGGATGTGATG/AGGGATGTGATG...AGCAG|GTA | 1 | 1 | 22.214 |
156031856 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna46.p1 28131402 | 7 | 96237 | 96294 | Planoprotostelium fungivorum 1890364 | AGA|GTCAGTCTCT...TCAGTCATGACG/TGTGATCTCAGT...AACAG|GTG | 1 | 1 | 30.023 |
156031857 | GT-AG | 0 | 1.000000099473604e-05 | 37 | rna46.p1 28131402 | 8 | 96505 | 96541 | Planoprotostelium fungivorum 1890364 | CCA|GTGAGTCAGA...TCTCTCTCAACG/CTCTCTCTCAAC...AACAG|CTG | 1 | 1 | 34.073 |
156031858 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna46.p1 28131402 | 9 | 96597 | 96664 | Planoprotostelium fungivorum 1890364 | CAT|GTGAGTCCCC...TGTTTCTGAATT/TTGTTTCTGAAT...CATAG|GGT | 2 | 1 | 35.133 |
156031859 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna46.p1 28131402 | 10 | 96681 | 96729 | Planoprotostelium fungivorum 1890364 | GTG|GTGTGTATTG...TTTGATGTGATT/TTTGATGTGATT...CATAG|AAT | 0 | 1 | 35.442 |
156031860 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna46.p1 28131402 | 11 | 96829 | 96887 | Planoprotostelium fungivorum 1890364 | AAG|GTGAGACGAG...GAAGCTGAAGAT/CGGAAGCTGAAG...AACAG|AAA | 0 | 1 | 37.351 |
156031861 | GT-AG | 0 | 0.0001455941069702 | 134 | rna46.p1 28131402 | 12 | 97439 | 97572 | Planoprotostelium fungivorum 1890364 | CGA|GTAATCATCT...ATATTCTGAAGG/TATATTCTGAAG...GACAG|GGA | 2 | 1 | 47.975 |
156031862 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna46.p1 28131402 | 13 | 97814 | 97902 | Planoprotostelium fungivorum 1890364 | ACG|GTGCGTACAA...GTCGTTGTAGAT/GATAGACTGACG...AATAG|ATG | 0 | 1 | 52.622 |
156031863 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna46.p1 28131402 | 14 | 98131 | 98191 | Planoprotostelium fungivorum 1890364 | CAG|GTCACACCAT...CACACGTTGATG/TGTGAACTGACG...AATAG|GTT | 0 | 1 | 57.019 |
156031864 | GT-AG | 0 | 1.000000099473604e-05 | 39 | rna46.p1 28131402 | 15 | 98283 | 98321 | Planoprotostelium fungivorum 1890364 | TCG|GTGAATAGGA...GAGTGCGAAACG/CGAATGCTGAGG...GGAAG|GTG | 1 | 1 | 58.774 |
156031865 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna46.p1 28131402 | 16 | 98558 | 98608 | Planoprotostelium fungivorum 1890364 | GAG|GTGACTTGAA...TCCTCTTCCACT/CTTCCACTCATA...TGTAG|GTT | 0 | 1 | 63.324 |
156031866 | GT-AG | 0 | 1.000000099473604e-05 | 39 | rna46.p1 28131402 | 17 | 98696 | 98734 | Planoprotostelium fungivorum 1890364 | ATG|GTGAGACGAG...TCGATCGAGAAA/GAGAAAATGACA...CACAG|GCT | 0 | 1 | 65.002 |
156031867 | GT-AG | 0 | 1.000000099473604e-05 | 41 | rna46.p1 28131402 | 18 | 99049 | 99089 | Planoprotostelium fungivorum 1890364 | GAG|GTGAACCTGC...GAGACGCTGACG/GAGACGCTGACG...ATCAG|AGA | 2 | 1 | 71.057 |
156033680 | GT-AG | 0 | 0.03987975353627 | 121 | rna46.p1 28131402 | 1 | 94281 | 94401 | Planoprotostelium fungivorum 1890364 | AAT|GTATGTATTA...CTAGTTTTGATA/CTAGTTTTGATA...ACTAG|AAC | 0 | 1.485 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);