introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
9 rows where transcript_id = 17285675
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
92354159 | GT-AG | 0 | 1.000000099473604e-05 | 94 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 1 | 1772810 | 1772903 | Hesseltinella vesiculosa 101127 | CAG|GTCCGTGCAT...TGCTCTTTACCA/CTCTTTTTTATT...TGTAG|ACT | 1 | 1 | 9.312 |
92354160 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 2 | 1775056 | 1775116 | Hesseltinella vesiculosa 101127 | AAA|GTGAGTATCT...ATCTTCTTTGCT/CTCTTTTTTACT...GGTAG|CTG | 2 | 1 | 64.364 |
92354161 | GT-AG | 0 | 7.864397286520422e-05 | 64 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 3 | 1775617 | 1775680 | Hesseltinella vesiculosa 101127 | ATC|GTACGTAGAT...CGTTTTTTTACG/CGTTTTTTTACG...ATTAG|GAT | 1 | 1 | 77.155 |
92354162 | GT-AG | 0 | 0.0001025068258763 | 61 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 4 | 1775873 | 1775933 | Hesseltinella vesiculosa 101127 | ACA|GTGGCAAAGG...TTTTTTTTAACT/TTTTTTTTAACT...TGTAG|ATA | 1 | 1 | 82.067 |
92354163 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 5 | 1776044 | 1776111 | Hesseltinella vesiculosa 101127 | CAG|GTACAGAGGA...GTTTTTTTATTC/CGTTTTTTTATT...GGTAG|GCC | 0 | 1 | 84.881 |
92354164 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 6 | 1776244 | 1776325 | Hesseltinella vesiculosa 101127 | CGG|GTAAGACGCA...TGTAATTTAATT/ATTGACCTCATT...AACAG|ACC | 0 | 1 | 88.258 |
92354165 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 7 | 1776420 | 1776501 | Hesseltinella vesiculosa 101127 | AAG|GTAAGATTGT...ACCTCCTTTACT/TTTACTCTCACT...TTTAG|ATT | 1 | 1 | 90.663 |
92354166 | GT-AG | 0 | 2.0123148255924685e-05 | 80 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 8 | 1776661 | 1776740 | Hesseltinella vesiculosa 101127 | TTG|GTCAGCCATT...TTTTCTTTAATT/TTTTCTTTAATT...CTTAG|ACT | 1 | 1 | 94.73 |
92354167 | GT-AG | 0 | 0.0058639109597354 | 55 | rna-gnl|WGS:MCGT|DM01DRAFT_mRNA1403580 17285675 | 9 | 1776891 | 1776945 | Hesseltinella vesiculosa 101127 | CAG|GTATGCTACT...GACATTTTATTC/TCTCTTCTGACA...GACAG|GCA | 1 | 1 | 98.567 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);