introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
13 rows where transcript_id = 9114832
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389160 | GT-AG | 0 | 1.000000099473604e-05 | 16644 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 1 | 74173 | 90816 | Columbina picui 115618 | GAG|GTAGGAGATA...TTTTTCTTTTCT/AGTTATTTTACA...TATAG|AAA | 0 | 1 | 7.667 |
49389161 | GT-AG | 0 | 0.002577209000602 | 971 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 2 | 90919 | 91889 | Columbina picui 115618 | CAG|GTATGTTCCT...TGTATTTTGATT/TGTATTTTGATT...TCCAG|GTT | 0 | 1 | 13.333 |
49389162 | GT-AG | 0 | 1.000000099473604e-05 | 1561 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 3 | 91972 | 93532 | Columbina picui 115618 | AAG|GTAATAGAAA...TGCTCTTTGTTC/GTATCTTTCAAT...TGTAG|AAA | 1 | 1 | 17.889 |
49389163 | GT-AG | 0 | 1.57709928231347e-05 | 1720 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 4 | 93666 | 95385 | Columbina picui 115618 | TAA|GTACATGCAT...GCTGCGATGATG/ATGATGGTAAGG...CAGAG|TGG | 2 | 1 | 25.278 |
49389164 | GT-AG | 0 | 7.57907005368613e-05 | 2709 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 5 | 95599 | 98307 | Columbina picui 115618 | TGG|GTAAACATGA...CCTTCATTATTA/ACGTTATTGAAT...TTAAG|CAT | 2 | 1 | 37.111 |
49389165 | GT-AG | 0 | 1.000000099473604e-05 | 162 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 6 | 98420 | 98581 | Columbina picui 115618 | GAG|GTGAGTACAG...ATGTTTTTACAT/AATGTTTTTACA...TTTAG|GAT | 0 | 1 | 43.333 |
49389166 | GT-AG | 0 | 1.000000099473604e-05 | 554 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 7 | 98662 | 99215 | Columbina picui 115618 | AGG|GTGAGTTTCT...TTATTATTATTG/TTACTATTTATT...TCCAG|AGA | 2 | 1 | 47.778 |
49389167 | GT-AG | 0 | 1.000000099473604e-05 | 1196 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 8 | 99350 | 100545 | Columbina picui 115618 | CAG|GTAGATGAAT...TTATCTTTGAAC/TTATCTTTGAAC...CTCAG|GTC | 1 | 1 | 55.222 |
49389168 | GT-AG | 0 | 1.000000099473604e-05 | 786 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 9 | 100621 | 101406 | Columbina picui 115618 | ATG|GTAATTGCGT...TGTTCCTTACTA/ATGTTCCTTACT...TATAG|AGA | 1 | 1 | 59.389 |
49389169 | GT-AG | 0 | 1.000000099473604e-05 | 165 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 10 | 101579 | 101743 | Columbina picui 115618 | GAT|GTGAGTTTTG...ACGATCTCAATC/CACGATCTCAAT...TTCAG|GAA | 2 | 1 | 68.944 |
49389170 | GT-AG | 0 | 0.0008663101445216 | 531 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 11 | 101961 | 102491 | Columbina picui 115618 | CTG|GTAACATCCC...AAGTTTTTAATT/AAGTTTTTAATT...TGCAG|GCT | 0 | 1 | 81.0 |
49389171 | GT-AG | 0 | 1.000000099473604e-05 | 1035 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 12 | 102557 | 103591 | Columbina picui 115618 | GAG|GTATGGCTGA...GAAGGCTTACTG/TGAAGGCTTACT...TTTAG|GTA | 2 | 1 | 84.611 |
49389172 | GT-AG | 0 | 1.000000099473604e-05 | 363 | rna-gnl|WGS:VYZG|COLPIC_R06823_mrna 9114832 | 13 | 103711 | 104073 | Columbina picui 115618 | TTG|GTAAGATATT...GGTTTCTTAATG/TGGTTTCTTAAT...TTTAG|GTG | 1 | 1 | 91.222 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);