introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 28131404
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
156031903 | GT-AG | 0 | 1.000000099473604e-05 | 140 | rna155.p1 28131404 | 1 | 358993 | 359132 | Planoprotostelium fungivorum 1890364 | GAG|GTGAGCAACG...ATCTCTTTACAG/GATCTCTTTACA...TATAG|CTA | 1 | 1 | 0.795 |
156031904 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna155.p1 28131404 | 2 | 359189 | 359243 | Planoprotostelium fungivorum 1890364 | AAG|GTGAGAATGC...ATTGTCTCACAC/AATTGTCTCACA...TTCAG|TTC | 0 | 1 | 1.908 |
156031905 | GT-AG | 0 | 1.6300573916537778e-05 | 58 | rna155.p1 28131404 | 3 | 359287 | 359344 | Planoprotostelium fungivorum 1890364 | TAG|GTACTTGACA...GCACTCTTATTG/AAACATCTGATC...TCTAG|GAC | 1 | 1 | 2.763 |
156031906 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna155.p1 28131404 | 4 | 359426 | 359480 | Planoprotostelium fungivorum 1890364 | CCT|GTAAGAAGGG...TCATTTTTCATC/TCATTTTTCATC...CACAG|GTG | 1 | 1 | 4.373 |
156031907 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna155.p1 28131404 | 5 | 359523 | 359574 | Planoprotostelium fungivorum 1890364 | CAG|GTGAACATCC...TTTTCCATAGAT/TAGATAATCACT...CGCAG|GAG | 1 | 1 | 5.208 |
156031908 | GT-AG | 0 | 1.000000099473604e-05 | 45 | rna155.p1 28131404 | 6 | 359743 | 359787 | Planoprotostelium fungivorum 1890364 | CAG|GTGAGACACG...TGGTCGGTGACG/TGGTCGGTGACG...ATCAG|CTC | 1 | 1 | 8.547 |
156031909 | GT-AG | 0 | 1.000000099473604e-05 | 45 | rna155.p1 28131404 | 7 | 359911 | 359955 | Planoprotostelium fungivorum 1890364 | CAC|GTGAGGCATC...CATCCTCTGATG/CATCCTCTGATG...ATCAG|CTC | 1 | 1 | 10.992 |
156031910 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna155.p1 28131404 | 8 | 359998 | 360082 | Planoprotostelium fungivorum 1890364 | CAG|GTACGATGTT...TTCGTCTTACGA/TTTCGTCTTACG...TGCAG|GTC | 1 | 1 | 11.827 |
156031911 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna155.p1 28131404 | 9 | 360125 | 360237 | Planoprotostelium fungivorum 1890364 | CGG|GTAAGACATC...ATCTCATTACAC/ATCCATCTCATT...TGCAG|CAC | 1 | 1 | 12.661 |
156031912 | GT-AG | 0 | 1.000000099473604e-05 | 42 | rna155.p1 28131404 | 10 | 360328 | 360369 | Planoprotostelium fungivorum 1890364 | CAG|GTAATGTATG...GTCACGCTGACA/GTCACGCTGACA...TCCAG|GTC | 1 | 1 | 14.45 |
156031913 | GT-AG | 0 | 1.000000099473604e-05 | 37 | rna155.p1 28131404 | 11 | 360580 | 360616 | Planoprotostelium fungivorum 1890364 | TGA|GTGAGTGAAA...GAGGATCTGACG/GAGGATCTGACG...AGTAG|AAC | 1 | 1 | 18.625 |
156031914 | GT-AG | 0 | 1.000000099473604e-05 | 39 | rna155.p1 28131404 | 12 | 361013 | 361051 | Planoprotostelium fungivorum 1890364 | AGA|GTGAGGTGGA...GAGTACTGAAGA/AGAGTACTGAAG...CGCAG|GGA | 1 | 1 | 26.496 |
156031915 | GT-AG | 0 | 0.0103684354097381 | 113 | rna155.p1 28131404 | 13 | 361183 | 361295 | Planoprotostelium fungivorum 1890364 | GGA|GTATTTCCAG...TCTTTCGTATCT/GTATCTGTTATC...AATAG|GCT | 0 | 1 | 29.1 |
156031916 | GT-AG | 0 | 1.000000099473604e-05 | 43 | rna155.p1 28131404 | 14 | 361420 | 361462 | Planoprotostelium fungivorum 1890364 | CAG|GTGTGCGAAA...CCCTTTTTACCG/ACCCTTTTTACC...CGCAG|TGT | 1 | 1 | 31.564 |
156031917 | GT-AG | 0 | 1.000000099473604e-05 | 44 | rna155.p1 28131404 | 15 | 361622 | 361665 | Planoprotostelium fungivorum 1890364 | GAG|GTTCGTAACA...GGTAATTGAACA/AGGTAATTGAAC...GATAG|GTT | 1 | 1 | 34.725 |
156031918 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna155.p1 28131404 | 16 | 361867 | 361918 | Planoprotostelium fungivorum 1890364 | CAG|GTGAGATCAT...TTCTCCTCCATC/ACTGGAATGACG...AAAAG|GTC | 1 | 1 | 38.72 |
156031919 | GT-AG | 0 | 1.000000099473604e-05 | 42 | rna155.p1 28131404 | 17 | 362041 | 362082 | Planoprotostelium fungivorum 1890364 | TTT|GTGAGTTCGC...TCCGTCTGATGT/GTCCGTCTGATG...ACTAG|GCG | 0 | 1 | 41.145 |
156031920 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna155.p1 28131404 | 18 | 362233 | 362282 | Planoprotostelium fungivorum 1890364 | AGG|GTGAGATGAG...CGCCTCTGATCT/ACGCCTCTGATC...ACAAG|GCC | 0 | 1 | 44.126 |
156031921 | GT-AG | 0 | 1.000000099473604e-05 | 41 | rna155.p1 28131404 | 19 | 362543 | 362583 | Planoprotostelium fungivorum 1890364 | AGG|GTGAGAAATC...CGAAGATTGACA/CGAAGATTGACA...TGTAG|ATC | 2 | 1 | 49.294 |
156031922 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna155.p1 28131404 | 20 | 362743 | 362791 | Planoprotostelium fungivorum 1890364 | GGG|GTGAGTCATC...ATCACATTACCA/CACATCATCACA...ATCAG|AAG | 2 | 1 | 52.455 |
156031923 | GT-AG | 0 | 1.000000099473604e-05 | 40 | rna155.p1 28131404 | 21 | 362981 | 363020 | Planoprotostelium fungivorum 1890364 | ACC|GTGAGTTAGC...AATTCGGTGAGA/TGAGAATTCACG...CGCAG|AGC | 2 | 1 | 56.211 |
156031924 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna155.p1 28131404 | 22 | 363172 | 363223 | Planoprotostelium fungivorum 1890364 | CAC|GTCATCCACA...GATCGCTCGATC/TCGATCTACAAC...CTCAG|ATT | 0 | 1 | 59.213 |
156031925 | GT-AG | 0 | 1.000000099473604e-05 | 40 | rna155.p1 28131404 | 23 | 363266 | 363305 | Planoprotostelium fungivorum 1890364 | CAG|GTGACACGAC...CTGTTCTCATCT/GCTGTTCTCATC...GTCAG|ATC | 0 | 1 | 60.048 |
156031926 | GT-AG | 0 | 1.000000099473604e-05 | 43 | rna155.p1 28131404 | 24 | 363435 | 363477 | Planoprotostelium fungivorum 1890364 | AAG|GTGGGCCGTC...CTCTTCTCATCA/ACTCTTCTCATC...TGCAG|GTC | 0 | 1 | 62.612 |
156031927 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna155.p1 28131404 | 25 | 363631 | 363687 | Planoprotostelium fungivorum 1890364 | AGC|GTCAAGTGAG...TTATCCGTGACA/CTTCGTCTGACT...TGAAG|TCG | 0 | 1 | 65.653 |
156031928 | GT-AG | 0 | 1.000000099473604e-05 | 40 | rna155.p1 28131404 | 26 | 363734 | 363773 | Planoprotostelium fungivorum 1890364 | ACC|GTGAGTTAGC...AAATTCATGATC/TGAAAATTCATG...CGCAG|AAT | 1 | 1 | 66.567 |
156031929 | GT-AG | 0 | 1.000000099473604e-05 | 40 | rna155.p1 28131404 | 27 | 363968 | 364007 | Planoprotostelium fungivorum 1890364 | CAG|GTGACACGAG...CTGTTCTCATCT/GCTGTTCTCATC...GTCAG|ATC | 0 | 1 | 70.423 |
156031930 | GT-AG | 0 | 1.000000099473604e-05 | 42 | rna155.p1 28131404 | 28 | 364137 | 364178 | Planoprotostelium fungivorum 1890364 | AAG|GTGGGCCGTC...CTCTTCTCATCA/ACTCTTCTCATC...CGCAG|GTC | 0 | 1 | 72.987 |
156031931 | GT-AG | 0 | 1.000000099473604e-05 | 39 | rna155.p1 28131404 | 29 | 364337 | 364375 | Planoprotostelium fungivorum 1890364 | CAA|GTGAGTTATC...CATTTCTTCGTC/CTTCGTCTGACT...GACAG|ATA | 2 | 1 | 76.128 |
156031932 | GT-AG | 0 | 1.000000099473604e-05 | 40 | rna155.p1 28131404 | 30 | 364666 | 364705 | Planoprotostelium fungivorum 1890364 | AAT|GTGAGTTACA...ACGGAGGTAACA/ACGGAGGTAACA...ATCAG|GGA | 1 | 1 | 81.892 |
156031933 | GT-AG | 0 | 1.000000099473604e-05 | 39 | rna155.p1 28131404 | 31 | 364880 | 364918 | Planoprotostelium fungivorum 1890364 | AAG|GTGAGAGTCG...TGGATCTGATCT/TTGGATCTGATC...GACAG|AAC | 1 | 1 | 85.351 |
156031934 | GT-AG | 0 | 1.000000099473604e-05 | 46 | rna155.p1 28131404 | 32 | 364957 | 365002 | Planoprotostelium fungivorum 1890364 | CAG|GTGAGAAGGA...AATATGTGAGCT/TGTGAGCTGATG...GACAG|CTG | 0 | 1 | 86.106 |
156031935 | GT-AG | 0 | 1.000000099473604e-05 | 39 | rna155.p1 28131404 | 33 | 365066 | 365104 | Planoprotostelium fungivorum 1890364 | CAG|GTGAGAAGCT...ATCACCTCACAC/CATCACCTCACA...GACAG|AGA | 0 | 1 | 87.358 |
156031936 | GT-AG | 0 | 1.000000099473604e-05 | 45 | rna155.p1 28131404 | 34 | 365251 | 365295 | Planoprotostelium fungivorum 1890364 | AGT|GTGAGTTTCT...AGGGCCGTGACG/CGTCATCTGACG...CACAG|TAT | 2 | 1 | 90.26 |
156031937 | GT-AG | 0 | 1.000000099473604e-05 | 44 | rna155.p1 28131404 | 35 | 365459 | 365502 | Planoprotostelium fungivorum 1890364 | ACG|GTAAGGTCTC...ATCAGATTGACT/ATCAGATTGACT...AGCAG|GCA | 0 | 1 | 93.5 |
156031938 | GT-AG | 0 | 1.000000099473604e-05 | 42 | rna155.p1 28131404 | 36 | 365597 | 365638 | Planoprotostelium fungivorum 1890364 | CAG|GTGCACACCT...TCATTCTCACCA/CTCATTCTCACC...CTCAG|ATG | 1 | 1 | 95.369 |
156031939 | GT-AG | 0 | 1.000000099473604e-05 | 40 | rna155.p1 28131404 | 37 | 365712 | 365751 | Planoprotostelium fungivorum 1890364 | GAG|GTGTGTAGAA...ACGAACGTAACA/CACAAAATGACA...CACAG|TTT | 2 | 1 | 96.82 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);