introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 9114882
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389622 | GT-AG | 0 | 1.000000099473604e-05 | 896 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 1 | 883199 | 884094 | Columbina picui 115618 | GAG|GTGAGGCCTT...TCTTTCTTGGTC/AGTATTGTAAAA...GTCAG|GAC | 0 | 1 | 5.903 |
49389623 | GT-AG | 0 | 0.0003019331762879 | 606 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 2 | 882431 | 883036 | Columbina picui 115618 | ATG|GTATGTGTTC...GCTGTTTTGTCA/ATGTTACTCAGT...AACAG|CTG | 0 | 1 | 12.153 |
49389624 | GT-AG | 0 | 1.000000099473604e-05 | 370 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 3 | 881937 | 882306 | Columbina picui 115618 | GAG|GTAAGTAGGA...AGATCCCTAGTG/TATGTTGTGACT...TCTAG|GGT | 1 | 1 | 16.937 |
49389625 | GT-AG | 0 | 3.895500948295244e-05 | 481 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 4 | 881274 | 881754 | Columbina picui 115618 | AAG|GTAAGCTCTG...TTTCTTTCGACT/TTTCTTTCGACT...TCTAG|GTG | 0 | 1 | 23.958 |
49389626 | GT-AG | 0 | 1.000000099473604e-05 | 448 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 5 | 880718 | 881165 | Columbina picui 115618 | GTG|GTAAGGAGGG...TCCTGTTTAACA/TCCTGTTTAACA...CCCAG|GTG | 0 | 1 | 28.125 |
49389627 | GT-AG | 0 | 1.000000099473604e-05 | 308 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 6 | 880203 | 880510 | Columbina picui 115618 | CGG|GTAAGTACCA...ATTTTTTTTTTT/TGAATTGTAATT...CCTAG|TTG | 0 | 1 | 36.111 |
49389628 | GT-AG | 0 | 0.0004842602346773 | 620 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 7 | 879458 | 880077 | Columbina picui 115618 | GTA|GTAAGTTTCA...GTTTCCTTCAAG/AGTTGACTGATT...CCTAG|TAA | 2 | 1 | 40.934 |
49389629 | GT-AG | 0 | 1.000000099473604e-05 | 706 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 8 | 878691 | 879396 | Columbina picui 115618 | CAG|GTGAGCTTCT...GATCCCTTTTCC/AATTTACTGATC...TCTAG|GGC | 0 | 1 | 43.287 |
49389630 | GT-AG | 0 | 1.000000099473604e-05 | 118 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 9 | 878426 | 878543 | Columbina picui 115618 | CGG|GTGAGAGCTG...TGCCTTTTGAAC/TGCCTTTTGAAC...TGTAG|GTG | 0 | 1 | 48.958 |
49389631 | GT-AG | 0 | 1.000000099473604e-05 | 875 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 10 | 877299 | 878173 | Columbina picui 115618 | CAG|GTAATTAACG...TTATCTTTTTTT/AATTGGTTTATC...TGCAG|GTT | 0 | 1 | 58.681 |
49389632 | GT-AG | 0 | 2.574025974345581e-05 | 922 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 11 | 876233 | 877154 | Columbina picui 115618 | GAG|GTATTGAGTT...GGAATTTTATAT/TTTTATATCACT...TACAG|ATG | 0 | 1 | 64.236 |
49389633 | GT-AG | 1 | 98.90479669895328 | 562 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 12 | 875470 | 876031 | Columbina picui 115618 | ATT|GTATCCTTTG...TTTTTCCTAACT/TTTTTCCTAACT...TTTAG|ATC | 0 | 1 | 71.991 |
49389634 | GT-AG | 0 | 1.000000099473604e-05 | 397 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 13 | 874902 | 875298 | Columbina picui 115618 | GTG|GTAAGTGATG...TAACCCTTACCC/GCTGTGCTCAGT...CACAG|AGC | 0 | 1 | 78.588 |
49389635 | GT-AG | 0 | 1.000000099473604e-05 | 854 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 14 | 873828 | 874681 | Columbina picui 115618 | GAG|GTAAAAGATT...TTTTCCTTCCCT/AGATTTCTCACT...TTTAG|AAG | 1 | 1 | 87.076 |
49389636 | GT-AG | 0 | 1.000000099473604e-05 | 614 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 15 | 873061 | 873674 | Columbina picui 115618 | GAG|GTGAGGGTCT...TAATTCTTCACT/CTTGTTCTCATT...TGTAG|ACA | 1 | 1 | 92.978 |
49389637 | GT-AG | 0 | 0.0025239796496416 | 147 | rna-gnl|WGS:VYZG|COLPIC_R14097_mrna 9114882 | 16 | 872798 | 872944 | Columbina picui 115618 | CAA|GTATGTGTTC...AAAATCTCAGCA/CAAAATCTCAGC...GTAAG|TCA | 0 | 1 | 97.454 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);