introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 720774
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3822162 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-gnl|I4U23|000562-T1 720774 | 1 | 2072077 | 2072189 | Adineta vaga 104782 | TGG|GTAAGTTATT...TAAATATTAATT/TAAATATTAATT...TTTAG|TTT | 2 | 1 | 8.829 |
3822163 | GT-AG | 0 | 0.0017541590170316 | 54 | rna-gnl|I4U23|000562-T1 720774 | 2 | 2071735 | 2071788 | Adineta vaga 104782 | TAA|GTATATAATT...TTTGGCTTTTTT/AAAGATATGAAA...TCTAG|AGA | 2 | 1 | 13.546 |
3822164 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|000562-T1 720774 | 3 | 2071379 | 2071433 | Adineta vaga 104782 | CAA|GTGAGTCAAT...AATTTCTTGACT/AATTTCTTGACT...TCTAG|ATA | 0 | 1 | 18.477 |
3822165 | GT-AG | 0 | 0.0001469918564237 | 792 | rna-gnl|I4U23|000562-T1 720774 | 4 | 2070245 | 2071036 | Adineta vaga 104782 | AAA|GTAAACATTA...AGATCTTTCTCG/TTGAAGTTAAAG...TTTAG|GTC | 0 | 1 | 24.079 |
3822166 | GT-AG | 0 | 1.000000099473604e-05 | 262 | rna-gnl|I4U23|000562-T1 720774 | 5 | 2069056 | 2069317 | Adineta vaga 104782 | TTG|GTAAGAAAGA...TTATTTTTATAA/TTTATTTTTATA...AATAG|GTT | 0 | 1 | 39.263 |
3822167 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-gnl|I4U23|000562-T1 720774 | 6 | 2068897 | 2068981 | Adineta vaga 104782 | CAG|GTAAGTCTTA...TATTCTTTCACT/TATTCTTTCACT...TCCAG|TAT | 2 | 1 | 40.475 |
3822168 | GT-AG | 0 | 1.000000099473604e-05 | 492 | rna-gnl|I4U23|000562-T1 720774 | 7 | 2068272 | 2068763 | Adineta vaga 104782 | CAA|GTAATAAGAT...TTTTTTCTGATA/TTTTTTCTGATA...ATTAG|GTA | 0 | 1 | 42.654 |
3822169 | GT-AG | 0 | 0.0009640563972451 | 57 | rna-gnl|I4U23|000562-T1 720774 | 8 | 2068059 | 2068115 | Adineta vaga 104782 | GAA|GTAAACATTA...GTTTCTATGATT/CATAGTCTAATG...TGTAG|AAA | 0 | 1 | 45.209 |
3822170 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|000562-T1 720774 | 9 | 2067936 | 2067992 | Adineta vaga 104782 | AAA|GTAAGAAAGA...TTACTTTTACTA/CTTTTACTAATG...TTTAG|ACA | 0 | 1 | 46.29 |
3822171 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-gnl|I4U23|000562-T1 720774 | 10 | 2067753 | 2067817 | Adineta vaga 104782 | GAG|GTAAATCAAT...CTTCTTTTATCT/TTAAATCTCACT...TGTAG|AAG | 1 | 1 | 48.223 |
3822172 | GT-AG | 0 | 0.0004452365349956 | 55 | rna-gnl|I4U23|000562-T1 720774 | 11 | 2067003 | 2067057 | Adineta vaga 104782 | GGA|GTAAATTTTT...ATCGTTTTGTTA/AATGAATTCATC...TTTAG|TTC | 0 | 1 | 59.607 |
3822173 | GT-AG | 0 | 9.425051854743616e-05 | 51 | rna-gnl|I4U23|000562-T1 720774 | 12 | 2066851 | 2066901 | Adineta vaga 104782 | TCG|GTAAATTTTA...ATCAACTGAACA/AATCAACTGAAC...TTTAG|ATT | 2 | 1 | 61.261 |
3822174 | GT-AG | 0 | 0.0069093271388874 | 62 | rna-gnl|I4U23|000562-T1 720774 | 13 | 2066665 | 2066726 | Adineta vaga 104782 | ACT|GTATGTATAA...AAATTCTGGATA/ATACAATTGATA...TTTAG|GAA | 0 | 1 | 63.292 |
3822175 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|000562-T1 720774 | 14 | 2066322 | 2066381 | Adineta vaga 104782 | GAG|GTTCGATATT...TTTCTTTTAATC/TTTCTTTTAATC...TGTAG|TTA | 1 | 1 | 67.928 |
3822176 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|000562-T1 720774 | 15 | 2066143 | 2066196 | Adineta vaga 104782 | CAG|GTAATTATGC...TTTTTTTTCATA/TTTTTTTTCATA...TTCAG|CTT | 0 | 1 | 69.975 |
3822177 | GT-AG | 0 | 0.0001715587345527 | 74 | rna-gnl|I4U23|000562-T1 720774 | 16 | 2065940 | 2066013 | Adineta vaga 104782 | CAA|GTAAGTTTTT...TGTTTTTCAATT/TTGTTTTTCAAT...ATTAG|GCA | 0 | 1 | 72.088 |
3822178 | GT-AG | 0 | 6.629932869583625e-05 | 53 | rna-gnl|I4U23|000562-T1 720774 | 17 | 2065452 | 2065504 | Adineta vaga 104782 | GCA|GTAAGTTATG...GACTATTTAGTT/TCATTTTTCAGA...TTTAG|GAA | 0 | 1 | 79.214 |
3822179 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|000562-T1 720774 | 18 | 2065288 | 2065341 | Adineta vaga 104782 | AAA|GTAAAAAAGA...TTTAGTTTGAAA/GAAAATTTCATA...TTTAG|ATG | 2 | 1 | 81.016 |
3822180 | GT-AG | 0 | 9.93626285660634e-05 | 50 | rna-gnl|I4U23|000562-T1 720774 | 19 | 2065104 | 2065153 | Adineta vaga 104782 | GAG|GTATGAATTG...TTCGTTTTGAAA/TGAGATTTAATA...TTTAG|TGC | 1 | 1 | 83.21 |
3822181 | GT-AG | 0 | 0.0024389491133032 | 59 | rna-gnl|I4U23|000562-T1 720774 | 20 | 2064962 | 2065020 | Adineta vaga 104782 | CTG|GTAGATTTTA...TCTCCTTTAATC/TCTCCTTTAATC...ACTAG|GGT | 0 | 1 | 84.57 |
3822182 | GT-AG | 0 | 1.852848015261232e-05 | 59 | rna-gnl|I4U23|000562-T1 720774 | 21 | 2064714 | 2064772 | Adineta vaga 104782 | AAG|GTAAGTTTTC...TCGTTCTCAATT/ATCGTTCTCAAT...TTCAG|GAT | 0 | 1 | 87.666 |
3822183 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|000562-T1 720774 | 22 | 2064507 | 2064566 | Adineta vaga 104782 | GAT|GTAAGTAGAT...TGTTTTTTGTTT/GCTGAGTTGATA...ATTAG|GAA | 0 | 1 | 90.074 |
3822184 | GT-AG | 0 | 2.372761665041977e-05 | 56 | rna-gnl|I4U23|000562-T1 720774 | 23 | 2064328 | 2064383 | Adineta vaga 104782 | GAT|GTAATTAAAT...TTTTTCTTTTCT/TTTCTCTTCAAA...TTAAG|TTT | 0 | 1 | 92.088 |
3822185 | GT-AG | 0 | 9.560861871583435e-05 | 46 | rna-gnl|I4U23|000562-T1 720774 | 24 | 2064173 | 2064218 | Adineta vaga 104782 | AAG|GTCTGTTTGT...ACTTTCTCAATG/TACTTTCTCAAT...TGTAG|TTA | 1 | 1 | 93.874 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);