introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 720755
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821802 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|002851-T1 720755 | 1 | 8984827 | 8984884 | Adineta vaga 104782 | GAT|GTAATATATT...TAGTCTTTTATG/TGTTTGTTTACA...TCTAG|GTC | 0 | 1 | 1.053 |
3821803 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|002851-T1 720755 | 2 | 8984996 | 8985049 | Adineta vaga 104782 | CGA|GTGAGATCAA...GAATAATTGATT/GAATAATTGATT...TTCAG|AGA | 0 | 1 | 2.612 |
3821804 | GT-AG | 0 | 1.000000099473604e-05 | 136 | rna-gnl|I4U23|002851-T1 720755 | 3 | 8985137 | 8985272 | Adineta vaga 104782 | AAT|GTAAGTAGTA...ATTGTATTGAAA/ATTGTATTGAAA...TTCAG|GTT | 0 | 1 | 3.833 |
3821805 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|002851-T1 720755 | 4 | 8985850 | 8985911 | Adineta vaga 104782 | CAA|GTGAGATGGT...TAGCTATTAGAA/ACACCGCTCAAA...ACCAG|AGA | 1 | 1 | 11.935 |
3821806 | GT-AG | 0 | 0.0001736682207377 | 57 | rna-gnl|I4U23|002851-T1 720755 | 5 | 8985972 | 8986028 | Adineta vaga 104782 | CTT|GTAAGTTTAT...AAATATTTGATT/AAATATTTGATT...TCTAG|CTT | 1 | 1 | 12.777 |
3821807 | GT-AG | 0 | 4.426958868623047e-05 | 54 | rna-gnl|I4U23|002851-T1 720755 | 6 | 8986122 | 8986175 | Adineta vaga 104782 | CAG|GTTTTATTTC...TTTTTCTTTTTT/GATTCTCTAAGT...TGTAG|CAG | 1 | 1 | 14.083 |
3821808 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|002851-T1 720755 | 7 | 8986294 | 8986345 | Adineta vaga 104782 | CAA|GTCAGTATTT...ATAGCCTTATTT/AATAGCCTTATT...TTTAG|TTT | 2 | 1 | 15.74 |
3821809 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002851-T1 720755 | 8 | 8986464 | 8986519 | Adineta vaga 104782 | TCG|GTAAAGTGAA...TTGATTTTATTC/ATTGATTTTATT...ATCAG|GTT | 0 | 1 | 17.397 |
3821810 | GT-AG | 0 | 1.000000099473604e-05 | 63 | rna-gnl|I4U23|002851-T1 720755 | 9 | 8986684 | 8986746 | Adineta vaga 104782 | ACG|GTAAAAAAGA...GCTGTCTTACTG/GTCTTACTGATT...TCTAG|AGC | 2 | 1 | 19.7 |
3821811 | GT-AG | 0 | 0.0006466101703143 | 52 | rna-gnl|I4U23|002851-T1 720755 | 10 | 8986832 | 8986883 | Adineta vaga 104782 | AAA|GTAGACTGTT...CATTTTATAATG/AATGTTATCATT...TCTAG|GAT | 0 | 1 | 20.893 |
3821812 | GT-AG | 0 | 0.0075155988104775 | 63 | rna-gnl|I4U23|002851-T1 720755 | 11 | 8986995 | 8987057 | Adineta vaga 104782 | ACA|GTAAACGTTT...GTTTTCTTATTT/TGTTTTCTTATT...TTTAG|AAA | 0 | 1 | 22.452 |
3821813 | GT-AG | 0 | 0.0716333176618496 | 53 | rna-gnl|I4U23|002851-T1 720755 | 12 | 8987181 | 8987233 | Adineta vaga 104782 | GAA|GTATATTTGA...ATAATCATGATT/TTTATAATCATG...ATTAG|GTT | 0 | 1 | 24.179 |
3821814 | GT-AG | 0 | 2.896992143878842e-05 | 69 | rna-gnl|I4U23|002851-T1 720755 | 13 | 8987312 | 8987380 | Adineta vaga 104782 | CAA|GTAAGTTATT...CTCGTTTTATAT/TCTCGTTTTATA...CTTAG|ATC | 0 | 1 | 25.274 |
3821815 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|002851-T1 720755 | 14 | 8987545 | 8987596 | Adineta vaga 104782 | AAA|GTAAAAATTT...TAGTTTGTATTC/TTAGTTTGTATT...TATAG|ACA | 2 | 1 | 27.577 |
3821816 | GT-AG | 0 | 0.0096870283658649 | 58 | rna-gnl|I4U23|002851-T1 720755 | 15 | 8987829 | 8987886 | Adineta vaga 104782 | TTA|GTATGAATCA...TTTTCCTTATTC/ATTTTCCTTATT...TTTAG|GAA | 0 | 1 | 30.834 |
3821817 | GT-AG | 0 | 7.294276922911536e-05 | 55 | rna-gnl|I4U23|002851-T1 720755 | 16 | 8988478 | 8988532 | Adineta vaga 104782 | GCA|GTAAGTTTAA...GTATACTTATGT/CGTATACTTATG...TCTAG|TTT | 0 | 1 | 39.132 |
3821818 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-gnl|I4U23|002851-T1 720755 | 17 | 8988581 | 8988627 | Adineta vaga 104782 | ATG|GTAAGATGAA...TCCTTCTAAACT/TCTAAACTTATT...TATAG|ACT | 0 | 1 | 39.806 |
3821819 | GT-AG | 0 | 1.000000099473604e-05 | 316 | rna-gnl|I4U23|002851-T1 720755 | 18 | 8988830 | 8989145 | Adineta vaga 104782 | ATG|GTAAAACTCT...TTTTCTTTCGTA/TTCTTTCGTAGT...ATCAG|CAA | 1 | 1 | 42.643 |
3821820 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|002851-T1 720755 | 19 | 8990078 | 8990135 | Adineta vaga 104782 | GTT|GTAAGTGATC...ATGTTTTTCGTT/AAACTACTAATC...TACAG|AAA | 0 | 1 | 55.729 |
3821821 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-gnl|I4U23|002851-T1 720755 | 20 | 8991192 | 8991272 | Adineta vaga 104782 | CAG|GTACAGTAAA...AAATATTTAATT/CTTTTTCTAATT...CATAG|AAC | 0 | 1 | 70.556 |
3821822 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|002851-T1 720755 | 21 | 8991361 | 8991422 | Adineta vaga 104782 | TTG|GTCGGCGACA...TTCGCCTTCGGT/TCGGTGGTAACA...AATAG|CTC | 1 | 1 | 71.792 |
3821823 | GT-AG | 0 | 0.005007143164058 | 55 | rna-gnl|I4U23|002851-T1 720755 | 22 | 8992995 | 8993049 | Adineta vaga 104782 | GAG|GTATTTTCAT...CAACTTTTAATA/TAATATTTCATT...TTAAG|GAC | 1 | 1 | 93.864 |
3821824 | GT-AG | 0 | 0.0015617311761906 | 53 | rna-gnl|I4U23|002851-T1 720755 | 23 | 8993103 | 8993155 | Adineta vaga 104782 | AAA|GTATGTATAA...TTTTCTTTTTCC/AAATTGTTTATC...CATAG|GGA | 0 | 1 | 94.608 |
3821825 | GT-AG | 0 | 0.0003421091814938 | 63 | rna-gnl|I4U23|002851-T1 720755 | 24 | 8993403 | 8993465 | Adineta vaga 104782 | ATG|GTATGTAATT...GTTTTCTTTTTT/TTTTTTCGAATA...TCTAG|GTG | 1 | 1 | 98.076 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);