introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
18 rows where transcript_id = 9129085
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49484665 | GT-AG | 0 | 1.000000099473604e-05 | 33383 | rna-XM_004674010.2 9129085 | 1 | 9682984 | 9716366 | Condylura cristata 143302 | GCG|GTGAGTCCTG...CTGACCTCACCT/ACTGACCTCACC...TGCAG|GTG | 1 | 1 | 2.525 |
49484666 | GC-AG | 0 | 1.000000099473604e-05 | 1424 | rna-XM_004674010.2 9129085 | 2 | 9716674 | 9718097 | Condylura cristata 143302 | GTG|GCCAGCCTCC...CCTGCTCTGACC/CCTGCTCTGACC...TATAG|GCC | 2 | 1 | 12.336 |
49484667 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_004674010.2 9129085 | 3 | 9718289 | 9718381 | Condylura cristata 143302 | CTG|GTGAGCCTGG...CTCTTCTTCCCC/TGCCTGCTCATG...CCCAG|ATG | 1 | 1 | 18.44 |
49484668 | GT-AG | 0 | 1.000000099473604e-05 | 692 | rna-XM_004674010.2 9129085 | 4 | 9718533 | 9719224 | Condylura cristata 143302 | TCG|GTAAGGCTCT...GACGCCTTCCAA/GCCTTCCAAATA...TGCAG|CCC | 2 | 1 | 23.266 |
49484669 | GT-AG | 0 | 1.000000099473604e-05 | 195 | rna-XM_004674010.2 9129085 | 5 | 9719374 | 9719568 | Condylura cristata 143302 | CAG|GTGGGATTCT...TTGGCCTTTTCC/AGTACCCTGAGA...TGCAG|AGA | 1 | 1 | 28.028 |
49484670 | GT-AG | 0 | 1.000000099473604e-05 | 3573 | rna-XM_004674010.2 9129085 | 6 | 9719836 | 9723408 | Condylura cristata 143302 | CCA|GTGAGCGTCT...GTTTCTTTCGTC/AGTTTGCTGAGA...GCTAG|CTG | 1 | 1 | 36.561 |
49484671 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_004674010.2 9129085 | 7 | 9723543 | 9723701 | Condylura cristata 143302 | GAG|GTGAGGATGA...TGGGGTTGAACC/CTGGGGTTGAAC...TGCAG|GAC | 0 | 1 | 40.844 |
49484672 | GT-AG | 0 | 1.000000099473604e-05 | 170 | rna-XM_004674010.2 9129085 | 8 | 9723838 | 9724007 | Condylura cristata 143302 | TGG|GTGAGCGGGC...CCACCCTCATTC/CCCACCCTCATT...AACAG|AAA | 1 | 1 | 45.19 |
49484673 | GT-AG | 0 | 1.000000099473604e-05 | 1222 | rna-XM_004674010.2 9129085 | 9 | 9724128 | 9725349 | Condylura cristata 143302 | CAG|GTGGGTACCG...TGCCCCCTGACT/CCCTTTGTTACT...TGCAG|ATG | 1 | 1 | 49.025 |
49484674 | GT-AG | 0 | 8.79347076686756e-05 | 100 | rna-XM_004674010.2 9129085 | 10 | 9725500 | 9725599 | Condylura cristata 143302 | CAG|GTACGCCTGG...GGGCTCTTCCCC/TCCTCGCTGACT...TCCAG|TGT | 1 | 1 | 53.819 |
49484675 | GT-AG | 0 | 1.000000099473604e-05 | 80 | rna-XM_004674010.2 9129085 | 11 | 9725751 | 9725830 | Condylura cristata 143302 | CAG|GTAGGGCCGT...CATGGCTTCTCT/GAGCCTGTGAGG...GGCAG|GAT | 2 | 1 | 58.645 |
49484676 | GT-AG | 0 | 1.000000099473604e-05 | 1132 | rna-XM_004674010.2 9129085 | 12 | 9725959 | 9727090 | Condylura cristata 143302 | TGG|GTATGGGCCA...TGGCCTCTACCC/CCTCTACCCACA...CTCAG|ACA | 1 | 1 | 62.736 |
49484677 | GT-AG | 0 | 1.000000099473604e-05 | 1013 | rna-XM_004674010.2 9129085 | 13 | 9727295 | 9728307 | Condylura cristata 143302 | ATG|GTGAGGGGTC...TTCTTCTTGCCC/GAGTGGTTAAGA...CCCAG|GCG | 1 | 1 | 69.255 |
49484678 | GT-AG | 0 | 0.0001849006588472 | 1085 | rna-XM_004674010.2 9129085 | 14 | 9728464 | 9729548 | Condylura cristata 143302 | TGG|GTATGTCGCT...CTGACTCTGACC/CACTGACTGACT...CCCAG|GGA | 1 | 1 | 74.241 |
49484679 | GT-AG | 0 | 0.0168534773146395 | 792 | rna-XM_004674010.2 9129085 | 15 | 9729782 | 9730573 | Condylura cristata 143302 | CTG|GTATGCTGTG...CTCTGTTTGATG/CTCTGTTTGATG...CTCAG|GGG | 0 | 1 | 81.687 |
49484680 | GT-AG | 0 | 1.000000099473604e-05 | 7650 | rna-XM_004674010.2 9129085 | 16 | 9730655 | 9738304 | Condylura cristata 143302 | AAG|GTGAGGACAG...TTGTTCTGACCA/GTTGTTCTGACC...TCCAG|GTG | 0 | 1 | 84.276 |
49484681 | GT-AG | 0 | 1.000000099473604e-05 | 1157 | rna-XM_004674010.2 9129085 | 17 | 9738457 | 9739613 | Condylura cristata 143302 | CAG|GTAGAAGGGC...GAGGTCTTACCC/CTTTGTGTGATT...TGCAG|TGA | 2 | 1 | 89.134 |
49484682 | GT-AG | 0 | 4.56964993828104e-05 | 461 | rna-XM_004674010.2 9129085 | 18 | 9739793 | 9740253 | Condylura cristata 143302 | AAG|GTAGGCAGCA...AATGTTTTGACT/AATGTTTTGACT...TGCAG|ACT | 1 | 1 | 94.855 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);