introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 6061951
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31221191 | AT-AC | 1 | 99.9999999996239 | 69387 | rna-XM_030447826.1 6061951 | 1 | 51540524 | 51609910 | Calypte anna 9244 | CCC|ATATCCTTCC...TTTTCCTTGACT/TTTTCCTTGACT...TTCAC|CTG | 2 | 1 | 4.263 |
31221192 | GT-AG | 0 | 1.000000099473604e-05 | 995 | rna-XM_030447826.1 6061951 | 2 | 51539417 | 51540411 | Calypte anna 9244 | CAG|GTAAGCAGAC...TGCTCTTTGGTT/TGGTTTCTGATT...GGCAG|GTA | 0 | 1 | 5.963 |
31221193 | GT-AG | 0 | 1.000000099473604e-05 | 18239 | rna-XM_030447826.1 6061951 | 3 | 51521044 | 51539282 | Calypte anna 9244 | TGG|GTAGGTCAAT...ATTTTCTTTTTT/TTGGTTTAGATT...TGCAG|GAT | 2 | 1 | 7.996 |
31221194 | GT-AG | 0 | 1.1189676915913084e-05 | 17187 | rna-XM_030447826.1 6061951 | 4 | 51503759 | 51520945 | Calypte anna 9244 | CTA|GTAAGTCATA...TTTTCTCTATCT/CTCTATCTCACA...TGCAG|GCA | 1 | 1 | 9.483 |
31221195 | GT-AG | 0 | 1.000000099473604e-05 | 2185 | rna-XM_030447826.1 6061951 | 5 | 51501414 | 51503598 | Calypte anna 9244 | AAT|GTGAGTGTCT...TGGGTTTTATCC/GTGGGTTTTATC...TGCAG|ACA | 2 | 1 | 11.91 |
31221196 | GT-AG | 0 | 1.000000099473604e-05 | 767 | rna-XM_030447826.1 6061951 | 6 | 51500331 | 51501097 | Calypte anna 9244 | CAG|GTAAGACCAT...TTTCTCTTACTG/GTTTCTCTTACT...TGCAG|GTG | 0 | 1 | 16.705 |
31221197 | GT-AG | 0 | 3.0023461049687174e-05 | 5803 | rna-XM_030447826.1 6061951 | 7 | 51494435 | 51500237 | Calypte anna 9244 | ATT|GTAAGTATAA...TTCTCCTTTTTT/CTGACACTGACT...ACTAG|GTG | 0 | 1 | 18.116 |
31221198 | GT-AG | 0 | 1.000000099473604e-05 | 1393 | rna-XM_030447826.1 6061951 | 8 | 51492735 | 51494127 | Calypte anna 9244 | ACA|GTAAGGATTT...TTTCCTTTAAGT/AGTGTGATCATT...CCCAG|GGC | 1 | 1 | 22.773 |
31221199 | GT-AG | 0 | 1.000000099473604e-05 | 8706 | rna-XM_030447826.1 6061951 | 9 | 51483628 | 51492333 | Calypte anna 9244 | CAG|GTAAGGCACC...TTTCTCATAATT/CTATTTCTCATA...TCCAG|CCA | 0 | 1 | 28.858 |
31221200 | GT-AG | 0 | 1.000000099473604e-05 | 621 | rna-XM_030447826.1 6061951 | 10 | 51482855 | 51483475 | Calypte anna 9244 | CAG|GTGACAAGGG...TTGTACTTGATG/TTGTACTTGATG...CACAG|CAT | 2 | 1 | 31.164 |
31221201 | GT-AG | 0 | 1.000000099473604e-05 | 671 | rna-XM_030447826.1 6061951 | 11 | 51481998 | 51482668 | Calypte anna 9244 | CAG|GTGAGAAGGC...TTATATTTGACA/TTATATTTGACA...AACAG|CAT | 2 | 1 | 33.986 |
31221202 | GT-AG | 0 | 1.000000099473604e-05 | 340 | rna-XM_030447826.1 6061951 | 12 | 51481540 | 51481879 | Calypte anna 9244 | CAG|GTGAGTCCAC...TCTTCCTTCACA/TCTTCCTTCACA...TGCAG|ATC | 0 | 1 | 35.776 |
31221203 | GT-AG | 0 | 1.000000099473604e-05 | 445 | rna-XM_030447826.1 6061951 | 13 | 51480939 | 51481383 | Calypte anna 9244 | GAG|GTGAGTTGTA...AACTTCTAAATT/GATCTTTTCAGT...TCCAG|GGT | 0 | 1 | 38.143 |
31221204 | GT-AG | 0 | 1.000000099473604e-05 | 1911 | rna-XM_030447826.1 6061951 | 14 | 51478937 | 51480847 | Calypte anna 9244 | CAG|GTAAAATAGC...TTCTTCTTCTTT/TCTTTCTGCATC...CCCAG|ATC | 1 | 1 | 39.524 |
31221205 | GT-AG | 0 | 1.000000099473604e-05 | 1157 | rna-XM_030447826.1 6061951 | 15 | 51477565 | 51478721 | Calypte anna 9244 | TGG|GTGAGTGGAA...GAGTCCTTATCC/TGAGTCCTTATC...TACAG|TCC | 0 | 1 | 42.786 |
31221206 | GT-AG | 0 | 1.000000099473604e-05 | 992 | rna-XM_030447826.1 6061951 | 16 | 51476156 | 51477147 | Calypte anna 9244 | TAT|GTGAGTACTT...TCTGCTGTATCC/AGGAAGGTAACC...TCCAG|ACC | 0 | 1 | 49.112 |
31221207 | GT-AG | 0 | 1.000000099473604e-05 | 912 | rna-XM_030447826.1 6061951 | 17 | 51475143 | 51476054 | Calypte anna 9244 | CAG|GTTTGTGCTG...TAGCTCCTAACT/TAGCTCCTAACT...TTCAG|GTT | 2 | 1 | 50.645 |
31221208 | GT-AG | 0 | 1.000000099473604e-05 | 766 | rna-XM_030447826.1 6061951 | 18 | 51474253 | 51475018 | Calypte anna 9244 | ATG|GTGAGCATCA...TTTTTTTTTTCT/ATGCATTTAAGG...TCTAG|GAG | 0 | 1 | 52.526 |
31221209 | GT-AG | 0 | 1.000000099473604e-05 | 919 | rna-XM_030447826.1 6061951 | 19 | 51473265 | 51474183 | Calypte anna 9244 | AAG|GTAAGTTGCT...TCTCCCTTCTCT/GAGCCATTTAGC...TATAG|GTT | 0 | 1 | 53.573 |
31221210 | GT-AG | 0 | 1.000000099473604e-05 | 343 | rna-XM_030447826.1 6061951 | 20 | 51472737 | 51473079 | Calypte anna 9244 | CCG|GTCAGAACCC...CCTTCCTTCCCT/TCCTCCGTCACT...TCCAG|AGT | 2 | 1 | 56.38 |
31221211 | GT-AG | 0 | 1.000000099473604e-05 | 1105 | rna-XM_030447826.1 6061951 | 21 | 51471505 | 51472609 | Calypte anna 9244 | CAG|GTAGGGTTGT...GGTCACTCCCCT/AAGGGGCTGATG...TGCAG|CTC | 0 | 1 | 58.307 |
31221212 | GT-AG | 0 | 1.000000099473604e-05 | 467 | rna-XM_030447826.1 6061951 | 22 | 51470912 | 51471378 | Calypte anna 9244 | CAG|GTGAGTGTGT...CGTGCTTTGTAT/CTGTTCCACATC...TGCAG|GCT | 0 | 1 | 60.218 |
31221213 | GT-AG | 0 | 1.000000099473604e-05 | 982 | rna-XM_030447826.1 6061951 | 23 | 51469840 | 51470821 | Calypte anna 9244 | CAG|GTAAAAATAC...CTGTTCTTCCCC/ATCCAACTGATT...TCCAG|CCA | 0 | 1 | 61.584 |
31221214 | GT-AG | 0 | 1.000000099473604e-05 | 1097 | rna-XM_030447826.1 6061951 | 24 | 51468550 | 51469646 | Calypte anna 9244 | GGA|GTAAGGCACA...TTCCTCCTAACT/TTCCTCCTAACT...TGAAG|AGG | 1 | 1 | 64.512 |
31221215 | GT-AG | 0 | 1.000000099473604e-05 | 1604 | rna-XM_030447826.1 6061951 | 25 | 51466794 | 51468397 | Calypte anna 9244 | GTG|GTGAGTGAGG...ATTTTTTTACTC/AATTTTTTTACT...GGCAG|TCC | 0 | 1 | 66.818 |
31221216 | GT-AG | 0 | 1.000000099473604e-05 | 583 | rna-XM_030447826.1 6061951 | 26 | 51466101 | 51466683 | Calypte anna 9244 | CAG|GTGAGCAGAG...CTTCTTTTATTC/TCTTCTTTTATT...TTAAG|GTG | 2 | 1 | 68.487 |
31221217 | GT-AG | 0 | 1.000000099473604e-05 | 742 | rna-XM_030447826.1 6061951 | 27 | 51465225 | 51465966 | Calypte anna 9244 | GGG|GTGAGAGAGA...ACCTCCTGAGCT/CTGAGTCTGAGT...GGCAG|TCC | 1 | 1 | 70.52 |
31221218 | GT-AG | 0 | 1.000000099473604e-05 | 3646 | rna-XM_030447826.1 6061951 | 28 | 51461508 | 51465153 | Calypte anna 9244 | CAG|GTAAAACAAA...ATGCCATTTGCT/GGTAGACTGACC...TTCAG|GTG | 0 | 1 | 71.598 |
31221219 | GT-AG | 0 | 1.000000099473604e-05 | 577 | rna-XM_030447826.1 6061951 | 29 | 51460852 | 51461428 | Calypte anna 9244 | TAG|GTAAGTCTCA...CCTCCCCTGATA/CCTCCCCTGATA...TTCAG|TGT | 1 | 1 | 72.796 |
31221220 | GT-AG | 0 | 0.0003955079490709 | 718 | rna-XM_030447826.1 6061951 | 30 | 51460012 | 51460729 | Calypte anna 9244 | AAG|GTAACTGGGC...TAGTCCTTACTT/CCTTCCATCATT...TCAAG|GAT | 0 | 1 | 74.647 |
31221221 | GT-AG | 0 | 1.000000099473604e-05 | 631 | rna-XM_030447826.1 6061951 | 31 | 51459042 | 51459672 | Calypte anna 9244 | CAG|GTGAGTACAC...ACATTCTTAAAA/CTTAAAATGATC...CCTAG|GAG | 0 | 1 | 79.791 |
31221222 | GT-AG | 0 | 0.021481574350557 | 3309 | rna-XM_030447826.1 6061951 | 32 | 51455586 | 51458894 | Calypte anna 9244 | GAG|GTAACCTGCG...AGTTCCTTTCTT/CCAGGCATGACC...TGCAG|GTC | 0 | 1 | 82.021 |
31221223 | GT-AG | 0 | 1.000000099473604e-05 | 410 | rna-XM_030447826.1 6061951 | 33 | 51455059 | 51455468 | Calypte anna 9244 | AAG|GTTAGCCCCA...TTGTCCTCATTT/TTTGTCCTCATT...TCCAG|CAG | 0 | 1 | 83.796 |
31221224 | GT-AG | 0 | 1.000000099473604e-05 | 720 | rna-XM_030447826.1 6061951 | 34 | 51454170 | 51454889 | Calypte anna 9244 | AAG|GTGAGCATAG...CATTTCTTTTTG/GTTCTCCCCATT...GTCAG|TTC | 1 | 1 | 86.36 |
31221225 | GT-AG | 0 | 1.000000099473604e-05 | 1792 | rna-XM_030447826.1 6061951 | 35 | 51452181 | 51453972 | Calypte anna 9244 | CAG|GTGAGGACCT...GTGCTTTTATTT/TTTATTTTCACA...CCCAG|GCC | 0 | 1 | 89.349 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);