introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 6061949
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31221111 | GT-AG | 0 | 1.000000099473604e-05 | 1290 | rna-XM_008500634.2 6061949 | 1 | 65753611 | 65754900 | Calypte anna 9244 | CAG|GTAAGTGTCT...TTTTCTGGGATT/TCATGTGTAATC...TCCAG|AAG | 1 | 1 | 0.829 |
31221112 | GT-AG | 0 | 1.000000099473604e-05 | 7589 | rna-XM_008500634.2 6061949 | 2 | 65755294 | 65762882 | Calypte anna 9244 | AAG|GTGAGCTACT...TTGATTTTAATT/TTGATTTTAATT...ATTAG|ATC | 1 | 1 | 6.754 |
31221113 | GT-AG | 0 | 1.000000099473604e-05 | 5120 | rna-XM_008500634.2 6061949 | 3 | 65763122 | 65768241 | Calypte anna 9244 | GAG|GTAAGGCAGA...GCTCTCTGAGCT/GGCTCTCTGAGC...CGCAG|GCC | 0 | 1 | 10.357 |
31221114 | GT-AG | 0 | 1.000000099473604e-05 | 7518 | rna-XM_008500634.2 6061949 | 4 | 65768519 | 65776036 | Calypte anna 9244 | CAG|GTACGGGCGG...TTTTTTTTCCCT/TTTGTTTTTAAC...TTCAG|ATC | 1 | 1 | 14.533 |
31221115 | GT-AG | 0 | 1.000000099473604e-05 | 310 | rna-XM_008500634.2 6061949 | 5 | 65776316 | 65776625 | Calypte anna 9244 | CCG|GTAGGACTTC...TGTTTTTTATCC/GTGTTTTTTATC...AACAG|TGC | 1 | 1 | 18.74 |
31221116 | GT-AG | 0 | 0.0011471684323772 | 1518 | rna-XM_008500634.2 6061949 | 6 | 65776884 | 65778401 | Calypte anna 9244 | CAG|GTATGATTTC...TGGTTCTTATAT/TATATTTTCATC...TCAAG|TCC | 1 | 1 | 22.629 |
31221117 | GT-AG | 0 | 2.8718417642461937e-05 | 2344 | rna-XM_008500634.2 6061949 | 7 | 65778666 | 65781009 | Calypte anna 9244 | CAT|GTAAGTCTCC...TTACTCTTGGTT/TTCAGAATTACT...TTCAG|TTC | 1 | 1 | 26.609 |
31221118 | GT-AG | 0 | 1.000000099473604e-05 | 1770 | rna-XM_008500634.2 6061949 | 8 | 65781280 | 65783049 | Calypte anna 9244 | CAG|GTCAGCATCC...TGTTTATTACCA/TGTGTGTTTATT...TACAG|CTC | 1 | 1 | 30.68 |
31221119 | GT-AG | 0 | 0.0966366615115841 | 2300 | rna-XM_008500634.2 6061949 | 9 | 65783311 | 65785610 | Calypte anna 9244 | CAG|GTATCATGCT...TTTGTTTTATTT/TTTTGTTTTATT...CACAG|TGC | 1 | 1 | 34.615 |
31221120 | GT-AG | 0 | 1.000000099473604e-05 | 1941 | rna-XM_008500634.2 6061949 | 10 | 65785878 | 65787818 | Calypte anna 9244 | CAG|GTAAAAAGGA...TTCCTCTTTGTT/TAGGAATTGACT...CATAG|TTC | 1 | 1 | 38.64 |
31221121 | GT-AG | 0 | 1.000000099473604e-05 | 303 | rna-XM_008500634.2 6061949 | 11 | 65788083 | 65788385 | Calypte anna 9244 | CAG|GTAAATAGAG...GAATCTTTAATT/TGGTTTCTGAAT...TTCAG|TCC | 1 | 1 | 42.62 |
31221122 | GT-AG | 0 | 1.000000099473604e-05 | 933 | rna-XM_008500634.2 6061949 | 12 | 65788650 | 65789582 | Calypte anna 9244 | CAA|GTAAGTAAAC...GTTTCCTTTCTT/TAAGTACTGAGA...TTCAG|TGC | 1 | 1 | 46.6 |
31221123 | GT-AG | 0 | 1.000000099473604e-05 | 461 | rna-XM_008500634.2 6061949 | 13 | 65789847 | 65790307 | Calypte anna 9244 | CAA|GTAAGAAATA...TTTGTCTTCTTT/AAATTTGTAAGC...TCCAG|TTC | 1 | 1 | 50.58 |
31221124 | GT-AG | 0 | 0.1345884543410029 | 874 | rna-XM_008500634.2 6061949 | 14 | 65790572 | 65791445 | Calypte anna 9244 | CAA|GTATGTTTCT...TTGCCTTTATTG/CTTGCCTTTATT...TCCAG|TTC | 1 | 1 | 54.561 |
31221125 | GT-AG | 0 | 0.0003058129757116 | 626 | rna-XM_008500634.2 6061949 | 15 | 65791716 | 65792341 | Calypte anna 9244 | CAG|GTAACTGGAA...GATACCTTGATA/TTGATATTTAAA...TTCAG|TTC | 1 | 1 | 58.631 |
31221126 | GT-AG | 0 | 1.000000099473604e-05 | 829 | rna-XM_008500634.2 6061949 | 16 | 65792606 | 65793434 | Calypte anna 9244 | CAG|GTAAGAACAT...TAATCATTGCTT/GCAATAATCATT...TTCAG|TAC | 1 | 1 | 62.611 |
31221127 | GT-AG | 0 | 1.000000099473604e-05 | 383 | rna-XM_008500634.2 6061949 | 17 | 65793699 | 65794081 | Calypte anna 9244 | CAG|GTAAAAGACA...TTTCCCTTTCCC/AATGTGTTAAGA...AGTAG|CAC | 1 | 1 | 66.591 |
31221128 | GT-AG | 0 | 1.0983843209046045e-05 | 1153 | rna-XM_008500634.2 6061949 | 18 | 65794364 | 65795516 | Calypte anna 9244 | CAA|GTAAGTAACT...TGCACTTTAACT/TAACTTCTCAAT...TGTAG|AAC | 1 | 1 | 70.843 |
31221129 | GT-AG | 0 | 6.772130573920617e-05 | 1823 | rna-XM_008500634.2 6061949 | 19 | 65795796 | 65797618 | Calypte anna 9244 | ACC|GTAAGTGGAA...TTTTTCTTAATT/TTTTTCTTAATT...CTCAG|GTC | 1 | 1 | 75.049 |
31221130 | GT-AG | 0 | 1.000000099473604e-05 | 1259 | rna-XM_008500634.2 6061949 | 20 | 65797769 | 65799027 | Calypte anna 9244 | ATG|GTAAGTGTCA...CTGTCTTTCTCT/ATGGCTATGATG...TGTAG|GTA | 1 | 1 | 77.31 |
31221131 | GT-AG | 0 | 1.000000099473604e-05 | 897 | rna-XM_008500634.2 6061949 | 21 | 65799272 | 65800168 | Calypte anna 9244 | CAG|GTTAGATTTA...CAACCTCTGACT/TTTGTTATGACA...TCTAG|GAT | 2 | 1 | 80.989 |
31221132 | GT-AG | 0 | 1.000000099473604e-05 | 518 | rna-XM_008500634.2 6061949 | 22 | 65800276 | 65800793 | Calypte anna 9244 | CAG|GTTAGTTTCA...TTCTTCATATCT/TGATTCTTCATA...TCTAG|AGC | 1 | 1 | 82.602 |
31221133 | GT-AG | 0 | 1.000000099473604e-05 | 297 | rna-XM_008500634.2 6061949 | 23 | 65800894 | 65801190 | Calypte anna 9244 | TAG|GTAAGTTGAG...CCTTCCTTTTTA/GTAAAACTGAAT...TCCAG|CAA | 2 | 1 | 84.11 |
31221134 | GT-AG | 0 | 0.0002320305561403 | 117 | rna-XM_008500634.2 6061949 | 24 | 65801275 | 65801391 | Calypte anna 9244 | AGG|GTAACATCAG...TTTCTTTTGAAA/CATTTGGTCACC...TCTAG|GAA | 2 | 1 | 85.376 |
31221135 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_008500634.2 6061949 | 25 | 65801410 | 65801513 | Calypte anna 9244 | CAG|GTAAAAAGAA...TCCTCCTTGAAT/CTGTGATTCATC...TTTAG|TCC | 2 | 1 | 85.648 |
31221136 | GT-AG | 0 | 1.000000099473604e-05 | 451 | rna-XM_008500634.2 6061949 | 26 | 65801596 | 65802046 | Calypte anna 9244 | GAG|GTGGGATCCT...CATCCTTTAATT/ATTATGTTCACC...TGCAG|GAC | 0 | 1 | 86.884 |
31221137 | GT-AG | 0 | 5.8406228848938615e-05 | 1179 | rna-XM_008500634.2 6061949 | 27 | 65802138 | 65803316 | Calypte anna 9244 | CCT|GTGAGTTTTA...CCTCCTTTAATT/CTTTAATTGATT...TCTAG|ATG | 1 | 1 | 88.256 |
31221138 | GT-AG | 0 | 1.000000099473604e-05 | 811 | rna-XM_008500634.2 6061949 | 28 | 65803394 | 65804204 | Calypte anna 9244 | CCA|GTAAGTGACT...GTTTTCTTTACT/GTTTTCTTTACT...TACAG|GGC | 0 | 1 | 89.417 |
31221139 | GT-AG | 0 | 1.000000099473604e-05 | 1050 | rna-XM_008500634.2 6061949 | 29 | 65804340 | 65805389 | Calypte anna 9244 | CGG|GTAAGCAGTG...GGGGCTTTGTTT/CAGTTACTAAAT...TGCAG|GTA | 0 | 1 | 91.452 |
31221140 | GT-AG | 0 | 0.8499971681295558 | 255 | rna-XM_008500634.2 6061949 | 30 | 65805513 | 65805767 | Calypte anna 9244 | AGT|GTATGTTTGT...ACTTTTTTAACT/ACTTTTTTAACT...AACAG|GAA | 0 | 1 | 93.306 |
31221141 | GT-AG | 0 | 5.050073288571326e-05 | 1256 | rna-XM_008500634.2 6061949 | 31 | 65805932 | 65807187 | Calypte anna 9244 | CAG|GTATGTGCAG...CTCCTTTTAAAT/AATGGTTTAATC...TCCAG|TGC | 2 | 1 | 95.779 |
31221142 | GT-AG | 0 | 0.0006652941981152 | 4007 | rna-XM_008500634.2 6061949 | 32 | 65807324 | 65811330 | Calypte anna 9244 | GAG|GTAAGCTTTG...ATGACTTTAAAA/CCTTGGATGACT...TTCAG|TGT | 0 | 1 | 97.829 |
31221143 | GT-AG | 0 | 1.000000099473604e-05 | 1487 | rna-XM_008500634.2 6061949 | 33 | 65811452 | 65812938 | Calypte anna 9244 | GAG|GTAAATGCAA...TTTTTCTAATCT/CTTTTTCTAATC...TTCAG|ATG | 1 | 1 | 99.653 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);