introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 6061974
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31221955 | GT-AG | 0 | 1.000000099473604e-05 | 143522 | rna-XM_030451268.1 6061974 | 1 | 109714189 | 109857710 | Calypte anna 9244 | CAG|GTAGGATGTA...GTAACCAGGATG/TGAAAGGTAACC...TCTAG|AGC | 1 | 1 | 1.829 |
31221956 | GT-AG | 0 | 0.242615945551362 | 51868 | rna-XM_030451268.1 6061974 | 2 | 109662314 | 109714181 | Calypte anna 9244 | TAG|GTCTCTTTAA...GATATTTTAAAC/GATATTTTAAAC...TCTAG|ATT | 2 | 1 | 1.953 |
31221957 | GT-AG | 0 | 3.6004126853207974e-05 | 5588 | rna-XM_030451268.1 6061974 | 3 | 109656586 | 109662173 | Calypte anna 9244 | CAG|GTGCTCCTTT...AGGATTTTAATA/AGGATTTTAATA...TTCAG|ATC | 1 | 1 | 4.44 |
31221958 | GT-AG | 0 | 7.373118428633911e-05 | 2231 | rna-XM_030451268.1 6061974 | 4 | 109654076 | 109656306 | Calypte anna 9244 | AAC|GTAAGTCCTG...TCTCCCTTATTT/CTTATTTTGATT...TTCAG|AAC | 1 | 1 | 9.394 |
31221959 | GT-AG | 0 | 1.000000099473604e-05 | 5547 | rna-XM_030451268.1 6061974 | 5 | 109648253 | 109653799 | Calypte anna 9244 | AAG|GTCAGTTACT...TTTTCTTTCCCT/ACATTTTTTACA...GGCAG|ATG | 1 | 1 | 14.296 |
31221960 | GT-AG | 0 | 1.000000099473604e-05 | 811 | rna-XM_030451268.1 6061974 | 6 | 109647145 | 109647955 | Calypte anna 9244 | GAG|GTGCTTGTCA...TTCTTTTTACCT/TTTCTTTTTACC...TCTAG|GGC | 1 | 1 | 19.57 |
31221961 | GT-AG | 0 | 1.000000099473604e-05 | 12480 | rna-XM_030451268.1 6061974 | 7 | 109634389 | 109646868 | Calypte anna 9244 | AAG|GTGAGGCCAA...CTTTCCTTTTTT/AGCACTGTAACT...TTCAG|TTC | 1 | 1 | 24.472 |
31221962 | GT-AG | 0 | 1.000000099473604e-05 | 6577 | rna-XM_030451268.1 6061974 | 8 | 109627533 | 109634109 | Calypte anna 9244 | GAG|GTAAGTAGAT...GCTGCTTTTGCT/CGAATTCTCATG...CACAG|TGC | 1 | 1 | 29.426 |
31221963 | GT-AG | 0 | 1.000000099473604e-05 | 3698 | rna-XM_030451268.1 6061974 | 9 | 109623715 | 109627412 | Calypte anna 9244 | AAG|GTAGGATTTC...TCTTTCGTAACT/TCGTAACTAACC...TTTAG|GTG | 1 | 1 | 31.557 |
31221964 | GT-AG | 0 | 1.000000099473604e-05 | 32683 | rna-XM_030451268.1 6061974 | 10 | 109590858 | 109623540 | Calypte anna 9244 | AAA|GTAAGAACTA...TTTGTTTTGTCT/AATCAGCTCACC...TTTAG|TTC | 1 | 1 | 34.647 |
31221965 | GT-AG | 0 | 1.000000099473604e-05 | 3020 | rna-XM_030451268.1 6061974 | 11 | 109587641 | 109590660 | Calypte anna 9244 | CAG|GTAAGAAAAC...CACTTCTTTTCA/CTTCTTTTCAAA...TGTAG|ATT | 0 | 1 | 38.146 |
31221966 | GT-AG | 0 | 1.000000099473604e-05 | 1402 | rna-XM_030451268.1 6061974 | 12 | 109586142 | 109587543 | Calypte anna 9244 | AAG|GTAGGAAGAG...TTTCTCCTGACT/TTTCTCCTGACT...CTCAG|AGC | 1 | 1 | 39.869 |
31221967 | GT-AG | 0 | 1.000000099473604e-05 | 6316 | rna-XM_030451268.1 6061974 | 13 | 109579697 | 109586012 | Calypte anna 9244 | CAG|GTAAGAAGTA...TCTGTTTTATTT/TTTTATTTCAAT...AACAG|ACT | 1 | 1 | 42.159 |
31221968 | GT-AG | 0 | 0.0297705259419692 | 5220 | rna-XM_030451268.1 6061974 | 14 | 109574309 | 109579528 | Calypte anna 9244 | CAG|GTATGCTCTG...CCAGTTTTGATC/CCAGTTTTGATC...TACAG|CTC | 1 | 1 | 45.143 |
31221969 | GT-AG | 0 | 1.000000099473604e-05 | 6198 | rna-XM_030451268.1 6061974 | 15 | 109568040 | 109574237 | Calypte anna 9244 | AAG|GTAAATACAC...GTGTACTTAATG/TAATCTTTCATT...CCTAG|GCC | 0 | 1 | 46.404 |
31221970 | GT-AG | 0 | 1.000000099473604e-05 | 1439 | rna-XM_030451268.1 6061974 | 16 | 109566360 | 109567798 | Calypte anna 9244 | ATG|GTAGGTCTTA...TAAACATTGAAT/ATGGTATTCAAG...TCCAG|TAC | 1 | 1 | 50.684 |
31221971 | GT-AG | 0 | 1.000000099473604e-05 | 1731 | rna-XM_030451268.1 6061974 | 17 | 109564482 | 109566212 | Calypte anna 9244 | GAG|GTGAGAGGAA...ATTTCCTTTTCA/TTCCTTTTCATT...CATAG|AGC | 1 | 1 | 53.294 |
31221972 | GT-AG | 0 | 2.8951710828545698e-05 | 8483 | rna-XM_030451268.1 6061974 | 18 | 109555843 | 109564325 | Calypte anna 9244 | ATG|GTAAGCAGTA...CACATCTTATTT/ACACATCTTATT...TCCAG|TTC | 1 | 1 | 56.065 |
31221973 | GT-AG | 0 | 1.005991425889186e-05 | 2012 | rna-XM_030451268.1 6061974 | 19 | 109553697 | 109555708 | Calypte anna 9244 | ACA|GTAAGTCCCA...AGGTGCTTGAAT/TGAATGCTAACA...TATAG|CCT | 0 | 1 | 58.444 |
31221974 | GT-AG | 0 | 3.6540020138690655e-05 | 14569 | rna-XM_030451268.1 6061974 | 20 | 109539110 | 109553678 | Calypte anna 9244 | GAA|GTTCTTGTAT...CTTATTTTAGTC/TCTTATTTTAGT...GCGAG|TTT | 0 | 1 | 58.764 |
31221975 | GT-AG | 0 | 1.000000099473604e-05 | 5343 | rna-XM_030451268.1 6061974 | 21 | 109533625 | 109538967 | Calypte anna 9244 | AAG|GTGTGGTGAC...TCTTCCTTGCTT/ATTGATTTCACT...CTCAG|CTC | 1 | 1 | 61.286 |
31221976 | GT-AG | 0 | 0.0013564944168067 | 444 | rna-XM_030451268.1 6061974 | 22 | 109533063 | 109533506 | Calypte anna 9244 | TAG|GTAACCAGCA...TGGTCAATAATT/TGGTCAATAATT...GTAAG|CAA | 2 | 1 | 63.381 |
31221977 | GT-AG | 0 | 1.000000099473604e-05 | 370 | rna-XM_030451268.1 6061974 | 23 | 109532678 | 109533047 | Calypte anna 9244 | AGT|GTTATGAAAC...ACTTCTTTAATG/ACTTCTTTAATG...CCCAG|TCT | 2 | 1 | 63.648 |
31221978 | GT-AG | 0 | 1.000000099473604e-05 | 2732 | rna-XM_030451268.1 6061974 | 24 | 109529797 | 109532528 | Calypte anna 9244 | AAG|GTAAGATTAT...CTATTTTTAACT/CTATTTTTAACT...TCTAG|TTC | 1 | 1 | 66.294 |
31221979 | GT-AG | 0 | 1.000000099473604e-05 | 556 | rna-XM_030451268.1 6061974 | 25 | 109529142 | 109529697 | Calypte anna 9244 | GAG|GTAATATAAC...TTATCCTTGCCA/GAATTTGTGATT...TTCAG|GTT | 1 | 1 | 68.052 |
31221980 | GT-AG | 0 | 1.000000099473604e-05 | 2246 | rna-XM_030451268.1 6061974 | 26 | 109526707 | 109528952 | Calypte anna 9244 | AAG|GTAAGTTGAT...TTTCACTTGAAA/AAATGTCTCACT...TCTAG|AGC | 1 | 1 | 71.408 |
31221981 | GT-AG | 0 | 3.356714084342442e-05 | 5480 | rna-XM_030451268.1 6061974 | 27 | 109520936 | 109526415 | Calypte anna 9244 | GCA|GTAAGTATAT...TTTGTCTTCATT/TTTGTCTTCATT...TCCAG|GTA | 1 | 1 | 76.576 |
31221982 | GT-AG | 0 | 1.000000099473604e-05 | 6410 | rna-XM_030451268.1 6061974 | 28 | 109514349 | 109520758 | Calypte anna 9244 | GAG|GTAAGATTAG...TCATTTGTAATG/TAGATTTTCATT...TCTAG|ATG | 1 | 1 | 79.719 |
31221983 | GT-AG | 0 | 1.000000099473604e-05 | 5147 | rna-XM_030451268.1 6061974 | 29 | 109509171 | 109514317 | Calypte anna 9244 | GAG|GTAAGAATAG...CTTCTCTTGTTT/TATCTTTTCATT...TTTAG|CAA | 2 | 1 | 80.27 |
31221984 | GT-AG | 0 | 0.0002894451533414 | 2352 | rna-XM_030451268.1 6061974 | 30 | 109506703 | 109509054 | Calypte anna 9244 | TTG|GTAAACTCTT...GCTGTCTGAAAA/AGCTGTCTGAAA...TGAAG|GGG | 1 | 1 | 82.33 |
31221985 | GT-AG | 0 | 1.000000099473604e-05 | 383 | rna-XM_030451268.1 6061974 | 31 | 109506317 | 109506699 | Calypte anna 9244 | GGG|GTGATGGCAG...TTTTTTTTAACT/TTGTTACTCATT...TCCAG|ATG | 1 | 1 | 82.383 |
31221986 | GT-AG | 0 | 0.0001289232050663 | 3867 | rna-XM_030451268.1 6061974 | 32 | 109502300 | 109506166 | Calypte anna 9244 | CAA|GTAAGCATCT...TCTCTGTTATCT/CTCTCTGTTATC...CACAG|ATC | 1 | 1 | 85.047 |
31221987 | GT-AG | 0 | 1.000000099473604e-05 | 3766 | rna-XM_030451268.1 6061974 | 33 | 109498336 | 109502101 | Calypte anna 9244 | CAG|GTACGGAGCT...TGTTTCCTGATG/TGTTTCCTGATG...TTCAG|ATC | 1 | 1 | 88.563 |
31221988 | GT-AG | 0 | 1.000000099473604e-05 | 18928 | rna-XM_030451268.1 6061974 | 34 | 109479105 | 109498032 | Calypte anna 9244 | CAG|GTGAGACTTT...CTTTTTTTCACT/CTTTTTTTCACT...TTCAG|GTG | 1 | 1 | 93.944 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);