introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
28 rows where transcript_id = 6061929
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31220344 | GT-AG | 0 | 1.000000099473604e-05 | 4510 | rna-XM_030461389.1 6061929 | 2 | 193645280 | 193649789 | Calypte anna 9244 | CAG|GTGAGTGTGT...TCTGCTATAAAG/AAAGGAGTAACC...CCCAG|GAG | 1 | 1 | 3.46 |
31220345 | GT-AG | 0 | 1.000000099473604e-05 | 133225 | rna-XM_030461389.1 6061929 | 3 | 193650060 | 193783284 | Calypte anna 9244 | CTG|GTAAGTGGGG...TGAACCTTAAAA/ATTTTACTTATC...TTTAG|ACC | 1 | 1 | 6.704 |
31220346 | GT-AG | 0 | 1.000000099473604e-05 | 5102 | rna-XM_030461389.1 6061929 | 4 | 193783538 | 193788639 | Calypte anna 9244 | TAG|GTAGGACTTT...GTTTCCATATGT/GCATGACTCACT...TGTAG|AAA | 2 | 1 | 9.743 |
31220347 | GT-AG | 0 | 8.847865946819797e-05 | 2069 | rna-XM_030461389.1 6061929 | 5 | 193788739 | 193790807 | Calypte anna 9244 | TGG|GTATGTGACA...ATGGCTTCAATC/TATGGCTTCAAT...CTCAG|GCA | 2 | 1 | 10.932 |
31220348 | GT-AG | 0 | 2.1044354901985777e-05 | 20699 | rna-XM_030461389.1 6061929 | 6 | 193791044 | 193811742 | Calypte anna 9244 | TAG|GTAAGCTCAT...CTTTTCTCATTG/ACTTTTCTCATT...CACAG|CCA | 1 | 1 | 13.767 |
31220349 | GT-AG | 0 | 2.954696261378916e-05 | 4793 | rna-XM_030461389.1 6061929 | 7 | 193811914 | 193816706 | Calypte anna 9244 | AAG|GTATGGTGTG...CTACTTTTAAGG/TTAAGGCTAATA...GGCAG|GAA | 1 | 1 | 15.822 |
31220350 | GT-AG | 0 | 1.000000099473604e-05 | 4391 | rna-XM_030461389.1 6061929 | 8 | 193816922 | 193821312 | Calypte anna 9244 | CAG|GTAATGAGAC...TCACTCTTTCTG/GAAATGCTCACA...TTCAG|TTT | 0 | 1 | 18.405 |
31220351 | GT-AG | 0 | 1.000000099473604e-05 | 18878 | rna-XM_030461389.1 6061929 | 9 | 193821524 | 193840401 | Calypte anna 9244 | TTG|GTAAGGATTT...CTTTCCTAAACT/TCTTTCCTAAAC...TCCAG|AAT | 1 | 1 | 20.939 |
31220352 | GT-AG | 0 | 1.000000099473604e-05 | 4525 | rna-XM_030461389.1 6061929 | 10 | 193840504 | 193845028 | Calypte anna 9244 | GAG|GTAAGAATTC...TTTTTCTTATAT/ATTTTTCTTATA...CCCAG|CAT | 1 | 1 | 22.165 |
31220353 | GT-AG | 0 | 9.880088103684456e-05 | 4194 | rna-XM_030461389.1 6061929 | 11 | 193845224 | 193849417 | Calypte anna 9244 | AAG|GTATTTGCAA...TCTTCCTTCTCC/TCCCTTCTGATT...TGTAG|TGG | 1 | 1 | 24.507 |
31220354 | GT-AG | 0 | 2.899804612435185e-05 | 18899 | rna-XM_030461389.1 6061929 | 12 | 193849619 | 193868517 | Calypte anna 9244 | TCG|GTAATGTTAG...TTCCCCTTCTCT/CTTCTCTCCACC...CACAG|AGA | 1 | 1 | 26.922 |
31220355 | GT-AG | 0 | 1.000000099473604e-05 | 13781 | rna-XM_030461389.1 6061929 | 13 | 193868704 | 193882484 | Calypte anna 9244 | TTG|GTAGGGAGAA...TTCCTCTTTGCA/GTGCATTTGAAC...TGCAG|AGG | 1 | 1 | 29.157 |
31220356 | GT-AG | 0 | 1.000000099473604e-05 | 15752 | rna-XM_030461389.1 6061929 | 14 | 193882632 | 193898383 | Calypte anna 9244 | GAG|GTAGGAGATG...TCTTCCTTACCC/ATCTTCCTTACC...TGCAG|ATG | 1 | 1 | 30.923 |
31220357 | GT-AG | 0 | 1.000000099473604e-05 | 11994 | rna-XM_030461389.1 6061929 | 15 | 193898601 | 193910594 | Calypte anna 9244 | AGG|GTAAGTGAAT...TAGTTCTTACCA/ATAGTTCTTACC...TTCAG|ACA | 2 | 1 | 33.53 |
31220358 | GT-AG | 0 | 2.9723236269577748e-05 | 5566 | rna-XM_030461389.1 6061929 | 16 | 193910715 | 193916280 | Calypte anna 9244 | CAG|GTAAGCCATT...AATTTTTTATCC/TAATTTTTTATC...AACAG|CTT | 2 | 1 | 34.971 |
31220359 | GT-AG | 0 | 1.1878846241073329e-05 | 4704 | rna-XM_030461389.1 6061929 | 17 | 193916543 | 193921246 | Calypte anna 9244 | CAG|GTATGAACTC...ACCTCCTGGATT/TGCCTGCTCACT...CACAG|GCT | 0 | 1 | 38.119 |
31220360 | GT-AG | 0 | 1.000000099473604e-05 | 2083 | rna-XM_030461389.1 6061929 | 18 | 193921515 | 193923597 | Calypte anna 9244 | TTG|GTAAGTGGCT...TTGCTCTTCTCT/GGGTTATTAATC...CACAG|TTT | 1 | 1 | 41.338 |
31220361 | GT-AG | 0 | 0.0001807466387183 | 1092 | rna-XM_030461389.1 6061929 | 19 | 193923742 | 193924833 | Calypte anna 9244 | GTG|GTAGGTTGAC...ATTTCCTTATTA/GATTTCCTTATT...TCCAG|GCA | 1 | 1 | 43.068 |
31220362 | GT-AG | 0 | 0.007352551793516 | 9458 | rna-XM_030461389.1 6061929 | 20 | 193925084 | 193934541 | Calypte anna 9244 | AAG|GTACACTGTG...GTACCCTTGTTT/AAAACAATAACG...TTCAG|CCA | 2 | 1 | 46.072 |
31220363 | GT-AG | 0 | 1.000000099473604e-05 | 8562 | rna-XM_030461389.1 6061929 | 21 | 193934775 | 193943336 | Calypte anna 9244 | GGG|GTGAGACTGA...GCTTTTTTACTC/TTCTTTCTGACT...CTCAG|GAA | 1 | 1 | 48.871 |
31220364 | GT-AG | 0 | 1.0137346734974648e-05 | 3990 | rna-XM_030461389.1 6061929 | 22 | 193943492 | 193947481 | Calypte anna 9244 | CAG|GTAAGCTGAT...CTTGCTGTAACG/CCTTGTGTTATT...TTCAG|GTT | 0 | 1 | 50.733 |
31220365 | GT-AG | 0 | 1.000000099473604e-05 | 10233 | rna-XM_030461389.1 6061929 | 23 | 193948360 | 193958592 | Calypte anna 9244 | TGA|GTAAGTATAT...GTGTCCTGCTCT/GCTGATTTCACT...TCCAG|GTA | 2 | 1 | 61.281 |
31220366 | GT-AG | 0 | 1.000000099473604e-05 | 3790 | rna-XM_030461389.1 6061929 | 24 | 193958766 | 193962555 | Calypte anna 9244 | AAG|GTGAGTTGGG...GATTTCTTTTCT/AGCTTAATCATT...TTCAG|ATC | 1 | 1 | 63.359 |
31220367 | GT-AG | 0 | 1.000000099473604e-05 | 6931 | rna-XM_030461389.1 6061929 | 25 | 193962792 | 193969722 | Calypte anna 9244 | AGG|GTAAGTGAGC...AGCTTCCTAATT/ATCTTTGTCACT...TGTAG|GTT | 0 | 1 | 66.194 |
31220368 | GT-AG | 0 | 1.000000099473604e-05 | 1203 | rna-XM_030461389.1 6061929 | 26 | 193970020 | 193971222 | Calypte anna 9244 | AAG|GTAAGAATCC...TTTTCCTTCTCA/TTTTGCCTGAAA...TCCAG|TCC | 0 | 1 | 69.762 |
31220369 | GT-AG | 0 | 1.000000099473604e-05 | 4264 | rna-XM_030461389.1 6061929 | 27 | 193972838 | 193977101 | Calypte anna 9244 | CAG|GTAAGACATT...CTTCTCTTCCCT/ATCAGATTTATG...TCCAG|ATG | 1 | 1 | 89.164 |
31220370 | GT-AG | 0 | 1.000000099473604e-05 | 9712 | rna-XM_030461389.1 6061929 | 28 | 193977245 | 193986956 | Calypte anna 9244 | AAG|GTGATCATTG...TGTTTCTAAGCC/CTGTTTCTAAGC...CTCAG|TCA | 0 | 1 | 90.882 |
31238748 | GT-AG | 0 | 1.000000099473604e-05 | 12218 | rna-XM_030461389.1 6061929 | 1 | 193632772 | 193644989 | Calypte anna 9244 | CAG|GTTTGTCATA...CTTTTCTTGTCT/CGTGAACTCAGA...TTCAG|GTT | 0 | 0.781 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);