introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
38 rows where transcript_id = 6061940
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31220810 | GT-AG | 0 | 1.898434613538617e-05 | 77624 | rna-XM_030463599.1 6061940 | 1 | 67133450 | 67211073 | Calypte anna 9244 | AAG|GTAAGTTGTA...TCCCCCTTGCCC/TTTTTTCTCTTC...TGCAG|ATT | 0 | 1 | 3.431 |
31220811 | GT-AG | 0 | 1.000000099473604e-05 | 698 | rna-XM_030463599.1 6061940 | 2 | 67211192 | 67211889 | Calypte anna 9244 | TCG|GTAAGACCAG...AAAATTTTATCT/TTTTATCTCATT...TTTAG|CAA | 1 | 1 | 5.097 |
31220812 | GT-AG | 0 | 1.000000099473604e-05 | 23078 | rna-XM_030463599.1 6061940 | 3 | 67211943 | 67235020 | Calypte anna 9244 | ATG|GTAAGAATGT...TTCTTGTTGATT/TTCTTGTTGATT...TGCAG|ATT | 0 | 1 | 5.845 |
31220813 | GT-AG | 0 | 1.000000099473604e-05 | 4123 | rna-XM_030463599.1 6061940 | 4 | 67235094 | 67239216 | Calypte anna 9244 | AAG|GTAAGAGAAA...TCTTTTTTGTCT/TCTTTTTTCTTT...AATAG|AAA | 1 | 1 | 6.876 |
31220814 | GT-AG | 0 | 7.232436223777882e-05 | 12527 | rna-XM_030463599.1 6061940 | 5 | 67239395 | 67251921 | Calypte anna 9244 | TAG|GTAGGTTAAG...AATTCTTTAATT/AATTCTTTAATT...TACAG|TCT | 2 | 1 | 9.389 |
31220815 | GT-AG | 0 | 1.000000099473604e-05 | 4296 | rna-XM_030463599.1 6061940 | 6 | 67251991 | 67256286 | Calypte anna 9244 | TAG|GTCAGTTAAA...CTGCTCATAACA/TTTCTGCTCATA...TCCAG|GCT | 2 | 1 | 10.363 |
31220816 | GT-AG | 0 | 1.000000099473604e-05 | 9608 | rna-XM_030463599.1 6061940 | 7 | 67256427 | 67266034 | Calypte anna 9244 | AAG|GTAAGTGATT...TTGTTTTTTTTT/GGAAGAATGATT...AGCAG|ATT | 1 | 1 | 12.339 |
31220817 | GT-AG | 0 | 1.000000099473604e-05 | 10944 | rna-XM_030463599.1 6061940 | 8 | 67267056 | 67277999 | Calypte anna 9244 | TAG|GTAAGGTCAC...TCAACCTTGTTG/ATAATAATAATC...TTCAG|AAC | 2 | 1 | 26.754 |
31220818 | GT-AG | 0 | 1.000000099473604e-05 | 14404 | rna-XM_030463599.1 6061940 | 9 | 67278116 | 67292519 | Calypte anna 9244 | CTG|GTGAGTAACA...GCTGTCTTTCCT/CTAAAACTCATG...TCCAG|GGG | 1 | 1 | 28.392 |
31220819 | GT-AG | 0 | 0.0002643097444573 | 907 | rna-XM_030463599.1 6061940 | 10 | 67292629 | 67293535 | Calypte anna 9244 | CAG|GTACACTGGT...TCCATTTTGTTT/AGCTCACTGACT...GGCAG|CAC | 2 | 1 | 29.931 |
31220820 | GT-AG | 0 | 1.000000099473604e-05 | 3270 | rna-XM_030463599.1 6061940 | 11 | 67293920 | 67297189 | Calypte anna 9244 | TGG|GTAAGTAAAA...CTTTTCTCACTG/GCTTTTCTCACT...TTCAG|GTA | 2 | 1 | 35.352 |
31220821 | GT-AG | 0 | 1.000000099473604e-05 | 33634 | rna-XM_030463599.1 6061940 | 12 | 67297310 | 67330943 | Calypte anna 9244 | CAG|GTAATAGTTC...TTCTCCTTTCTC/TGCCTATTAAGA...TGCAG|CTG | 2 | 1 | 37.046 |
31220822 | GC-AG | 0 | 1.000000099473604e-05 | 1462 | rna-XM_030463599.1 6061940 | 13 | 67331077 | 67332538 | Calypte anna 9244 | CAT|GCCCAGGTGA...TAATTTTTTGTG/CTGAGATTGATA...CGTAG|CTG | 0 | 1 | 38.924 |
31220823 | GT-AG | 0 | 1.000000099473604e-05 | 2135 | rna-XM_030463599.1 6061940 | 14 | 67332816 | 67334950 | Calypte anna 9244 | CGG|GTACAAAACT...GTGACCTTTTCA/TTACAGCTTACA...TTTAG|GAA | 1 | 1 | 42.835 |
31220824 | GT-AG | 0 | 1.000000099473604e-05 | 1576 | rna-XM_030463599.1 6061940 | 16 | 67335659 | 67337234 | Calypte anna 9244 | AAG|GTTAGTGCTT...TAATGTTTAACG/ATTTGTTTTACA...TTTAG|GTT | 2 | 1 | 52.802 |
31220825 | GT-AG | 0 | 1.000000099473604e-05 | 5696 | rna-XM_030463599.1 6061940 | 17 | 67337724 | 67343419 | Calypte anna 9244 | AAG|GTGAGCGTGC...TCCTCTTTACCG/GTCCTCTTTACC...AACAG|CAT | 2 | 1 | 59.706 |
31220826 | GT-AG | 0 | 0.0057019352142796 | 2248 | rna-XM_030463599.1 6061940 | 18 | 67343462 | 67345709 | Calypte anna 9244 | AAG|GTATATATTT...TGTTTCTTGTTT/CTTGTTTTGAGC...TTCAG|ATA | 2 | 1 | 60.299 |
31220827 | GT-AG | 0 | 1.000000099473604e-05 | 1748 | rna-XM_030463599.1 6061940 | 19 | 67345871 | 67347618 | Calypte anna 9244 | TGG|GTAAATGACA...GCTTCATTAATA/ATGGGCTTCATT...TCCAG|TGA | 1 | 1 | 62.572 |
31220828 | GT-AG | 0 | 1.000000099473604e-05 | 3043 | rna-XM_030463599.1 6061940 | 20 | 67347808 | 67350850 | Calypte anna 9244 | AAG|GTAAGACTTA...TTTTCCTTTTTT/TATGCTTTCATC...TGAAG|TAC | 1 | 1 | 65.241 |
31220829 | GT-AG | 0 | 0.000138654976016 | 3592 | rna-XM_030463599.1 6061940 | 21 | 67350904 | 67354495 | Calypte anna 9244 | ACA|GTAAGTATTG...GTTTTCTAAACC/CGTTTTCTAAAC...TCCAG|GCA | 0 | 1 | 65.989 |
31220830 | GT-AG | 0 | 1.000000099473604e-05 | 2201 | rna-XM_030463599.1 6061940 | 22 | 67354520 | 67356720 | Calypte anna 9244 | GAG|GTAAAAACTT...CTGTCTTTGTTT/TGAACACTGACA...TCTAG|CAA | 0 | 1 | 66.328 |
31220831 | GC-AG | 0 | 1.000000099473604e-05 | 8592 | rna-XM_030463599.1 6061940 | 23 | 67356799 | 67365390 | Calypte anna 9244 | AAT|GCAAGTATTT...GTTGTCTTATGT/TTGAGACTCATT...CACAG|GCT | 0 | 1 | 67.429 |
31220832 | GT-AG | 0 | 0.0969984566880697 | 1563 | rna-XM_030463599.1 6061940 | 24 | 67365475 | 67367037 | Calypte anna 9244 | AAG|GTACCTTTAC...GCTTTCTTTTCT/AATATTTTGAGC...CACAG|GAA | 0 | 1 | 68.615 |
31220833 | GT-AG | 0 | 0.009623651639903 | 1279 | rna-XM_030463599.1 6061940 | 25 | 67367150 | 67368428 | Calypte anna 9244 | AAG|GTATGTTTTT...GTGGCTTTCACA/GTGGCTTTCACA...CTCAG|ACT | 1 | 1 | 70.196 |
31220834 | GT-AG | 0 | 1.000000099473604e-05 | 570 | rna-XM_030463599.1 6061940 | 26 | 67368551 | 67369120 | Calypte anna 9244 | TGG|GTAAGTACTC...ATTCACTTGATT/CCTTCATTCACT...GCAAG|GTG | 0 | 1 | 71.919 |
31220835 | GT-AG | 0 | 1.000000099473604e-05 | 1150 | rna-XM_030463599.1 6061940 | 27 | 67369142 | 67370291 | Calypte anna 9244 | GAG|GTGAATATAA...ATAATCTTGCTC/CAGAAATTGACC...TCCAG|CTA | 0 | 1 | 72.215 |
31220836 | GT-AG | 0 | 4.627743368487391e-05 | 5407 | rna-XM_030463599.1 6061940 | 28 | 67370465 | 67375871 | Calypte anna 9244 | TGC|GTAAGTCTCT...CTTCGTTTAATC/CTTCGTTTAATC...TTCAG|GAT | 2 | 1 | 74.658 |
31220837 | GT-AG | 0 | 1.000000099473604e-05 | 4020 | rna-XM_030463599.1 6061940 | 29 | 67376026 | 67380045 | Calypte anna 9244 | CAG|GTCAGTGCAG...TTTATGTTGATT/TTTATGTTGATT...TGCAG|AAT | 0 | 1 | 76.832 |
31220838 | GT-AG | 0 | 4.964889812919302e-05 | 2752 | rna-XM_030463599.1 6061940 | 30 | 67380221 | 67382972 | Calypte anna 9244 | CAG|GTAAACATAA...TTTCCCTTCTTT/AGGTTCATCACT...TGCAG|ATA | 1 | 1 | 79.303 |
31220839 | GT-AG | 0 | 1.000000099473604e-05 | 649 | rna-XM_030463599.1 6061940 | 31 | 67383071 | 67383719 | Calypte anna 9244 | AAG|GTAAGTTCAT...CCATTTTTAGTC/CCAATTTTCAAA...TGCAG|GAT | 0 | 1 | 80.686 |
31220840 | GT-AG | 0 | 1.000000099473604e-05 | 297 | rna-XM_030463599.1 6061940 | 32 | 67383816 | 67384112 | Calypte anna 9244 | AAG|GTAAGACAGT...TATGTCTTGTCT/TGTCTTTTCATA...TTCAG|GAA | 0 | 1 | 82.042 |
31220841 | GT-AG | 0 | 1.000000099473604e-05 | 3604 | rna-XM_030463599.1 6061940 | 33 | 67384282 | 67387885 | Calypte anna 9244 | AAG|GTAAAACACC...TTTTTCTTTTCT/CTGTACCAGAAA...TCCAG|GAG | 1 | 1 | 84.428 |
31220842 | GT-AG | 0 | 1.000000099473604e-05 | 1716 | rna-XM_030463599.1 6061940 | 34 | 67388122 | 67389837 | Calypte anna 9244 | AAG|GTAAGTTACT...TTTCTGTTAATT/TTTCTGTTAATT...TGAAG|GAT | 0 | 1 | 87.759 |
31220843 | GT-AG | 0 | 1.000000099473604e-05 | 868 | rna-XM_030463599.1 6061940 | 35 | 67389993 | 67390860 | Calypte anna 9244 | ATG|GTAGGTAGCA...CTACTTTTCTCT/CAAGACCAAACT...TTCAG|TCC | 2 | 1 | 89.948 |
31220844 | GT-AG | 0 | 1.000000099473604e-05 | 427 | rna-XM_030463599.1 6061940 | 36 | 67390933 | 67391359 | Calypte anna 9244 | CAG|GTAGATTCAC...TTATCCTTTCTA/TTGCTATTCATT...CACAG|GTG | 2 | 1 | 90.964 |
31220845 | GT-AG | 0 | 1.000000099473604e-05 | 648 | rna-XM_030463599.1 6061940 | 37 | 67391557 | 67392204 | Calypte anna 9244 | TTG|GTGAGTACAT...CTTTTTTTAATG/CTTTTTTTAATG...CTCAG|GTC | 1 | 1 | 93.746 |
31220846 | GT-AG | 0 | 0.0002112702650962 | 2299 | rna-XM_030463599.1 6061940 | 38 | 67392324 | 67394622 | Calypte anna 9244 | CAG|GTATGTCATT...GGATACTTACCT/AGGATACTTACC...TGCAG|ATG | 0 | 1 | 95.426 |
31220847 | GT-AG | 0 | 1.000000099473604e-05 | 2187 | rna-XM_030463599.1 6061940 | 39 | 67394827 | 67397013 | Calypte anna 9244 | TTG|GTAAGATTTA...GCAACATTAAAT/AATTATGTAAAC...TTCAG|ATG | 0 | 1 | 98.306 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);