introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
46 rows where transcript_id = 6061954
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31221295 | GT-AG | 0 | 1.000000099473604e-05 | 1080 | rna-XM_030448235.1 6061954 | 1 | 194797485 | 194798564 | Calypte anna 9244 | AAT|GTAGGTACAA...ACTCATTTGATG/ATCCCACTCATT...CCCAG|GAG | 0 | 1 | 1.523 |
31221296 | GT-AG | 0 | 1.000000099473604e-05 | 6895 | rna-XM_030448235.1 6061954 | 2 | 194790437 | 194797331 | Calypte anna 9244 | TAT|GTGAGTGTCC...AGTCTCTCAGCC/CCACCTCTCACT...TCCAG|ACA | 0 | 1 | 3.876 |
31221297 | GT-AG | 0 | 1.000000099473604e-05 | 1938 | rna-XM_030448235.1 6061954 | 3 | 194788314 | 194790251 | Calypte anna 9244 | CAG|GTGAGATGGG...CTGACCTCAATT/GGGTTTCTGACC...TGCAG|TGG | 2 | 1 | 6.722 |
31221298 | GT-AG | 0 | 1.000000099473604e-05 | 3945 | rna-XM_030448235.1 6061954 | 4 | 194784247 | 194788191 | Calypte anna 9244 | AAG|GTGAGGCCCA...CTGCTCTTACCC/TCTGCTCTTACC...TGCAG|CTT | 1 | 1 | 8.599 |
31221299 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_030448235.1 6061954 | 5 | 194784000 | 194784103 | Calypte anna 9244 | CAG|GTGAGGTGAC...TTGCTCTTTTCT/CCGAGAATCAGC...TGCAG|GCC | 0 | 1 | 10.798 |
31221300 | GT-AG | 0 | 1.000000099473604e-05 | 1173 | rna-XM_030448235.1 6061954 | 6 | 194782713 | 194783885 | Calypte anna 9244 | ATG|GTGAGGGTGC...ATGCTCTTCACT/ATGCTCTTCACT...CCCAG|GGT | 0 | 1 | 12.552 |
31221301 | GT-AG | 0 | 1.000000099473604e-05 | 647 | rna-XM_030448235.1 6061954 | 7 | 194781912 | 194782558 | Calypte anna 9244 | AAG|GTAAAAGGAC...CTTTTCTTCTCC/CCCTTGCTGATG...CCCAG|CTC | 1 | 1 | 14.921 |
31221302 | GT-AG | 0 | 1.000000099473604e-05 | 878 | rna-XM_030448235.1 6061954 | 8 | 194780957 | 194781834 | Calypte anna 9244 | GAG|GTCAGTGGCC...GGTGCTGTAAAG/TGTAAAGTCAAT...GCCAG|GTT | 0 | 1 | 16.105 |
31221303 | GT-AG | 0 | 1.000000099473604e-05 | 1534 | rna-XM_030448235.1 6061954 | 9 | 194779303 | 194780836 | Calypte anna 9244 | AAG|GTGAATCTTA...AACTCCTAACCT/AAACTCCTAACC...TGCAG|GGA | 0 | 1 | 17.951 |
31221304 | GT-AG | 0 | 0.0002077811929104 | 453 | rna-XM_030448235.1 6061954 | 10 | 194778707 | 194779159 | Calypte anna 9244 | CAG|GTATGGAGCT...CTTTCCTTGATC/CTTTCCTTGATC...CACAG|TTT | 2 | 1 | 20.151 |
31221305 | GT-AG | 0 | 0.0001895761821049 | 465 | rna-XM_030448235.1 6061954 | 11 | 194778031 | 194778495 | Calypte anna 9244 | AAG|GTGTGTTTTG...ATTATTTTAATG/ATTATTTTAATG...CACAG|GGT | 0 | 1 | 23.396 |
31221306 | GT-AG | 0 | 1.000000099473604e-05 | 359 | rna-XM_030448235.1 6061954 | 12 | 194777536 | 194777894 | Calypte anna 9244 | AAG|GTATGGGGCA...GTCCCATTATTC/CACTGTATCATC...TCTAG|GTT | 1 | 1 | 25.488 |
31221307 | GT-AG | 0 | 1.000000099473604e-05 | 9591 | rna-XM_030448235.1 6061954 | 13 | 194767838 | 194777428 | Calypte anna 9244 | ATG|GTAATGCAGG...ATTTCCTGAGCC/TATCCTTTCATT...TGCAG|GGA | 0 | 1 | 27.134 |
31221308 | GT-AG | 0 | 1.000000099473604e-05 | 3927 | rna-XM_030448235.1 6061954 | 14 | 194763773 | 194767699 | Calypte anna 9244 | ATG|GTAAGTTCTG...TGTATTGTATCC/CCCAGGCTGAGC...CCCAG|CTG | 0 | 1 | 29.257 |
31221309 | GT-AG | 0 | 1.000000099473604e-05 | 995 | rna-XM_030448235.1 6061954 | 15 | 194762619 | 194763613 | Calypte anna 9244 | CAG|GTGAATATCT...TCTTCTATGATT/TCTTCTATGATT...CCCAG|GGT | 0 | 1 | 31.703 |
31221310 | GT-AG | 0 | 1.000000099473604e-05 | 1320 | rna-XM_030448235.1 6061954 | 16 | 194761206 | 194762525 | Calypte anna 9244 | AAG|GTAGGCAGCT...GTGCTTTTCTCC/GAGAGCTTGAGA...CACAG|GAC | 0 | 1 | 33.133 |
31221311 | GT-AG | 0 | 1.000000099473604e-05 | 1000 | rna-XM_030448235.1 6061954 | 17 | 194760111 | 194761110 | Calypte anna 9244 | CAG|GTAGGTGTGA...AAGTTCTTGAAA/AAGTTCTTGAAA...TGCAG|GTC | 2 | 1 | 34.595 |
31221312 | GT-AG | 0 | 1.000000099473604e-05 | 887 | rna-XM_030448235.1 6061954 | 18 | 194759139 | 194760025 | Calypte anna 9244 | GCT|GTGAGTACTG...TTGGTTTCATCC/GTTGGTTTCATC...TGCAG|ATG | 0 | 1 | 35.902 |
31221313 | GT-AG | 0 | 1.000000099473604e-05 | 165 | rna-XM_030448235.1 6061954 | 19 | 194758755 | 194758919 | Calypte anna 9244 | GAA|GTAAGTACCG...TGGTGTTTATTC/CTGGTGTTTATT...CCCAG|TAT | 0 | 1 | 39.271 |
31221314 | GT-AG | 0 | 0.0010385698353717 | 695 | rna-XM_030448235.1 6061954 | 20 | 194757952 | 194758646 | Calypte anna 9244 | CAA|GTATGTAGTA...TGGTTCTTGTCC/AGTTTTGTCATG...TGCAG|GTA | 0 | 1 | 40.932 |
31221315 | GT-AG | 0 | 1.000000099473604e-05 | 525 | rna-XM_030448235.1 6061954 | 21 | 194757217 | 194757741 | Calypte anna 9244 | GAG|GTACTGGAGA...ATGTTCTTCCCC/GCCATGCTGAAT...ACCAG|GAT | 0 | 1 | 44.162 |
31221316 | GT-AG | 0 | 1.000000099473604e-05 | 1187 | rna-XM_030448235.1 6061954 | 22 | 194755826 | 194757012 | Calypte anna 9244 | TTG|GTGAGACAAG...TGGAATTTAACT/TGGAATTTAACT...CCTAG|GCA | 0 | 1 | 47.3 |
31221317 | GT-AG | 0 | 0.0022754375090929 | 1501 | rna-XM_030448235.1 6061954 | 23 | 194754148 | 194755648 | Calypte anna 9244 | GAG|GTAACCACTG...CACCTCTTGCTT/CCATGTCTCACA...TGCAG|AGT | 0 | 1 | 50.023 |
31221318 | GT-AG | 0 | 1.000000099473604e-05 | 1264 | rna-XM_030448235.1 6061954 | 24 | 194752794 | 194754057 | Calypte anna 9244 | GAG|GTGAGAGGCG...CTGGCTGTACTG/GCTGTACTGAAA...GGCAG|GTG | 0 | 1 | 51.407 |
31221319 | GT-AG | 0 | 1.000000099473604e-05 | 1405 | rna-XM_030448235.1 6061954 | 25 | 194751261 | 194752665 | Calypte anna 9244 | AAG|GTTAGAAACC...TTTTTCTTTTCA/TTTCTTTTCAAT...TGTAG|GGA | 2 | 1 | 53.376 |
31221320 | GT-AG | 0 | 1.000000099473604e-05 | 3070 | rna-XM_030448235.1 6061954 | 26 | 194748064 | 194751133 | Calypte anna 9244 | AAG|GTAAATTGTA...ATGGTCTAACTG/GATGGTCTAACT...ACCAG|TAT | 0 | 1 | 55.33 |
31221321 | GT-AG | 0 | 1.6864911624861208e-05 | 698 | rna-XM_030448235.1 6061954 | 27 | 194747246 | 194747943 | Calypte anna 9244 | CAG|GTATGAGCCC...CATGCCTTGATA/TTTGGCCTCATT...GTTAG|GCA | 0 | 1 | 57.176 |
31221322 | GT-AG | 0 | 3.750220349578064e-05 | 1209 | rna-XM_030448235.1 6061954 | 28 | 194745863 | 194747071 | Calypte anna 9244 | AAG|GTACATCTTC...TTCTCTCTGCCC/GTGAATGTCACA...AGCAG|GTC | 0 | 1 | 59.852 |
31221323 | GT-AG | 0 | 1.000000099473604e-05 | 890 | rna-XM_030448235.1 6061954 | 29 | 194744745 | 194745634 | Calypte anna 9244 | AAG|GTGAGGGATC...ACAGCATTAACT/ACAGCATTAACT...CACAG|GAA | 0 | 1 | 63.359 |
31221324 | GT-AG | 0 | 1.000000099473604e-05 | 1360 | rna-XM_030448235.1 6061954 | 30 | 194743214 | 194744573 | Calypte anna 9244 | AAG|GTGGGGACAA...TTGTTCTTCATG/TTGTTCTTCATG...TGTAG|GGA | 0 | 1 | 65.99 |
31221325 | GT-AG | 0 | 1.000000099473604e-05 | 270 | rna-XM_030448235.1 6061954 | 31 | 194742826 | 194743095 | Calypte anna 9244 | CAG|GTGAGTCAGT...AACTGCTTATCT/AGCATCCTAATA...CCCAG|GGC | 1 | 1 | 67.805 |
31221326 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_030448235.1 6061954 | 32 | 194742593 | 194742698 | Calypte anna 9244 | CAG|GTAAGGAGAA...AAGCCATTATCT/GTCCTGTTTATG...CCCAG|AGG | 2 | 1 | 69.758 |
31221327 | GT-AG | 0 | 1.000000099473604e-05 | 120 | rna-XM_030448235.1 6061954 | 33 | 194742303 | 194742422 | Calypte anna 9244 | CTG|GTAAGTAGGG...GGATGCTTTTCT/AACAAAATAATG...TGCAG|TGG | 1 | 1 | 72.373 |
31221328 | GT-AG | 0 | 1.000000099473604e-05 | 545 | rna-XM_030448235.1 6061954 | 34 | 194741567 | 194742111 | Calypte anna 9244 | GTG|GTAAGATGTC...TGAATTATATCA/ACTATATTGAGT...CACAG|GCT | 0 | 1 | 75.311 |
31221329 | GT-AG | 0 | 1.000000099473604e-05 | 474 | rna-XM_030448235.1 6061954 | 35 | 194740968 | 194741441 | Calypte anna 9244 | TAG|GTGATTTATT...CTTGCTTTGCCT/ATCCTTCTAACC...ACCAG|GCC | 2 | 1 | 77.234 |
31221330 | AT-AC | 1 | 99.99999998785135 | 996 | rna-XM_030448235.1 6061954 | 36 | 194739814 | 194740809 | Calypte anna 9244 | TTG|ATATCCTTCC...TCATCCTTGACT/GGTTAGCTCATC...CCCAC|CCG | 1 | 1 | 79.665 |
31221331 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_030448235.1 6061954 | 37 | 194739544 | 194739659 | Calypte anna 9244 | CAA|GTAAGAGTTT...GTGATCTTATTC/AGTGATCTTATT...TGTAG|GTA | 2 | 1 | 82.034 |
31221332 | GT-AG | 0 | 3.1729742675756696e-05 | 415 | rna-XM_030448235.1 6061954 | 38 | 194738973 | 194739387 | Calypte anna 9244 | GAG|GTATGGCCCC...CAGGCTTTGATG/CAGGCTTTGATG...TGTAG|GAA | 2 | 1 | 84.433 |
31221333 | GT-AG | 0 | 1.000000099473604e-05 | 649 | rna-XM_030448235.1 6061954 | 39 | 194738218 | 194738866 | Calypte anna 9244 | GAG|GTAAGAAGCA...TTAACTTTGAGA/TGGAGATTAACT...TGTAG|GCA | 0 | 1 | 86.064 |
31221334 | GT-AG | 0 | 1.000000099473604e-05 | 1638 | rna-XM_030448235.1 6061954 | 40 | 194736466 | 194738103 | Calypte anna 9244 | AAG|GTGAGTGACC...GGCCTCTTACCA/CACTTGCTCAGC...CCCAG|GTC | 0 | 1 | 87.817 |
31221335 | GT-AG | 0 | 1.000000099473604e-05 | 572 | rna-XM_030448235.1 6061954 | 41 | 194735806 | 194736377 | Calypte anna 9244 | ATG|GTAACAGGGG...TTGTTGTTACAT/CTGGGGCTCACA...TTCAG|GTA | 1 | 1 | 89.171 |
31221336 | GT-AG | 0 | 0.0001292208508211 | 847 | rna-XM_030448235.1 6061954 | 42 | 194734852 | 194735698 | Calypte anna 9244 | CAG|GTACTCCAGG...TTTTTCTTTCCT/CTGTCTCTGATT...TGCAG|GAG | 0 | 1 | 90.817 |
31221337 | GT-AG | 0 | 1.000000099473604e-05 | 870 | rna-XM_030448235.1 6061954 | 43 | 194733796 | 194734665 | Calypte anna 9244 | AGG|GTAAAACAGC...CAAACCTTATAG/ACAAACCTTATA...TGCAG|TCC | 0 | 1 | 93.678 |
31221338 | GT-AG | 0 | 0.0297480039641371 | 1050 | rna-XM_030448235.1 6061954 | 44 | 194732629 | 194733678 | Calypte anna 9244 | AAG|GTACCTGCCC...TTCTCCTTACCA/GTTCTTCTCATC...AACAG|CAA | 0 | 1 | 95.478 |
31221339 | GT-AG | 0 | 1.000000099473604e-05 | 531 | rna-XM_030448235.1 6061954 | 45 | 194732014 | 194732544 | Calypte anna 9244 | AAG|GTAAGCAAAA...TCAGTCTTGTGT/CGTGTGTTGAGT...ACCAG|GAT | 0 | 1 | 96.77 |
31221340 | GT-AG | 0 | 1.000000099473604e-05 | 772 | rna-XM_030448235.1 6061954 | 46 | 194731122 | 194731893 | Calypte anna 9244 | TTG|GTAAGGAGCA...ACCATCTTCTCT/TGTGCTCTAAAG...CTCAG|GGG | 0 | 1 | 98.616 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);