introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
43 rows where transcript_id = 6061990
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31222333 | GT-AG | 0 | 3.711645899359577e-05 | 2252 | rna-XM_030450947.1 6061990 | 2 | 145421498 | 145423749 | Calypte anna 9244 | CCA|GTAGGTATCT...TTTTTATGAACA/CTTTTTATGAAC...TCCAG|TGG | 0 | 1 | 3.495 |
31222334 | GT-AG | 0 | 1.000000099473604e-05 | 16142 | rna-XM_030450947.1 6061990 | 3 | 145423933 | 145440074 | Calypte anna 9244 | AAG|GTAAGCACTT...TTAACCTCAGCT/ATTAACCTCAGC...TGTAG|GGT | 0 | 1 | 6.951 |
31222335 | GT-AG | 0 | 1.000000099473604e-05 | 1549 | rna-XM_030450947.1 6061990 | 4 | 145440159 | 145441707 | Calypte anna 9244 | CAG|GTAATTATTT...ATACACTTGATT/ATACACTTGATT...TGTAG|GTG | 0 | 1 | 8.538 |
31222336 | GT-AG | 0 | 1.000000099473604e-05 | 818 | rna-XM_030450947.1 6061990 | 5 | 145441848 | 145442665 | Calypte anna 9244 | GAA|GTAAGTAAGG...TTTATCTTTTTT/TGTGTACTCATT...GCCAG|GCG | 2 | 1 | 11.182 |
31222337 | GT-AG | 0 | 1.000000099473604e-05 | 16097 | rna-XM_030450947.1 6061990 | 6 | 145442795 | 145458891 | Calypte anna 9244 | AGG|GTGAGTTAGC...AACACCTTAATT/TTAATTTTTATT...CACAG|AAA | 2 | 1 | 13.619 |
31222338 | GT-AG | 0 | 1.000000099473604e-05 | 27893 | rna-XM_030450947.1 6061990 | 7 | 145459047 | 145486939 | Calypte anna 9244 | TAG|GTCTGTCCTA...GGAGGCTTAATG/AACATTTTCATT...AACAG|GCA | 1 | 1 | 16.547 |
31222339 | GT-AG | 0 | 1.000000099473604e-05 | 124 | rna-XM_030450947.1 6061990 | 8 | 145487083 | 145487206 | Calypte anna 9244 | AAG|GTACTGAAAA...TCTTTCTTGTTT/TGTTTGCTGAGC...TTCAG|AAT | 0 | 1 | 19.248 |
31222340 | GT-AG | 0 | 1.000000099473604e-05 | 2765 | rna-XM_030450947.1 6061990 | 9 | 145487312 | 145490076 | Calypte anna 9244 | CAG|GTGAACCAAC...ATATTTCTGACA/ATATTTCTGACA...TCAAG|ATG | 0 | 1 | 21.232 |
31222341 | GT-AG | 0 | 3.783970976209044e-05 | 29446 | rna-XM_030450947.1 6061990 | 10 | 145490164 | 145519609 | Calypte anna 9244 | CAG|GTATGACATT...CTATTTTTGATT/CTATTTTTGATT...TGCAG|AAA | 0 | 1 | 22.875 |
31222342 | GT-AG | 0 | 1.000000099473604e-05 | 8251 | rna-XM_030450947.1 6061990 | 11 | 145519742 | 145527992 | Calypte anna 9244 | GAG|GTAAGAATGA...AGTGTTTTAAAT/TGTGATTTAACT...TCTAG|GTT | 0 | 1 | 25.368 |
31222343 | GT-AG | 0 | 1.000000099473604e-05 | 6323 | rna-XM_030450947.1 6061990 | 12 | 145528161 | 145534483 | Calypte anna 9244 | CAG|GTATGACCAG...ATTGTCTTTTCT/TACTGATTAATG...TACAG|GTT | 0 | 1 | 28.542 |
31222344 | GC-AG | 0 | 1.000000099473604e-05 | 7187 | rna-XM_030450947.1 6061990 | 13 | 145534676 | 145541862 | Calypte anna 9244 | CGG|GCAAGTATAA...TGTACTTTGTTT/ATTGTTATTATA...TTCAG|GCT | 0 | 1 | 32.168 |
31222345 | GT-AG | 0 | 1.000000099473604e-05 | 3058 | rna-XM_030450947.1 6061990 | 14 | 145542001 | 145545058 | Calypte anna 9244 | CTG|GTAAGTCTTC...CAAGCTCTAACA/CAAGCTCTAACA...TTTAG|ATT | 0 | 1 | 34.775 |
31222346 | GT-AG | 0 | 1.000000099473604e-05 | 30101 | rna-XM_030450947.1 6061990 | 15 | 145545134 | 145575234 | Calypte anna 9244 | CAA|GTAAGATAAT...ATTTTTTTCATT/ATTTTTTTCATT...TTCAG|CTA | 0 | 1 | 36.192 |
31222347 | GT-AG | 0 | 0.0036180278778374 | 1691 | rna-XM_030450947.1 6061990 | 16 | 145575372 | 145577062 | Calypte anna 9244 | AAG|GTATCGAAAT...TCCCTTTTAAAT/CTTCTGTTTATT...ACAAG|GGA | 2 | 1 | 38.78 |
31222348 | GT-AG | 0 | 1.000000099473604e-05 | 15623 | rna-XM_030450947.1 6061990 | 17 | 145577205 | 145592827 | Calypte anna 9244 | AAG|GTAAATCAAG...TTTTTCTTAATT/TTTTTCTTAATT...TATAG|CTC | 0 | 1 | 41.462 |
31222349 | GT-AG | 0 | 1.000000099473604e-05 | 13520 | rna-XM_030450947.1 6061990 | 18 | 145592902 | 145606421 | Calypte anna 9244 | AAG|GTAAGAGTCA...TCATCATTATAT/TTTATACTGAAA...TGTAG|GGC | 2 | 1 | 42.86 |
31222350 | GT-AG | 0 | 1.000000099473604e-05 | 432 | rna-XM_030450947.1 6061990 | 19 | 145606524 | 145606955 | Calypte anna 9244 | CAG|GTAAATGTGG...GTATTGTTAAAT/CAAAATTTCATT...TTCAG|GTC | 2 | 1 | 44.787 |
31222351 | GT-AG | 0 | 1.923814132844301e-05 | 1269 | rna-XM_030450947.1 6061990 | 20 | 145607026 | 145608294 | Calypte anna 9244 | GAT|GTAAGTCGTC...CCTCTTTTAAAA/CCTCTTTTAAAA...TCTAG|CAC | 0 | 1 | 46.109 |
31222352 | GT-AG | 0 | 0.0003052882618525 | 159 | rna-XM_030450947.1 6061990 | 21 | 145608387 | 145608545 | Calypte anna 9244 | AAG|GTATTTAAAA...ATATTTTTACCA/GATATTTTTACC...AAAAG|GAA | 2 | 1 | 47.847 |
31222353 | GT-AG | 0 | 4.058943359124681e-05 | 2256 | rna-XM_030450947.1 6061990 | 22 | 145608669 | 145610924 | Calypte anna 9244 | TGC|GTAAGTCTAT...CTTTTCTTCTTT/TGATAACTAATA...TATAG|TTC | 2 | 1 | 50.17 |
31222354 | GT-AG | 0 | 0.0004691642229986 | 383 | rna-XM_030450947.1 6061990 | 23 | 145610982 | 145611364 | Calypte anna 9244 | TTA|GTAAGCACTG...TTATCCTTATGT/TTTATCCTTATG...CACAG|TGA | 2 | 1 | 51.247 |
31222355 | GT-AG | 0 | 5.2927365425771474e-05 | 100 | rna-XM_030450947.1 6061990 | 24 | 145611486 | 145611585 | Calypte anna 9244 | CAG|GTACTGTCAC...TCTGCTTTGATT/TAAATTCTGACT...TTCAG|ATT | 0 | 1 | 53.532 |
31222356 | GT-AG | 0 | 0.0010499209612788 | 1767 | rna-XM_030450947.1 6061990 | 25 | 145611718 | 145613484 | Calypte anna 9244 | CTT|GTAAGTTTTT...TTTTCCTTCCCT/TTAAGTCTAATC...TCGAG|GTG | 0 | 1 | 56.026 |
31222357 | GT-AG | 0 | 1.000000099473604e-05 | 1219 | rna-XM_030450947.1 6061990 | 26 | 145613653 | 145614871 | Calypte anna 9244 | TTG|GTAAGTGCCA...CTTACTTTATCT/ACAAGTCTTACT...TGCAG|GTC | 0 | 1 | 59.199 |
31222358 | GT-AG | 0 | 1.000000099473604e-05 | 741 | rna-XM_030450947.1 6061990 | 27 | 145614977 | 145615717 | Calypte anna 9244 | AGG|GTAAGAACTG...CATTATTTAATC/CATTATTTAATC...TGTAG|GAA | 0 | 1 | 61.182 |
31222359 | GT-AG | 0 | 1.000000099473604e-05 | 1834 | rna-XM_030450947.1 6061990 | 28 | 145615825 | 145617658 | Calypte anna 9244 | CTG|GTGAGTTGCT...TGTTTCTTCACT/TGTTTCTTCACT...TTCAG|GGC | 2 | 1 | 63.204 |
31222360 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_030450947.1 6061990 | 29 | 145617780 | 145617865 | Calypte anna 9244 | CCG|GTAAGGAAAC...TTTCTTTTAATG/CATCTTTTTATC...TTCAG|ATC | 0 | 1 | 65.489 |
31222361 | GT-AG | 0 | 1.000000099473604e-05 | 5029 | rna-XM_030450947.1 6061990 | 30 | 145617965 | 145622993 | Calypte anna 9244 | AAG|GTAGTGAGTG...GAAGCTTTATAT/CTTTATATGACT...CTAAG|GGG | 0 | 1 | 67.359 |
31222362 | GT-AG | 0 | 0.0394001233266796 | 535 | rna-XM_030450947.1 6061990 | 31 | 145623088 | 145623622 | Calypte anna 9244 | CTG|GTATCAAAGA...TGTATTTTAACT/TGTATTTTAACT...TCCAG|ATA | 1 | 1 | 69.135 |
31222363 | GT-AG | 0 | 2.369936415888573e-05 | 394 | rna-XM_030450947.1 6061990 | 32 | 145623730 | 145624123 | Calypte anna 9244 | AAG|GTAATTGTGT...TTTTCTTTAACA/TTTTCTTTAACA...TTCAG|TGG | 0 | 1 | 71.156 |
31222364 | GT-AG | 0 | 1.000000099473604e-05 | 3243 | rna-XM_030450947.1 6061990 | 33 | 145624199 | 145627441 | Calypte anna 9244 | GAG|GTAAAGTAAT...ATTGTTTTGGTT/TAACCATTTACC...CATAG|GTC | 0 | 1 | 72.573 |
31222365 | GT-AG | 0 | 1.000000099473604e-05 | 1020 | rna-XM_030450947.1 6061990 | 34 | 145627562 | 145628581 | Calypte anna 9244 | CTG|GTAAGATTTA...AAGTACTAAGTC/AAAGTACTAAGT...TCCAG|AAT | 0 | 1 | 74.839 |
31222366 | GT-AG | 0 | 1.000000099473604e-05 | 1773 | rna-XM_030450947.1 6061990 | 35 | 145628651 | 145630423 | Calypte anna 9244 | CAC|GTAAGTATAA...ATTACTTTTGCT/TCATGGATTACT...TCCAG|GTG | 0 | 1 | 76.143 |
31222367 | GT-AG | 0 | 1.5877721915837098e-05 | 691 | rna-XM_030450947.1 6061990 | 36 | 145630573 | 145631263 | Calypte anna 9244 | CAG|GTAGAGTATG...AGTGTTTTAGTT/TCTGCACTGAGA...AACAG|ACA | 2 | 1 | 78.957 |
31222368 | GT-AG | 0 | 1.000000099473604e-05 | 785 | rna-XM_030450947.1 6061990 | 37 | 145631358 | 145632142 | Calypte anna 9244 | ATG|GTAAATATAA...TTTTCTTTCATT/TTTTCTTTCATT...TGAAG|GTT | 0 | 1 | 80.733 |
31222369 | GT-AG | 0 | 1.000000099473604e-05 | 556 | rna-XM_030450947.1 6061990 | 38 | 145632276 | 145632831 | Calypte anna 9244 | TTG|GTAAGTGAAA...CATTTTTTGGTT/CAGGCTTTCATT...TTCAG|CTA | 1 | 1 | 83.245 |
31222370 | GT-AG | 0 | 1.000000099473604e-05 | 3060 | rna-XM_030450947.1 6061990 | 39 | 145632948 | 145636007 | Calypte anna 9244 | GAG|GTAAGAAACT...AATTATTTACCA/GAATTATTTACC...CCCAG|GGA | 0 | 1 | 85.436 |
31222371 | GT-AG | 0 | 0.0001725889916918 | 1435 | rna-XM_030450947.1 6061990 | 40 | 145636166 | 145637600 | Calypte anna 9244 | CAG|GTACTCTGAA...CAACCCCAAACA/AAAAAACAAACC...TCCAG|CAT | 2 | 1 | 88.421 |
31222372 | GT-AG | 0 | 1.5623343683829347e-05 | 2104 | rna-XM_030450947.1 6061990 | 41 | 145637752 | 145639855 | Calypte anna 9244 | GCA|GTAAGTCAAG...TTTTTCTTGTCA/TTTCTTGTCACA...GATAG|AAA | 0 | 1 | 91.273 |
31222373 | GT-AG | 0 | 0.0043640068516985 | 661 | rna-XM_030450947.1 6061990 | 42 | 145640006 | 145640666 | Calypte anna 9244 | GAG|GTATGCTAAC...GGAACTGTGACT/CTGTGACTGAGA...TTCAG|ACA | 0 | 1 | 94.107 |
31222374 | GT-AG | 0 | 1.000000099473604e-05 | 2192 | rna-XM_030450947.1 6061990 | 43 | 145640785 | 145642976 | Calypte anna 9244 | CAG|GTATGTGAAA...TATATTCTAATA/TTCTGTTTCATT...CACAG|CCC | 1 | 1 | 96.335 |
31238774 | GT-AG | 0 | 1.000000099473604e-05 | 10044 | rna-XM_030450947.1 6061990 | 1 | 145411307 | 145421350 | Calypte anna 9244 | GCG|GTAGGTCGGA...CTCTTTTTGTCT/AGTGAACTAATG...CCAAG|GAG | 0 | 1.454 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);