introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 6061981
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31222131 | GT-AG | 0 | 1.000000099473604e-05 | 6975 | rna-XM_030450642.1 6061981 | 1 | 82064664 | 82071638 | Calypte anna 9244 | CAG|GTAGGGGTTG...AGTATTTTGAAT/AGTATTTTGAAT...TGCAG|AAG | 1 | 1 | 0.293 |
31222132 | GT-AG | 0 | 1.000000099473604e-05 | 812 | rna-XM_030450642.1 6061981 | 2 | 82071732 | 82072543 | Calypte anna 9244 | CTG|GTGAGTGCTG...TTTACTGTGACT/CTAAAATTTACT...CTGAG|ATG | 1 | 1 | 1.995 |
31222133 | GT-AG | 1 | 99.99847856112748 | 1018 | rna-XM_030450642.1 6061981 | 3 | 82072722 | 82073739 | Calypte anna 9244 | TCA|GTATCCTTTC...TTTTCCTTACTG/CTTTTCCTTACT...CTCAG|ATA | 2 | 1 | 5.254 |
31222134 | GT-AG | 0 | 0.0005915253611904 | 1025 | rna-XM_030450642.1 6061981 | 4 | 82073903 | 82074927 | Calypte anna 9244 | ACA|GTATGGCTTT...GTGTCATTACAG/AAAGGTGTCATT...TCAAG|GCA | 0 | 1 | 8.237 |
31222135 | GT-AG | 0 | 1.000000099473604e-05 | 320 | rna-XM_030450642.1 6061981 | 5 | 82075031 | 82075350 | Calypte anna 9244 | GAG|GTAATGTAAA...TGTTTCTTCCTC/GGTGTGTGTACT...TTCAG|GTG | 1 | 1 | 10.123 |
31222136 | GT-AG | 0 | 1.000000099473604e-05 | 1359 | rna-XM_030450642.1 6061981 | 6 | 82075568 | 82076926 | Calypte anna 9244 | CAG|GTAAGTTACC...AATTTTGTAACG/AATTTTGTAACG...TTCAG|GTT | 2 | 1 | 14.095 |
31222137 | GT-AG | 0 | 1.000000099473604e-05 | 507 | rna-XM_030450642.1 6061981 | 7 | 82077042 | 82077548 | Calypte anna 9244 | AAG|GTAATTAAAA...AAGTCCTTATGT/CTACTCTTCATT...TGAAG|GTT | 0 | 1 | 16.2 |
31222138 | GT-AG | 0 | 1.000000099473604e-05 | 980 | rna-XM_030450642.1 6061981 | 8 | 82077714 | 82078693 | Calypte anna 9244 | CGG|GTGAGTTTTA...AGTATTTTATTT/CAGTATTTTATT...TGAAG|GTG | 0 | 1 | 19.22 |
31222139 | GT-AG | 0 | 1.000000099473604e-05 | 643 | rna-XM_030450642.1 6061981 | 9 | 82078850 | 82079492 | Calypte anna 9244 | CAG|GTAAGAGACA...TGTTTTTTACTT/TTGTTTTTTACT...TGCAG|GAT | 0 | 1 | 22.076 |
31222140 | GT-AG | 0 | 0.0002160800593076 | 484 | rna-XM_030450642.1 6061981 | 10 | 82079541 | 82080024 | Calypte anna 9244 | ATG|GTTTGTTTTC...TCTGTTTTGCAC/CAGCTCCACAAT...TGCAG|ACT | 0 | 1 | 22.954 |
31222141 | GT-AG | 0 | 1.702126366475585e-05 | 1162 | rna-XM_030450642.1 6061981 | 11 | 82080125 | 82081286 | Calypte anna 9244 | ACC|GTAAGTGTAT...AAATTCTTGTTT/CAAGAAATAAAT...TACAG|GGT | 1 | 1 | 24.785 |
31222142 | GT-AG | 0 | 8.679724532194592e-05 | 1324 | rna-XM_030450642.1 6061981 | 12 | 82081382 | 82082705 | Calypte anna 9244 | GTT|GTGAGTTTTT...TTTTTTTTAATT/TTTTTTTTAATT...CAAAG|GTA | 0 | 1 | 26.524 |
31222143 | GT-AG | 0 | 0.001465298347037 | 919 | rna-XM_030450642.1 6061981 | 13 | 82082916 | 82083834 | Calypte anna 9244 | GGG|GTATGTATCG...TTTTTTTTTTTT/GTCATAATCATC...CTCAG|AGT | 0 | 1 | 30.368 |
31222144 | GT-AG | 0 | 1.000000099473604e-05 | 705 | rna-XM_030450642.1 6061981 | 14 | 82083946 | 82084650 | Calypte anna 9244 | CAT|GTGAGTAATA...TCAGTTTTGGCT/TGGCTATACATC...TACAG|GTG | 0 | 1 | 32.4 |
31222145 | GT-AG | 0 | 1.000000099473604e-05 | 1513 | rna-XM_030450642.1 6061981 | 15 | 82084804 | 82086316 | Calypte anna 9244 | AAG|GTAAAGAACT...CATGCCTTGGTT/CATAGTTTCATG...AACAG|CTG | 0 | 1 | 35.2 |
31222146 | GT-AG | 0 | 1.000000099473604e-05 | 488 | rna-XM_030450642.1 6061981 | 16 | 82086536 | 82087023 | Calypte anna 9244 | TTG|GTAAATCAAT...CATTTCTTTATG/CATTTCTTTATG...TTCAG|GGT | 0 | 1 | 39.209 |
31222147 | GT-AG | 0 | 1.000000099473604e-05 | 744 | rna-XM_030450642.1 6061981 | 17 | 82087170 | 82087913 | Calypte anna 9244 | CTG|GTAAGACATT...TATTTCTTATTG/CTATTTCTTATT...GGCAG|TAC | 2 | 1 | 41.882 |
31222148 | GT-AG | 0 | 1.000000099473604e-05 | 1924 | rna-XM_030450642.1 6061981 | 18 | 82088185 | 82090108 | Calypte anna 9244 | AGG|GTAAGTTGCT...GCTTGTTTAAAG/TGCTTGTTTAAA...TCCAG|AGT | 0 | 1 | 46.842 |
31222149 | GT-AG | 0 | 1.000000099473604e-05 | 380 | rna-XM_030450642.1 6061981 | 19 | 82090165 | 82090544 | Calypte anna 9244 | CAA|GTAAGAGATG...GATACATTAACT/GATACATTAACT...TTCAG|CAT | 2 | 1 | 47.867 |
31222150 | GT-AG | 0 | 0.0001026437358185 | 337 | rna-XM_030450642.1 6061981 | 20 | 82090708 | 82091044 | Calypte anna 9244 | ACG|GTACATATGG...GGCTTTCTATCT/CAAGATGTCATG...TGCAG|GAA | 0 | 1 | 50.851 |
31222151 | GT-AG | 0 | 1.000000099473604e-05 | 956 | rna-XM_030450642.1 6061981 | 21 | 82091176 | 82092131 | Calypte anna 9244 | TCA|GTAAGTGCCA...TGCCTATTGACA/TTTGGTCTAATT...TTCAG|GTT | 2 | 1 | 53.249 |
31222152 | GT-AG | 0 | 1.000000099473604e-05 | 410 | rna-XM_030450642.1 6061981 | 22 | 82092323 | 82092732 | Calypte anna 9244 | CAA|GTTATGACTG...ATGGTATTGACA/ATGGTATTGACA...TCCAG|ATG | 1 | 1 | 56.745 |
31222153 | GT-AG | 0 | 1.2022075847953163e-05 | 678 | rna-XM_030450642.1 6061981 | 23 | 82092899 | 82093576 | Calypte anna 9244 | ACT|GTGAGTTTTG...TCAGCTTTGTTT/TTGTTTTGTACC...TCCAG|ATA | 2 | 1 | 59.784 |
31222154 | GT-AG | 0 | 5.221573488911438e-05 | 3263 | rna-XM_030450642.1 6061981 | 24 | 82093755 | 82097017 | Calypte anna 9244 | CTG|GTATGTAAAT...TTGTTTTTCCTT/TACATTCACATA...TGCAG|ATC | 0 | 1 | 63.042 |
31222155 | GT-AG | 0 | 1.000000099473604e-05 | 1418 | rna-XM_030450642.1 6061981 | 25 | 82097507 | 82098924 | Calypte anna 9244 | AAG|GTAGGAAATC...GTCTCCCTGATC/CTGATCCTCATT...TGCAG|ATC | 0 | 1 | 71.993 |
31222156 | GT-AG | 0 | 8.13307805986796e-05 | 405 | rna-XM_030450642.1 6061981 | 26 | 82099133 | 82099537 | Calypte anna 9244 | TAA|GTCGGCCATG...TATCTTTTAACC/TATCTTTTAACC...GTCAG|AGT | 1 | 1 | 75.801 |
31222157 | GT-AG | 0 | 4.233713253838177e-05 | 963 | rna-XM_030450642.1 6061981 | 27 | 82099730 | 82100692 | Calypte anna 9244 | CAG|GTTCTTTTTG...CTATCTTTTTCT/GGTGCTCTGAAA...TCCAG|ATG | 1 | 1 | 79.315 |
31222158 | GT-AG | 0 | 0.0375184868073164 | 1489 | rna-XM_030450642.1 6061981 | 28 | 82100804 | 82102292 | Calypte anna 9244 | AGA|GTATGTAACC...TATTCTTTGACT/TATTCTTTGACT...TTCAG|ACT | 1 | 1 | 81.347 |
31222159 | GT-AG | 0 | 5.6303482324732e-05 | 1115 | rna-XM_030450642.1 6061981 | 29 | 82102424 | 82103538 | Calypte anna 9244 | AAG|GTATTGTAGT...ATGGTTTTAAGT/ATGGTTTTAAGT...TACAG|GCA | 0 | 1 | 83.745 |
31222160 | GT-AG | 0 | 1.000000099473604e-05 | 903 | rna-XM_030450642.1 6061981 | 30 | 82103656 | 82104558 | Calypte anna 9244 | CAG|GTAATTTAAG...TCAGCCTTGCTT/GCCTTGCTTACG...CGCAG|GTG | 0 | 1 | 85.887 |
31222161 | GT-AG | 0 | 1.000000099473604e-05 | 636 | rna-XM_030450642.1 6061981 | 31 | 82104761 | 82105396 | Calypte anna 9244 | GAC|GTGAGTAACA...ATATTCTGATTA/GATATTCTGATT...GACAG|TGC | 1 | 1 | 89.584 |
31222162 | GT-AG | 0 | 1.000000099473604e-05 | 1133 | rna-XM_030450642.1 6061981 | 32 | 82105591 | 82106723 | Calypte anna 9244 | AAG|GTAAGGTGCT...CCTTTTTTATCT/TTTGTCCTCATC...TCTAG|GGA | 0 | 1 | 93.136 |
31222163 | GT-AG | 0 | 1.4821544146096708e-05 | 545 | rna-XM_030450642.1 6061981 | 33 | 82106859 | 82107403 | Calypte anna 9244 | CTG|GTAAGCCTTT...TTATTTTTTCCT/CCACCATTCACT...TACAG|GGA | 0 | 1 | 95.607 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);