introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
20 rows where transcript_id = 6061980
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31222111 | GT-AG | 0 | 1.000000099473604e-05 | 160 | rna-XM_030450569.1 6061980 | 1 | 30304395 | 30304554 | Calypte anna 9244 | AGG|GTGAGGAAAA...TTGTGTGTATCT/GCAGAGCTGACA...GGTAG|GTC | 2 | 1 | 1.673 |
31222112 | GT-AG | 0 | 1.000000099473604e-05 | 1170 | rna-XM_030450569.1 6061980 | 2 | 30304649 | 30305818 | Calypte anna 9244 | AAG|GTGAGTGGAA...TATGTGTTGTTT/GACATGTTTATG...GACAG|GTA | 0 | 1 | 3.382 |
31222113 | GT-AG | 0 | 1.000000099473604e-05 | 55127 | rna-XM_030450569.1 6061980 | 3 | 30305917 | 30361043 | Calypte anna 9244 | GCG|GTGAGTGCTA...AAAGATTTAGTT/TATATATTCAAA...TCCAG|TTA | 2 | 1 | 5.165 |
31222114 | GT-AG | 0 | 0.0003943303485233 | 843 | rna-XM_030450569.1 6061980 | 4 | 30361178 | 30362020 | Calypte anna 9244 | CAG|GTAAATTTTT...ATTATCTTAGCT/TGAATATTAATT...TTTAG|ATT | 1 | 1 | 7.601 |
31222115 | GT-AG | 0 | 5.293247571004791e-05 | 2653 | rna-XM_030450569.1 6061980 | 5 | 30362240 | 30364892 | Calypte anna 9244 | ACA|GTAAGTATAT...TTTGTTTTACTA/TTTTGTTTTACT...TAAAG|CTT | 1 | 1 | 11.584 |
31222116 | GT-AG | 0 | 1.000000099473604e-05 | 6907 | rna-XM_030450569.1 6061980 | 6 | 30364961 | 30371867 | Calypte anna 9244 | AAG|GTAATTAAAA...GTATTTTTAATT/GTATTTTTAATT...TTCAG|TTT | 0 | 1 | 12.821 |
31222117 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_030450569.1 6061980 | 7 | 30371935 | 30372018 | Calypte anna 9244 | AAG|GTGAGACATC...ATTCACTTACTT/AACTTATTCACT...TGTAG|AAA | 1 | 1 | 14.039 |
31222118 | GT-AG | 0 | 1.000000099473604e-05 | 1139 | rna-XM_030450569.1 6061980 | 8 | 30372270 | 30373408 | Calypte anna 9244 | GAG|GTAATGTGCT...TTTTTCTTTTCT/GTGTTACTCATA...ATCAG|CTT | 0 | 1 | 18.603 |
31222119 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_030450569.1 6061980 | 9 | 30373506 | 30373584 | Calypte anna 9244 | GAG|GTAAGATCTT...TATTTCTTATTT/CTATTTCTTATT...TTCAG|GCA | 1 | 1 | 20.367 |
31222120 | GT-AG | 0 | 1.000000099473604e-05 | 232 | rna-XM_030450569.1 6061980 | 10 | 30373795 | 30374026 | Calypte anna 9244 | TAG|GTAAGGGAGA...TTTGTCTTATCC/ATTTGTCTTATC...TTTAG|ATA | 1 | 1 | 24.186 |
31222121 | GT-AG | 0 | 1.000000099473604e-05 | 960 | rna-XM_030450569.1 6061980 | 11 | 30374195 | 30375154 | Calypte anna 9244 | CAG|GTTAGTAGCA...CTTTTCTTCATT/CTTTTCTTCATT...CACAG|CCT | 1 | 1 | 27.241 |
31222122 | GT-AG | 0 | 1.000000099473604e-05 | 2555 | rna-XM_030450569.1 6061980 | 12 | 30375231 | 30377785 | Calypte anna 9244 | GTG|GTAAGTGTAT...TAGGTTTTACTT/TTTTACTTCATA...CTCAG|GTT | 2 | 1 | 28.623 |
31222123 | GT-AG | 0 | 1.000000099473604e-05 | 462 | rna-XM_030450569.1 6061980 | 13 | 30377921 | 30378382 | Calypte anna 9244 | AAG|GTAAGATAAA...TGTTTATTAACC/TGTTTATTAACC...TTTAG|AAC | 2 | 1 | 31.078 |
31222124 | GT-AG | 0 | 1.7025970123804257e-05 | 372 | rna-XM_030450569.1 6061980 | 14 | 30378580 | 30378951 | Calypte anna 9244 | CAG|GTTTTTAAAT...GTGTTTTTAAAT/ACTGTGTTTACT...GACAG|GGA | 1 | 1 | 34.661 |
31222125 | GT-AG | 0 | 0.0001950631963918 | 7569 | rna-XM_030450569.1 6061980 | 15 | 30381810 | 30389378 | Calypte anna 9244 | CAG|GTATGTGGCA...TGTGTTTTAACT/TGTGTTTTAACT...TACAG|AGC | 0 | 1 | 86.634 |
31222126 | GT-AG | 0 | 1.4368562073024848e-05 | 12744 | rna-XM_030450569.1 6061980 | 16 | 30389528 | 30402271 | Calypte anna 9244 | AAA|GTAAATGGCT...CCAACCTTAATT/TTTCTCCTCATT...TTCAG|GTG | 2 | 1 | 89.344 |
31222127 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_030450569.1 6061980 | 17 | 30402411 | 30402509 | Calypte anna 9244 | CAG|GTGCAGATTT...CCTTTCTTACTG/ACCTTTCTTACT...AACAG|GAT | 0 | 1 | 91.871 |
31222128 | GT-AG | 0 | 1.000000099473604e-05 | 4181 | rna-XM_030450569.1 6061980 | 18 | 30402596 | 30406776 | Calypte anna 9244 | CAA|GTAAGAGAGC...AACTCTTTATTT/TTTGTATTAACT...CATAG|GCA | 2 | 1 | 93.435 |
31222129 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_030450569.1 6061980 | 19 | 30406901 | 30406988 | Calypte anna 9244 | ACA|GTTAGTATTG...GGGCTTTTAATT/TTTCTCTTCACT...TGCAG|GAT | 0 | 1 | 95.69 |
31222130 | GT-AG | 0 | 1.0572311444931842e-05 | 2251 | rna-XM_030450569.1 6061980 | 20 | 30407081 | 30409331 | Calypte anna 9244 | CAG|GTAAGCATGG...GCTTCTTTGGTT/TCTTTGGTTATT...TCCAG|ATT | 2 | 1 | 97.363 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);