introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
29 rows where transcript_id = 6061986
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31222258 | GT-AG | 0 | 1.000000099473604e-05 | 1687 | rna-XM_030455510.1 6061986 | 1 | 104911171 | 104912857 | Calypte anna 9244 | CGG|GTGAGCGGGC...CTGTCCCTGTTT/AATGAAATGACT...GCCAG|CCT | 0 | 1 | 0.791 |
31222259 | GT-AG | 0 | 1.000000099473604e-05 | 346 | rna-XM_030455510.1 6061986 | 2 | 104910621 | 104910966 | Calypte anna 9244 | AAA|GTAAGTGTAT...ATGCTCTCAATA/CATGCTCTCAAT...TTTAG|GCC | 0 | 1 | 4.635 |
31222260 | GT-AG | 0 | 1.4311513046702656e-05 | 1134 | rna-XM_030455510.1 6061986 | 3 | 104909388 | 104910521 | Calypte anna 9244 | CTA|GTAAGTATTG...CAATTCTCAAAC/GCAATTCTCAAA...ATCAG|GAT | 0 | 1 | 6.501 |
31222261 | GT-AG | 0 | 1.000000099473604e-05 | 472 | rna-XM_030455510.1 6061986 | 4 | 104908685 | 104909156 | Calypte anna 9244 | AAT|GTGAGTTTAC...TTCTTTTTGCCC/TAAATTGTCATG...TCTAG|GTT | 0 | 1 | 10.854 |
31222262 | GT-AG | 0 | 0.0617129309861831 | 481 | rna-XM_030455510.1 6061986 | 5 | 104908151 | 104908631 | Calypte anna 9244 | CCA|GTATGTTAGA...TTCACTTTAAAT/ATTTTCTTCACT...TTGAG|AAC | 2 | 1 | 11.852 |
31222263 | GT-AG | 0 | 0.0052031792658471 | 648 | rna-XM_030455510.1 6061986 | 6 | 104907322 | 104907969 | Calypte anna 9244 | CAG|GTAACTTTTG...TTTGTTTTGCTG/AGTGTTGTCAAA...GTCAG|GTT | 0 | 1 | 15.263 |
31222264 | GT-AG | 0 | 1.000000099473604e-05 | 982 | rna-XM_030455510.1 6061986 | 7 | 104906166 | 104907147 | Calypte anna 9244 | GAG|GTAAGGTGGG...TATCCTTTAATT/TTTCTTCTTATC...CGTAG|GAC | 0 | 1 | 18.542 |
31222265 | GT-AG | 0 | 1.753677474353993e-05 | 1222 | rna-XM_030455510.1 6061986 | 8 | 104904753 | 104905974 | Calypte anna 9244 | AGG|GTATGAACCA...TGTGCCTTTCCT/TTTCCTCTGATG...AACAG|GTT | 2 | 1 | 22.141 |
31222266 | GT-AG | 0 | 0.066416964611876 | 388 | rna-XM_030455510.1 6061986 | 9 | 104904220 | 104904607 | Calypte anna 9244 | CAG|GTATTCACTA...TGTTCTTTAACA/TGTTCTTTAACA...TATAG|CTG | 0 | 1 | 24.873 |
31222267 | GT-AG | 0 | 3.871590570475808e-05 | 488 | rna-XM_030455510.1 6061986 | 10 | 104902946 | 104903433 | Calypte anna 9244 | AAG|GTATGACTGG...AATTCATTAAAT/AAAGAATTCATT...TTCAG|ATG | 0 | 1 | 39.683 |
31222268 | GT-AG | 0 | 1.000000099473604e-05 | 967 | rna-XM_030455510.1 6061986 | 11 | 104901937 | 104902903 | Calypte anna 9244 | AAG|GTACTTAAGA...AGCTTATTAAAA/CAGAAGCTTATT...CTCAG|GCA | 0 | 1 | 40.475 |
31222269 | GT-AG | 0 | 2.4942309932485068e-05 | 1245 | rna-XM_030455510.1 6061986 | 12 | 104900499 | 104901743 | Calypte anna 9244 | ATG|GTAACTGCAT...ATGTTTATGAAG/ATGATGTTTATG...AATAG|AAT | 1 | 1 | 44.112 |
31222270 | GT-AG | 0 | 1.1390360941772187e-05 | 902 | rna-XM_030455510.1 6061986 | 13 | 104899357 | 104900258 | Calypte anna 9244 | CAG|GTAATTTGGA...TATTCTTTTTTT/TGTTTGTCTATT...AATAG|ATT | 1 | 1 | 48.634 |
31222271 | GT-AG | 0 | 1.000000099473604e-05 | 270 | rna-XM_030455510.1 6061986 | 14 | 104898933 | 104899202 | Calypte anna 9244 | AAG|GTAATTGTGA...AGCACCTTGATT/CTTGATTTTACT...CTTAG|CCT | 2 | 1 | 51.536 |
31222272 | GT-AG | 0 | 1.000000099473604e-05 | 707 | rna-XM_030455510.1 6061986 | 15 | 104898075 | 104898781 | Calypte anna 9244 | GAG|GTATGGAAAT...ACTCTTTTTATG/TAATTGTTCACA...TACAG|TGG | 0 | 1 | 54.381 |
31222273 | GT-AG | 0 | 2.915590362107744e-05 | 892 | rna-XM_030455510.1 6061986 | 16 | 104896984 | 104897875 | Calypte anna 9244 | TAA|GTAAGTTACA...CTGTATTTAGCA/ACTGTATTTAGC...TTTAG|TTG | 1 | 1 | 58.131 |
31222274 | GT-AG | 0 | 0.0007544351703885 | 160 | rna-XM_030455510.1 6061986 | 17 | 104896664 | 104896823 | Calypte anna 9244 | TAG|GTATGCAAGT...TGTATGTTAATC/TGTATGTTAATC...TGAAG|ATC | 2 | 1 | 61.146 |
31222275 | GT-AG | 0 | 0.027613454593424 | 256 | rna-XM_030455510.1 6061986 | 18 | 104896303 | 104896558 | Calypte anna 9244 | AGG|GTACCTTAAT...TCATTCTTCATG/GTTTCTCTCATT...AATAG|GTT | 2 | 1 | 63.124 |
31222276 | GT-AG | 0 | 1.000000099473604e-05 | 814 | rna-XM_030455510.1 6061986 | 19 | 104895346 | 104896159 | Calypte anna 9244 | ATG|GTGAGTATGG...TTTATCTTGATG/TTTTTTCTGATG...TACAG|GAG | 1 | 1 | 65.819 |
31222277 | GT-AG | 0 | 1.000000099473604e-05 | 891 | rna-XM_030455510.1 6061986 | 20 | 104894319 | 104895209 | Calypte anna 9244 | CTG|GTGGGTGCAA...TAAGCTGTAGTT/TGTAGTTGTATT...TTCAG|CAA | 2 | 1 | 68.381 |
31222278 | GT-AG | 0 | 1.000000099473604e-05 | 1788 | rna-XM_030455510.1 6061986 | 21 | 104892386 | 104894173 | Calypte anna 9244 | GAG|GTAATATGAA...ATTTCTTTTGCT/AACAAATTTATA...TCTAG|ACA | 0 | 1 | 71.114 |
31222279 | GC-AG | 0 | 1.000000099473604e-05 | 276 | rna-XM_030455510.1 6061986 | 22 | 104891896 | 104892171 | Calypte anna 9244 | CAG|GCAAGTTTAT...CATTCCTTATCA/ACATTCCTTATC...AATAG|GAG | 1 | 1 | 75.146 |
31222280 | GT-AG | 0 | 1.000000099473604e-05 | 826 | rna-XM_030455510.1 6061986 | 23 | 104890832 | 104891657 | Calypte anna 9244 | TAA|GTAAGAATCA...CATATCTTGATA/TTAAATCTGATT...TATAG|ATT | 2 | 1 | 79.631 |
31222281 | GT-AG | 0 | 0.0001954899808454 | 683 | rna-XM_030455510.1 6061986 | 24 | 104890074 | 104890756 | Calypte anna 9244 | ATT|GTGTGTATTT...GTTACTATGACA/ATTTGAGTTACT...TATAG|ATC | 2 | 1 | 81.044 |
31222282 | GT-AG | 0 | 0.0182790488159103 | 430 | rna-XM_030455510.1 6061986 | 25 | 104889457 | 104889886 | Calypte anna 9244 | CAG|GTATATATTC...TGTTTTTTAACA/TGTTTTTTAACA...TTCAG|CTG | 0 | 1 | 84.568 |
31222283 | GT-AG | 0 | 1.000000099473604e-05 | 696 | rna-XM_030455510.1 6061986 | 26 | 104888589 | 104889284 | Calypte anna 9244 | AAG|GTTAGTAAAA...TATCCTTTATTT/TTGTGTCTAATG...CACAG|GGA | 1 | 1 | 87.809 |
31222284 | GT-AG | 0 | 1.000000099473604e-05 | 470 | rna-XM_030455510.1 6061986 | 27 | 104887898 | 104888367 | Calypte anna 9244 | ACG|GTGAGTAAGT...TGTTTTTTAATA/TGTTTTTTAATA...CATAG|GTA | 0 | 1 | 91.973 |
31222285 | GT-AG | 0 | 1.000000099473604e-05 | 625 | rna-XM_030455510.1 6061986 | 28 | 104887078 | 104887702 | Calypte anna 9244 | CAG|GTGGGTTTAG...CATTCTATGATA/ATGATATTAAAA...GTTAG|AAT | 0 | 1 | 95.647 |
31222286 | GT-AG | 0 | 1.000000099473604e-05 | 778 | rna-XM_030455510.1 6061986 | 29 | 104886132 | 104886909 | Calypte anna 9244 | TTG|GTAAGTACCT...TGATATTTAGCT/CTGATATTTAGC...TTCAG|TAC | 0 | 1 | 98.813 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);