introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
15 rows where transcript_id = 6061994
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31222447 | GT-AG | 0 | 1.000000099473604e-05 | 43186 | rna-XM_030452815.1 6061994 | 1 | 125008455 | 125051640 | Calypte anna 9244 | CAA|GTAAGTGGGC...CATGTTTTAATG/CATGTTTTAATG...CACAG|TCA | 2 | 1 | 0.734 |
31222448 | GT-AG | 0 | 0.022373565651992 | 1905 | rna-XM_030452815.1 6061994 | 2 | 125006389 | 125008293 | Calypte anna 9244 | CAG|GTATGTTTTC...TTTTTTTTATTT/TTTTTTTTTATT...TCCAG|GTG | 1 | 1 | 3.845 |
31222449 | GT-AG | 0 | 1.000000099473604e-05 | 35331 | rna-XM_030452815.1 6061994 | 3 | 124970955 | 125006285 | Calypte anna 9244 | CAG|GTAGGTGATG...CTTTTCTTGATT/CTTTTCTTGATT...CATAG|AAG | 2 | 1 | 5.836 |
31222450 | GT-AG | 0 | 0.0001086234202981 | 9198 | rna-XM_030452815.1 6061994 | 4 | 124961711 | 124970908 | Calypte anna 9244 | CCT|GTAAGTATTC...TTTGGCTTGAAT/CTTGTCATCATT...TCCAG|TCT | 0 | 1 | 6.725 |
31222451 | GT-AG | 0 | 1.000000099473604e-05 | 1206 | rna-XM_030452815.1 6061994 | 5 | 124960400 | 124961605 | Calypte anna 9244 | TCG|GTGAGTATTT...GTAATCTGATTC/TGTAATCTGATT...GACAG|GAA | 0 | 1 | 8.754 |
31222452 | GT-AG | 0 | 2.5366980045379155e-05 | 1875 | rna-XM_030452815.1 6061994 | 6 | 124958417 | 124960291 | Calypte anna 9244 | AAG|GTAGGTTACA...TATTTTTTAAAT/TATTTTTTAAAT...ATTAG|GAT | 0 | 1 | 10.841 |
31222453 | GT-AG | 0 | 1.000000099473604e-05 | 2110 | rna-XM_030452815.1 6061994 | 7 | 124956175 | 124958284 | Calypte anna 9244 | CAG|GTTTGTGTCA...CAAGCTCTAATA/ACATTGTTCAAC...CCTAG|GTG | 0 | 1 | 13.391 |
31222454 | GT-AG | 0 | 1.000000099473604e-05 | 763 | rna-XM_030452815.1 6061994 | 8 | 124955292 | 124956054 | Calypte anna 9244 | CAG|GTAAGTGAAA...TTATTTTTAGTC/AGATTTTTGAAC...GGCAG|AGC | 0 | 1 | 15.71 |
31222455 | GT-AG | 0 | 1.6954081470993015e-05 | 1204 | rna-XM_030452815.1 6061994 | 9 | 124953951 | 124955154 | Calypte anna 9244 | AGA|GTAAGTTGAG...TATGCCTTTTAT/TAAGTAATCATC...TTCAG|AAA | 2 | 1 | 18.357 |
31222456 | GT-AG | 0 | 1.1108661747704035e-05 | 2318 | rna-XM_030452815.1 6061994 | 10 | 124951506 | 124953823 | Calypte anna 9244 | AAG|GTACTACATT...TGCCTCTTACCC/AATGTGTTTATT...CTCAG|CTG | 0 | 1 | 20.812 |
31222457 | GT-AG | 0 | 2.800467591600585e-05 | 577 | rna-XM_030452815.1 6061994 | 11 | 124950839 | 124951415 | Calypte anna 9244 | GTG|GTAATATTTC...ACTTTTTTATTT/ATGTTTTTCACT...TAAAG|CAG | 0 | 1 | 22.551 |
31222458 | GT-AG | 0 | 0.0012000336170322 | 3422 | rna-XM_030452815.1 6061994 | 12 | 124947234 | 124950655 | Calypte anna 9244 | AAA|GTAAGTTTGA...ATAACCTTAACC/CAAGACTTAATT...CTTAG|CCT | 0 | 1 | 26.087 |
31222459 | GT-AG | 0 | 0.0722497927626651 | 591 | rna-XM_030452815.1 6061994 | 13 | 124946504 | 124947094 | Calypte anna 9244 | CAG|GTATTCATTG...TGCTCTTTGATT/TTTTATTTCACC...TATAG|GAA | 1 | 1 | 28.773 |
31222460 | GT-AG | 0 | 1.000000099473604e-05 | 332 | rna-XM_030452815.1 6061994 | 14 | 124945110 | 124945441 | Calypte anna 9244 | CAG|GTTCTTTCAC...TGACTCTTATTT/TTGACTCTTATT...TGTAG|GTG | 1 | 1 | 49.295 |
31222461 | GT-AG | 0 | 1.000000099473604e-05 | 1178 | rna-XM_030452815.1 6061994 | 15 | 124942660 | 124943837 | Calypte anna 9244 | CAG|GTACAGCAGT...TGATGCTTAATT/TGATGCTTAATT...TGTAG|CTT | 1 | 1 | 73.874 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);