introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 9114829
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389132 | GT-AG | 0 | 1.000000099473604e-05 | 1859 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 1 | 1297465 | 1299323 | Columbina picui 115618 | AAG|GTGAGTCCGC...GAGACTGTAGCC/CCATGGTTTATG...TTCAG|GAA | 0 | 1 | 6.97 |
49389133 | GT-AG | 0 | 1.000000099473604e-05 | 1582 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 2 | 1295771 | 1297352 | Columbina picui 115618 | CAG|GTAAGTATTC...AACTTTTTTTTT/AATGTACTAAAC...CTTAG|GAA | 1 | 1 | 12.626 |
49389134 | GT-AG | 0 | 1.000000099473604e-05 | 200 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 3 | 1295422 | 1295621 | Columbina picui 115618 | CCA|GTAAGTTAAG...AAACATTTATTT/TAAACATTTATT...CACAG|GGC | 0 | 1 | 20.152 |
49389135 | GT-AG | 0 | 1.000000099473604e-05 | 1216 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 4 | 1294125 | 1295340 | Columbina picui 115618 | GAG|GTGGAGTTCT...AATTTTTTAATG/AATTTTTTAATG...TCCAG|GGT | 0 | 1 | 24.242 |
49389136 | GT-AG | 0 | 1.000000099473604e-05 | 241 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 5 | 1293806 | 1294046 | Columbina picui 115618 | AAG|GTAAATAAAG...TTGCCTGTATAT/TGTATATTTATG...TTCAG|ATT | 0 | 1 | 28.182 |
49389137 | GT-AG | 0 | 1.000000099473604e-05 | 1261 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 6 | 1292461 | 1293721 | Columbina picui 115618 | AGG|GTAAGATCAC...CACATCTTATGA/GAAATTGTCATT...TTTAG|ATT | 0 | 1 | 32.424 |
49389138 | GT-AG | 0 | 0.0029234685097757 | 1006 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 7 | 1291390 | 1292395 | Columbina picui 115618 | GCC|GTAAGTTTGT...ATTGTTTTAAAT/ATTGTTTTAAAT...GACAG|ACC | 2 | 1 | 35.707 |
49389139 | GT-AG | 0 | 1.000000099473604e-05 | 1549 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 8 | 1289637 | 1291185 | Columbina picui 115618 | GCT|GTAAGTAAAA...GAAATATTGATA/GAAATATTGATA...TTTAG|ATA | 2 | 1 | 46.01 |
49389140 | GT-AG | 0 | 1.000000099473604e-05 | 1270 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 9 | 1288227 | 1289496 | Columbina picui 115618 | CAG|GTGAGGAGTA...TCATTTTTATCT/TTTTATCTAATA...GACAG|GAA | 1 | 1 | 53.081 |
49389141 | GT-AG | 0 | 1.000000099473604e-05 | 359 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 10 | 1287744 | 1288102 | Columbina picui 115618 | GAG|GTGAGACTTT...ATACTTTTACCA/AATACTTTTACC...CCCAG|GTT | 2 | 1 | 59.343 |
49389142 | GT-AG | 0 | 1.0847121511877478e-05 | 1504 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 11 | 1286107 | 1287610 | Columbina picui 115618 | AAG|GTACATAAAA...TCTGTTTTATTT/TATTTGTTTATT...TTCAG|GAG | 0 | 1 | 66.061 |
49389143 | GT-AG | 0 | 1.000000099473604e-05 | 1052 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 12 | 1284911 | 1285962 | Columbina picui 115618 | CAG|GTTGGTATAT...CAATTTTTATTT/ACAATTTTTATT...TCTAG|GAT | 0 | 1 | 73.333 |
49389144 | GT-AG | 0 | 1.000000099473604e-05 | 2965 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 13 | 1281812 | 1284776 | Columbina picui 115618 | AAG|GTAAGTACCT...TAACTTTTAAAC/ATAATTTTTACA...TTTAG|TTT | 2 | 1 | 80.101 |
49389145 | GT-AG | 0 | 1.957113189289962e-05 | 1954 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 14 | 1279756 | 1281709 | Columbina picui 115618 | CAA|GTAAGTTGAC...TCTTCCTGAAAA/GTCTTCCTGAAA...TTCAG|CGA | 2 | 1 | 85.253 |
49389146 | GT-AG | 0 | 1.000000099473604e-05 | 911 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 15 | 1278725 | 1279635 | Columbina picui 115618 | AAC|GTAAGTGAAG...GTTATTTTATTT/TGTTATTTTATT...TACAG|GCT | 2 | 1 | 91.313 |
49389147 | GT-AG | 0 | 4.005608673854596e-05 | 1010 | rna-gnl|WGS:VYZG|COLPIC_R05511_mrna 9114829 | 16 | 1277678 | 1278687 | Columbina picui 115618 | TGT|GTAAGTATTT...TTTACTGTAGTT/CCCTTTCTCATT...CATAG|TAT | 0 | 1 | 93.182 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);