introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 23944443
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 130192382 | GT-AG | 0 | 0.0019054017399224 | 1711 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 1 | 17042462 | 17044172 | Nothocercus julius 2585813 | TCT|GTAGGTTGAA...CAGACTTTAACT/TTTTTTCTTATG...AACAG|CCT | 0 | 1 | 38.131 |
| 130192383 | GT-AG | 0 | 1.000000099473604e-05 | 15752 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 2 | 17044291 | 17060042 | Nothocercus julius 2585813 | CAG|GTAAAACATA...CTTTTCTTACTT/ACTTTTCTTACT...TACAG|AGT | 1 | 1 | 41.384 |
| 130192384 | GT-AG | 0 | 0.6685572438484818 | 17377 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 3 | 17060105 | 17077481 | Nothocercus julius 2585813 | AAG|GTATACTTTT...GTCACTTTAGCA/AATACATTTATA...TGCAG|TGT | 0 | 1 | 43.093 |
| 130192385 | GT-AG | 0 | 1.000000099473604e-05 | 2217 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 4 | 17078276 | 17080492 | Nothocercus julius 2585813 | CAG|GTAAAATGCA...ATTTTCTTACTG/CATTTTCTTACT...TATAG|GAA | 2 | 1 | 64.985 |
| 130192386 | GT-AG | 0 | 0.0020826477726058 | 9382 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 5 | 17080644 | 17090025 | Nothocercus julius 2585813 | AAG|GTATTATTTC...TTTGCATTATCT/GGTTATTTGATC...TGTAG|ATT | 0 | 1 | 69.148 |
| 130192387 | GT-AG | 0 | 1.000000099473604e-05 | 8883 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 6 | 17090114 | 17098996 | Nothocercus julius 2585813 | AAG|GTAGGTCTTT...TATTTTCTAGCG/TAGCGTCTAATT...TACAG|CCA | 1 | 1 | 71.574 |
| 130192388 | GT-AG | 0 | 8.679385994630699e-05 | 805 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 7 | 17099059 | 17099863 | Nothocercus julius 2585813 | AAT|GTAAGTATGT...CATTTTTTACCT/TTGTTATTAATT...CTCAG|AGA | 0 | 1 | 73.284 |
| 130192389 | GT-AG | 0 | 9.248724818908126e-05 | 4185 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 8 | 17099953 | 17104137 | Nothocercus julius 2585813 | ACA|GTAAGTTAAT...TTTTTCTTTTCC/ATCTTGTTTATA...CACAG|ATT | 2 | 1 | 75.738 |
| 130192390 | GT-AG | 0 | 7.319432621671002e-05 | 1481 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 9 | 17104268 | 17105748 | Nothocercus julius 2585813 | AAG|GTATGTGCTT...ATATTTTTATAA/CTCATATTCATT...TCCAG|GCT | 0 | 1 | 79.322 |
| 130192391 | GT-AG | 0 | 1.2480931047646095e-05 | 2816 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 10 | 17105896 | 17108711 | Nothocercus julius 2585813 | AGA|GTAAGTACTC...TAACTTTTACCC/TACTTTTTCAGT...CCCAG|GAT | 0 | 1 | 83.375 |
| 130192392 | GT-AG | 0 | 1.000000099473604e-05 | 2271 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 11 | 17108772 | 17111042 | Nothocercus julius 2585813 | AAG|GTAAGAGATT...ATTTTCATAGCG/TTAATTTTCATA...TTTAG|GAA | 0 | 1 | 85.029 |
| 130192393 | GT-AG | 0 | 3.227656822199338e-05 | 13164 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 12 | 17111164 | 17124327 | Nothocercus julius 2585813 | AAG|GTAACTGGAA...TGTGTCTCAACT/CTATTTCTGATA...TGTAG|TCT | 1 | 1 | 88.365 |
| 130192394 | GT-AG | 0 | 0.002873223627488 | 20705 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 13 | 17124421 | 17145125 | Nothocercus julius 2585813 | AAG|GTAACCATAT...ATGTTTTTCATT/ATGTTTTTCATT...GGCAG|CTA | 1 | 1 | 90.929 |
| 130192395 | GT-AG | 0 | 5.309298161115263e-05 | 2611 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 14 | 17145178 | 17147788 | Nothocercus julius 2585813 | ATA|GTAAGTATTT...GTGGCCTTATGC/TATGCGCTGACA...TGTAG|CTT | 2 | 1 | 92.363 |
| 130192396 | GT-AG | 0 | 1.000000099473604e-05 | 11115 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 15 | 17147939 | 17159053 | Nothocercus julius 2585813 | AAG|GTAAGGTTCT...CTTTCTTTGAAT/TTGAATCTAATA...TTCAG|AAT | 2 | 1 | 96.498 |
| 130192397 | GT-AG | 0 | 0.0033218832219054 | 861 | rna-gnl|WGS:VZSV|NOTJUL_R09903_mrna 23944443 | 16 | 17159139 | 17159999 | Nothocercus julius 2585813 | CTG|GTATGATGTA...TTTTTCTTAAAC/CTTTTTCTTAAA...AACAG|AAA | 0 | 1 | 98.842 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);