introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 32672022
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182504337 | GT-AG | 0 | 1.000000099473604e-05 | 100367 | rna-XM_018908954.2 32672022 | 1 | 101852392 | 101952758 | Serinus canaria 9135 | CAG|GTAAGGAGTC...CCTGCCTTAATT/TTAATTTTTACC...AACAG|GGT | 1 | 1 | 1.623 |
| 182504338 | GT-AG | 0 | 1.000000099473604e-05 | 33789 | rna-XM_018908954.2 32672022 | 2 | 101818480 | 101852268 | Serinus canaria 9135 | CGG|GTTCGTCTTT...GTTACTTTGTTT/AAGAAATTGAAT...TGCAG|GTT | 1 | 1 | 4.358 |
| 182504339 | GT-AG | 0 | 0.0004726368166521 | 11550 | rna-XM_018908954.2 32672022 | 3 | 101806664 | 101818213 | Serinus canaria 9135 | CAG|GTATGTCTTA...GTTTTCTTTTTG/GAGATGCTAACT...CTCAG|GTT | 0 | 1 | 10.274 |
| 182504340 | GT-AG | 0 | 0.0001111465621763 | 3018 | rna-XM_018908954.2 32672022 | 4 | 101803567 | 101806584 | Serinus canaria 9135 | GTA|GTAAGTACCT...ATAACTTTAACA/ATAACTTTAACA...AACAG|CAA | 1 | 1 | 12.03 |
| 182504341 | GT-AG | 0 | 1.2630690334698974e-05 | 18346 | rna-XM_018908954.2 32672022 | 5 | 101785105 | 101803450 | Serinus canaria 9135 | CAG|GTAATTCCTT...TTTTTTTTAATA/TTTTTTTTAATA...CCTAG|GGT | 0 | 1 | 14.61 |
| 182504342 | GT-AG | 0 | 7.925560851830943e-05 | 7946 | rna-XM_018908954.2 32672022 | 6 | 101776984 | 101784929 | Serinus canaria 9135 | AAG|GTACTTCACC...ATAGCCTTACTG/CATAGCCTTACT...TTCAG|AGC | 1 | 1 | 18.501 |
| 182504343 | GT-AG | 0 | 1.000000099473604e-05 | 54664 | rna-XM_018908954.2 32672022 | 7 | 101722026 | 101776689 | Serinus canaria 9135 | CTG|GTGAGTACCT...CTATTCTAATAT/ACTATTCTAATA...TGCAG|ATC | 1 | 1 | 25.039 |
| 182504344 | GT-AG | 0 | 1.000000099473604e-05 | 3187 | rna-XM_018908954.2 32672022 | 8 | 101718530 | 101721716 | Serinus canaria 9135 | ATG|GTGAGTGCTC...TTTTTCTTAAAA/TTTTTTTTAATC...GTTAG|TCC | 1 | 1 | 31.91 |
| 182504345 | GT-AG | 0 | 0.1967819391311285 | 2294 | rna-XM_018908954.2 32672022 | 9 | 101716126 | 101718419 | Serinus canaria 9135 | GAG|GTATCTAGAA...ACTTTCTTATTC/AACTTTCTTATT...TGCAG|ATC | 0 | 1 | 34.356 |
| 182504346 | GT-AG | 0 | 3.538330279528037e-05 | 3711 | rna-XM_018908954.2 32672022 | 10 | 101712213 | 101715923 | Serinus canaria 9135 | CAG|GTACTTCATC...ATGATTTTATTT/AATGATTTTATT...TCTAG|CAC | 1 | 1 | 38.848 |
| 182504347 | GT-AG | 0 | 1.000000099473604e-05 | 26389 | rna-XM_018908954.2 32672022 | 11 | 101685721 | 101712109 | Serinus canaria 9135 | CAG|GTACTGGGGG...CAAGTTTTAAAA/CAAGTTTTAAAA...CTTAG|TGT | 2 | 1 | 41.139 |
| 182504348 | GT-AG | 0 | 4.4641356011737534e-05 | 1133 | rna-XM_018908954.2 32672022 | 12 | 101684314 | 101685446 | Serinus canaria 9135 | GGG|GTAGGTCTTG...CTTTCTTTGTCT/CAAGATGTGACC...CACAG|GAA | 0 | 1 | 47.231 |
| 182504349 | GT-AG | 0 | 0.0038243692338631 | 4973 | rna-XM_018908954.2 32672022 | 13 | 101679304 | 101684276 | Serinus canaria 9135 | AAG|GTATGTTTGG...TTTTCTTTCCCT/CTTTCCCTCATG...CTTAG|CTG | 1 | 1 | 48.054 |
| 182504350 | GT-AG | 0 | 1.000000099473604e-05 | 18020 | rna-XM_018908954.2 32672022 | 14 | 101661209 | 101679228 | Serinus canaria 9135 | CAG|GTCAGATGTT...TTTTCCTCATTC/CTTTTCCTCATT...CCTAG|GGG | 1 | 1 | 49.722 |
| 182504351 | GT-AG | 0 | 1.000000099473604e-05 | 6900 | rna-XM_018908954.2 32672022 | 15 | 101654176 | 101661075 | Serinus canaria 9135 | AAG|GTGAGCAATT...ACTGCCTAATCC/TTCCATTTCATC...GGCAG|AAG | 2 | 1 | 52.68 |
| 182504352 | GT-AG | 0 | 2.8912428028783854e-05 | 50467 | rna-XM_018908954.2 32672022 | 16 | 101603679 | 101654145 | Serinus canaria 9135 | CCT|GTAAGTAACT...TTCTCCTTTTTT/TCATCATTCATT...TTAAG|AAA | 2 | 1 | 53.347 |
| 182504353 | GT-AG | 0 | 0.0005477096535907 | 2531 | rna-XM_018908954.2 32672022 | 17 | 101600996 | 101603526 | Serinus canaria 9135 | GAT|GTAAGTTTTT...CTGTCTCTAACC/CTGTCTCTAACC...TGCAG|CTG | 1 | 1 | 56.727 |
| 182504354 | GT-AG | 0 | 0.0080473682173336 | 226 | rna-XM_018908954.2 32672022 | 18 | 101600695 | 101600920 | Serinus canaria 9135 | CAG|GTAACCAGTT...CTCTTTTTACCT/TCTCTTTTTACC...GACAG|ATC | 1 | 1 | 58.394 |
| 182504355 | GT-AG | 0 | 1.000000099473604e-05 | 3726 | rna-XM_018908954.2 32672022 | 19 | 101596942 | 101600667 | Serinus canaria 9135 | TAG|GTGAGAACAT...GTTTCCTTTTTC/TTTCTCCTGATA...AAAAG|TGC | 1 | 1 | 58.995 |
| 182504356 | GT-AG | 0 | 1.000000099473604e-05 | 763 | rna-XM_018908954.2 32672022 | 20 | 101596167 | 101596929 | Serinus canaria 9135 | ATG|GTAAGTTGCC...TTTCTTCTAATA/TTTCTTCTAATA...TCTAG|ATG | 1 | 1 | 59.262 |
| 182504357 | GT-AG | 0 | 1.9334439757070912e-05 | 24865 | rna-XM_018908954.2 32672022 | 21 | 101571114 | 101595978 | Serinus canaria 9135 | GAA|GTAAGTGGCA...CTTTTCTTAATG/CTTTTCTTAATG...TTTAG|AGT | 0 | 1 | 63.442 |
| 182504358 | GT-AG | 0 | 1.000000099473604e-05 | 5980 | rna-XM_018908954.2 32672022 | 22 | 101565046 | 101571025 | Serinus canaria 9135 | CAT|GTGAGTGTTG...GTGTCTTTAATT/GTGTCTTTAATT...TTCAG|ATG | 1 | 1 | 65.399 |
| 182504359 | GT-AG | 0 | 1.000000099473604e-05 | 2755 | rna-XM_018908954.2 32672022 | 23 | 101562214 | 101564968 | Serinus canaria 9135 | GAT|GTGCGTATCC...TTTTTTTTTTCC/AAATTGCTGATT...TCTAG|GGT | 0 | 1 | 67.111 |
| 182504360 | GT-AG | 0 | 1.000000099473604e-05 | 18060 | rna-XM_018908954.2 32672022 | 24 | 101544117 | 101562176 | Serinus canaria 9135 | AAG|GTAAGTTTCC...TTTCATTTGATT/ATTTGATTTATT...TTCAG|GAC | 1 | 1 | 67.934 |
| 182504361 | GT-AG | 0 | 1.000000099473604e-05 | 11815 | rna-XM_018908954.2 32672022 | 25 | 101532204 | 101544018 | Serinus canaria 9135 | AGG|GTAAGTGGTT...TGAACCTTAATT/TGAACCTTAATT...TAAAG|GTC | 0 | 1 | 70.113 |
| 182504362 | GT-AG | 0 | 3.5445046515850264e-05 | 4301 | rna-XM_018908954.2 32672022 | 26 | 101527786 | 101532086 | Serinus canaria 9135 | AAG|GTAAGTTTCC...CAACCCTTGTCT/GAAGATTTCAAC...AACAG|AGA | 0 | 1 | 72.715 |
| 182504363 | GT-AG | 0 | 1.000000099473604e-05 | 968 | rna-XM_018908954.2 32672022 | 27 | 101526663 | 101527630 | Serinus canaria 9135 | CAG|GTAGGTCAAA...TGCACATTAACT/TGCACATTAACT...TTCAG|TGC | 2 | 1 | 76.162 |
| 182504364 | GT-AG | 0 | 0.0009600955837681 | 576 | rna-XM_018908954.2 32672022 | 28 | 101525951 | 101526526 | Serinus canaria 9135 | GAG|GTATTGCTCA...TCTATTTTAACT/TCTATTTTAACT...ACCAG|GAA | 0 | 1 | 79.186 |
| 182504365 | GT-AG | 0 | 1.000000099473604e-05 | 461 | rna-XM_018908954.2 32672022 | 29 | 101525340 | 101525800 | Serinus canaria 9135 | AGG|GTAAGTAGTG...GTCGCTTTCACT/GCTGTGTTAATT...GAAAG|ACT | 0 | 1 | 82.522 |
| 182504366 | GT-AG | 0 | 3.131901443184204e-05 | 2740 | rna-XM_018908954.2 32672022 | 30 | 101522426 | 101525165 | Serinus canaria 9135 | GAT|GTAAGTCTCC...TTCTTCTCATCA/ATTCTTCTCATC...TGCAG|AGC | 0 | 1 | 86.391 |
| 182504367 | GT-AG | 0 | 1.000000099473604e-05 | 3030 | rna-XM_018908954.2 32672022 | 31 | 101519264 | 101522293 | Serinus canaria 9135 | CAA|GTAAGTCAAG...TTGTCCATGATC/AGGTGATTAACT...TTTAG|CTC | 0 | 1 | 89.326 |
| 182504368 | GT-AG | 0 | 1.000000099473604e-05 | 1786 | rna-XM_018908954.2 32672022 | 32 | 101517352 | 101519137 | Serinus canaria 9135 | AGG|GTAAGGATCT...CTGCTTTTGTCC/AATGCTGTTACT...TACAG|CCC | 0 | 1 | 92.128 |
| 182504369 | GT-AG | 0 | 1.2456992856908656e-05 | 2069 | rna-XM_018908954.2 32672022 | 33 | 101515119 | 101517187 | Serinus canaria 9135 | CTT|GTAAGTATGG...TCCATTGTAACT/CGTTTCATCACT...TGCAG|GAA | 2 | 1 | 95.775 |
| 182504370 | GT-AG | 0 | 1.000000099473604e-05 | 4267 | rna-XM_018908954.2 32672022 | 34 | 101510716 | 101514982 | Serinus canaria 9135 | CTG|GTAGGAGCTT...TTCCTTTTGATT/TTCCTTTTGATT...TTCAG|GAT | 0 | 1 | 98.799 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);