introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 3555679
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17676333 | GT-AG | 0 | 1.000000099473604e-05 | 4192 | rna-XM_038334239.1 3555679 | 2 | 180079358 | 180083549 | Arvicola amphibius 1047088 | CTG|GTAAGTCATT...TCCATCTTCTTT/GGCATGGTCACC...TTCAG|AAT | 1 | 1 | 4.626 |
| 17676334 | GT-AG | 0 | 1.000000099473604e-05 | 5146 | rna-XM_038334239.1 3555679 | 3 | 180074143 | 180079288 | Arvicola amphibius 1047088 | AAG|GTAAGGCTGT...GGATCTTTATTT/TGGATCTTTATT...CTCAG|ACA | 1 | 1 | 6.02 |
| 17676335 | GT-AG | 0 | 6.390406835124856e-05 | 8740 | rna-XM_038334239.1 3555679 | 4 | 180065274 | 180074013 | Arvicola amphibius 1047088 | CAG|GTGTGCTACC...TTTCCCTTTTTG/TTTGTTTTCATC...CACAG|GCC | 1 | 1 | 8.626 |
| 17676336 | GT-AG | 0 | 6.664257561673722e-05 | 125 | rna-XM_038334239.1 3555679 | 5 | 180065122 | 180065246 | Arvicola amphibius 1047088 | CAG|GTATGGCTTT...TGTTTTTTGTCT/GCGGGTGTTACC...CACAG|AGG | 1 | 1 | 9.172 |
| 17676337 | GT-AG | 0 | 1.000000099473604e-05 | 1666 | rna-XM_038334239.1 3555679 | 6 | 180063343 | 180065008 | Arvicola amphibius 1047088 | CAG|GTAGGTGTGT...TTGTCCTGCACA/TGCCATTTGAAC...TCAAG|GAT | 0 | 1 | 11.455 |
| 17676338 | GT-AG | 0 | 4.788323344682551e-05 | 7356 | rna-XM_038334239.1 3555679 | 7 | 180055984 | 180063339 | Arvicola amphibius 1047088 | GAT|GTCTAGTACT...TAATTCTAGATA/CTTGTGTTCACT...TTCAG|GGC | 0 | 1 | 11.515 |
| 17676339 | GT-AG | 0 | 1.000000099473604e-05 | 3698 | rna-XM_038334239.1 3555679 | 8 | 180052144 | 180055841 | Arvicola amphibius 1047088 | GAG|GTAAGGCAGT...GTGTTCTTTTCT/ATATCACTCACA...CACAG|AAC | 1 | 1 | 14.384 |
| 17676340 | GT-AG | 0 | 1.000000099473604e-05 | 5495 | rna-XM_038334239.1 3555679 | 9 | 180046619 | 180052113 | Arvicola amphibius 1047088 | CAG|GTAGGTGAGC...TTGCTTTTCATT/TTGCTTTTCATT...TCCAG|ATC | 1 | 1 | 14.99 |
| 17676341 | GT-AG | 0 | 1.000000099473604e-05 | 2097 | rna-XM_038334239.1 3555679 | 10 | 180044346 | 180046442 | Arvicola amphibius 1047088 | CGG|GTACGAATGT...ATTGTTTTACCC/CATTGTTTTACC...TCCAG|AAT | 0 | 1 | 18.545 |
| 17676342 | GT-AG | 0 | 0.0105510530064565 | 11666 | rna-XM_038334239.1 3555679 | 11 | 180032616 | 180044281 | Arvicola amphibius 1047088 | GAG|GTACCAATTT...TCTCCCTTGAGT/TCCCCCCTGACT...ACTAG|ATG | 1 | 1 | 19.838 |
| 17676343 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_038334239.1 3555679 | 12 | 180032461 | 180032567 | Arvicola amphibius 1047088 | ACA|GTAAGTGAAG...TCGGTTTTAGTT/TTAGTTTTCATT...TCCAG|CAA | 1 | 1 | 20.808 |
| 17676344 | GT-AG | 0 | 1.000000099473604e-05 | 2306 | rna-XM_038334239.1 3555679 | 13 | 180030025 | 180032330 | Arvicola amphibius 1047088 | GAG|GTAAGTGGTC...GCTTTCATGAAA/GCCGCTTTCATG...TACAG|AAA | 2 | 1 | 23.434 |
| 17676345 | GT-AG | 0 | 1.000000099473604e-05 | 5461 | rna-XM_038334239.1 3555679 | 14 | 180024449 | 180029909 | Arvicola amphibius 1047088 | ATG|GTAATGTGCA...TTATCCTTCTCT/CATAATGTTATC...TATAG|AGT | 0 | 1 | 25.758 |
| 17676346 | GT-AG | 0 | 1.000000099473604e-05 | 3774 | rna-XM_038334239.1 3555679 | 15 | 180020591 | 180024364 | Arvicola amphibius 1047088 | CCG|GTAAGTGCGA...CATTTCTTCATT/TTTATGCTCATT...GTCAG|CCT | 0 | 1 | 27.455 |
| 17676347 | GT-AG | 0 | 3.446211546938539e-05 | 524 | rna-XM_038334239.1 3555679 | 16 | 180020001 | 180020524 | Arvicola amphibius 1047088 | AAG|GTAATCCCGT...TTTCTCTAAACT/TTCTTTCTCATC...ACCAG|ACT | 0 | 1 | 28.788 |
| 17676348 | GT-AG | 0 | 0.0013000136606426 | 4092 | rna-XM_038334239.1 3555679 | 17 | 180015807 | 180019898 | Arvicola amphibius 1047088 | CAG|GTATTTCCCA...TCTTTCTTACAT/CTCTTTCTTACA...TACAG|AGC | 0 | 1 | 30.848 |
| 17676349 | GT-AG | 0 | 1.000000099473604e-05 | 9667 | rna-XM_038334239.1 3555679 | 18 | 180006090 | 180015756 | Arvicola amphibius 1047088 | ATA|GTAAGTAGCC...TCTCTTTTAAAA/TCTCTTTTAAAA...TGTAG|TAT | 2 | 1 | 31.859 |
| 17676350 | GT-AG | 0 | 2.631996451856428e-05 | 512 | rna-XM_038334239.1 3555679 | 19 | 180005405 | 180005916 | Arvicola amphibius 1047088 | AAG|GTAACGGACA...TCAGCTTTAACC/GTTTATTTTATT...ACCAG|TGG | 1 | 1 | 35.354 |
| 17676351 | GT-AG | 0 | 0.0001273038982697 | 629 | rna-XM_038334239.1 3555679 | 20 | 180003306 | 180003934 | Arvicola amphibius 1047088 | GAG|GTGGCATTTT...CCCGTTTTAACT/CCCGTTTTAACT...TGAAG|ATT | 1 | 1 | 65.051 |
| 17676352 | GT-AG | 0 | 1.000000099473604e-05 | 3308 | rna-XM_038334239.1 3555679 | 21 | 179999912 | 180003219 | Arvicola amphibius 1047088 | GAG|GTAATTCTCC...CCCATTTTATAC/AGATAATTTACT...GGCAG|AAA | 0 | 1 | 66.788 |
| 17676353 | GT-AG | 0 | 0.0016356621163849 | 3443 | rna-XM_038334239.1 3555679 | 22 | 179996301 | 179999743 | Arvicola amphibius 1047088 | GAG|GTCTGTTTTG...CGACTTTTAACG/CGACTTTTAACG...TTTAG|ATG | 0 | 1 | 70.182 |
| 17676354 | GT-AG | 0 | 1.6672595863662416e-05 | 3906 | rna-XM_038334239.1 3555679 | 23 | 179992348 | 179996253 | Arvicola amphibius 1047088 | GAA|GTAAGTCCTG...CACGCTTTGAAA/ACCAAGCTCACG...AACAG|GGA | 2 | 1 | 71.131 |
| 17676355 | GT-AG | 0 | 0.0001002142774074 | 135 | rna-XM_038334239.1 3555679 | 24 | 179992098 | 179992232 | Arvicola amphibius 1047088 | GAG|GTACTGTAGC...CTCTCCTTGCCT/TCTTTTTTGCTT...CTCAG|CTT | 0 | 1 | 73.455 |
| 17676356 | GT-AG | 0 | 1.7927618817144414e-05 | 2638 | rna-XM_038334239.1 3555679 | 25 | 179989334 | 179991971 | Arvicola amphibius 1047088 | AAG|GTAAGCTGTA...GGACTCTTGCCT/TGGTGGTTCAAG...TACAG|GGT | 0 | 1 | 76.0 |
| 17676357 | GT-AG | 0 | 0.0781191698027429 | 1362 | rna-XM_038334239.1 3555679 | 26 | 179987777 | 179989138 | Arvicola amphibius 1047088 | CAG|GTATCTAAGC...TTTACCTTTGCT/GACACTTTTACC...TCAAG|TGT | 0 | 1 | 79.939 |
| 17676358 | GT-AG | 0 | 0.0003366479516063 | 17839 | rna-XM_038334239.1 3555679 | 27 | 179969185 | 179987023 | Arvicola amphibius 1047088 | CAG|GTAGGCCTGA...TCATCTTTAATC/TCATCTTTAATC...CTCAG|CAA | 0 | 1 | 95.152 |
| 17676359 | GT-AG | 0 | 1.000000099473604e-05 | 1973 | rna-XM_038334239.1 3555679 | 28 | 179967153 | 179969125 | Arvicola amphibius 1047088 | TTT|GTAAGTACAG...ATGGCATTAGAC/TTGGGACTCACA...ACAAG|ATT | 2 | 1 | 96.343 |
| 17676360 | GT-AG | 0 | 1.000000099473604e-05 | 950 | rna-XM_038334239.1 3555679 | 29 | 179966197 | 179967146 | Arvicola amphibius 1047088 | CTG|GTGCTACTGG...TCTTTCTGATTT/CTCTTTCTGATT...TGTAG|CTA | 2 | 1 | 96.465 |
| 17676361 | GT-AG | 0 | 1.000000099473604e-05 | 1863 | rna-XM_038334239.1 3555679 | 30 | 179964227 | 179966089 | Arvicola amphibius 1047088 | TTG|GTAAGATCCC...GTTTCCTTGTTA/TGTCTTCTCATT...TGCAG|GCA | 1 | 1 | 98.626 |
| 17692047 | GT-AG | 0 | 1.000000099473604e-05 | 15433 | rna-XM_038334239.1 3555679 | 1 | 180083604 | 180099036 | Arvicola amphibius 1047088 | AAA|GTAAGTAAAT...GGGGATTTATCA/CTGCCATTCATT...TGCAG|AGC | 0 | 4.424 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);