introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
28 rows where transcript_id = 25387422
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 140016231 | GT-AG | 0 | 9.26682084600611e-05 | 16693 | rna-XM_040243955.1 25387422 | 2 | 73998401 | 74015093 | Oryx dammah 59534 | CAG|GTAAGTTTTT...AAATCTTTAACA/TTGTATTTGATT...TACAG|GCA | 0 | 1 | 5.936 |
| 140016232 | GT-AG | 0 | 0.0202007772073349 | 2290 | rna-XM_040243955.1 25387422 | 3 | 73996033 | 73998322 | Oryx dammah 59534 | CAG|GTATGTTTCT...ATTTTCTTATAA/CATTTTCTTATA...AACAG|CTT | 0 | 1 | 8.151 |
| 140016233 | GT-AG | 0 | 0.0009317657253975 | 3790 | rna-XM_040243955.1 25387422 | 4 | 73992132 | 73995921 | Oryx dammah 59534 | ACT|GTAAGTATTA...ATGCTCTTATTT/AATGCTCTTATT...GGTAG|ACT | 0 | 1 | 11.304 |
| 140016234 | GT-AG | 0 | 7.117848248990487e-05 | 1532 | rna-XM_040243955.1 25387422 | 5 | 73990446 | 73991977 | Oryx dammah 59534 | AGT|GTAAGTACAG...TTGTCTTTAGTA/CTTGTCTTTAGT...TTTAG|TTA | 1 | 1 | 15.677 |
| 140016235 | GT-AG | 0 | 1.000000099473604e-05 | 7964 | rna-XM_040243955.1 25387422 | 6 | 73982404 | 73990367 | Oryx dammah 59534 | GAG|GTAAGAGAAT...CTATTTTTATAC/ACTATTTTTATA...TAAAG|GAT | 1 | 1 | 17.893 |
| 140016236 | GT-AG | 0 | 2.417751454061378e-05 | 1804 | rna-XM_040243955.1 25387422 | 7 | 73980545 | 73982348 | Oryx dammah 59534 | GGG|GTAAGTCTTA...TATTTTTTATTT/TTATTTTTTATT...CCAAG|GAC | 2 | 1 | 19.455 |
| 140016237 | GT-AG | 0 | 1.000000099473604e-05 | 5065 | rna-XM_040243955.1 25387422 | 8 | 73975376 | 73980440 | Oryx dammah 59534 | CTG|GTAAGTGTCT...AAACCCTTACAT/TTTACATTGATC...CACAG|AAC | 1 | 1 | 22.408 |
| 140016238 | GT-AG | 0 | 0.0001256765814573 | 3182 | rna-XM_040243955.1 25387422 | 9 | 73972128 | 73975309 | Oryx dammah 59534 | AAG|GTATGGATTT...CAGACCTTATTT/CCTTATTTGACT...AACAG|GGG | 1 | 1 | 24.283 |
| 140016239 | GT-AG | 0 | 9.926218668936496e-05 | 3401 | rna-XM_040243955.1 25387422 | 10 | 73968674 | 73972074 | Oryx dammah 59534 | GAT|GTAAGTGTTT...ATGTTTTTAATC/ATGTTTTTAATC...TACAG|CTT | 0 | 1 | 25.788 |
| 140016240 | GT-AG | 0 | 1.000000099473604e-05 | 2178 | rna-XM_040243955.1 25387422 | 11 | 73966428 | 73968605 | Oryx dammah 59534 | AAG|GTAAGTGCTT...CATTCTTTCATT/CATTCTTTCATT...CTTAG|TGT | 2 | 1 | 27.719 |
| 140016241 | GT-AG | 0 | 0.0016064150174864 | 2857 | rna-XM_040243955.1 25387422 | 12 | 73963381 | 73966237 | Oryx dammah 59534 | TTG|GTAACTCACT...TTTTTCTTTCCC/GTATTGCTGAGG...TAAAG|TCA | 0 | 1 | 33.116 |
| 140016242 | GT-AG | 0 | 1.000000099473604e-05 | 7229 | rna-XM_040243955.1 25387422 | 13 | 73955938 | 73963166 | Oryx dammah 59534 | TTG|GTAAGTGTGC...TGTGCCTAAATG/AAATGACTGATT...CTTAG|ACT | 1 | 1 | 39.193 |
| 140016243 | GT-AG | 0 | 1.000000099473604e-05 | 608 | rna-XM_040243955.1 25387422 | 14 | 73955049 | 73955656 | Oryx dammah 59534 | AAG|GTGGGTCTTT...AATCTCTCACTT/CTGTTGCTAACA...TACAG|ATC | 0 | 1 | 47.174 |
| 140016244 | GT-AG | 0 | 0.01115702060215 | 1987 | rna-XM_040243955.1 25387422 | 15 | 73952892 | 73954878 | Oryx dammah 59534 | AAG|GTATACAGTT...TGTCTTTTATTC/TTATGTCTCATG...TGTAG|TGA | 2 | 1 | 52.002 |
| 140016245 | GC-AG | 0 | 1.000000099473604e-05 | 4165 | rna-XM_040243955.1 25387422 | 16 | 73948513 | 73952677 | Oryx dammah 59534 | ACG|GCAAGTGGTC...CTTACTTTATTT/GGTTTACTTACT...TATAG|AAT | 0 | 1 | 58.08 |
| 140016246 | GT-AG | 0 | 1.000000099473604e-05 | 2104 | rna-XM_040243955.1 25387422 | 17 | 73946369 | 73948472 | Oryx dammah 59534 | ACA|GTAAGTATAT...TGGAGCTAAACT/CCGATACTCACA...TACAG|CTA | 1 | 1 | 59.216 |
| 140016247 | GT-AG | 0 | 1.000000099473604e-05 | 886 | rna-XM_040243955.1 25387422 | 18 | 73945299 | 73946184 | Oryx dammah 59534 | CAG|GTATGGGGAC...GCCGCCTTGTCT/CCGTGGTGGACA...TCCAG|CTC | 2 | 1 | 64.442 |
| 140016248 | GT-AG | 0 | 4.642397673185995e-05 | 4259 | rna-XM_040243955.1 25387422 | 19 | 73940970 | 73945228 | Oryx dammah 59534 | ACA|GTAAGTGTTC...TGCCCCTAGATC/CCTAGATCCATG...GCCAG|CAT | 0 | 1 | 66.43 |
| 140016249 | GT-AG | 0 | 1.000000099473604e-05 | 564 | rna-XM_040243955.1 25387422 | 20 | 73940210 | 73940773 | Oryx dammah 59534 | CAG|GTGTGTCCCC...CTCCATTTGATT/CTCCATTTGATT...TGCAG|GCA | 1 | 1 | 71.997 |
| 140016250 | GT-AG | 0 | 1.000000099473604e-05 | 262 | rna-XM_040243955.1 25387422 | 21 | 73939856 | 73940117 | Oryx dammah 59534 | CCG|GTGAGCGCTC...GGCCTCTTACAG/TGGCCTCTTACA...CGCAG|ATT | 0 | 1 | 74.609 |
| 140016251 | GT-AG | 0 | 1.000000099473604e-05 | 2325 | rna-XM_040243955.1 25387422 | 22 | 73937483 | 73939807 | Oryx dammah 59534 | ATG|GTGAGTGGGG...ATTCCGCTGACC/ATTCCGCTGACC...CCTAG|GAT | 0 | 1 | 75.973 |
| 140016252 | GT-AG | 0 | 1.000000099473604e-05 | 219 | rna-XM_040243955.1 25387422 | 23 | 73937185 | 73937403 | Oryx dammah 59534 | CAG|GTTAGTCTTC...GTGCCCCTGTTC/GACTGCATCAGT...CCTAG|GTG | 1 | 1 | 78.216 |
| 140016253 | GT-AG | 0 | 1.000000099473604e-05 | 261 | rna-XM_040243955.1 25387422 | 24 | 73936718 | 73936978 | Oryx dammah 59534 | TTT|GTGAGTATCG...CTCCCCTCACTC/GCTCCCCTCACT...CCCAG|GTC | 0 | 1 | 84.067 |
| 140016254 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_040243955.1 25387422 | 25 | 73936535 | 73936617 | Oryx dammah 59534 | CAG|GTGAGGGGCG...CCCCTCTTGTCC/GACTGGCTCACC...GCTAG|GTT | 1 | 1 | 86.907 |
| 140016255 | GT-AG | 0 | 4.488630051810529e-05 | 158 | rna-XM_040243955.1 25387422 | 26 | 73936269 | 73936426 | Oryx dammah 59534 | AAG|GTAGCATCAC...ACCGCTTTTCCT/GAACCACCCACC...TCCAG|GAG | 1 | 1 | 89.974 |
| 140016256 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_040243955.1 25387422 | 27 | 73936122 | 73936203 | Oryx dammah 59534 | CAG|GTGAGCGGGG...CGGCCCTCACTG/CCGGCCCTCACT...TGCAG|ACG | 0 | 1 | 91.821 |
| 140016257 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_040243955.1 25387422 | 28 | 73935832 | 73935929 | Oryx dammah 59534 | CAG|GTGAGCTGTC...GCAGCCTTGACC/GCAGCCTTGACC...CTCAG|AGC | 0 | 1 | 97.274 |
| 140019970 | GT-AG | 0 | 1.000000099473604e-05 | 31625 | rna-XM_040243955.1 25387422 | 1 | 74015234 | 74046858 | Oryx dammah 59534 | GCG|GTAATTGGCA...ATTTGCTTACCA/AATTTGCTTACC...TGCAG|ATT | 0 | 3.124 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);