introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 32191447
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179734305 | GT-AG | 0 | 1.5654279425259355e-05 | 38140 | rna-XM_047145044.1 32191447 | 1 | 193028332 | 193066471 | Schistocerca americana 7009 | AGC|GTAAGTAAGA...TTTATTTTATTT/ATTTTATTTACT...TCCAG|GCA | 0 | 1 | 1.807 |
| 179734306 | GT-AG | 0 | 0.0001132383790712 | 1119 | rna-XM_047145044.1 32191447 | 2 | 193027110 | 193028228 | Schistocerca americana 7009 | GAC|GTAAGTTGCT...TTTGTCATAACA/ACTTTTGTCATA...TTCAG|TCA | 1 | 1 | 3.876 |
| 179734307 | GT-AG | 0 | 1.000000099473604e-05 | 4987 | rna-XM_047145044.1 32191447 | 3 | 193022041 | 193027027 | Schistocerca americana 7009 | AAG|GTAAAATAGA...TTTATTTTGAAT/TTTATTTTGAAT...TTCAG|ATT | 2 | 1 | 5.522 |
| 179734308 | GT-AG | 0 | 1.000000099473604e-05 | 1277 | rna-XM_047145044.1 32191447 | 4 | 193020574 | 193021850 | Schistocerca americana 7009 | CAA|GTTAAATAGA...TCCCTCTTCACA/AACATTTTCAAA...GGAAG|CTG | 0 | 1 | 9.337 |
| 179734309 | GT-AG | 0 | 2.6858414001243463e-05 | 6046 | rna-XM_047145044.1 32191447 | 5 | 193014326 | 193020371 | Schistocerca americana 7009 | TTG|GTATGGAATC...TTTGCCTCAGTC/CTTTGCCTCAGT...TACAG|GAA | 1 | 1 | 13.394 |
| 179734310 | GT-AG | 0 | 1.000000099473604e-05 | 635 | rna-XM_047145044.1 32191447 | 6 | 193013554 | 193014188 | Schistocerca americana 7009 | AAA|GTAAGATGAC...TTGGTCTTGACA/TTGGTCTTGACA...AATAG|CAC | 0 | 1 | 16.145 |
| 179734311 | GT-AG | 0 | 0.0013241949466828 | 981 | rna-XM_047145044.1 32191447 | 7 | 193012437 | 193013417 | Schistocerca americana 7009 | TCT|GTTTGACTTA...CATGTCTTAACT/CATGTCTTAACT...CATAG|TGC | 1 | 1 | 18.876 |
| 179734312 | GT-AG | 0 | 0.0005754357585293 | 160 | rna-XM_047145044.1 32191447 | 8 | 193012199 | 193012358 | Schistocerca americana 7009 | CAA|GTGCACTTAA...GTTTTTTTGAGT/TATGTATTAATA...ACCAG|TTG | 1 | 1 | 20.442 |
| 179734313 | GT-AG | 0 | 0.0029033095309835 | 58 | rna-XM_047145044.1 32191447 | 9 | 193012025 | 193012082 | Schistocerca americana 7009 | ATC|GTATAGCACC...ACAACTTTAAAA/ATATTGCACATT...TCAAG|TCA | 0 | 1 | 22.771 |
| 179734314 | GT-AG | 0 | 2.232715652372718e-05 | 62 | rna-XM_047145044.1 32191447 | 10 | 193011848 | 193011909 | Schistocerca americana 7009 | ATG|GTAACAGCAT...TACTCATTAATT/CTGATACTCATT...ATTAG|TCA | 1 | 1 | 25.08 |
| 179734315 | GT-AG | 0 | 1.1418054262397e-05 | 3291 | rna-XM_047145044.1 32191447 | 11 | 193008375 | 193011665 | Schistocerca americana 7009 | TGC|GTAAATGAAC...GATATCTTGGCT/GCTTGTGTGACA...GCCAG|CCC | 0 | 1 | 28.735 |
| 179734316 | GT-AG | 0 | 1.000000099473604e-05 | 1528 | rna-XM_047145044.1 32191447 | 12 | 193006681 | 193008208 | Schistocerca americana 7009 | GCT|GTGAGTGATG...CATGCTGTAATT/CTATTGTTCAAT...GTAAG|TGC | 1 | 1 | 32.068 |
| 179734317 | GT-AG | 0 | 0.0001446540933727 | 9839 | rna-XM_047145044.1 32191447 | 13 | 192996689 | 193006527 | Schistocerca americana 7009 | GAT|GTTGGCTCCA...TGTTCCCTAATG/CTAATGTTTACA...TACAG|TGG | 1 | 1 | 35.141 |
| 179734318 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_047145044.1 32191447 | 14 | 192996562 | 192996662 | Schistocerca americana 7009 | CAT|GTAAGTGTAT...GATGGGTTATCT/GGATGGGTTATC...CACAG|ATA | 0 | 1 | 35.663 |
| 179734319 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_047145044.1 32191447 | 15 | 192996309 | 192996416 | Schistocerca americana 7009 | AAG|GTAACAACCG...TGACACTGAACC/TGTTAAGTGACA...ACAAG|CTC | 1 | 1 | 38.574 |
| 179734320 | GT-AG | 0 | 0.5217433445942612 | 440 | rna-XM_047145044.1 32191447 | 16 | 192995793 | 192996232 | Schistocerca americana 7009 | GCA|GTAACCTCAA...ACTGCTTTGCTG/CTTTGCTGCAAT...TGAAG|ACT | 2 | 1 | 40.1 |
| 179734321 | GT-AG | 0 | 1.000000099473604e-05 | 405 | rna-XM_047145044.1 32191447 | 17 | 192995216 | 192995620 | Schistocerca americana 7009 | AAG|GTCAAAATGC...GCCACTTTGACA/TGTTAACTGACA...TTCAG|AAT | 0 | 1 | 43.554 |
| 179734322 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_047145044.1 32191447 | 18 | 192995086 | 192995170 | Schistocerca americana 7009 | ACT|GTAAAGGATC...TCTGTTTTGAAC/TCTGTTTTGAAC...TACAG|TCA | 0 | 1 | 44.458 |
| 179734323 | GC-AG | 0 | 1.000000099473604e-05 | 67 | rna-XM_047145044.1 32191447 | 19 | 192994839 | 192994905 | Schistocerca americana 7009 | GGG|GCATGGCCAC...CCAACCTTAGTA/GACATACTCATG...CATAG|GTC | 0 | 1 | 48.072 |
| 179734324 | GT-AG | 0 | 28.71828071607257 | 480 | rna-XM_047145044.1 32191447 | 20 | 192994203 | 192994682 | Schistocerca americana 7009 | TCT|GTCTCCTACC...GGTTTGTTAATA/GGTTTGTTAATA...CTAAG|GTC | 0 | 1 | 51.205 |
| 179734325 | GT-AG | 0 | 0.0015478507746329 | 1752 | rna-XM_047145044.1 32191447 | 21 | 192992338 | 192994089 | Schistocerca americana 7009 | CTT|GTACAGTTTT...TAGTCTGTATTG/ATATAACTAATT...TGAAG|CTT | 2 | 1 | 53.474 |
| 179734326 | GT-AG | 0 | 1.000000099473604e-05 | 3640 | rna-XM_047145044.1 32191447 | 22 | 192988571 | 192992210 | Schistocerca americana 7009 | TCA|GTAATAAAAA...ATTATCTTATTT/CATTATCTTATT...GACAG|GTA | 0 | 1 | 56.024 |
| 179734327 | GT-AG | 0 | 1.000000099473604e-05 | 8495 | rna-XM_047145044.1 32191447 | 23 | 192979942 | 192988436 | Schistocerca americana 7009 | AAA|GTAAGCCACA...CATTTCTGAAGA/ACATTTCTGAAG...TTTAG|GTA | 2 | 1 | 58.715 |
| 179734328 | GT-AG | 0 | 1.000000099473604e-05 | 115 | rna-XM_047145044.1 32191447 | 24 | 192979607 | 192979721 | Schistocerca americana 7009 | CAG|GTAATTACAA...TATGTCTTACAT/ATATGTCTTACA...TTCAG|GTA | 0 | 1 | 63.133 |
| 179734329 | GT-AG | 0 | 1.000000099473604e-05 | 140 | rna-XM_047145044.1 32191447 | 25 | 192979267 | 192979406 | Schistocerca americana 7009 | GAA|GTAAGTAATG...TGGGCCTTTGTT/TTTGTTATTATG...TACAG|TAA | 2 | 1 | 67.149 |
| 179734330 | GT-AG | 0 | 1.000000099473604e-05 | 26150 | rna-XM_047145044.1 32191447 | 26 | 192952926 | 192979075 | Schistocerca americana 7009 | ATG|GTAAGTAACA...CTATCATTAATG/ATGTATTTAATT...TGCAG|GTA | 1 | 1 | 70.984 |
| 179734331 | GT-AG | 0 | 1.000000099473604e-05 | 77 | rna-XM_047145044.1 32191447 | 27 | 192952643 | 192952719 | Schistocerca americana 7009 | AAT|GTAAGGTTTT...ATCATTTTGGCT/TTTTGGCTGAAA...TGTAG|TAC | 0 | 1 | 75.12 |
| 179734332 | GT-AG | 0 | 1.000000099473604e-05 | 7719 | rna-XM_047145044.1 32191447 | 28 | 192944707 | 192952425 | Schistocerca americana 7009 | AAG|GTTAGCTCTA...TTATATTTAATT/TTATATTTAATT...AACAG|GTG | 1 | 1 | 79.478 |
| 179734333 | GT-AG | 0 | 0.0041500385215413 | 83 | rna-XM_047145044.1 32191447 | 29 | 192944506 | 192944588 | Schistocerca americana 7009 | CGA|GTATGTATTG...AATTTATTATAT/ATATAATTCATC...TACAG|GGA | 2 | 1 | 81.847 |
| 179734334 | GT-AG | 0 | 1.000000099473604e-05 | 7482 | rna-XM_047145044.1 32191447 | 30 | 192936747 | 192944228 | Schistocerca americana 7009 | ATG|GTAAGATGCA...ATAACTTTATCT/TGCATTTTGATT...CTTAG|ATC | 0 | 1 | 87.41 |
| 179734335 | GT-AG | 0 | 0.0003222687543757 | 334 | rna-XM_047145044.1 32191447 | 31 | 192936161 | 192936494 | Schistocerca americana 7009 | CAG|GTTTGTATGC...TAATTCTTAACT/TAATTCTTAACT...TCTAG|TTT | 0 | 1 | 92.47 |
| 179734336 | GT-AG | 0 | 1.000000099473604e-05 | 5766 | rna-XM_047145044.1 32191447 | 32 | 192930245 | 192936010 | Schistocerca americana 7009 | GAG|GTTAGAAACT...ATTTCTTCAACA/TTGAGACTCATA...TTCAG|GCT | 0 | 1 | 95.482 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);