introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
31 rows where transcript_id = 32191454
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179734462 | GT-AG | 0 | 0.0005394688602169 | 173 | rna-XM_047141357.1 32191454 | 1 | 813023167 | 813023339 | Schistocerca americana 7009 | CAG|GTATGCGTCA...CATTGTTTATCG/TCATTGTTTATC...TTCAG|GTG | 0 | 1 | 3.736 |
| 179734463 | GT-AG | 0 | 3.1319907662336e-05 | 4956 | rna-XM_047141357.1 32191454 | 2 | 813023461 | 813028416 | Schistocerca americana 7009 | TGC|GTAAGATCTC...CAAATCTTAAAA/CTCTGTTTAATA...GCAAG|GCC | 1 | 1 | 6.247 |
| 179734464 | GT-AG | 0 | 1.000000099473604e-05 | 198 | rna-XM_047141357.1 32191454 | 3 | 813028539 | 813028736 | Schistocerca americana 7009 | GAG|GTAATATAAA...AAAGACATGATT/CATGATTGCACT...TTCAG|TTG | 0 | 1 | 8.78 |
| 179734465 | GT-AG | 0 | 1.000000099473604e-05 | 92 | rna-XM_047141357.1 32191454 | 4 | 813028932 | 813029023 | Schistocerca americana 7009 | GAA|GTAATGTACT...TGTTTCATATTT/TTATATCTAATA...TTAAG|GAA | 0 | 1 | 12.827 |
| 179734466 | GT-AG | 0 | 0.001394760487017 | 15692 | rna-XM_047141357.1 32191454 | 5 | 813029160 | 813044851 | Schistocerca americana 7009 | CAG|GTATGTTGGC...ACTTTTTTATTT/TAATTATTCATC...TTTAG|AGA | 1 | 1 | 15.65 |
| 179734467 | GT-AG | 0 | 1.000000099473604e-05 | 800 | rna-XM_047141357.1 32191454 | 6 | 813044989 | 813045788 | Schistocerca americana 7009 | AAG|GTAAAACTCA...TATTTTTTGGTG/ATGAAATTAACA...CACAG|AAA | 0 | 1 | 18.493 |
| 179734468 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_047141357.1 32191454 | 7 | 813045912 | 813046000 | Schistocerca americana 7009 | ATG|GTAAGGAAAC...TGACTTTTATCT/ACATTTTTTACT...AACAG|ACA | 0 | 1 | 21.046 |
| 179734469 | GT-AG | 0 | 1.000000099473604e-05 | 23933 | rna-XM_047141357.1 32191454 | 8 | 813046117 | 813070049 | Schistocerca americana 7009 | AAA|GTGAGAAATA...AGTATATTAATA/AGTATATTAATA...TTCAG|GGA | 2 | 1 | 23.454 |
| 179734470 | GT-AG | 0 | 1.0498026428906544e-05 | 6145 | rna-XM_047141357.1 32191454 | 9 | 813070186 | 813076330 | Schistocerca americana 7009 | GAG|GTCTGGTACT...GTGTGCTTAACA/TACTATTTAATA...TTCAG|ACA | 0 | 1 | 26.276 |
| 179734471 | GT-AG | 0 | 1.000000099473604e-05 | 10636 | rna-XM_047141357.1 32191454 | 10 | 813076522 | 813087157 | Schistocerca americana 7009 | GCG|GTTAGTAGAT...TTTACATTATAT/ATAATATTTACA...TGCAG|GGA | 2 | 1 | 30.241 |
| 179734472 | GT-AG | 0 | 0.0001165160619107 | 3391 | rna-XM_047141357.1 32191454 | 11 | 813087300 | 813090690 | Schistocerca americana 7009 | GAG|GTAACAAATG...ACTCTTTTAATT/ACTCTTTTAATT...TACAG|AAT | 0 | 1 | 33.188 |
| 179734473 | GT-AG | 0 | 1.0415539183496466e-05 | 14577 | rna-XM_047141357.1 32191454 | 12 | 813090821 | 813105397 | Schistocerca americana 7009 | CAA|GTAAGATTAT...GAAATTTTAACT/GAAATTTTAACT...CCCAG|GGA | 1 | 1 | 35.886 |
| 179734474 | GT-AG | 0 | 1.3152493514453273e-05 | 6375 | rna-XM_047141357.1 32191454 | 13 | 813105584 | 813111958 | Schistocerca americana 7009 | ATG|GTAATTATGG...TATTTCTGAATT/ATATTTCTGAAT...TAAAG|GCA | 1 | 1 | 39.747 |
| 179734475 | GT-AG | 0 | 1.000000099473604e-05 | 6613 | rna-XM_047141357.1 32191454 | 14 | 813112125 | 813118737 | Schistocerca americana 7009 | AAG|GTAGTATGGA...TTGGCCATAGTA/AGTAAAATAAAT...CACAG|GCG | 2 | 1 | 43.192 |
| 179734476 | GT-AG | 0 | 1.000000099473604e-05 | 12734 | rna-XM_047141357.1 32191454 | 15 | 813118937 | 813131670 | Schistocerca americana 7009 | AAT|GTGAGTGTTT...AATTGTTTAGCA/ATCTCACTAACT...TTTAG|GAG | 0 | 1 | 47.323 |
| 179734477 | GT-AG | 0 | 0.0080684686226277 | 5509 | rna-XM_047141357.1 32191454 | 16 | 813131914 | 813137422 | Schistocerca americana 7009 | CAG|GTACACTATG...CAGGTCTTGATG/CAGGTCTTGATG...AACAG|GAA | 0 | 1 | 52.366 |
| 179734478 | GT-AG | 0 | 1.367426481403915e-05 | 18024 | rna-XM_047141357.1 32191454 | 17 | 813137549 | 813155572 | Schistocerca americana 7009 | GGA|GTAAGTAACT...ACTTTCTTAGAC/CTTATATTTATG...GACAG|GAA | 0 | 1 | 54.981 |
| 179734479 | GT-AG | 0 | 1.000000099473604e-05 | 27017 | rna-XM_047141357.1 32191454 | 18 | 813155652 | 813182668 | Schistocerca americana 7009 | AAA|GTAAGTCCCC...TGTTTTTTCGCG/AAGAAGTTGACT...CACAG|GTG | 1 | 1 | 56.621 |
| 179734480 | GT-AG | 0 | 1.000000099473604e-05 | 7635 | rna-XM_047141357.1 32191454 | 19 | 813182809 | 813190443 | Schistocerca americana 7009 | ATA|GTAAGTAATT...TAATGTTTAATC/TTTAATCTGACA...TTCAG|GTA | 0 | 1 | 59.527 |
| 179734481 | GT-AG | 0 | 1.000000099473604e-05 | 4031 | rna-XM_047141357.1 32191454 | 20 | 813190610 | 813194640 | Schistocerca americana 7009 | TTG|GTGAGTATAA...TTTGTTTTACCC/ATTTGTTTTACC...TCCAG|CAA | 1 | 1 | 62.972 |
| 179734482 | GT-AG | 0 | 1.000000099473604e-05 | 976 | rna-XM_047141357.1 32191454 | 21 | 813194822 | 813195797 | Schistocerca americana 7009 | CAG|GTAATGTTTT...ATATTTTTATTA/CATATTTTTATT...TTTAG|GGT | 2 | 1 | 66.729 |
| 179734483 | GT-AG | 0 | 0.0001697046371003 | 620 | rna-XM_047141357.1 32191454 | 22 | 813195952 | 813196571 | Schistocerca americana 7009 | CAG|GTATTGCATT...TAATTTTTGATA/CTTTTGTTAATT...TCCAG|ACA | 0 | 1 | 69.925 |
| 179734484 | GT-AG | 0 | 1.000000099473604e-05 | 2035 | rna-XM_047141357.1 32191454 | 23 | 813196719 | 813198753 | Schistocerca americana 7009 | GAG|GTACGATAAT...TATCTTTCAACT/TTTCAACTGATT...CACAG|GAG | 0 | 1 | 72.976 |
| 179734485 | GT-AG | 0 | 0.0001097503123072 | 3687 | rna-XM_047141357.1 32191454 | 24 | 813198931 | 813202617 | Schistocerca americana 7009 | GAG|GTATGTAATT...ACAGTTTTAGTG/CAATAATTTACG...TTTAG|GAA | 0 | 1 | 76.65 |
| 179734486 | GT-AG | 0 | 1.000000099473604e-05 | 92 | rna-XM_047141357.1 32191454 | 25 | 813202753 | 813202844 | Schistocerca americana 7009 | CAG|GTAAGCAGAT...TTCTGTTTGACT/TTCTGTTTGACT...GGTAG|GAA | 0 | 1 | 79.452 |
| 179734487 | GT-AG | 0 | 0.0001779022222177 | 4306 | rna-XM_047141357.1 32191454 | 26 | 813203018 | 813207323 | Schistocerca americana 7009 | TCG|GTATGGTGAA...ATCCCCCTATTT/CCCCTATTTATA...TTCAG|ATT | 2 | 1 | 83.043 |
| 179734488 | GT-AG | 0 | 1.000000099473604e-05 | 8313 | rna-XM_047141357.1 32191454 | 27 | 813207474 | 813215786 | Schistocerca americana 7009 | AAG|GTAAGTGTAA...GACACTTTGATA/ATATTGCTAATT...TGCAG|TGT | 2 | 1 | 86.156 |
| 179734489 | GT-AG | 0 | 1.000000099473604e-05 | 12283 | rna-XM_047141357.1 32191454 | 28 | 813215986 | 813228268 | Schistocerca americana 7009 | GAG|GTAAAGACAC...GTGTCTTTTACT/CTTTTACTGACA...TTCAG|AGT | 0 | 1 | 90.286 |
| 179734490 | GT-AG | 0 | 0.0344735038704509 | 2263 | rna-XM_047141357.1 32191454 | 29 | 813228370 | 813230632 | Schistocerca americana 7009 | AAG|GTATCATTCT...ATGACTATATTT/TCAGTATTCATT...TGCAG|AGC | 2 | 1 | 92.383 |
| 179734491 | GT-AG | 0 | 1.000000099473604e-05 | 6836 | rna-XM_047141357.1 32191454 | 30 | 813230784 | 813237619 | Schistocerca americana 7009 | AAG|GTTGGTTGCT...ATTTTCATGAAA/ACTATTTTCATG...TTCAG|GCA | 0 | 1 | 95.517 |
| 179734492 | GT-AG | 0 | 1.000000099473604e-05 | 4181 | rna-XM_047141357.1 32191454 | 31 | 813237800 | 813241980 | Schistocerca americana 7009 | AAG|GTAATTTAGC...GTCCTGTTATTT/TATGCATTTACC...TTCAG|GAG | 0 | 1 | 99.253 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);