introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
31 rows where transcript_id = 32191431
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179733895 | GT-AG | 0 | 0.0007444759336748 | 84 | rna-XM_047140056.1 32191431 | 2 | 690842384 | 690842467 | Schistocerca americana 7009 | GAG|GTATGTTGTA...GGTGCCTCATTC/GGGTGCCTCATT...TTTAG|GTC | 1 | 1 | 2.911 |
| 179733896 | GT-AG | 0 | 1.000000099473604e-05 | 31545 | rna-XM_047140056.1 32191431 | 3 | 690842599 | 690874143 | Schistocerca americana 7009 | AAG|GTGAATCCTT...TGTATTTTCATT/TGTATTTTCATT...TTTAG|GCA | 0 | 1 | 5.309 |
| 179733897 | GT-AG | 0 | 2.449816704581049e-05 | 82 | rna-XM_047140056.1 32191431 | 4 | 690874280 | 690874361 | Schistocerca americana 7009 | ATA|GTAAGTCTGA...GTTATTTTATTC/ATTTTATTCATG...TACAG|GCT | 1 | 1 | 7.799 |
| 179733898 | GT-AG | 0 | 0.0005198427662534 | 16412 | rna-XM_047140056.1 32191431 | 5 | 690874409 | 690890820 | Schistocerca americana 7009 | GAT|GTAAGTTTTT...TTTTGCTTACTC/TTTTTGCTTACT...TACAG|GTA | 0 | 1 | 8.66 |
| 179733899 | GC-AG | 0 | 1.000000099473604e-05 | 285 | rna-XM_047140056.1 32191431 | 6 | 690891025 | 690891309 | Schistocerca americana 7009 | AAA|GCAAGTAGTT...TAATTCATAACT/GAGTAATTCATA...CTTAG|ATT | 0 | 1 | 12.395 |
| 179733900 | GC-AG | 0 | 1.000000099473604e-05 | 155 | rna-XM_047140056.1 32191431 | 7 | 690891514 | 690891668 | Schistocerca americana 7009 | GAA|GCAAGTATAT...TAATTTATGATT/AAAATACTAATA...TACAG|CAC | 0 | 1 | 16.13 |
| 179733901 | GT-AG | 0 | 1.000000099473604e-05 | 6447 | rna-XM_047140056.1 32191431 | 8 | 690891783 | 690898229 | Schistocerca americana 7009 | CAG|GTCAGATCAT...AAAGTCATGACT/CTGAAATTAATG...TACAG|GTT | 0 | 1 | 18.217 |
| 179733902 | GT-AG | 0 | 0.000558598218303 | 1706 | rna-XM_047140056.1 32191431 | 9 | 690898418 | 690900123 | Schistocerca americana 7009 | TCG|GTATGTGATG...AACTTTTTAGTG/TATCAATTTATT...TTCAG|GTA | 2 | 1 | 21.659 |
| 179733903 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_047140056.1 32191431 | 10 | 690900251 | 690900358 | Schistocerca americana 7009 | CAG|GTGTGGCTTT...GTATTGTTATTT/GGTATTGTTATT...TTTAG|GAA | 0 | 1 | 23.984 |
| 179733904 | GT-AG | 0 | 1.000000099473604e-05 | 42105 | rna-XM_047140056.1 32191431 | 11 | 690900481 | 690942585 | Schistocerca americana 7009 | CAG|GTAAATATGA...TATTACTTGATC/CCGTCTTTCATT...TCCAG|CGA | 2 | 1 | 26.218 |
| 179733905 | GT-AG | 0 | 1.000000099473604e-05 | 6139 | rna-XM_047140056.1 32191431 | 12 | 690942710 | 690948848 | Schistocerca americana 7009 | AGG|GTTTGTACAT...TTTTTTGTAAAT/TTTTTTGTAAAT...CATAG|GGT | 0 | 1 | 28.488 |
| 179733906 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_047140056.1 32191431 | 13 | 690949016 | 690949094 | Schistocerca americana 7009 | CAG|GTTTGTATAT...TGTTTTCTAATT/TGTTTTCTAATT...TTCAG|AAA | 2 | 1 | 31.545 |
| 179733907 | GT-AG | 0 | 1.3494096684635744e-05 | 2979 | rna-XM_047140056.1 32191431 | 14 | 690949135 | 690952113 | Schistocerca americana 7009 | GAG|GTAAAATTTG...ATTTTCTAAACT/CATTTTCTAAAC...TTTAG|CCA | 0 | 1 | 32.278 |
| 179733908 | GT-AG | 0 | 0.0001564938710365 | 7496 | rna-XM_047140056.1 32191431 | 15 | 690952194 | 690959689 | Schistocerca americana 7009 | AGT|GTAAGTATTA...CCTCTCTTATAT/CCAAAATTTATT...TTTAG|TTC | 2 | 1 | 33.742 |
| 179733909 | GT-AG | 0 | 0.0001187355735552 | 18047 | rna-XM_047140056.1 32191431 | 16 | 690959775 | 690977821 | Schistocerca americana 7009 | TTT|GTAAGTATAT...CCATTTTTAATA/TGTATTTTCATT...TACAG|TGG | 0 | 1 | 35.298 |
| 179733910 | GT-AG | 0 | 1.4246004285120306e-05 | 38284 | rna-XM_047140056.1 32191431 | 17 | 690977941 | 691016224 | Schistocerca americana 7009 | CAC|GTAAGTGATG...TTCTTCTTACCT/ATTTTATTAATT...GTCAG|GTA | 2 | 1 | 37.477 |
| 179733911 | GT-AG | 0 | 1.000000099473604e-05 | 184 | rna-XM_047140056.1 32191431 | 18 | 691016338 | 691016521 | Schistocerca americana 7009 | ATG|GTGAGTCCTA...CTTTTTTTATTT/ACTTTTTTTATT...TCCAG|GTA | 1 | 1 | 39.546 |
| 179733912 | GT-AG | 0 | 1.000000099473604e-05 | 17482 | rna-XM_047140056.1 32191431 | 19 | 691017569 | 691035050 | Schistocerca americana 7009 | AAG|GTAAGAAAAG...TTTCTTGTGATT/TTTCTTGTGATT...GCCAG|CTA | 1 | 1 | 58.715 |
| 179733913 | GT-AG | 0 | 1.000000099473604e-05 | 14233 | rna-XM_047140056.1 32191431 | 20 | 691035212 | 691049444 | Schistocerca americana 7009 | AAG|GTAACAAAAC...AAATACTTACCA/CAAATACTTACC...ATCAG|ACT | 0 | 1 | 61.662 |
| 179733914 | GT-AG | 0 | 5.241313357099283e-05 | 10908 | rna-XM_047140056.1 32191431 | 21 | 691049673 | 691060580 | Schistocerca americana 7009 | CAG|GTAAACATTG...TTGTATTTAACT/TTGTATTTAACT...TTTAG|GTG | 0 | 1 | 65.837 |
| 179733915 | GT-AG | 0 | 1.000000099473604e-05 | 23608 | rna-XM_047140056.1 32191431 | 22 | 691060724 | 691084331 | Schistocerca americana 7009 | AAG|GTGTGTGGGA...AAATTCTTGATT/AAATTCTTGATT...TCCAG|GAA | 2 | 1 | 68.455 |
| 179733916 | GT-AG | 0 | 1.031791263417206e-05 | 5121 | rna-XM_047140056.1 32191431 | 23 | 691084413 | 691089533 | Schistocerca americana 7009 | GAT|GTAAGTAATA...GAATCATTATTT/TTCTATCTCATC...TACAG|TTA | 2 | 1 | 69.938 |
| 179733917 | GT-AG | 0 | 1.000000099473604e-05 | 16311 | rna-XM_047140056.1 32191431 | 24 | 691089688 | 691105998 | Schistocerca americana 7009 | AAG|GTCAGTAACA...TATTATTTATTT/ATATTATTTATT...TTTAG|GAG | 0 | 1 | 72.757 |
| 179733918 | GT-AG | 0 | 1.000000099473604e-05 | 11418 | rna-XM_047140056.1 32191431 | 25 | 691106300 | 691117717 | Schistocerca americana 7009 | ACA|GTAAGTAAAA...AATTGATTGATC/AATTGATTGATC...TTCAG|GTG | 1 | 1 | 78.268 |
| 179733919 | GT-AG | 0 | 0.0015181928539981 | 9755 | rna-XM_047140056.1 32191431 | 26 | 691117805 | 691127559 | Schistocerca americana 7009 | AGT|GTAAGTTTAC...GTACTCTTAATA/TAATAACTGATC...TATAG|TTC | 1 | 1 | 79.861 |
| 179733920 | GT-AG | 0 | 1.000000099473604e-05 | 4244 | rna-XM_047140056.1 32191431 | 27 | 691127664 | 691131907 | Schistocerca americana 7009 | AAG|GTAAGACATC...TTTTGTTTAGCT/GTTTTGTTTAGC...TACAG|TTG | 0 | 1 | 81.765 |
| 179733921 | GT-AG | 0 | 1.000000099473604e-05 | 9687 | rna-XM_047140056.1 32191431 | 28 | 691132438 | 691142124 | Schistocerca americana 7009 | AAG|GTAAGAGTAT...TAATTTTTAATC/TAATTTTTAATC...TTTAG|GGC | 2 | 1 | 91.468 |
| 179733922 | GT-AG | 0 | 1.000000099473604e-05 | 6675 | rna-XM_047140056.1 32191431 | 29 | 691142206 | 691148880 | Schistocerca americana 7009 | ACG|GTAAGTATTT...TGTTTATTATTT/TTATTATTTATG...AACAG|AGA | 2 | 1 | 92.951 |
| 179733923 | GT-AG | 0 | 1.000000099473604e-05 | 5558 | rna-XM_047140056.1 32191431 | 30 | 691149009 | 691154566 | Schistocerca americana 7009 | CAG|GTAAATGAAA...GTACCTTTACTT/TTTGTTCTCAAG...CACAG|GAC | 1 | 1 | 95.295 |
| 179733924 | GT-AG | 0 | 0.0001295612854277 | 11997 | rna-XM_047140056.1 32191431 | 31 | 691154734 | 691166730 | Schistocerca americana 7009 | AAG|GTATGTCATT...TATTGTTTGATA/TATTGTTTGATA...TTCAG|TGT | 0 | 1 | 98.352 |
| 179750126 | GT-AG | 0 | 0.023227733449098 | 5914 | rna-XM_047140056.1 32191431 | 1 | 690836370 | 690842283 | Schistocerca americana 7009 | GAG|GTATATTCCT...TAATTCTTAAAC/TTAATTCTTAAA...TTCAG|GTG | 0 | 1.3 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);