introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
52 rows where transcript_id = 32191383
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179732713 | GT-AG | 0 | 0.0001066020884423 | 20125 | rna-XM_047136614.1 32191383 | 1 | 514873633 | 514893757 | Schistocerca americana 7009 | CAG|GTATAGTAAA...GTTTTCTTGTTT/TTTTTGTTGAAT...TTCAG|TTG | 0 | 1 | 0.715 |
| 179732714 | GT-AG | 0 | 1.000000099473604e-05 | 8518 | rna-XM_047136614.1 32191383 | 2 | 514893902 | 514902419 | Schistocerca americana 7009 | ATG|GTAAGTTCAA...GGTGTTTTACAT/CATATATTTACA...TTCAG|TTT | 0 | 1 | 2.429 |
| 179732715 | GT-AG | 0 | 3.113033675155172e-05 | 6401 | rna-XM_047136614.1 32191383 | 3 | 514902601 | 514909001 | Schistocerca americana 7009 | TAG|GTATGATCCC...CATCTCATATAT/CTGAAACTCATT...TTCAG|ATG | 1 | 1 | 4.585 |
| 179732716 | GT-AG | 0 | 0.0001727458385915 | 24144 | rna-XM_047136614.1 32191383 | 4 | 514909114 | 514933257 | Schistocerca americana 7009 | AAG|GTATGTATAA...CTTGATTTAATT/CTTGATTTAATT...CCCAG|GTT | 2 | 1 | 5.919 |
| 179732717 | GT-AG | 0 | 1.000000099473604e-05 | 6184 | rna-XM_047136614.1 32191383 | 5 | 514933365 | 514939548 | Schistocerca americana 7009 | ATG|GTAAGTTAGT...ATGACTTTATTG/AATGACTTTATT...TACAG|AAA | 1 | 1 | 7.193 |
| 179732718 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_047136614.1 32191383 | 6 | 514939668 | 514939775 | Schistocerca americana 7009 | CAG|GTAAGTTCGT...TGTTCATTAGTG/GTTGTTTTCATA...CACAG|AAA | 0 | 1 | 8.61 |
| 179732719 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-XM_047136614.1 32191383 | 7 | 514939905 | 514940017 | Schistocerca americana 7009 | CCA|GTCAGTATTT...GAATTTCTAGCT/CTGTGTGTAATT...AGCAG|AAA | 0 | 1 | 10.146 |
| 179732720 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_047136614.1 32191383 | 8 | 514940261 | 514940343 | Schistocerca americana 7009 | AAG|GTGAGGATAA...GATCTCTTATTG/GGATCTCTTATT...TTCAG|GCA | 0 | 1 | 13.04 |
| 179732721 | GT-AG | 0 | 1.000000099473604e-05 | 34807 | rna-XM_047136614.1 32191383 | 9 | 514940535 | 514975341 | Schistocerca americana 7009 | CAG|GTGAGAATAT...CTTTTTTTAGAA/GATATTTTAAAT...TACAG|GAT | 2 | 1 | 15.315 |
| 179732722 | GT-AG | 0 | 0.0091622577625306 | 12246 | rna-XM_047136614.1 32191383 | 10 | 514975487 | 514987732 | Schistocerca americana 7009 | ACG|GTATGTTGTA...TATTCATTGATT/ATTGTATTCATT...TGCAG|AGC | 0 | 1 | 17.042 |
| 179732723 | GT-AG | 0 | 1.000000099473604e-05 | 4359 | rna-XM_047136614.1 32191383 | 11 | 514987862 | 514992220 | Schistocerca americana 7009 | AAT|GTAAGTGTAA...ATTTACATATCT/GATCCATTTACA...AACAG|AAT | 0 | 1 | 18.578 |
| 179732724 | GT-AG | 0 | 1.000000099473604e-05 | 6950 | rna-XM_047136614.1 32191383 | 12 | 514992415 | 514999364 | Schistocerca americana 7009 | GAG|GTGAGTAGAA...AGTATTTTAATC/AGTATTTTAATC...TTCAG|TTC | 2 | 1 | 20.888 |
| 179732725 | GT-AG | 0 | 1.000000099473604e-05 | 9244 | rna-XM_047136614.1 32191383 | 13 | 514999549 | 515008792 | Schistocerca americana 7009 | GAG|GTACAGTACA...ATTCTCTTTCCA/CATTTCATCATT...TTTAG|GTT | 0 | 1 | 23.08 |
| 179732726 | GT-AG | 0 | 1.000000099473604e-05 | 3600 | rna-XM_047136614.1 32191383 | 14 | 515008980 | 515012579 | Schistocerca americana 7009 | CAG|GTTTGTGTCA...TACTTCTTGCCT/ATATTTTTCAAT...CACAG|CCA | 1 | 1 | 25.307 |
| 179732727 | GT-AG | 0 | 1.000000099473604e-05 | 2590 | rna-XM_047136614.1 32191383 | 15 | 515012622 | 515015211 | Schistocerca americana 7009 | AGG|GTAAGATTGT...CTATTTTTGAAA/CTATTTTTGAAA...TACAG|GAC | 1 | 1 | 25.807 |
| 179732728 | GT-AG | 0 | 0.0004021100454735 | 2354 | rna-XM_047136614.1 32191383 | 16 | 515015286 | 515017639 | Schistocerca americana 7009 | CCA|GTAAGTGTCT...GTTTCCTTAGTT/ATGTATTTTATT...CTAAG|GCA | 0 | 1 | 26.688 |
| 179732729 | GT-AG | 0 | 2.0521472680284625e-05 | 269 | rna-XM_047136614.1 32191383 | 17 | 515017793 | 515018061 | Schistocerca americana 7009 | GAG|GTAATTTTAC...AATGCATTAATG/TAATGTTTGATT...TGCAG|GAC | 0 | 1 | 28.51 |
| 179732730 | GT-AG | 0 | 1.2192085872988244e-05 | 3925 | rna-XM_047136614.1 32191383 | 18 | 515018244 | 515022168 | Schistocerca americana 7009 | ACA|GTAAGTCATT...GTAACATTGACA/CAATGATTCAAT...TGCAG|GTT | 2 | 1 | 30.678 |
| 179732731 | GT-AG | 0 | 1.000000099473604e-05 | 1415 | rna-XM_047136614.1 32191383 | 19 | 515022341 | 515023755 | Schistocerca americana 7009 | CAG|GTCAGTGAAA...GGATTTTTACAG/GGGATTTTTACA...TTCAG|GTT | 0 | 1 | 32.726 |
| 179732732 | GT-AG | 0 | 1.000000099473604e-05 | 5153 | rna-XM_047136614.1 32191383 | 20 | 515023896 | 515029048 | Schistocerca americana 7009 | AAG|GTAGAGTATA...AAGTGCTTATTA/GAAGTGCTTATT...TACAG|GTA | 2 | 1 | 34.393 |
| 179732733 | GT-AG | 0 | 1.000000099473604e-05 | 570 | rna-XM_047136614.1 32191383 | 21 | 515029265 | 515029834 | Schistocerca americana 7009 | GAG|GTAATAACAT...TTATCCTTCTTT/TATTTATTTATT...GATAG|AGA | 2 | 1 | 36.966 |
| 179732734 | GT-AG | 0 | 1.000000099473604e-05 | 3962 | rna-XM_047136614.1 32191383 | 22 | 515029952 | 515033913 | Schistocerca americana 7009 | TAA|GTAAGTAAAT...CATCTTTTAATT/CATCTTTTAATT...TCCAG|GTA | 2 | 1 | 38.359 |
| 179732735 | GT-AG | 0 | 0.0002699893221128 | 8040 | rna-XM_047136614.1 32191383 | 23 | 515034093 | 515042132 | Schistocerca americana 7009 | TTG|GTAATCCATT...GTGTTTTTATTT/TTTTTACTTATT...CATAG|ATC | 1 | 1 | 40.491 |
| 179732736 | GC-AG | 0 | 1.000000099473604e-05 | 9164 | rna-XM_047136614.1 32191383 | 24 | 515042345 | 515051508 | Schistocerca americana 7009 | ATG|GCAAGTATAA...ATTTTGTTATAT/AATTTTGTTATA...TTCAG|GAC | 0 | 1 | 43.015 |
| 179732737 | GT-AG | 0 | 1.000000099473604e-05 | 2114 | rna-XM_047136614.1 32191383 | 25 | 515051766 | 515053879 | Schistocerca americana 7009 | AAG|GTAAGTTGCT...CATTTGTTAAAC/ATGGCTCTCATT...CACAG|ATT | 2 | 1 | 46.076 |
| 179732738 | GT-AG | 0 | 3.952927874394477e-05 | 13276 | rna-XM_047136614.1 32191383 | 26 | 515053973 | 515067248 | Schistocerca americana 7009 | TAA|GTAAGTTCCA...TGAGGCTTAAAA/ACTATAATCATA...TTCAG|ATT | 2 | 1 | 47.184 |
| 179732739 | GT-AG | 0 | 1.000000099473604e-05 | 223 | rna-XM_047136614.1 32191383 | 27 | 515067383 | 515067605 | Schistocerca americana 7009 | TTG|GTGAGTTCGT...ATAGTTTTCACA/ATAGTTTTCACA...CTCAG|TTT | 1 | 1 | 48.779 |
| 179732740 | GT-AG | 0 | 0.0130890213440314 | 77 | rna-XM_047136614.1 32191383 | 28 | 515067781 | 515067857 | Schistocerca americana 7009 | AAG|GTATATATTT...ATTACTTTATTT/TATTACTTTATT...TTCAG|ATT | 2 | 1 | 50.863 |
| 179732741 | GT-AG | 0 | 0.0043116197874235 | 99 | rna-XM_047136614.1 32191383 | 29 | 515068098 | 515068196 | Schistocerca americana 7009 | ACA|GTAAGTTTAT...TTTTTCTTAAAT/ATTTTTCTTAAA...CGTAG|TTT | 2 | 1 | 53.722 |
| 179732742 | GT-AG | 0 | 1.000000099473604e-05 | 2354 | rna-XM_047136614.1 32191383 | 30 | 515068389 | 515070742 | Schistocerca americana 7009 | CAA|GTGAAGTATT...TCAGTTTTGACT/TCAGTTTTGACT...TACAG|GAT | 2 | 1 | 56.008 |
| 179732743 | GT-AG | 0 | 1.000000099473604e-05 | 137 | rna-XM_047136614.1 32191383 | 31 | 515070866 | 515071002 | Schistocerca americana 7009 | TAG|GTAGGTTAAT...AGTTCGCTAATG/AGTTCGCTAATG...TGCAG|GAC | 2 | 1 | 57.473 |
| 179732744 | GT-AG | 0 | 1.000000099473604e-05 | 20822 | rna-XM_047136614.1 32191383 | 32 | 515071142 | 515091963 | Schistocerca americana 7009 | AAG|GTATGACCTA...GTGTACTAAACA/ATGGTTATTATT...TTCAG|GGA | 0 | 1 | 59.128 |
| 179732745 | GT-AG | 0 | 1.000000099473604e-05 | 15641 | rna-XM_047136614.1 32191383 | 33 | 515092129 | 515107769 | Schistocerca americana 7009 | AAG|GTAAGATACT...GACATTTTAAAT/GACATTTTAAAT...TACAG|GTT | 0 | 1 | 61.093 |
| 179732746 | GT-AG | 0 | 1.000000099473604e-05 | 13524 | rna-XM_047136614.1 32191383 | 34 | 515108010 | 515121533 | Schistocerca americana 7009 | CCA|GTAAGTGTTT...GTTATTTTGTTT/CAAAAGTTGACA...TCTAG|GAT | 0 | 1 | 63.951 |
| 179732747 | GT-AG | 0 | 1.000000099473604e-05 | 5356 | rna-XM_047136614.1 32191383 | 35 | 515121715 | 515127070 | Schistocerca americana 7009 | AAG|GTAAATTATT...TGAAAATTAATG/ATGTATATCATT...TGTAG|GTC | 1 | 1 | 66.107 |
| 179732748 | GT-AG | 0 | 1.000000099473604e-05 | 185 | rna-XM_047136614.1 32191383 | 36 | 515127255 | 515127439 | Schistocerca americana 7009 | GAG|GTAAAATTAC...CTCTGTTTGATT/TTGATTTTCATT...TTCAG|GTT | 2 | 1 | 68.298 |
| 179732749 | GT-AG | 0 | 1.000000099473604e-05 | 12961 | rna-XM_047136614.1 32191383 | 37 | 515127576 | 515140536 | Schistocerca americana 7009 | CAA|GTGAGTGTAA...ATTATCGTGATC/ATTATCGTGATC...TTCAG|GTA | 0 | 1 | 69.918 |
| 179732750 | GT-AG | 0 | 1.0216184120986432e-05 | 419 | rna-XM_047136614.1 32191383 | 38 | 515140699 | 515141117 | Schistocerca americana 7009 | CGG|GTAAATATCA...TATTCCTAGATG/ATGTGATTGATT...TCTAG|GTG | 0 | 1 | 71.847 |
| 179732751 | GT-AG | 0 | 1.000000099473604e-05 | 12862 | rna-XM_047136614.1 32191383 | 39 | 515141377 | 515154238 | Schistocerca americana 7009 | CAG|GTAAGTGCAA...TCAGTTTCAGTA/ATCAGTTTCAGT...TTCAG|AAG | 1 | 1 | 74.932 |
| 179732752 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_047136614.1 32191383 | 40 | 515154378 | 515154493 | Schistocerca americana 7009 | ATG|GTAAGTATGT...TCTGCCTGTATG/ATAAAAATTACA...TCTAG|GTA | 2 | 1 | 76.587 |
| 179732753 | GT-AG | 0 | 1.000000099473604e-05 | 8628 | rna-XM_047136614.1 32191383 | 41 | 515154597 | 515163224 | Schistocerca americana 7009 | GAG|GTAAAGCCAA...GTATTTTTTCCA/ATTTTTTCCACT...TGCAG|GCT | 0 | 1 | 77.814 |
| 179732754 | GT-AG | 0 | 1.000000099473604e-05 | 9170 | rna-XM_047136614.1 32191383 | 42 | 515163349 | 515172518 | Schistocerca americana 7009 | TAA|GTGAGTGTTA...TTATGTTTAATA/TTTATTGTTATT...AACAG|GTA | 1 | 1 | 79.29 |
| 179732755 | GT-AG | 0 | 5.026592251110367e-05 | 5360 | rna-XM_047136614.1 32191383 | 43 | 515172653 | 515178012 | Schistocerca americana 7009 | TCT|GTAAGTAAAG...GAATTTTTGACT/GAATTTTTGACT...TTCAG|GAA | 0 | 1 | 80.886 |
| 179732756 | GT-AG | 0 | 0.0019773850442337 | 18485 | rna-XM_047136614.1 32191383 | 44 | 515178154 | 515196638 | Schistocerca americana 7009 | AAG|GTAAACTTTC...TTCTTTTTGAAA/TTCTTTTTGAAA...TTCAG|TCT | 0 | 1 | 82.565 |
| 179732757 | GT-AG | 0 | 0.0014432621788161 | 12316 | rna-XM_047136614.1 32191383 | 45 | 515196727 | 515209042 | Schistocerca americana 7009 | AAG|GTATTTTGCA...TTTTTCTGATAT/ATTTTTCTGATA...TACAG|CAA | 1 | 1 | 83.613 |
| 179732758 | GT-AG | 0 | 5.613505656186955e-05 | 14393 | rna-XM_047136614.1 32191383 | 46 | 515209214 | 515223606 | Schistocerca americana 7009 | CAG|GTAAATTTCT...ATGGCTTTAGAA/TAGATTCTGATT...CTCAG|CAC | 1 | 1 | 85.65 |
| 179732759 | GT-AG | 0 | 1.000000099473604e-05 | 2490 | rna-XM_047136614.1 32191383 | 47 | 515223735 | 515226224 | Schistocerca americana 7009 | CAA|GTGAGTAACA...GTAACTTTAAAT/CTTTAAATAATT...TTCAG|CCA | 0 | 1 | 87.174 |
| 179732760 | GT-AG | 0 | 1.000000099473604e-05 | 44005 | rna-XM_047136614.1 32191383 | 48 | 515226417 | 515270421 | Schistocerca americana 7009 | CAG|GTAATTATTA...AACCTTTTAAGG/AGTGAATTGACA...TGTAG|GTT | 0 | 1 | 89.461 |
| 179732761 | GT-AG | 0 | 1.000000099473604e-05 | 14987 | rna-XM_047136614.1 32191383 | 49 | 515270593 | 515285579 | Schistocerca americana 7009 | AAA|GTAAGGGGCT...CATTTTATAATA/CCATTTTTTATG...TCCAG|AAT | 0 | 1 | 91.497 |
| 179732762 | GT-AG | 0 | 1.000000099473604e-05 | 13962 | rna-XM_047136614.1 32191383 | 50 | 515285783 | 515299744 | Schistocerca americana 7009 | TAT|GTAAGTACCT...CCAGTCATAATT/ATTTTTGTCATC...TGCAG|TCA | 2 | 1 | 93.914 |
| 179732763 | GT-AG | 0 | 1.1941481430817735e-05 | 93 | rna-XM_047136614.1 32191383 | 51 | 515299932 | 515300024 | Schistocerca americana 7009 | AAG|GTATGAAGGA...GTTCTTTTATTC/CGTTCTTTTATT...AACAG|AGC | 0 | 1 | 96.141 |
| 179732764 | GT-AG | 0 | 1.000000099473604e-05 | 31625 | rna-XM_047136614.1 32191383 | 52 | 515300212 | 515331836 | Schistocerca americana 7009 | AAG|GTTAGTTAAC...AATTTTTTATGT/ATTTAATTAATT...TACAG|AAC | 1 | 1 | 98.368 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);