introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
54 rows where transcript_id = 32191372
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179732328 | GT-AG | 0 | 0.0006613179409475 | 2241 | rna-XM_047145406.1 32191372 | 2 | 1129810336 | 1129812576 | Schistocerca americana 7009 | AAG|GTACGTTATC...TTTACTTTAATG/AAGGTGTTTACT...TACAG|GTA | 0 | 1 | 3.692 |
| 179732329 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-XM_047145406.1 32191372 | 3 | 1129809977 | 1129810089 | Schistocerca americana 7009 | CAG|GTAATGAACA...CTCTCATTAATT/TTATCTCTCATT...TTCAG|GTT | 0 | 1 | 5.923 |
| 179732330 | GT-AG | 0 | 0.0001891062991471 | 1485 | rna-XM_047145406.1 32191372 | 4 | 1129808339 | 1129809823 | Schistocerca americana 7009 | CAG|GTATTTACAT...AGATCCGTAGTT/ATTGTTCAGATC...TGCAG|GTA | 0 | 1 | 7.311 |
| 179732331 | GT-AG | 0 | 1.000000099473604e-05 | 21019 | rna-XM_047145406.1 32191372 | 5 | 1129787127 | 1129808145 | Schistocerca americana 7009 | ACA|GTCAGTAGTT...GCAGTGTTAATG/TAACATTTTATG...TTCAG|ATA | 1 | 1 | 9.062 |
| 179732332 | GT-AG | 0 | 1.000000099473604e-05 | 1561 | rna-XM_047145406.1 32191372 | 6 | 1129785379 | 1129786939 | Schistocerca americana 7009 | GAA|GTAAGTCATT...CTCTTTTGGACT/TTTGGACTGATA...TTTAG|GTT | 2 | 1 | 10.758 |
| 179732333 | GT-AG | 0 | 1.000000099473604e-05 | 773 | rna-XM_047145406.1 32191372 | 7 | 1129784439 | 1129785211 | Schistocerca americana 7009 | GAG|GTAAGGGAAA...TATGCTTTAATT/TAATTATTCATT...TACAG|GAG | 1 | 1 | 12.273 |
| 179732334 | GT-AG | 0 | 0.0003129555591887 | 3904 | rna-XM_047145406.1 32191372 | 8 | 1129780332 | 1129784235 | Schistocerca americana 7009 | CAG|GTAAACTTCA...TTATCCTTTGTT/ATGGCACTTATC...TTCAG|GAG | 0 | 1 | 14.115 |
| 179732335 | GT-AG | 0 | 1.000000099473604e-05 | 8900 | rna-XM_047145406.1 32191372 | 9 | 1129771283 | 1129780182 | Schistocerca americana 7009 | AAG|GTAATACTTC...ATGCCTTTGATG/CCTTTGATGATT...TTCAG|TTC | 2 | 1 | 15.466 |
| 179732336 | GT-AG | 0 | 1.000000099473604e-05 | 6068 | rna-XM_047145406.1 32191372 | 10 | 1129765087 | 1129771154 | Schistocerca americana 7009 | AAG|GTTAGTAGAG...GATATCTTATTT/AGATATCTTATT...TGCAG|ATT | 1 | 1 | 16.627 |
| 179732337 | GT-AG | 0 | 0.0014784051160002 | 7424 | rna-XM_047145406.1 32191372 | 11 | 1129757495 | 1129764918 | Schistocerca americana 7009 | TAG|GTATGTATTA...TTTTCTCTAATT/TTTTCTCTAATT...TTCAG|GAA | 1 | 1 | 18.151 |
| 179732338 | GT-AG | 0 | 1.000000099473604e-05 | 9150 | rna-XM_047145406.1 32191372 | 12 | 1129748229 | 1129757378 | Schistocerca americana 7009 | AAG|GTCAGTTATC...CATTTCTTACTA/ACATTTCTTACT...TATAG|TCA | 0 | 1 | 19.204 |
| 179732339 | GT-AG | 0 | 0.0003141093382126 | 3854 | rna-XM_047145406.1 32191372 | 13 | 1129744135 | 1129747988 | Schistocerca americana 7009 | GAG|GTATGTAATT...TATGCATTAAAG/AAAATTATGATA...TCCAG|ATA | 0 | 1 | 21.381 |
| 179732340 | GT-AG | 0 | 1.000000099473604e-05 | 165 | rna-XM_047145406.1 32191372 | 14 | 1129743824 | 1129743988 | Schistocerca americana 7009 | AAG|GTAATAATTA...TTTTGCTTAATA/CTTTTGCTTAAT...TTAAG|CAT | 2 | 1 | 22.705 |
| 179732341 | GT-AG | 0 | 1.000000099473604e-05 | 7683 | rna-XM_047145406.1 32191372 | 15 | 1129735993 | 1129743675 | Schistocerca americana 7009 | GAG|GTACAGAAGA...CATGTTTAGATC/GTGGATCTAATA...TGCAG|TCG | 0 | 1 | 24.048 |
| 179732342 | GT-AG | 0 | 1.2975619530521125e-05 | 14849 | rna-XM_047145406.1 32191372 | 16 | 1129720952 | 1129735800 | Schistocerca americana 7009 | GAG|GTAAGCATCT...TTTGTGTTATTT/ATTTGTGTTATT...GCTAG|GTA | 0 | 1 | 25.789 |
| 179732343 | GT-AG | 0 | 1.000000099473604e-05 | 3622 | rna-XM_047145406.1 32191372 | 17 | 1129717148 | 1129720769 | Schistocerca americana 7009 | CAG|GTTAGAATCT...ATGTTCTTGTGT/GGACTACTGATT...TGCAG|AGA | 2 | 1 | 27.44 |
| 179732344 | GT-AG | 0 | 1.000000099473604e-05 | 3869 | rna-XM_047145406.1 32191372 | 18 | 1129712944 | 1129716812 | Schistocerca americana 7009 | CAG|GTGAATCCTG...TACATTTTAATT/TTTAATTTTATT...TTTAG|GTG | 1 | 1 | 30.479 |
| 179732345 | GT-AG | 0 | 2.915424465825332e-05 | 14348 | rna-XM_047145406.1 32191372 | 19 | 1129698380 | 1129712727 | Schistocerca americana 7009 | GTA|GTAAGTAATT...TCTGTTTTATCC/CACATTTTTATT...TTCAG|ATG | 1 | 1 | 32.438 |
| 179732346 | GT-AG | 0 | 0.0005427618005568 | 4457 | rna-XM_047145406.1 32191372 | 20 | 1129693750 | 1129698206 | Schistocerca americana 7009 | AAG|GTACTTTCTT...TGATTATTAATA/TGATTATTAATA...TCTAG|CTG | 0 | 1 | 34.008 |
| 179732347 | GT-AG | 0 | 0.0035068799261121 | 3619 | rna-XM_047145406.1 32191372 | 21 | 1129689902 | 1129693520 | Schistocerca americana 7009 | ATG|GTATATTGCA...TGTGTTTTACAG/TTGTGTTTTACA...GTTAG|GGA | 1 | 1 | 36.085 |
| 179732348 | GT-AG | 0 | 0.001014403632767 | 343 | rna-XM_047145406.1 32191372 | 22 | 1129689422 | 1129689764 | Schistocerca americana 7009 | CAG|GTATGTATGA...AAAGTTTTGACA/AAAGTTTTGACA...TTCAG|CTT | 0 | 1 | 37.328 |
| 179732349 | GT-AG | 0 | 1.000000099473604e-05 | 135 | rna-XM_047145406.1 32191372 | 23 | 1129689103 | 1129689237 | Schistocerca americana 7009 | TAG|GTATGAAACT...CTTGTTTTCTCG/TATGAGCTAATT...TACAG|GGT | 1 | 1 | 38.997 |
| 179732350 | GT-AG | 0 | 1.0200461050483726e-05 | 5942 | rna-XM_047145406.1 32191372 | 24 | 1129682889 | 1129688830 | Schistocerca americana 7009 | GAG|GTACAGACAT...TTATCCTTATTT/TTTATCCTTATT...TTTAG|GTT | 0 | 1 | 41.464 |
| 179732351 | GT-AG | 0 | 1.000000099473604e-05 | 5230 | rna-XM_047145406.1 32191372 | 25 | 1129677426 | 1129682655 | Schistocerca americana 7009 | AGG|GTCAGTGGTA...GATTGTTTAAAA/AAGGATTTAATA...CATAG|GTT | 2 | 1 | 43.578 |
| 179732352 | GT-AG | 0 | 1.000000099473604e-05 | 4375 | rna-XM_047145406.1 32191372 | 26 | 1129672805 | 1129677179 | Schistocerca americana 7009 | AAC|GTGAGTATGA...ATTTCTATAATT/TTGTGCTTCATT...AACAG|CTT | 2 | 1 | 45.809 |
| 179732353 | GT-AG | 0 | 0.0011368076670403 | 7397 | rna-XM_047145406.1 32191372 | 27 | 1129665187 | 1129672583 | Schistocerca americana 7009 | TGG|GTATGTTGAT...ACAATTTTATTT/ATCTTCCTCATA...ATTAG|GTT | 1 | 1 | 47.814 |
| 179732354 | GT-AG | 0 | 1.000000099473604e-05 | 463 | rna-XM_047145406.1 32191372 | 28 | 1129664533 | 1129664995 | Schistocerca americana 7009 | AAG|GTTAGTACTG...CAGACTTTAAAG/AACTGTTTTACT...TTCAG|CTT | 0 | 1 | 49.546 |
| 179732355 | GT-AG | 0 | 0.0001086087197032 | 2960 | rna-XM_047145406.1 32191372 | 29 | 1129661424 | 1129664383 | Schistocerca americana 7009 | AAG|GTATGTGGTA...TGTGTTTTATTC/TTGTGTTTTATT...AACAG|CCC | 2 | 1 | 50.898 |
| 179732356 | GT-AG | 0 | 1.000000099473604e-05 | 3977 | rna-XM_047145406.1 32191372 | 30 | 1129657245 | 1129661221 | Schistocerca americana 7009 | AAG|GTCAGTGGTC...GTAATCTTGAAA/ATGATATTTATC...CTCAG|ACT | 0 | 1 | 52.73 |
| 179732357 | GT-AG | 0 | 0.0001650853915374 | 1339 | rna-XM_047145406.1 32191372 | 31 | 1129655732 | 1129657070 | Schistocerca americana 7009 | GAG|GTATGTAGAT...AATTTTTTAGAA/TTGAGTGTCATT...TGCAG|GCA | 0 | 1 | 54.309 |
| 179732358 | GT-AG | 0 | 1.000000099473604e-05 | 4707 | rna-XM_047145406.1 32191372 | 32 | 1129650823 | 1129655529 | Schistocerca americana 7009 | AGG|GTGAGTCACG...AATGCTTTAAAA/ACCATACTGATA...TGCAG|GTA | 1 | 1 | 56.141 |
| 179732359 | GT-AG | 0 | 1.000000099473604e-05 | 5561 | rna-XM_047145406.1 32191372 | 33 | 1129645057 | 1129650617 | Schistocerca americana 7009 | TAG|GTAAGTTGAG...TTTAATTTAATT/TTTAATTTAATT...TTCAG|AGG | 2 | 1 | 58.001 |
| 179732360 | GT-AG | 0 | 1.000000099473604e-05 | 2686 | rna-XM_047145406.1 32191372 | 34 | 1129642278 | 1129644963 | Schistocerca americana 7009 | TAT|GTGAGTATAT...ATTTTTTTCATT/ATTTTTTTCATT...TCCAG|GTT | 2 | 1 | 58.844 |
| 179732361 | GT-AG | 0 | 9.30852489930044e-05 | 11076 | rna-XM_047145406.1 32191372 | 35 | 1129631045 | 1129642120 | Schistocerca americana 7009 | GAG|GTACTGTTTA...CGTTCATTATCA/TTCATTATCATG...TATAG|GTG | 0 | 1 | 60.269 |
| 179732362 | GT-AG | 0 | 6.420174647060416e-05 | 110 | rna-XM_047145406.1 32191372 | 36 | 1129630770 | 1129630879 | Schistocerca americana 7009 | TCT|GTAAGTGTAC...GATCTTTTATCA/ATTTATTTCAAC...TGCAG|AAA | 0 | 1 | 61.765 |
| 179732363 | GT-AG | 0 | 2.652349703926318e-05 | 1354 | rna-XM_047145406.1 32191372 | 37 | 1129629135 | 1129630488 | Schistocerca americana 7009 | TCG|GTAAGTTGGC...GTTTTTTTATTT/TATTTACTTACT...TCCAG|GTT | 2 | 1 | 64.314 |
| 179732364 | GT-AG | 0 | 1.000000099473604e-05 | 5806 | rna-XM_047145406.1 32191372 | 38 | 1129623208 | 1129629013 | Schistocerca americana 7009 | AAT|GTAAGTAAAG...TATGATTTAATG/TTATGATTTAAT...TATAG|AGA | 0 | 1 | 65.412 |
| 179732365 | GT-AG | 0 | 1.000000099473604e-05 | 7672 | rna-XM_047145406.1 32191372 | 39 | 1129615358 | 1129623029 | Schistocerca americana 7009 | TAG|GTAAGTGTGA...CATTATTTATTT/GCATTATTTATT...TCTAG|GTC | 1 | 1 | 67.026 |
| 179732366 | GT-AG | 0 | 1.000000099473604e-05 | 1124 | rna-XM_047145406.1 32191372 | 40 | 1129614067 | 1129615190 | Schistocerca americana 7009 | AAG|GTAATGTGCT...ATAAGGTTGATT/CTTTTAATCATA...TGCAG|ATC | 0 | 1 | 68.541 |
| 179732367 | GT-AG | 0 | 1.000000099473604e-05 | 8442 | rna-XM_047145406.1 32191372 | 41 | 1129605386 | 1129613827 | Schistocerca americana 7009 | AAG|GTGAGAAGTA...ATTTCATTAAAT/GAGCATTTCATT...TTCAG|GGA | 2 | 1 | 70.709 |
| 179732368 | GT-AG | 0 | 1.000000099473604e-05 | 1100 | rna-XM_047145406.1 32191372 | 42 | 1129604169 | 1129605268 | Schistocerca americana 7009 | GAG|GTAATATCAT...TTTTCTGTAAAA/AAAAAAATAACT...TGCAG|GTT | 2 | 1 | 71.771 |
| 179732369 | GT-AG | 0 | 7.247407243230349e-05 | 153 | rna-XM_047145406.1 32191372 | 43 | 1129603877 | 1129604029 | Schistocerca americana 7009 | GTG|GTAATTTGCT...TTTTCCTTCTCA/TCAAATTTGATC...TCCAG|CGT | 0 | 1 | 73.032 |
| 179732370 | GT-AG | 0 | 6.289105937796616e-05 | 123 | rna-XM_047145406.1 32191372 | 44 | 1129603542 | 1129603664 | Schistocerca americana 7009 | AAG|GTAAATTTTT...GACATTTTACTT/ATAAAGCTGATT...CCTAG|GTA | 2 | 1 | 74.955 |
| 179732371 | GT-AG | 0 | 1.3597311404946563e-05 | 515 | rna-XM_047145406.1 32191372 | 45 | 1129602894 | 1129603408 | Schistocerca americana 7009 | CAG|GTGAGCTTCA...ATTTTCTTGACA/GTATTACTTACT...CACAG|GGA | 0 | 1 | 76.161 |
| 179732372 | GT-AG | 0 | 1.000000099473604e-05 | 429 | rna-XM_047145406.1 32191372 | 46 | 1129602311 | 1129602739 | Schistocerca americana 7009 | TAG|GTGAGAATGA...AATATTTTACTG/GAATATTTTACT...TACAG|GTT | 1 | 1 | 77.558 |
| 179732373 | GT-AG | 0 | 1.0037469608239425e-05 | 8253 | rna-XM_047145406.1 32191372 | 47 | 1129593823 | 1129602075 | Schistocerca americana 7009 | CAA|GTCAGTTTTG...GTCACATTAACT/TATTACTTCAAT...TTCAG|TAT | 2 | 1 | 79.69 |
| 179732374 | GT-AG | 0 | 1.000000099473604e-05 | 3358 | rna-XM_047145406.1 32191372 | 48 | 1129590241 | 1129593598 | Schistocerca americana 7009 | AAG|GTAGGAATTA...TTATGTTTAATA/TTATGTTTAATA...TGCAG|AAC | 1 | 1 | 81.722 |
| 179732375 | GT-AG | 0 | 1.000000099473604e-05 | 19294 | rna-XM_047145406.1 32191372 | 49 | 1129570693 | 1129589986 | Schistocerca americana 7009 | ACG|GTAAGTTATA...TTCAGATTAACT/GATTAACTCACT...CTTAG|GTT | 0 | 1 | 84.026 |
| 179732376 | GT-AG | 0 | 1.000000099473604e-05 | 3164 | rna-XM_047145406.1 32191372 | 50 | 1129567301 | 1129570464 | Schistocerca americana 7009 | ACT|GTAAGTGACG...TAATTTTGGACT/ATGTTTGTAATT...TGCAG|GGT | 0 | 1 | 86.094 |
| 179732377 | GT-AG | 0 | 1.000000099473604e-05 | 6971 | rna-XM_047145406.1 32191372 | 51 | 1129560138 | 1129567108 | Schistocerca americana 7009 | AGA|GTAAGTAAAT...CAATTTTTGAAT/ATCTTTCTGATG...TTCAG|ATG | 0 | 1 | 87.836 |
| 179732378 | GT-AG | 0 | 3.833903350290436e-05 | 10101 | rna-XM_047145406.1 32191372 | 52 | 1129549829 | 1129559929 | Schistocerca americana 7009 | AAG|GTATGAATAA...CTGACTTTAAAA/GTGTTCCTGACT...AACAG|GTT | 1 | 1 | 89.722 |
| 179732379 | GT-AG | 0 | 1.000000099473604e-05 | 4159 | rna-XM_047145406.1 32191372 | 53 | 1129544965 | 1129549123 | Schistocerca americana 7009 | AAG|GTAAGTAAAA...TTGTTGTTATAT/GTTGTTGTTATA...CATAG|GTT | 1 | 1 | 96.118 |
| 179732380 | GT-AG | 0 | 0.0859170056294201 | 11258 | rna-XM_047145406.1 32191372 | 54 | 1129533397 | 1129544654 | Schistocerca americana 7009 | CAG|GTATCATTGT...TAATTCTTTGTT/TTGTAATTTATT...TTCAG|GTG | 2 | 1 | 98.93 |
| 179750097 | GT-AG | 0 | 1.000000099473604e-05 | 17045 | rna-XM_047145406.1 32191372 | 1 | 1129812784 | 1129829828 | Schistocerca americana 7009 | TTG|GTGAGTTGTG...AAAACTTGGACT/ATTGTTGTTACA...TGCAG|GAC | 0 | 1.896 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);