introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
52 rows where transcript_id = 32191376
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179732483 | GT-AG | 0 | 1.000000099473604e-05 | 634 | rna-XM_047136016.1 32191376 | 2 | 320373947 | 320374580 | Schistocerca americana 7009 | AAG|GTAAAGAGCA...TACACTTTAATT/GGAATACTTACT...TGCAG|GCC | 1 | 1 | 1.19 |
| 179732484 | GT-AG | 0 | 1.000000099473604e-05 | 1715 | rna-XM_047136016.1 32191376 | 3 | 320374629 | 320376343 | Schistocerca americana 7009 | CAG|GTAAGTCTAA...TATTTTTTATTT/TTATTTTTTATT...TGTAG|AGG | 1 | 1 | 1.687 |
| 179732485 | GT-AG | 0 | 1.000000099473604e-05 | 961 | rna-XM_047136016.1 32191376 | 4 | 320376707 | 320377667 | Schistocerca americana 7009 | CAG|GTAAGAAAAA...AAACTTTTAAGA/AAACTTTTAAGA...TTCAG|GAG | 1 | 1 | 5.445 |
| 179732486 | GT-AG | 0 | 0.0002451549584151 | 16883 | rna-XM_047136016.1 32191376 | 5 | 320377937 | 320394819 | Schistocerca americana 7009 | CCG|GTATGAATTA...GTCTTTTTAGAT/TTTTAGATTACT...CACAG|GAG | 0 | 1 | 8.23 |
| 179732487 | GT-AG | 0 | 1.000000099473604e-05 | 3700 | rna-XM_047136016.1 32191376 | 6 | 320395008 | 320398707 | Schistocerca americana 7009 | AAG|GTAAATTCTC...AAAACATTAAAT/CTAGATTTCAAT...TGCAG|CAC | 2 | 1 | 10.176 |
| 179732488 | GT-AG | 0 | 1.000000099473604e-05 | 15967 | rna-XM_047136016.1 32191376 | 7 | 320398891 | 320414857 | Schistocerca americana 7009 | AGG|GTAAGTAAAT...TGTATATTAATT/TGTATATTAATT...TTCAG|TAT | 2 | 1 | 12.07 |
| 179732489 | GT-AG | 0 | 0.0431922904712253 | 193 | rna-XM_047136016.1 32191376 | 8 | 320415066 | 320415258 | Schistocerca americana 7009 | AAG|GTATCGCATT...TTTCTTTTAAAA/TTTCTTTTAAAA...ATTAG|ATG | 0 | 1 | 14.224 |
| 179732490 | GT-AG | 0 | 0.0004410893653691 | 4103 | rna-XM_047136016.1 32191376 | 9 | 320415397 | 320419499 | Schistocerca americana 7009 | GCA|GTAAGTATCT...ATTGTTTTAATT/ATTGTTTTAATT...TTCAG|ACA | 0 | 1 | 15.652 |
| 179732491 | GT-AG | 0 | 1.000000099473604e-05 | 1910 | rna-XM_047136016.1 32191376 | 10 | 320419735 | 320421644 | Schistocerca americana 7009 | TTG|GTAAGTTTAC...CTAACAATGAAG/GTAAAACTAACA...TTCAG|AAT | 1 | 1 | 18.085 |
| 179732492 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_047136016.1 32191376 | 11 | 320421824 | 320421916 | Schistocerca americana 7009 | CAG|GTAAAAAGTA...ATGTTTTTATAC/TATGTTTTTATA...TTAAG|GAA | 0 | 1 | 19.938 |
| 179732493 | GT-AG | 0 | 1.000000099473604e-05 | 204 | rna-XM_047136016.1 32191376 | 12 | 320422142 | 320422345 | Schistocerca americana 7009 | AAG|GTAATGTGCC...CATACTTTGATG/ATGATAGTAATT...TACAG|ATG | 0 | 1 | 22.267 |
| 179732494 | GT-AG | 0 | 1.000000099473604e-05 | 2601 | rna-XM_047136016.1 32191376 | 13 | 320422490 | 320425090 | Schistocerca americana 7009 | AAG|GTAAATGTAT...TGGGCTGTAAAT/TATTACATCAAT...TACAG|GAA | 0 | 1 | 23.758 |
| 179732495 | GT-AG | 0 | 0.0008496172918995 | 6881 | rna-XM_047136016.1 32191376 | 14 | 320425320 | 320432200 | Schistocerca americana 7009 | ACG|GTCTGTTATT...TTACCTTTAAAA/GGTTTTTTGAGT...CTCAG|GTT | 1 | 1 | 26.128 |
| 179732496 | GT-AG | 0 | 1.000000099473604e-05 | 1553 | rna-XM_047136016.1 32191376 | 15 | 320432422 | 320433974 | Schistocerca americana 7009 | GGG|GTAAGGAGAA...TAACCCTTTGTG/TTGTGTATAATT...CACAG|AAG | 0 | 1 | 28.416 |
| 179732497 | GT-AG | 0 | 1.000000099473604e-05 | 2090 | rna-XM_047136016.1 32191376 | 16 | 320434282 | 320436371 | Schistocerca americana 7009 | CAG|GTTAGTGCTA...ATAATTTTGATA/ATAATTTTGATA...AACAG|GTT | 1 | 1 | 31.594 |
| 179732498 | GT-AG | 0 | 1.000000099473604e-05 | 6952 | rna-XM_047136016.1 32191376 | 17 | 320436559 | 320443510 | Schistocerca americana 7009 | AAC|GTAAGTACGG...TCTGTGTTGATG/CTAATTATGATT...TATAG|GCC | 2 | 1 | 33.53 |
| 179732499 | GT-AG | 0 | 1.000000099473604e-05 | 18656 | rna-XM_047136016.1 32191376 | 18 | 320443688 | 320462343 | Schistocerca americana 7009 | GAG|GTAAGTAAAA...TTGTTCTTTGTT/ATATTTCTCAGT...TTTAG|CTC | 2 | 1 | 35.362 |
| 179732500 | GT-AG | 0 | 4.266203009029062e-05 | 4854 | rna-XM_047136016.1 32191376 | 19 | 320462515 | 320467368 | Schistocerca americana 7009 | GAA|GTAAGTTAAT...ATTATCTTATTT/AATTATCTTATT...TGTAG|GGA | 2 | 1 | 37.133 |
| 179732501 | GT-AG | 0 | 0.0005765131622715 | 83 | rna-XM_047136016.1 32191376 | 20 | 320467519 | 320467601 | Schistocerca americana 7009 | CTG|GTATGTGCAC...CTAATCTTAACA/CTAATCTTAACA...TTCAG|TCA | 2 | 1 | 38.685 |
| 179732502 | GT-AG | 0 | 3.500768850211341e-05 | 8509 | rna-XM_047136016.1 32191376 | 21 | 320467772 | 320476280 | Schistocerca americana 7009 | CAT|GTAAGTATTA...TAATCTTTGGCT/TGTATCCTAAGT...TGTAG|TGG | 1 | 1 | 40.445 |
| 179732503 | GT-AG | 0 | 0.0002381801912025 | 123316 | rna-XM_047136016.1 32191376 | 22 | 320476580 | 320599895 | Schistocerca americana 7009 | AAG|GTATGTTGCA...TGTTTCTGCGTT/TTCTGCGTTATG...TGCAG|ATC | 0 | 1 | 43.54 |
| 179732504 | GT-AG | 0 | 1.000000099473604e-05 | 246005 | rna-XM_047136016.1 32191376 | 23 | 320600051 | 320846055 | Schistocerca americana 7009 | CAG|GTCAGTGGCC...CATTTGTTAATG/CATTTGTTAATG...TACAG|GGA | 2 | 1 | 45.145 |
| 179732505 | GT-AG | 0 | 1.000000099473604e-05 | 139593 | rna-XM_047136016.1 32191376 | 24 | 320846223 | 320985815 | Schistocerca americana 7009 | AGG|GTGAGTACCC...GAATCTTTGAGT/GTTCTATTCACA...TGCAG|GTG | 1 | 1 | 46.874 |
| 179732506 | GT-AG | 0 | 1.000000099473604e-05 | 108227 | rna-XM_047136016.1 32191376 | 25 | 320985945 | 321094171 | Schistocerca americana 7009 | CAG|GTTAGCACCG...TGCATCTGAGCT/TAGGCGTTCACC...TTCAG|AGA | 1 | 1 | 48.209 |
| 179732507 | GT-AG | 0 | 1.000000099473604e-05 | 65972 | rna-XM_047136016.1 32191376 | 26 | 321094306 | 321160277 | Schistocerca americana 7009 | CAG|GTGAGCTCTT...GTGTTTTTGTTT/AATGAAATGAGT...CGCAG|TAC | 0 | 1 | 49.596 |
| 179732508 | GT-AG | 0 | 1.000000099473604e-05 | 107142 | rna-XM_047136016.1 32191376 | 27 | 321160383 | 321267524 | Schistocerca americana 7009 | AAG|GTCAGTCACG...CAGCATTTAACA/CATCTGTTTACG...CGCAG|TTT | 0 | 1 | 50.683 |
| 179732509 | GT-AG | 0 | 1.000000099473604e-05 | 15117 | rna-XM_047136016.1 32191376 | 28 | 321267660 | 321282776 | Schistocerca americana 7009 | TAT|GTAAGTATCC...GGTCCCTGTACG/TGTGTGCTGATT...ATCAG|GCG | 0 | 1 | 52.081 |
| 179732510 | GT-AG | 0 | 1.000000099473604e-05 | 71044 | rna-XM_047136016.1 32191376 | 29 | 321283088 | 321354131 | Schistocerca americana 7009 | CGG|GTGCGTACAG...TTTTCCATAGAT/TGGTATTTTACT...TACAG|CAC | 2 | 1 | 55.3 |
| 179732511 | GT-AG | 0 | 1.000000099473604e-05 | 2558 | rna-XM_047136016.1 32191376 | 30 | 321354262 | 321356819 | Schistocerca americana 7009 | GAG|GTTGGCAAAT...TTCTTCTTAAAA/TTTCTTCTTAAA...TCTAG|TTA | 0 | 1 | 56.646 |
| 179732512 | GT-AG | 0 | 3.6235179719313976e-05 | 216 | rna-XM_047136016.1 32191376 | 31 | 321356981 | 321357196 | Schistocerca americana 7009 | CAG|GTATGATATA...GTGCTCATAATA/TTTGTGCTCATA...AACAG|GTG | 2 | 1 | 58.313 |
| 179732513 | GT-AG | 0 | 1.000000099473604e-05 | 15004 | rna-XM_047136016.1 32191376 | 32 | 321357484 | 321372487 | Schistocerca americana 7009 | CAG|GTAAATGGAA...TTTTTTTTAATT/TTTTTTTTAATT...TATAG|AAC | 1 | 1 | 61.284 |
| 179732514 | GT-AG | 0 | 1.000000099473604e-05 | 4800 | rna-XM_047136016.1 32191376 | 33 | 321372587 | 321377386 | Schistocerca americana 7009 | GAG|GTGAGTTCCA...GTTTTTCTAACA/GTTTTTCTAACA...CTCAG|GGG | 1 | 1 | 62.308 |
| 179732515 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_047136016.1 32191376 | 34 | 321377541 | 321377636 | Schistocerca americana 7009 | CAG|GTGATAGATG...ATAATTTTATTA/AATAATTTTATT...TATAG|GCA | 2 | 1 | 63.903 |
| 179732516 | GT-AG | 0 | 1.000000099473604e-05 | 3516 | rna-XM_047136016.1 32191376 | 35 | 321377848 | 321381363 | Schistocerca americana 7009 | CAG|GTAAGATAGG...TTTTTGTTAACT/TTTTTGTTAACT...TACAG|GTG | 0 | 1 | 66.087 |
| 179732517 | GT-AG | 0 | 0.0002939542542604 | 5438 | rna-XM_047136016.1 32191376 | 36 | 321381463 | 321386900 | Schistocerca americana 7009 | CAG|GTAACTGTCT...TGAATTTTATTT/TTTTATTTTACA...TTTAG|GGA | 0 | 1 | 67.112 |
| 179732518 | GT-AG | 0 | 1.000000099473604e-05 | 12503 | rna-XM_047136016.1 32191376 | 37 | 321387202 | 321399704 | Schistocerca americana 7009 | AAG|GTAAAGCTGT...TTTTGTTTATTA/TTTTTGTTTATT...TACAG|ATA | 1 | 1 | 70.228 |
| 179732519 | GT-AG | 0 | 0.000686991466201 | 80 | rna-XM_047136016.1 32191376 | 38 | 321400049 | 321400128 | Schistocerca americana 7009 | AAG|GTATATTTAC...TGTAAGTTAATT/TGTAAGTTAATT...TGCAG|GGC | 0 | 1 | 73.789 |
| 179732520 | GT-AG | 0 | 1.000000099473604e-05 | 3473 | rna-XM_047136016.1 32191376 | 39 | 321400327 | 321403799 | Schistocerca americana 7009 | CAG|GTAAGGGTAA...TTATTTGTGAAT/GAAAATTTCACT...TCCAG|GTT | 0 | 1 | 75.839 |
| 179732521 | GT-AG | 0 | 0.0003479728616195 | 16172 | rna-XM_047136016.1 32191376 | 40 | 321403968 | 321420139 | Schistocerca americana 7009 | GAG|GTAGGTTTTA...ATTATCTTATCT/ATGTATTTCATT...CTTAG|GTA | 0 | 1 | 77.578 |
| 179732522 | GT-AG | 0 | 1.000000099473604e-05 | 1281 | rna-XM_047136016.1 32191376 | 41 | 321420281 | 321421561 | Schistocerca americana 7009 | GAG|GTAAGAAATT...ATATCCTTTTTT/TTGCTAGTAATT...ACCAG|GGT | 0 | 1 | 79.037 |
| 179732523 | GT-AG | 0 | 1.000000099473604e-05 | 7095 | rna-XM_047136016.1 32191376 | 42 | 321421731 | 321428825 | Schistocerca americana 7009 | CAG|GTAATGTGAT...ATTAGTTTAATT/TTTAATTTCATT...CCTAG|GCA | 1 | 1 | 80.787 |
| 179732524 | GT-AG | 0 | 0.3636469302598589 | 1989 | rna-XM_047136016.1 32191376 | 43 | 321429011 | 321430999 | Schistocerca americana 7009 | CAG|GTTTCCATCT...TGTATTTTAATA/TGTATTTTAATA...TTCAG|AGT | 0 | 1 | 82.702 |
| 179732525 | GT-AG | 0 | 3.146170181673447e-05 | 34299 | rna-XM_047136016.1 32191376 | 44 | 321431075 | 321465373 | Schistocerca americana 7009 | TCT|GTAAGAATTC...CTAGTCTTGATG/CTAGTCTTGATG...TTTAG|GTG | 0 | 1 | 83.478 |
| 179732526 | GT-AG | 0 | 0.0003892426491319 | 81 | rna-XM_047136016.1 32191376 | 45 | 321465557 | 321465637 | Schistocerca americana 7009 | GAA|GTAAGTTTCA...GTTACTTTATTT/CGTTACTTTATT...TTCAG|GGT | 0 | 1 | 85.373 |
| 179732527 | GT-AG | 0 | 0.0065726022025043 | 83 | rna-XM_047136016.1 32191376 | 46 | 321465764 | 321465846 | Schistocerca americana 7009 | ATG|GTATATTGAA...ATAATTTTAAAC/CTCTGTTTTATA...TTCAG|AGA | 0 | 1 | 86.677 |
| 179732528 | GT-AG | 0 | 2.125658529176177e-05 | 10423 | rna-XM_047136016.1 32191376 | 47 | 321466029 | 321476451 | Schistocerca americana 7009 | TTC|GTAAGTACTA...TTCACCTTATGA/TAATATTTCACC...TCTAG|TGG | 2 | 1 | 88.561 |
| 179732529 | GT-AG | 0 | 6.889529631576481e-05 | 4122 | rna-XM_047136016.1 32191376 | 48 | 321476642 | 321480763 | Schistocerca americana 7009 | AAG|GTAACTACAG...GTCTCTTTCATT/GTCTCTTTCATT...TACAG|GAT | 0 | 1 | 90.528 |
| 179732530 | GT-AG | 0 | 1.000000099473604e-05 | 2588 | rna-XM_047136016.1 32191376 | 49 | 321481028 | 321483615 | Schistocerca americana 7009 | GAG|GTAAGTAATT...GTTTCTTTGTTA/ATGTGATTAAAA...TTCAG|TGT | 0 | 1 | 93.261 |
| 179732531 | GT-AG | 0 | 2.6040344100772783e-05 | 11099 | rna-XM_047136016.1 32191376 | 50 | 321483808 | 321494906 | Schistocerca americana 7009 | CAG|GTATTTGAGA...ACAGCCTTGCAT/TAAAAATTAAAT...TGTAG|TTG | 0 | 1 | 95.248 |
| 179732532 | GT-AG | 0 | 1.000000099473604e-05 | 12018 | rna-XM_047136016.1 32191376 | 51 | 321495113 | 321507130 | Schistocerca americana 7009 | CAG|GTAATAACTG...TGACTCTTACAT/GTAATATTTACT...TGCAG|AGG | 2 | 1 | 97.381 |
| 179732533 | GT-AG | 0 | 1.000000099473604e-05 | 819 | rna-XM_047136016.1 32191376 | 52 | 321507232 | 321508050 | Schistocerca americana 7009 | CAG|GTCAGTATGT...TGAACTCTGACT/TGAACTCTGACT...TTTAG|ATG | 1 | 1 | 98.427 |
| 179750101 | GT-AG | 0 | 0.000215479049692 | 16278 | rna-XM_047136016.1 32191376 | 1 | 320357584 | 320373861 | Schistocerca americana 7009 | TCG|GTATGAAGCT...CTATTTTTAATT/CTATTTTTAATT...TTCAG|ATT | 0 | 0.714 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);