introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
52 rows where transcript_id = 32210488
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179852601 | GT-AG | 0 | 0.0001146078226969 | 19253 | rna-XM_047260068.1 32210488 | 1 | 496013661 | 496032913 | Schistocerca piceifrons 274613 | CAG|GTATAGTAAA...ATTTTCTTGTTT/TTTGTGTTGAAT...TTCAG|TTG | 0 | 1 | 0.715 |
| 179852602 | GT-AG | 0 | 1.000000099473604e-05 | 8322 | rna-XM_047260068.1 32210488 | 2 | 496033058 | 496041379 | Schistocerca piceifrons 274613 | ATG|GTAAGTTCAA...GGTGTTTTACAT/TTTACACTTACA...TTCAG|TTT | 0 | 1 | 2.429 |
| 179852603 | GT-AG | 0 | 4.532104940304052e-05 | 6441 | rna-XM_047260068.1 32210488 | 3 | 496041561 | 496048001 | Schistocerca piceifrons 274613 | TAG|GTATGATCCC...AAACTCATATTC/CTGAAACTCATA...TTCAG|ATG | 1 | 1 | 4.585 |
| 179852604 | GT-AG | 0 | 0.0001471673711098 | 28147 | rna-XM_047260068.1 32210488 | 4 | 496048114 | 496076260 | Schistocerca piceifrons 274613 | AAG|GTATGTATAA...CATGATTTAATT/CATGATTTAATT...CCCAG|GTT | 2 | 1 | 5.919 |
| 179852605 | GT-AG | 0 | 4.745120731975517e-05 | 6187 | rna-XM_047260068.1 32210488 | 5 | 496076368 | 496082554 | Schistocerca piceifrons 274613 | ATG|GTAAGTTTGT...ATGACTTTATTG/AATGACTTTATT...TACAG|AAA | 1 | 1 | 7.193 |
| 179852606 | GT-AG | 0 | 1.1254879496116464e-05 | 108 | rna-XM_047260068.1 32210488 | 6 | 496082674 | 496082781 | Schistocerca piceifrons 274613 | CAG|GTAAGTTTGT...TGTTCATTAGTG/GTTGTTTTCATA...CACAG|AAA | 0 | 1 | 8.61 |
| 179852607 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-XM_047260068.1 32210488 | 7 | 496082911 | 496083023 | Schistocerca piceifrons 274613 | CCA|GTCAGTATTT...GAATTTCTAGCT/CTGTGTGTAATT...AGCAG|AAA | 0 | 1 | 10.146 |
| 179852608 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_047260068.1 32210488 | 8 | 496083267 | 496083349 | Schistocerca piceifrons 274613 | AAG|GTGAGGATAA...GATCTCTTATTG/GGATCTCTTATT...TTCAG|GCA | 0 | 1 | 13.04 |
| 179852609 | GT-AG | 0 | 1.000000099473604e-05 | 33660 | rna-XM_047260068.1 32210488 | 9 | 496083541 | 496117200 | Schistocerca piceifrons 274613 | CAG|GTGAGAATAT...CTTTTTTTAGAA/CCTTTTTTTAGA...TACAG|GAT | 2 | 1 | 15.315 |
| 179852610 | GT-AG | 0 | 0.0091622577625306 | 12297 | rna-XM_047260068.1 32210488 | 10 | 496117346 | 496129642 | Schistocerca piceifrons 274613 | ACG|GTATGTTGTA...TATTCATTGATT/ATTGTATTCATT...TGCAG|AGC | 0 | 1 | 17.042 |
| 179852611 | GT-AG | 0 | 1.000000099473604e-05 | 4379 | rna-XM_047260068.1 32210488 | 11 | 496129772 | 496134150 | Schistocerca piceifrons 274613 | AAT|GTAAGTGTAA...ATTTACATATCT/GATCCATTTACA...AACAG|AAT | 0 | 1 | 18.578 |
| 179852612 | GT-AG | 0 | 1.000000099473604e-05 | 5197 | rna-XM_047260068.1 32210488 | 12 | 496134345 | 496139541 | Schistocerca piceifrons 274613 | GAG|GTGAGTAGAA...AGTATTTTAATC/AGTATTTTAATC...TTCAG|TTC | 2 | 1 | 20.888 |
| 179852613 | GT-AG | 0 | 1.000000099473604e-05 | 9189 | rna-XM_047260068.1 32210488 | 13 | 496139726 | 496148914 | Schistocerca piceifrons 274613 | GAG|GTACAGAACA...ATTCTCTTTCCA/CATTTCATCATT...TTTAG|GTT | 0 | 1 | 23.08 |
| 179852614 | GT-AG | 0 | 1.000000099473604e-05 | 3539 | rna-XM_047260068.1 32210488 | 14 | 496149102 | 496152640 | Schistocerca piceifrons 274613 | CAG|GTTTGTGTCA...TACTTCTTGCCT/ATATTTTTCAAT...CACAG|CCA | 1 | 1 | 25.307 |
| 179852615 | GT-AG | 0 | 1.000000099473604e-05 | 2584 | rna-XM_047260068.1 32210488 | 15 | 496152683 | 496155266 | Schistocerca piceifrons 274613 | AGG|GTAAGATTGT...CTATTTTTGAAA/CTATTTTTGAAA...TACAG|GAC | 1 | 1 | 25.807 |
| 179852616 | GT-AG | 0 | 0.0004021100454735 | 3919 | rna-XM_047260068.1 32210488 | 16 | 496155341 | 496159259 | Schistocerca piceifrons 274613 | CCA|GTAAGTGTCT...GTTTCCTTAGTT/ATGTATTTTATT...CTAAG|GCA | 0 | 1 | 26.688 |
| 179852617 | GT-AG | 0 | 2.0521472680284625e-05 | 269 | rna-XM_047260068.1 32210488 | 17 | 496159413 | 496159681 | Schistocerca piceifrons 274613 | GAG|GTAATTTTAT...AATGCATTAATG/TAATGTTTGATT...TGCAG|GAC | 0 | 1 | 28.51 |
| 179852618 | GT-AG | 0 | 1.2192085872988244e-05 | 4120 | rna-XM_047260068.1 32210488 | 18 | 496159864 | 496163983 | Schistocerca piceifrons 274613 | ACA|GTAAGTCATT...GTAACATTGACA/CAATGATTCAAT...TGCAG|GTT | 2 | 1 | 30.678 |
| 179852619 | GT-AG | 0 | 1.000000099473604e-05 | 3570 | rna-XM_047260068.1 32210488 | 19 | 496164156 | 496167725 | Schistocerca piceifrons 274613 | CAG|GTCAGTGAAA...GGATTTTTACAG/GGGATTTTTACA...TTCAG|GTT | 0 | 1 | 32.726 |
| 179852620 | GT-AG | 0 | 1.000000099473604e-05 | 5814 | rna-XM_047260068.1 32210488 | 20 | 496167866 | 496173679 | Schistocerca piceifrons 274613 | AAG|GTAGAGTATA...AAGTGCTTATTA/GAAGTGCTTATT...TACAG|GTA | 2 | 1 | 34.393 |
| 179852621 | GT-AG | 0 | 1.000000099473604e-05 | 570 | rna-XM_047260068.1 32210488 | 21 | 496173896 | 496174465 | Schistocerca piceifrons 274613 | GAG|GTAATAACAT...TATTTCTTATCC/TTATTTCTTATC...GATAG|AGA | 2 | 1 | 36.966 |
| 179852622 | GT-AG | 0 | 1.000000099473604e-05 | 3853 | rna-XM_047260068.1 32210488 | 22 | 496174583 | 496178435 | Schistocerca piceifrons 274613 | TAA|GTAAGTAAAT...CATCTTTTAATT/CATCTTTTAATT...TCCAG|GTA | 2 | 1 | 38.359 |
| 179852623 | GT-AG | 0 | 0.0003103255795942 | 8735 | rna-XM_047260068.1 32210488 | 23 | 496178615 | 496187349 | Schistocerca piceifrons 274613 | TTG|GTAATCCATT...GTGTTTTTATTT/TTTTTATTTATA...CATAG|ATC | 1 | 1 | 40.491 |
| 179852624 | GC-AG | 0 | 1.000000099473604e-05 | 9075 | rna-XM_047260068.1 32210488 | 24 | 496187562 | 496196636 | Schistocerca piceifrons 274613 | ATG|GCAAGTATAA...ATTTTGTTATAT/AATTTTGTTATA...TTCAG|GAC | 0 | 1 | 43.015 |
| 179852625 | GT-AG | 0 | 1.000000099473604e-05 | 2146 | rna-XM_047260068.1 32210488 | 25 | 496196894 | 496199039 | Schistocerca piceifrons 274613 | AAG|GTAAGTTGCT...CATTTGTTAAAC/ATGGCTCTCATT...CACAG|ATT | 2 | 1 | 46.076 |
| 179852626 | GT-AG | 0 | 3.952927874394477e-05 | 21814 | rna-XM_047260068.1 32210488 | 26 | 496199133 | 496220946 | Schistocerca piceifrons 274613 | TAA|GTAAGTTCCA...TGAGGCTTAAAA/ACTATAATCATA...TTCAG|ATT | 2 | 1 | 47.184 |
| 179852627 | GT-AG | 0 | 1.000000099473604e-05 | 223 | rna-XM_047260068.1 32210488 | 27 | 496221081 | 496221303 | Schistocerca piceifrons 274613 | TTG|GTGAGTTCGT...ATAGTTTTCACA/ATAGTTTTCACA...CTCAG|TTT | 1 | 1 | 48.779 |
| 179852628 | GT-AG | 0 | 0.0130890213440314 | 77 | rna-XM_047260068.1 32210488 | 28 | 496221479 | 496221555 | Schistocerca piceifrons 274613 | AAG|GTATATATTT...ATTACTTTATTT/TATTACTTTATT...TTCAG|ATT | 2 | 1 | 50.863 |
| 179852629 | GT-AG | 0 | 0.0043116197874235 | 99 | rna-XM_047260068.1 32210488 | 29 | 496221796 | 496221894 | Schistocerca piceifrons 274613 | ACA|GTAAGTTTAT...TTTTTCTTAAAT/ATTTTTCTTAAA...CGTAG|TTT | 2 | 1 | 53.722 |
| 179852630 | GT-AG | 0 | 1.000000099473604e-05 | 3394 | rna-XM_047260068.1 32210488 | 30 | 496222087 | 496225480 | Schistocerca piceifrons 274613 | CAA|GTGAAGTATT...TCAGTTTTGACT/TCAGTTTTGACT...TACAG|GAT | 2 | 1 | 56.008 |
| 179852631 | GT-AG | 0 | 1.000000099473604e-05 | 137 | rna-XM_047260068.1 32210488 | 31 | 496225604 | 496225740 | Schistocerca piceifrons 274613 | TAG|GTAGGTTAAT...AGTTCGCTAATG/AGTTCGCTAATG...TGCAG|GAC | 2 | 1 | 57.473 |
| 179852632 | GT-AG | 0 | 1.000000099473604e-05 | 27537 | rna-XM_047260068.1 32210488 | 32 | 496225880 | 496253416 | Schistocerca piceifrons 274613 | AAG|GTATGACCTA...GTGTACTAAACG/ATGGTTATTATT...TTCAG|GGA | 0 | 1 | 59.128 |
| 179852633 | GT-AG | 0 | 1.000000099473604e-05 | 16877 | rna-XM_047260068.1 32210488 | 33 | 496253582 | 496270458 | Schistocerca piceifrons 274613 | AAG|GTAAGATACT...GACATTTTAAAT/GACATTTTAAAT...TACAG|GTT | 0 | 1 | 61.093 |
| 179852634 | GT-AG | 0 | 1.000000099473604e-05 | 13833 | rna-XM_047260068.1 32210488 | 34 | 496270699 | 496284531 | Schistocerca piceifrons 274613 | CCA|GTAAGTGTTT...GTTATTTTGTTT/CAAAAGTTGACA...TCTAG|GAT | 0 | 1 | 63.951 |
| 179852635 | GT-AG | 0 | 1.000000099473604e-05 | 5319 | rna-XM_047260068.1 32210488 | 35 | 496284713 | 496290031 | Schistocerca piceifrons 274613 | AAG|GTAAATTATT...TGTAAATTAATG/ATGTATATCATT...TGTAG|GTC | 1 | 1 | 66.107 |
| 179852636 | GT-AG | 0 | 1.000000099473604e-05 | 185 | rna-XM_047260068.1 32210488 | 36 | 496290216 | 496290400 | Schistocerca piceifrons 274613 | GAG|GTAAAATTAC...CTCTGTTTGATT/TTGATTTTCATT...TTCAG|GTT | 2 | 1 | 68.298 |
| 179852637 | GT-AG | 0 | 1.000000099473604e-05 | 13177 | rna-XM_047260068.1 32210488 | 37 | 496290537 | 496303713 | Schistocerca piceifrons 274613 | CAA|GTGAGTGTAA...ATTATCGTGATC/ATTATCGTGATC...TTCAG|GTA | 0 | 1 | 69.918 |
| 179852638 | GT-AG | 0 | 1.0216184120986432e-05 | 418 | rna-XM_047260068.1 32210488 | 38 | 496303876 | 496304293 | Schistocerca piceifrons 274613 | CGG|GTAAATATCA...TATTCCTAGATG/ATGTGATTGATT...TCTAG|GTG | 0 | 1 | 71.847 |
| 179852639 | GT-AG | 0 | 1.000000099473604e-05 | 14117 | rna-XM_047260068.1 32210488 | 39 | 496304553 | 496318669 | Schistocerca piceifrons 274613 | CAG|GTAAGTGCAA...TCAGTTTCAGTA/ATCAGTTTCAGT...TTCAG|AAG | 1 | 1 | 74.932 |
| 179852640 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_047260068.1 32210488 | 40 | 496318809 | 496318924 | Schistocerca piceifrons 274613 | ATG|GTAAGTATGT...CTGTCTGTACAA/ATAAAAATTACA...TTTAG|GTA | 2 | 1 | 76.587 |
| 179852641 | GT-AG | 0 | 1.000000099473604e-05 | 15169 | rna-XM_047260068.1 32210488 | 41 | 496319028 | 496334196 | Schistocerca piceifrons 274613 | GAG|GTAAAGCCAA...ATATTTTTTCCG/TACTGCTTCAGT...TGCAG|GCT | 0 | 1 | 77.814 |
| 179852642 | GT-AG | 0 | 1.000000099473604e-05 | 5194 | rna-XM_047260068.1 32210488 | 42 | 496334321 | 496339514 | Schistocerca piceifrons 274613 | TAA|GTGAGTGTTA...TTATGTTTAATA/TTATGTTTAATA...AACAG|GTA | 1 | 1 | 79.29 |
| 179852643 | GT-AG | 0 | 5.026592251110367e-05 | 4406 | rna-XM_047260068.1 32210488 | 43 | 496339649 | 496344054 | Schistocerca piceifrons 274613 | TCT|GTAAGTAAAG...GAATTTTTGACT/GAATTTTTGACT...TTCAG|GAA | 0 | 1 | 80.886 |
| 179852644 | GT-AG | 0 | 0.0019773850442337 | 19458 | rna-XM_047260068.1 32210488 | 44 | 496344196 | 496363653 | Schistocerca piceifrons 274613 | AAG|GTAAACTTTC...TTCTTTTTGAAA/TTCTTTTTGAAA...TTCAG|TCT | 0 | 1 | 82.565 |
| 179852645 | GT-AG | 0 | 0.0014432621788161 | 8781 | rna-XM_047260068.1 32210488 | 45 | 496363742 | 496372522 | Schistocerca piceifrons 274613 | AAG|GTATTTTGCA...TTTTTCTGATAT/ATTTTTCTGATA...TACAG|CAA | 1 | 1 | 83.613 |
| 179852646 | GT-AG | 0 | 5.613505656186955e-05 | 14422 | rna-XM_047260068.1 32210488 | 46 | 496372694 | 496387115 | Schistocerca piceifrons 274613 | CAG|GTAAATTTCT...ATGGCTTTAGAA/TAGATTCTGATT...CTCAG|CAC | 1 | 1 | 85.65 |
| 179852647 | GT-AG | 0 | 1.000000099473604e-05 | 2468 | rna-XM_047260068.1 32210488 | 47 | 496387244 | 496389711 | Schistocerca piceifrons 274613 | CAA|GTGAGTAACA...GTAACTTTAAAT/CTTTAAATAATT...TTCAG|CCA | 0 | 1 | 87.174 |
| 179852648 | GT-AG | 0 | 1.000000099473604e-05 | 48206 | rna-XM_047260068.1 32210488 | 48 | 496389904 | 496438109 | Schistocerca piceifrons 274613 | CAG|GTAATTATTG...AACCTTTTAAGG/AGTGAATTGACA...TGTAG|GTT | 0 | 1 | 89.461 |
| 179852649 | GT-AG | 0 | 1.000000099473604e-05 | 15080 | rna-XM_047260068.1 32210488 | 49 | 496438281 | 496453360 | Schistocerca piceifrons 274613 | AAA|GTAAGGGGCT...CATTTTATAATA/CCATTTTTTATG...TCCAG|AAT | 0 | 1 | 91.497 |
| 179852650 | GT-AG | 0 | 1.000000099473604e-05 | 13805 | rna-XM_047260068.1 32210488 | 50 | 496453564 | 496467368 | Schistocerca piceifrons 274613 | TAT|GTAAGTACCT...CCAGTCATAATT/ATTTTTGTCATC...TGCAG|ACA | 2 | 1 | 93.914 |
| 179852651 | GT-AG | 0 | 1.1941481430817735e-05 | 93 | rna-XM_047260068.1 32210488 | 51 | 496467556 | 496467648 | Schistocerca piceifrons 274613 | AAG|GTATGAAGGA...GTTCTTTTATTC/CGTTCTTTTATT...AACAG|AGC | 0 | 1 | 96.141 |
| 179852652 | GT-AG | 0 | 1.000000099473604e-05 | 33646 | rna-XM_047260068.1 32210488 | 52 | 496467836 | 496501481 | Schistocerca piceifrons 274613 | AAG|GTTAGTTAAC...AATTTTTTATGT/ATTTAATTAATT...TACAG|AAC | 1 | 1 | 98.368 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);