introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
51 rows where transcript_id = 32210480
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179852293 | GT-AG | 0 | 5.931942881828481e-05 | 955 | rna-XM_047256101.1 32210480 | 1 | 408123687 | 408124641 | Schistocerca piceifrons 274613 | TAT|GTAAGTTATC...AACTCGTTATTC/TCAGTTCTAATG...TGCAG|ATG | 1 | 1 | 0.432 |
| 179852294 | GT-AG | 0 | 1.000000099473604e-05 | 1229 | rna-XM_047256101.1 32210480 | 2 | 408124742 | 408125970 | Schistocerca piceifrons 274613 | GAG|GTAAGAGCAA...TTGTTCTTGCTT/TATTGTTTCACT...TACAG|GCA | 2 | 1 | 1.435 |
| 179852295 | GT-AG | 0 | 1.000000099473604e-05 | 18000 | rna-XM_047256101.1 32210480 | 3 | 408126155 | 408144154 | Schistocerca piceifrons 274613 | AAG|GTAATTCTGC...TTTGCTTCAATG/ATTTGCTTCAAT...TGCAG|GTA | 0 | 1 | 3.282 |
| 179852296 | GT-AG | 0 | 5.0688837419267366e-05 | 7015 | rna-XM_047256101.1 32210480 | 4 | 408144320 | 408151334 | Schistocerca piceifrons 274613 | AAG|GTATGTACAT...AATTCCTTCCTT/AGTGTGCTAAAT...CCTAG|GCT | 0 | 1 | 4.938 |
| 179852297 | GT-AG | 0 | 1.000000099473604e-05 | 425 | rna-XM_047256101.1 32210480 | 5 | 408151552 | 408151976 | Schistocerca piceifrons 274613 | CAG|GTAATGTAAT...AATTTTTTGGTA/GTAATTCTGATT...TCCAG|ATT | 1 | 1 | 7.116 |
| 179852298 | GT-AG | 0 | 1.000000099473604e-05 | 8847 | rna-XM_047256101.1 32210480 | 6 | 408152249 | 408161095 | Schistocerca piceifrons 274613 | GAG|GTATGGAAAA...GTATCCTTGCAT/GTAACATTCATT...TCTAG|GTA | 0 | 1 | 9.846 |
| 179852299 | GT-AG | 0 | 1.1092608873995822e-05 | 2458 | rna-XM_047256101.1 32210480 | 7 | 408161315 | 408163772 | Schistocerca piceifrons 274613 | CAG|GTAAATATTT...TGCTACTTAATT/TTATATCTCATG...TTCAG|AGC | 0 | 1 | 12.045 |
| 179852300 | GT-AG | 0 | 1.2160025699547529e-05 | 2958 | rna-XM_047256101.1 32210480 | 8 | 408163983 | 408166940 | Schistocerca piceifrons 274613 | CAG|GTCTGTATTA...TTTTTTTTTTTT/ACAGCATTTATT...TTCAG|GGC | 0 | 1 | 14.152 |
| 179852301 | GT-AG | 0 | 1.000000099473604e-05 | 25863 | rna-XM_047256101.1 32210480 | 9 | 408167059 | 408192921 | Schistocerca piceifrons 274613 | ACG|GTGAGTGCAG...TTGTTTATAATT/ATATTTTTCATG...TCTAG|GAA | 1 | 1 | 15.337 |
| 179852302 | GT-AG | 0 | 1.000000099473604e-05 | 16350 | rna-XM_047256101.1 32210480 | 10 | 408193028 | 408209377 | Schistocerca piceifrons 274613 | AGT|GTGAGTATTA...ATTGCATTACTA/ATCTTGCTGAAA...TTTAG|ATC | 2 | 1 | 16.401 |
| 179852303 | GT-AG | 0 | 1.000000099473604e-05 | 1458 | rna-XM_047256101.1 32210480 | 11 | 408209518 | 408210975 | Schistocerca piceifrons 274613 | AAG|GTTAGTTTTG...TTTTTCTTTCCA/AGTACTCTCATT...AACAG|GAA | 1 | 1 | 17.806 |
| 179852304 | GT-AG | 0 | 0.1441053512762322 | 7064 | rna-XM_047256101.1 32210480 | 12 | 408211151 | 408218214 | Schistocerca piceifrons 274613 | GAG|GTATGCTTCT...CTCATTTTAGCA/GGTTCGCTCATT...TCCAG|AGA | 2 | 1 | 19.562 |
| 179852305 | GT-AG | 0 | 0.0005650746450929 | 3585 | rna-XM_047256101.1 32210480 | 13 | 408218374 | 408221958 | Schistocerca piceifrons 274613 | CAG|GTATATTAAA...TCTCACTTATCC/TGTTCTCTCACT...TACAG|GGA | 2 | 1 | 21.158 |
| 179852306 | GT-AG | 0 | 1.000000099473604e-05 | 3185 | rna-XM_047256101.1 32210480 | 14 | 408222104 | 408225288 | Schistocerca piceifrons 274613 | CAG|GTAGGGTGTT...TGTGTTTTATTT/TTTTATTTTATT...AGCAG|GCT | 0 | 1 | 22.614 |
| 179852307 | GT-AG | 0 | 2.5593038443781955e-05 | 27105 | rna-XM_047256101.1 32210480 | 15 | 408225439 | 408252543 | Schistocerca piceifrons 274613 | ATT|GTGAGTTTTG...GATGTCTTATCT/AAATTTCTGATG...TACAG|GGT | 0 | 1 | 24.119 |
| 179852308 | GT-AG | 0 | 1.000000099473604e-05 | 17516 | rna-XM_047256101.1 32210480 | 16 | 408252672 | 408270187 | Schistocerca piceifrons 274613 | CTG|GTTAGTACAT...TGTTTGTTGAAT/TGTTTGTTGAAT...TGCAG|GTC | 2 | 1 | 25.404 |
| 179852309 | GT-AG | 0 | 1.000000099473604e-05 | 23280 | rna-XM_047256101.1 32210480 | 17 | 408270443 | 408293722 | Schistocerca piceifrons 274613 | CAG|GTCAGTTCAA...CTGCTTTTGAAT/CAGATTCTCATG...AACAG|GTT | 2 | 1 | 27.963 |
| 179852310 | GT-AG | 0 | 1.000000099473604e-05 | 4331 | rna-XM_047256101.1 32210480 | 18 | 408294004 | 408298334 | Schistocerca piceifrons 274613 | CTG|GTAAGTGTAG...TTTTTTTTCTCT/TCCTTAGTGACA...TCCAG|ATG | 1 | 1 | 30.784 |
| 179852311 | GT-AG | 0 | 1.000000099473604e-05 | 10392 | rna-XM_047256101.1 32210480 | 19 | 408298537 | 408308928 | Schistocerca piceifrons 274613 | CAG|GTAAGACCAA...TTGTTCTTAAAC/CCCATTTTCACA...CACAG|ATA | 2 | 1 | 32.811 |
| 179852312 | GT-AG | 0 | 0.0001577376616851 | 27308 | rna-XM_047256101.1 32210480 | 20 | 408309233 | 408336540 | Schistocerca piceifrons 274613 | CAG|GTATAGTTAA...TCTCTCTCACCA/CTCTCTCTCACC...TTCAG|CTT | 0 | 1 | 35.863 |
| 179852313 | GT-AG | 0 | 1.000000099473604e-05 | 26336 | rna-XM_047256101.1 32210480 | 21 | 408336733 | 408363068 | Schistocerca piceifrons 274613 | GGG|GTAAGTACAC...AATTTCTTGAAA/AATTTCTTGAAA...TCCAG|CAC | 0 | 1 | 37.79 |
| 179852314 | GT-AG | 0 | 1.000000099473604e-05 | 2461 | rna-XM_047256101.1 32210480 | 22 | 408363360 | 408365820 | Schistocerca piceifrons 274613 | AAG|GTAAGCAACA...TTTCTCTTACAC/ATTTCTCTTACA...TTCAG|GCC | 0 | 1 | 40.711 |
| 179852315 | GT-AG | 0 | 1.5845450220275457e-05 | 38766 | rna-XM_047256101.1 32210480 | 23 | 408365954 | 408404719 | Schistocerca piceifrons 274613 | CTG|GTAGGTTTGA...GTTGTTTTTTTT/CAGACTCTGAAG...TACAG|GCG | 1 | 1 | 42.046 |
| 179852316 | GT-AG | 0 | 1.2540748480299195e-05 | 5492 | rna-XM_047256101.1 32210480 | 24 | 408404911 | 408410402 | Schistocerca piceifrons 274613 | GAT|GTAAGTATAT...GATTTGTTATTT/GTGTATTTCAGT...ACAAG|GAC | 0 | 1 | 43.963 |
| 179852317 | GT-AG | 0 | 3.6665577617315295e-05 | 3625 | rna-XM_047256101.1 32210480 | 25 | 408410523 | 408414147 | Schistocerca piceifrons 274613 | CAG|GTTTGTTGCA...TTTTCTATATTA/AGCACTCTTACA...CTCAG|GTT | 0 | 1 | 45.167 |
| 179852318 | GT-AG | 0 | 1.000000099473604e-05 | 5076 | rna-XM_047256101.1 32210480 | 26 | 408414409 | 408419484 | Schistocerca piceifrons 274613 | GAG|GTAAAGAATT...CTCCTCTTACTT/CCTCCTCTTACT...TTTAG|ACC | 0 | 1 | 47.787 |
| 179852319 | GT-AG | 0 | 1.000000099473604e-05 | 6514 | rna-XM_047256101.1 32210480 | 27 | 408419597 | 408426110 | Schistocerca piceifrons 274613 | AAG|GTAGGTGAGA...GCTGTCTTGATT/TCATTTTTGATA...TACAG|AAA | 1 | 1 | 48.911 |
| 179852320 | GT-AG | 0 | 1.000000099473604e-05 | 229 | rna-XM_047256101.1 32210480 | 28 | 408426323 | 408426551 | Schistocerca piceifrons 274613 | GAG|GTAATTTATA...ATCTTTTCAATG/GATCTTTTCAAT...TCCAG|GTT | 0 | 1 | 51.039 |
| 179852321 | GT-AG | 0 | 0.0021611533011305 | 6761 | rna-XM_047256101.1 32210480 | 29 | 408426951 | 408433711 | Schistocerca piceifrons 274613 | CTT|GTATGTATGA...AGAACTATGACT/ATTTGAGTAATA...TATAG|GAG | 0 | 1 | 55.044 |
| 179852322 | GT-AG | 0 | 1.000000099473604e-05 | 737 | rna-XM_047256101.1 32210480 | 30 | 408433916 | 408434652 | Schistocerca piceifrons 274613 | CTA|GTGAGTATTA...ATCATTTTCATT/ATCATTTTCATT...CACAG|GAT | 0 | 1 | 57.091 |
| 179852323 | GT-AG | 0 | 1.000000099473604e-05 | 6217 | rna-XM_047256101.1 32210480 | 31 | 408434826 | 408441042 | Schistocerca piceifrons 274613 | AAG|GTGAGTAATT...AAACTTTTATTT/TTGGTTTTCATG...TTCAG|GGC | 2 | 1 | 58.828 |
| 179852324 | GT-AG | 0 | 1.0819450318318565e-05 | 28624 | rna-XM_047256101.1 32210480 | 32 | 408441210 | 408469833 | Schistocerca piceifrons 274613 | AAG|GTAATCCACA...ATTTTCTTCATA/ATTTTCTTCATA...TACAG|AGG | 1 | 1 | 60.504 |
| 179852325 | GT-AG | 0 | 2.3406650112332856e-05 | 21985 | rna-XM_047256101.1 32210480 | 33 | 408470115 | 408492099 | Schistocerca piceifrons 274613 | ACA|GTAAGTAATG...TTCCCCTCAATT/TTTCCCCTCAAT...TTTAG|AGC | 0 | 1 | 63.324 |
| 179852326 | GT-AG | 0 | 1.000000099473604e-05 | 7418 | rna-XM_047256101.1 32210480 | 34 | 408492402 | 408499819 | Schistocerca piceifrons 274613 | CAA|GTAAGTACTA...TTTTGATTGATA/TTTTGATTGATA...TCCAG|CTT | 2 | 1 | 66.356 |
| 179852327 | GT-AG | 0 | 1.000000099473604e-05 | 2987 | rna-XM_047256101.1 32210480 | 35 | 408499947 | 408502933 | Schistocerca piceifrons 274613 | AAG|GTAAATAGAA...GTGCCCTTTTTG/GAAACTTTCAAG...TATAG|CCA | 0 | 1 | 67.63 |
| 179852328 | GT-AG | 0 | 1.000000099473604e-05 | 2866 | rna-XM_047256101.1 32210480 | 36 | 408503122 | 408505987 | Schistocerca piceifrons 274613 | AAG|GTAATAACGA...AGATTGGTAACA/TAACATTTCAAA...GATAG|GCC | 2 | 1 | 69.517 |
| 179852329 | GT-AG | 0 | 1.000000099473604e-05 | 24705 | rna-XM_047256101.1 32210480 | 37 | 408506106 | 408530810 | Schistocerca piceifrons 274613 | TTG|GTGAGTTCCT...TTTTTATTAACT/TTTTTATTAACT...TACAG|ATG | 0 | 1 | 70.702 |
| 179852330 | GT-AG | 0 | 5.3510946963846965e-05 | 9078 | rna-XM_047256101.1 32210480 | 38 | 408530976 | 408540053 | Schistocerca piceifrons 274613 | TAC|GTAAGTATCT...AACTTTTTAGTA/TATTGTTTCATT...AAAAG|GTG | 0 | 1 | 72.358 |
| 179852331 | GT-AG | 0 | 1.000000099473604e-05 | 345 | rna-XM_047256101.1 32210480 | 39 | 408540237 | 408540581 | Schistocerca piceifrons 274613 | AAG|GTAATTATTA...ATAATCTCAATG/AATAATCTCAAT...TGCAG|GGG | 0 | 1 | 74.195 |
| 179852332 | GT-AG | 0 | 1.000000099473604e-05 | 4370 | rna-XM_047256101.1 32210480 | 40 | 408540729 | 408545098 | Schistocerca piceifrons 274613 | GAG|GTAGAGTAGA...CATGTTGTAATT/GTACTATTCACT...CACAG|TTG | 0 | 1 | 75.67 |
| 179852333 | GT-AG | 0 | 1.571353438924114e-05 | 3032 | rna-XM_047256101.1 32210480 | 41 | 408545352 | 408548383 | Schistocerca piceifrons 274613 | CTG|GTACGTATAA...GATGTTTTCTTT/TATGAGATAAGA...TTCAG|CTG | 1 | 1 | 78.209 |
| 179852334 | GT-AG | 0 | 4.353289635396652e-05 | 14660 | rna-XM_047256101.1 32210480 | 42 | 408548638 | 408563297 | Schistocerca piceifrons 274613 | ATG|GTTTGTTCAT...GTTCTCATAGCT/TTAGTTCTCATA...TACAG|TTT | 0 | 1 | 80.759 |
| 179852335 | GT-AG | 0 | 1.000000099473604e-05 | 10240 | rna-XM_047256101.1 32210480 | 43 | 408563489 | 408573728 | Schistocerca piceifrons 274613 | ACT|GTGAGTCTGA...TTAACCTAAATC/CTGTATTTAACC...TCTAG|GGC | 2 | 1 | 82.676 |
| 179852336 | GT-AG | 0 | 1.000000099473604e-05 | 194 | rna-XM_047256101.1 32210480 | 44 | 408573907 | 408574100 | Schistocerca piceifrons 274613 | GCA|GTGAGTTACT...TATTTCTCAAAT/TTATTTCTCAAA...TCCAG|GCT | 0 | 1 | 84.463 |
| 179852337 | GT-AG | 0 | 1.000000099473604e-05 | 1736 | rna-XM_047256101.1 32210480 | 45 | 408574221 | 408575956 | Schistocerca piceifrons 274613 | CAG|GTTTGTAGTA...CTAATCTTATAC/TTAAAACTAATC...TGCAG|GTT | 0 | 1 | 85.667 |
| 179852338 | GT-AG | 0 | 3.3268675102772155e-05 | 1449 | rna-XM_047256101.1 32210480 | 46 | 408576074 | 408577522 | Schistocerca piceifrons 274613 | GAG|GTAAGCCTCT...TTCTTCTTTTCT/TAGGTATTAATG...AACAG|GCA | 0 | 1 | 86.841 |
| 179852339 | GT-AG | 0 | 0.0006249919251181 | 4517 | rna-XM_047256101.1 32210480 | 47 | 408577758 | 408582274 | Schistocerca piceifrons 274613 | TAG|GTAAATTTCT...AATTTCTTATCC/TTTATATTAATT...TACAG|AAC | 1 | 1 | 89.2 |
| 179852340 | GT-AG | 0 | 1.000000099473604e-05 | 9050 | rna-XM_047256101.1 32210480 | 48 | 408582508 | 408591557 | Schistocerca piceifrons 274613 | AAG|GTCAGCTACA...TCTGTTTTAAAT/TCTGTTTTAAAT...TCTAG|GGA | 0 | 1 | 91.539 |
| 179852341 | GT-AG | 0 | 1.000000099473604e-05 | 19766 | rna-XM_047256101.1 32210480 | 49 | 408591762 | 408611527 | Schistocerca piceifrons 274613 | CAG|GTTAGAATTC...AGATCCTTCATA/TACTTTCTAATT...TTTAG|CAC | 0 | 1 | 93.586 |
| 179852342 | GT-AG | 0 | 1.000000099473604e-05 | 121 | rna-XM_047256101.1 32210480 | 50 | 408611666 | 408611786 | Schistocerca piceifrons 274613 | AAG|GTAAGTTCAG...TATTTCTAACCA/ATATTTCTAACC...TTCAG|GTG | 0 | 1 | 94.971 |
| 179852343 | GT-AG | 0 | 1.000000099473604e-05 | 27898 | rna-XM_047256101.1 32210480 | 51 | 408612040 | 408639937 | Schistocerca piceifrons 274613 | CGG|GTTCGTATTG...GGTAACTTGATG/TTGGTGGTAACT...TACAG|ATG | 1 | 1 | 97.511 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);