introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 32210526
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179853531 | GT-AG | 0 | 1.0115894213089664e-05 | 73 | rna-XM_047260813.1 32210526 | 1 | 43899142 | 43899214 | Schistocerca piceifrons 274613 | CAG|GTAGGCATGA...CCAACCTTTACG/CGTTTGTTTACG...TGTAG|GAG | 0 | 1 | 0.411 |
| 179853532 | GT-AG | 0 | 1.000000099473604e-05 | 42391 | rna-XM_047260813.1 32210526 | 2 | 43899353 | 43941743 | Schistocerca piceifrons 274613 | AAG|GTAAATACAA...ACTGTTTAAACT/TACTGTTTAAAC...GGCAG|AAT | 0 | 1 | 2.776 |
| 179853533 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_047260813.1 32210526 | 3 | 43942009 | 43942127 | Schistocerca piceifrons 274613 | AAG|GTATGTACAA...TGTGTTGTAGAA/ATCTCACTAAAG...TACAG|GGG | 1 | 1 | 7.318 |
| 179853534 | GT-AG | 0 | 1.471707920074268e-05 | 19466 | rna-XM_047260813.1 32210526 | 4 | 43942223 | 43961688 | Schistocerca piceifrons 274613 | GAG|GTTTGTATTT...AACATCTTGAGG/AGGTTTGTCAAT...TCCAG|GTG | 0 | 1 | 8.946 |
| 179853535 | GT-AG | 0 | 1.000000099473604e-05 | 12472 | rna-XM_047260813.1 32210526 | 5 | 43961846 | 43974317 | Schistocerca piceifrons 274613 | ATG|GTAAGTCATT...CTTTTTTTTCCT/CCATAATTAACA...TGCAG|TGA | 1 | 1 | 11.637 |
| 179853536 | GT-AG | 0 | 4.323914182261153e-05 | 1347 | rna-XM_047260813.1 32210526 | 6 | 43974449 | 43975795 | Schistocerca piceifrons 274613 | GAG|GTAATTATTC...TGTTTTTTATTT/CTGTTTTTTATT...ACTAG|GAA | 0 | 1 | 13.882 |
| 179853537 | GT-AG | 0 | 1.000000099473604e-05 | 21480 | rna-XM_047260813.1 32210526 | 7 | 43976068 | 43997547 | Schistocerca piceifrons 274613 | TGG|GTGAGTTATA...TAATTTTTAACA/TAATTTTTAATT...TGCAG|GAA | 2 | 1 | 18.543 |
| 179853538 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_047260813.1 32210526 | 8 | 43997568 | 43997648 | Schistocerca piceifrons 274613 | CAG|GTGAGGCATT...CTCTTCTTGCCG/CCGTCGTCCATC...TGCAG|ACT | 1 | 1 | 18.886 |
| 179853539 | GT-AG | 0 | 1.000000099473604e-05 | 18540 | rna-XM_047260813.1 32210526 | 9 | 43997749 | 44016288 | Schistocerca piceifrons 274613 | AAG|GTAAGTAGTT...TAGTTCATATAA/AGGTAGTTCATA...TATAG|GGA | 2 | 1 | 20.6 |
| 179853540 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_047260813.1 32210526 | 10 | 44016476 | 44016566 | Schistocerca piceifrons 274613 | CCG|GTAAGTGTCT...TATATTTTAAAA/TATATTTTAAAA...TGCAG|AAG | 0 | 1 | 23.805 |
| 179853541 | GT-AG | 0 | 1.000000099473604e-05 | 8653 | rna-XM_047260813.1 32210526 | 11 | 44016716 | 44025368 | Schistocerca piceifrons 274613 | GAG|GTAAGATTTG...GTTTGCATAACT/GTTTGCATAACT...CTTAG|ACG | 2 | 1 | 26.358 |
| 179853542 | GC-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_047260813.1 32210526 | 12 | 44025592 | 44025692 | Schistocerca piceifrons 274613 | TTG|GCAAGTTGTG...GCTTTCTTAAAA/AGCTTTCTTAAA...TACAG|GAA | 0 | 1 | 30.18 |
| 179853543 | GT-AG | 0 | 0.0001900567967234 | 19212 | rna-XM_047260813.1 32210526 | 13 | 44025754 | 44044965 | Schistocerca piceifrons 274613 | TAG|GTATGTCAGA...CAAGCGTTAATT/CGTTAATTAATA...TACAG|TTA | 1 | 1 | 31.225 |
| 179853544 | GT-AG | 0 | 1.000000099473604e-05 | 7965 | rna-XM_047260813.1 32210526 | 14 | 44045116 | 44053080 | Schistocerca piceifrons 274613 | TAG|GTGACTGTGC...GCTTCTTTTGTT/TAGAGTATAATT...CTTAG|ATA | 1 | 1 | 33.796 |
| 179853545 | GT-AG | 0 | 1.000000099473604e-05 | 20501 | rna-XM_047260813.1 32210526 | 15 | 44053413 | 44073913 | Schistocerca piceifrons 274613 | GAG|GTAAGTGTGC...TTTTTTTCAATT/GTTTTTTTCAAT...CATAG|GAC | 0 | 1 | 39.486 |
| 179853546 | GT-AG | 0 | 1.000000099473604e-05 | 5041 | rna-XM_047260813.1 32210526 | 16 | 44074061 | 44079101 | Schistocerca piceifrons 274613 | AAG|GTATTGCCGT...TGTGTATTACTG/AGGCGACTCACT...TACAG|GAA | 0 | 1 | 42.005 |
| 179853547 | GT-AG | 0 | 1.000000099473604e-05 | 2750 | rna-XM_047260813.1 32210526 | 17 | 44079270 | 44082019 | Schistocerca piceifrons 274613 | CAG|GTAAGTCATA...GTGCTCTTACTT/TGTGCTCTTACT...TGTAG|TTA | 0 | 1 | 44.884 |
| 179853548 | GT-AG | 0 | 2.2110035716945315e-05 | 8858 | rna-XM_047260813.1 32210526 | 18 | 44082201 | 44091058 | Schistocerca piceifrons 274613 | TGG|GTAAGCTAAA...CATTGTTTGAAA/TCTGTTGTCATT...TACAG|ATA | 1 | 1 | 47.986 |
| 179853549 | GT-AG | 0 | 1.000000099473604e-05 | 17245 | rna-XM_047260813.1 32210526 | 19 | 44091334 | 44108578 | Schistocerca piceifrons 274613 | GAG|GTTAGTTGCT...TCAGTCTTAAAT/TAGCTACTGACT...ATCAG|CAG | 0 | 1 | 52.699 |
| 179853550 | GT-AG | 0 | 1.000000099473604e-05 | 2859 | rna-XM_047260813.1 32210526 | 20 | 44108767 | 44111625 | Schistocerca piceifrons 274613 | AAG|GTAAGTAATG...ACATCTTTGCTT/TGTGTGTTCACA...TTCAG|GGG | 2 | 1 | 55.921 |
| 179853551 | GT-AG | 0 | 1.000000099473604e-05 | 12609 | rna-XM_047260813.1 32210526 | 21 | 44111864 | 44124472 | Schistocerca piceifrons 274613 | AAG|GTGTTGTGCA...AGTGTTTTATTT/CAGTGTTTTATT...CACAG|GGC | 0 | 1 | 60.0 |
| 179853552 | GT-AG | 0 | 1.000000099473604e-05 | 3915 | rna-XM_047260813.1 32210526 | 22 | 44124636 | 44128550 | Schistocerca piceifrons 274613 | GAG|GTGAGAACAT...AGTCTGTTAACT/AGTCTGTTAACT...TTTAG|GCA | 1 | 1 | 62.793 |
| 179853553 | GT-AG | 0 | 1.000000099473604e-05 | 7988 | rna-XM_047260813.1 32210526 | 23 | 44128731 | 44136718 | Schistocerca piceifrons 274613 | TAG|GTACTATTAT...AGCTCATTTGTT/TGGTAGCTCATT...TACAG|GAC | 1 | 1 | 65.878 |
| 179853554 | GT-AG | 0 | 6.521149782758894e-05 | 88 | rna-XM_047260813.1 32210526 | 24 | 44136902 | 44136989 | Schistocerca piceifrons 274613 | CAG|GTCTGTTCTT...TTCCTATTAACA/TTCCTATTAACA...TTTAG|GCT | 1 | 1 | 69.015 |
| 179853555 | GT-AG | 0 | 1.000000099473604e-05 | 77 | rna-XM_047260813.1 32210526 | 25 | 44137117 | 44137193 | Schistocerca piceifrons 274613 | CAG|GTTTGTGGAG...TTTGTGTTTAAT/TATATACTCACA...TTCAG|GGC | 2 | 1 | 71.191 |
| 179853556 | GT-AG | 0 | 1.1312236950079354e-05 | 16721 | rna-XM_047260813.1 32210526 | 26 | 44137419 | 44154139 | Schistocerca piceifrons 274613 | GAA|GTAAGTATAT...AATGCCATAATC/AATTATTTGATT...TACAG|TCA | 2 | 1 | 75.047 |
| 179853557 | GT-AG | 0 | 0.0004470847593959 | 8820 | rna-XM_047260813.1 32210526 | 27 | 44154342 | 44163161 | Schistocerca piceifrons 274613 | ATG|GTATGTATTT...GTTATATTAAAT/GTTATATTAAAT...TATAG|GTG | 0 | 1 | 78.509 |
| 179853558 | GT-AG | 0 | 1.000000099473604e-05 | 15548 | rna-XM_047260813.1 32210526 | 28 | 44163395 | 44178942 | Schistocerca piceifrons 274613 | CAG|GTGATGTCAT...GTGTGCTAAATA/TGTGTGCTAAAT...TACAG|GTA | 2 | 1 | 82.502 |
| 179853559 | GT-AG | 0 | 2.9177314362312496e-05 | 241 | rna-XM_047260813.1 32210526 | 29 | 44179139 | 44179379 | Schistocerca piceifrons 274613 | GAG|GTATGTGGGG...GAATTATTAACT/GAATTATTAACT...TATAG|GTC | 0 | 1 | 85.861 |
| 179853560 | GT-AG | 0 | 1.0204787324039952e-05 | 27347 | rna-XM_047260813.1 32210526 | 30 | 44179448 | 44206794 | Schistocerca piceifrons 274613 | AAG|GTAGATATTT...AGAACATTGAAA/TTGAAAGTGATT...AACAG|AAA | 2 | 1 | 87.027 |
| 179853561 | GT-AG | 0 | 1.000000099473604e-05 | 1001 | rna-XM_047260813.1 32210526 | 31 | 44206934 | 44207934 | Schistocerca piceifrons 274613 | AGG|GTAAGTGTGA...TACATTTTATAT/TTACATTTTATA...TTCAG|GCT | 0 | 1 | 89.409 |
| 179853562 | GT-AG | 0 | 1.000000099473604e-05 | 16031 | rna-XM_047260813.1 32210526 | 32 | 44208124 | 44224154 | Schistocerca piceifrons 274613 | AAG|GTAAGTGAGT...GTCGTCATAATC/GAATCTCTCATA...AACAG|ACT | 0 | 1 | 92.648 |
| 179853563 | GT-AG | 0 | 1.000000099473604e-05 | 16580 | rna-XM_047260813.1 32210526 | 33 | 44224339 | 44240918 | Schistocerca piceifrons 274613 | CAG|GTAAGCAGCA...TTGTTTATAATT/TTATAATTAATT...AACAG|ATC | 1 | 1 | 95.801 |
| 179853564 | GT-AG | 0 | 1.000000099473604e-05 | 7172 | rna-XM_047260813.1 32210526 | 34 | 44241051 | 44248222 | Schistocerca piceifrons 274613 | CAG|GTAATTTGCT...TTTGTCTTCATT/TTTGTCTTCATT...TACAG|CAA | 1 | 1 | 98.063 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);