introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 32210510
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179853143 | GT-AG | 0 | 1.000000099473604e-05 | 206688 | rna-XM_047265991.1 32210510 | 1 | 658368999 | 658575686 | Schistocerca piceifrons 274613 | AAG|GTAAGCTCGG...CTGTCTGTACCT/CTTCCGCTCATG...TGCAG|GTT | 1 | 1 | 0.339 |
| 179853144 | GT-AG | 1 | 99.99968629017218 | 210117 | rna-XM_047265991.1 32210510 | 2 | 658158775 | 658368891 | Schistocerca piceifrons 274613 | TAC|GTATCCTTTC...TTTTCCTTGACT/CCTTGACTGACT...AACAG|ACC | 0 | 1 | 1.988 |
| 179853145 | GT-AG | 0 | 1.000000099473604e-05 | 154258 | rna-XM_047265991.1 32210510 | 3 | 658004454 | 658158711 | Schistocerca piceifrons 274613 | ACG|GTAAGTACAG...CTGTTGTTACCT/GCTGTTGTTACC...TGCAG|GAG | 0 | 1 | 2.959 |
| 179853146 | GT-AG | 0 | 1.000000099473604e-05 | 4329 | rna-XM_047265991.1 32210510 | 4 | 657999872 | 658004200 | Schistocerca piceifrons 274613 | TCG|GTCAGTGCTC...ATCTTTTTATTG/TATCTTTTTATT...TTCAG|GCA | 1 | 1 | 6.858 |
| 179853147 | GT-AG | 0 | 1.000000099473604e-05 | 11496 | rna-XM_047265991.1 32210510 | 5 | 657988185 | 657999680 | Schistocerca piceifrons 274613 | AAG|GTGAGCCAAT...GAGATCACAATA/AATATAGTGAGT...TTCAG|AAC | 0 | 1 | 9.801 |
| 179853148 | GT-AG | 0 | 0.0004304844839298 | 7664 | rna-XM_047265991.1 32210510 | 6 | 657980302 | 657987965 | Schistocerca piceifrons 274613 | GAG|GTATTGTCTT...AACTTTTTAGTA/GTTATTCTCATT...TACAG|GAT | 0 | 1 | 13.176 |
| 179853149 | GT-AG | 0 | 1.000000099473604e-05 | 5888 | rna-XM_047265991.1 32210510 | 7 | 657974249 | 657980136 | Schistocerca piceifrons 274613 | GAG|GTCAGATACA...ATGTTTTTGTTG/TGTAAGCTCATT...TGCAG|GCA | 0 | 1 | 15.719 |
| 179853150 | GT-AG | 0 | 0.0002717764738291 | 11437 | rna-XM_047265991.1 32210510 | 8 | 657962569 | 657974005 | Schistocerca piceifrons 274613 | ATT|GTAAGTTTTG...ATGTCTTTTTTT/TATGTTTTCATG...TGCAG|TAC | 0 | 1 | 19.464 |
| 179853151 | GT-AG | 0 | 5.229330486513344e-05 | 3226 | rna-XM_047265991.1 32210510 | 9 | 657959217 | 657962442 | Schistocerca piceifrons 274613 | AAG|GTAAGCATAC...ATATTCTTAATG/TTTTTTTTCAAA...GCTAG|GGA | 0 | 1 | 21.405 |
| 179853152 | GT-AG | 0 | 0.0003323326064341 | 625 | rna-XM_047265991.1 32210510 | 10 | 657958352 | 657958976 | Schistocerca piceifrons 274613 | CAG|GTATAGTATG...TTCTTGTTAACC/TTCTTGTTAACC...AACAG|GAT | 0 | 1 | 25.104 |
| 179853153 | GT-AG | 0 | 1.000000099473604e-05 | 3612 | rna-XM_047265991.1 32210510 | 11 | 657954606 | 657958217 | Schistocerca piceifrons 274613 | ATG|GTGTGTAAAT...ATGCTATTAATG/ATGCTATTAATG...GGCAG|GTA | 2 | 1 | 27.169 |
| 179853154 | GT-AG | 0 | 0.0002680507774497 | 2090 | rna-XM_047265991.1 32210510 | 12 | 657952297 | 657954386 | Schistocerca piceifrons 274613 | TAA|GTAAGTTGTC...CATCTTTTAATA/CATCTTTTAATA...TGCAG|ATC | 2 | 1 | 30.544 |
| 179853155 | GT-AG | 0 | 1.000000099473604e-05 | 1055 | rna-XM_047265991.1 32210510 | 13 | 657951072 | 657952126 | Schistocerca piceifrons 274613 | GGG|GTGAGTATTA...TATATTTTCATT/TATATTTTCATT...AACAG|AGT | 1 | 1 | 33.164 |
| 179853156 | GT-AG | 0 | 0.0028223276070611 | 22031 | rna-XM_047265991.1 32210510 | 14 | 657928904 | 657950934 | Schistocerca piceifrons 274613 | GAA|GTATGTTCAG...AAGTTTGTAATA/TATTGTTTCATT...ACTAG|GTT | 0 | 1 | 35.275 |
| 179853157 | GT-AG | 0 | 1.000000099473604e-05 | 15501 | rna-XM_047265991.1 32210510 | 15 | 657913242 | 657928742 | Schistocerca piceifrons 274613 | AAG|GTGAGTTTCT...TATTTTTCAAAC/AGAAGATTCATC...TTCAG|GGC | 2 | 1 | 37.756 |
| 179853158 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_047265991.1 32210510 | 16 | 657912941 | 657913039 | Schistocerca piceifrons 274613 | GAG|GTAATTCATT...CTTCTTTTGGCA/CTCCAATTTATT...TGCAG|CTG | 0 | 1 | 40.869 |
| 179853159 | GT-AG | 0 | 0.0042513575350763 | 12809 | rna-XM_047265991.1 32210510 | 17 | 657899979 | 657912787 | Schistocerca piceifrons 274613 | TCA|GTAAGCTTTT...TATTCTATATTA/ATAAATTTAATT...ACTAG|GTT | 0 | 1 | 43.227 |
| 179853160 | GT-AG | 0 | 1.000000099473604e-05 | 919 | rna-XM_047265991.1 32210510 | 18 | 657898865 | 657899783 | Schistocerca piceifrons 274613 | GGG|GTCAGCTACT...GATTTTTTACTT/TGATTTTTTACT...AACAG|GGT | 0 | 1 | 46.232 |
| 179853161 | GT-AG | 0 | 1.4127303080326326e-05 | 9291 | rna-XM_047265991.1 32210510 | 19 | 657889372 | 657898662 | Schistocerca piceifrons 274613 | AAG|GTAAGTTTTT...GTAACATTATCT/CTATGTATCATT...CACAG|GTA | 1 | 1 | 49.345 |
| 179853162 | GT-AG | 0 | 1.000000099473604e-05 | 5123 | rna-XM_047265991.1 32210510 | 20 | 657884094 | 657889216 | Schistocerca piceifrons 274613 | CGG|GTAAGTGCTT...ATTTTCTCATTT/AATTTTCTCATT...CTAAG|GAC | 0 | 1 | 51.734 |
| 179853163 | GT-AG | 0 | 1.000000099473604e-05 | 157 | rna-XM_047265991.1 32210510 | 21 | 657883712 | 657883868 | Schistocerca piceifrons 274613 | AAG|GTACTAAGTA...TGGATCTGAATG/AAATTATTTACT...TTCAG|TAT | 0 | 1 | 55.201 |
| 179853164 | GT-AG | 0 | 1.000000099473604e-05 | 2686 | rna-XM_047265991.1 32210510 | 22 | 657880891 | 657883576 | Schistocerca piceifrons 274613 | GCT|GTAAGTAGCT...TTATGTTTAAAA/ATGATTTTTATG...TACAG|GCA | 0 | 1 | 57.282 |
| 179853165 | GT-AG | 0 | 1.000000099473604e-05 | 344 | rna-XM_047265991.1 32210510 | 23 | 657880317 | 657880660 | Schistocerca piceifrons 274613 | GAT|GTTAGTAATT...ACAGCTTTGTTA/AGAAAGCTCATT...TTCAG|GAA | 2 | 1 | 60.826 |
| 179853166 | GT-AG | 0 | 1.000000099473604e-05 | 12452 | rna-XM_047265991.1 32210510 | 24 | 657867684 | 657880135 | Schistocerca piceifrons 274613 | AAG|GTCAGAGACT...TATGTTTTGAAA/TGTAAATTCACT...TTCAG|TGT | 0 | 1 | 63.615 |
| 179853167 | GT-AG | 0 | 1.000000099473604e-05 | 23238 | rna-XM_047265991.1 32210510 | 25 | 657844281 | 657867518 | Schistocerca piceifrons 274613 | TGG|GTAAGTAGTT...CGTTGTTTGATT/CGTTGTTTGATT...TTCAG|GTT | 0 | 1 | 66.158 |
| 179853168 | GT-AG | 0 | 1.000000099473604e-05 | 4018 | rna-XM_047265991.1 32210510 | 26 | 657840115 | 657844132 | Schistocerca piceifrons 274613 | ATG|GTAAGTGACA...ATTGCTTTATTA/TATTGCTTTATT...TTCAG|GCA | 1 | 1 | 68.439 |
| 179853169 | GT-AG | 0 | 0.000409403995221 | 15793 | rna-XM_047265991.1 32210510 | 27 | 657824041 | 657839833 | Schistocerca piceifrons 274613 | AAG|GTATGTTGTT...TAATCTTTGTAT/ATTAATATTACA...TGCAG|GTG | 0 | 1 | 72.769 |
| 179853170 | GT-AG | 0 | 0.0001853082276248 | 10548 | rna-XM_047265991.1 32210510 | 28 | 657813356 | 657823903 | Schistocerca piceifrons 274613 | AAG|GTTTGTTTAT...TTGATTTTAGCT/ATTTTGTTAATT...AACAG|GTT | 2 | 1 | 74.881 |
| 179853171 | GT-AG | 0 | 1.000000099473604e-05 | 13370 | rna-XM_047265991.1 32210510 | 29 | 657799777 | 657813146 | Schistocerca piceifrons 274613 | ATG|GTAAAGATAG...CTATTTTTGACT/CTATTTTTGACT...CTCAG|GTG | 1 | 1 | 78.101 |
| 179853172 | GT-AG | 0 | 1.000000099473604e-05 | 822 | rna-XM_047265991.1 32210510 | 30 | 657798656 | 657799477 | Schistocerca piceifrons 274613 | AAG|GTGAGCTTAA...AATATTATAATT/AATATTATAATT...CCCAG|TGT | 0 | 1 | 82.709 |
| 179853173 | GT-AG | 0 | 1.000000099473604e-05 | 457 | rna-XM_047265991.1 32210510 | 31 | 657797991 | 657798447 | Schistocerca piceifrons 274613 | TGG|GTGAGTTTTG...TTTGTTTTGAAC/TTTGTTTTGAAC...TTCAG|GAT | 1 | 1 | 85.915 |
| 179853174 | GT-AG | 0 | 0.0004608171157301 | 4356 | rna-XM_047265991.1 32210510 | 32 | 657793330 | 657797685 | Schistocerca piceifrons 274613 | GCT|GTAAGTCTGC...TAATCTTTAATA/TAATCTTTAATA...TGTAG|GTT | 0 | 1 | 90.615 |
| 179853175 | GT-AG | 0 | 0.0051932601429796 | 10660 | rna-XM_047265991.1 32210510 | 33 | 657782430 | 657793089 | Schistocerca piceifrons 274613 | ATG|GTATGTTCTT...GTTTTCTTTGTG/TGTTTTTTCAAA...TTCAG|CAA | 0 | 1 | 94.313 |
| 179853176 | GT-AG | 0 | 1.000000099473604e-05 | 5764 | rna-XM_047265991.1 32210510 | 34 | 657776462 | 657782225 | Schistocerca piceifrons 274613 | CAG|GTTAGTTACA...TGTTTCTTTTTT/AGAATATTCATT...TTCAG|GCA | 0 | 1 | 97.457 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);