introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 32191410
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179733424 | GT-AG | 0 | 1.000000099473604e-05 | 381909 | rna-XM_047142002.1 32191410 | 1 | 846673056 | 847054964 | Schistocerca americana 7009 | AGG|GTGAGTACAC...TTCTGCTTGTCT/CACATTCTCACA...TGCAG|GCG | 0 | 1 | 2.037 |
| 179733425 | GT-AG | 0 | 1.000000099473604e-05 | 210736 | rna-XM_047142002.1 32191410 | 2 | 846462239 | 846672974 | Schistocerca americana 7009 | AAG|GTGAGTTCTT...TAGTGTTTGATT/TAGTGTTTGATT...TGCAG|AGA | 0 | 1 | 3.346 |
| 179733426 | GT-AG | 0 | 1.000000099473604e-05 | 77320 | rna-XM_047142002.1 32191410 | 3 | 846384853 | 846462172 | Schistocerca americana 7009 | CAC|GTGAGTACCA...TTTGTCTTACAT/GTTTGTCTTACA...TGCAG|CCC | 0 | 1 | 4.413 |
| 179733427 | GC-AG | 0 | 1.000000099473604e-05 | 4074 | rna-XM_047142002.1 32191410 | 4 | 846380590 | 846384663 | Schistocerca americana 7009 | AAG|GCATGTACCT...GCTCATTTAAAT/TTACAGCTCATT...TACAG|GGG | 0 | 1 | 7.468 |
| 179733428 | GT-AG | 0 | 1.000000099473604e-05 | 145 | rna-XM_047142002.1 32191410 | 5 | 846380369 | 846380513 | Schistocerca americana 7009 | GAG|GTGAATGTTA...ACATTTTTCATT/ACATTTTTCATT...ACTAG|CAA | 1 | 1 | 8.697 |
| 179733429 | GT-AG | 0 | 1.000000099473604e-05 | 1904 | rna-XM_047142002.1 32191410 | 6 | 846378270 | 846380173 | Schistocerca americana 7009 | AAG|GTAATAAAAT...TACTTCTTTGTT/TTATTATTTACT...TAAAG|GGT | 1 | 1 | 11.849 |
| 179733430 | GT-AG | 0 | 1.000000099473604e-05 | 1935 | rna-XM_047142002.1 32191410 | 7 | 846376131 | 846378065 | Schistocerca americana 7009 | AAG|GTGAAATCTA...GTGTGCTTAAAA/CATGGTTTCATA...AACAG|ATA | 1 | 1 | 15.147 |
| 179733431 | GT-AG | 0 | 1.000000099473604e-05 | 545 | rna-XM_047142002.1 32191410 | 8 | 846375284 | 846375828 | Schistocerca americana 7009 | AAT|GTTAGTACCT...GTTTTCATATTT/CGTGTTTTCATA...TCAAG|GTA | 0 | 1 | 20.029 |
| 179733432 | GT-AG | 0 | 1.000000099473604e-05 | 1729 | rna-XM_047142002.1 32191410 | 9 | 846373414 | 846375142 | Schistocerca americana 7009 | AAG|GTGTGATATT...CCAATTTTAATT/TATGTATTTATT...TTTAG|GAA | 0 | 1 | 22.308 |
| 179733433 | GT-AG | 0 | 1.000000099473604e-05 | 1834 | rna-XM_047142002.1 32191410 | 10 | 846371418 | 846373251 | Schistocerca americana 7009 | GAT|GTAAGTAGTT...TGAGCTGTATTT/GTGTAGCTAATG...TACAG|ACA | 0 | 1 | 24.927 |
| 179733434 | GT-AG | 0 | 7.420121210998767e-05 | 7399 | rna-XM_047142002.1 32191410 | 11 | 846363843 | 846371241 | Schistocerca americana 7009 | ATT|GTAAGTTTTT...GCAAATTTAGTA/AATTTAGTAATG...TGTAG|GTT | 2 | 1 | 27.772 |
| 179733435 | GT-AG | 0 | 1.0424988560081874e-05 | 1358 | rna-XM_047142002.1 32191410 | 12 | 846362284 | 846363641 | Schistocerca americana 7009 | CAG|GTAATTTGTT...GACTTCTTAAGT/TGCAGCCTCATT...TATAG|GTC | 2 | 1 | 31.022 |
| 179733436 | GT-AG | 0 | 5.6699707148008167e-05 | 9835 | rna-XM_047142002.1 32191410 | 13 | 846352298 | 846362132 | Schistocerca americana 7009 | GGT|GTAAGTTCTA...TGTGCTGTGAAT/ACATTTTTCAAA...TTCAG|GTC | 0 | 1 | 33.463 |
| 179733437 | GT-AG | 0 | 1.000000099473604e-05 | 2298 | rna-XM_047142002.1 32191410 | 14 | 846349832 | 846352129 | Schistocerca americana 7009 | AAG|GTAAAAAGAT...AGTTCCTTAAAT/AAGTTCCTTAAA...TTCAG|GTA | 0 | 1 | 36.178 |
| 179733438 | GT-AG | 0 | 1.000000099473604e-05 | 10010 | rna-XM_047142002.1 32191410 | 15 | 846339730 | 846349739 | Schistocerca americana 7009 | AAG|GTAAGAATGC...TTATCCATACTA/GAGTAGGTTATC...TGTAG|TTA | 2 | 1 | 37.666 |
| 179733439 | GT-AG | 0 | 1.000000099473604e-05 | 396 | rna-XM_047142002.1 32191410 | 16 | 846339232 | 846339627 | Schistocerca americana 7009 | CAG|GTAAGTTCTG...GTATTTGTACTC/TTTGTACTCACA...TGCAG|ATA | 2 | 1 | 39.315 |
| 179733440 | GT-AG | 0 | 0.0020506317984343 | 9648 | rna-XM_047142002.1 32191410 | 17 | 846329347 | 846338994 | Schistocerca americana 7009 | CAG|GTATGTGTGT...TATTCTTTAATA/TATTCTTTAATA...TCCAG|AGT | 2 | 1 | 43.146 |
| 179733441 | GT-AG | 0 | 1.000000099473604e-05 | 154 | rna-XM_047142002.1 32191410 | 18 | 846328958 | 846329111 | Schistocerca americana 7009 | CAG|GTAAAGGGAA...AAATTCTGATTA/CAAATTCTGATT...CACAG|GTT | 0 | 1 | 46.945 |
| 179733442 | GT-AG | 0 | 1.000000099473604e-05 | 284 | rna-XM_047142002.1 32191410 | 19 | 846326398 | 846326681 | Schistocerca americana 7009 | AAG|GTAAGACCCA...GTTATTTTATTG/AGTTATTTTATT...CTCAG|GCT | 2 | 1 | 83.737 |
| 179733443 | GT-AG | 0 | 1.000000099473604e-05 | 1634 | rna-XM_047142002.1 32191410 | 20 | 846324661 | 846326294 | Schistocerca americana 7009 | CTT|GTAAGAATTA...TTATACTTAAAA/CATTTTGTTATT...CATAG|ATT | 0 | 1 | 85.403 |
| 179733444 | GT-AG | 0 | 1.000000099473604e-05 | 2375 | rna-XM_047142002.1 32191410 | 21 | 846322076 | 846324450 | Schistocerca americana 7009 | AGG|GTAAGACAAT...CTGATTTTAACA/TGATTTCTGATT...AACAG|GCT | 0 | 1 | 88.797 |
| 179733445 | GT-AG | 0 | 0.0050639300535349 | 81 | rna-XM_047142002.1 32191410 | 22 | 846321847 | 846321927 | Schistocerca americana 7009 | CAG|GTATGCATTA...TGTGTGTTAAAT/TGTGTGTTAAAT...TTCAG|CCC | 1 | 1 | 91.19 |
| 179733446 | GT-AG | 0 | 1.90996582281e-05 | 2174 | rna-XM_047142002.1 32191410 | 23 | 846319549 | 846321722 | Schistocerca americana 7009 | TGA|GTAAGTCCAA...TTATTCTTATTA/ATTATTCTTATT...TGCAG|GGC | 2 | 1 | 93.194 |
| 179733447 | GT-AG | 0 | 0.0006020034849562 | 1036 | rna-XM_047142002.1 32191410 | 24 | 846318360 | 846319395 | Schistocerca americana 7009 | AAG|GTACACAGTA...ATTTTTTTATTA/TATTTTTTTATT...TCCAG|GAA | 2 | 1 | 95.668 |
| 179733448 | GT-AG | 0 | 0.0031681378733158 | 18537 | rna-XM_047142002.1 32191410 | 25 | 846299672 | 846318208 | Schistocerca americana 7009 | GCT|GTAAGTTTAT...GAATTTTTAGCT/CATGTACTAATC...TTCAG|CTT | 0 | 1 | 98.109 |
| 179733449 | GT-AG | 0 | 2.318610128584496e-05 | 2977 | rna-XM_047142002.1 32191410 | 26 | 846296584 | 846299560 | Schistocerca americana 7009 | CAG|GTTTGTTGGC...GTGTTGTTGATC/GTGTTGTTGATC...CACAG|AAT | 0 | 1 | 99.903 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);