introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 32191441
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179734159 | GT-AG | 0 | 1.000000099473604e-05 | 53211 | rna-XM_047136226.1 32191441 | 1 | 381432901 | 381486111 | Schistocerca americana 7009 | CAT|GTAAGAACGC...ATTTACTTGAAT/CTGTCATTTACT...AACAG|GTG | 1 | 1 | 0.882 |
| 179734160 | GT-AG | 0 | 0.0009803174454871 | 9985 | rna-XM_047136226.1 32191441 | 2 | 381486297 | 381496281 | Schistocerca americana 7009 | GAG|GTATGTGTTT...ATTTTTTTACTA/TATTTTTTTACT...TGTAG|ATA | 0 | 1 | 4.43 |
| 179734161 | GT-AG | 0 | 8.249440146171499e-05 | 19706 | rna-XM_047136226.1 32191441 | 3 | 381496439 | 381516144 | Schistocerca americana 7009 | ATG|GTAAGCTGCA...TGTATTTTAAAA/TCTTCTTTCATT...TGCAG|GTA | 1 | 1 | 7.442 |
| 179734162 | GT-AG | 0 | 1.000000099473604e-05 | 467 | rna-XM_047136226.1 32191441 | 4 | 381516386 | 381516852 | Schistocerca americana 7009 | TGG|GTGAGAAGAC...GATTACGTACTA/AAAATGCTGACG...TACAG|GTA | 2 | 1 | 12.064 |
| 179734163 | GT-AG | 0 | 1.000000099473604e-05 | 3983 | rna-XM_047136226.1 32191441 | 5 | 381516980 | 381520962 | Schistocerca americana 7009 | AAG|GTGATCTTTA...TTCCTCTAAATT/TCTAAATTCAAT...TATAG|GTA | 0 | 1 | 14.499 |
| 179734164 | GT-AG | 0 | 0.0868107222089987 | 20126 | rna-XM_047136226.1 32191441 | 6 | 381521156 | 381541281 | Schistocerca americana 7009 | ACA|GTATGTATTT...TTTTCCTTTTTT/TGCAACTTCAGA...TTCAG|GTG | 1 | 1 | 18.201 |
| 179734165 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_047136226.1 32191441 | 7 | 381541454 | 381541543 | Schistocerca americana 7009 | CAA|GTAAGGAGTG...TTTTCTGTACTG/GACTTGCTTATT...TGCAG|GGT | 2 | 1 | 21.5 |
| 179734166 | GT-AG | 0 | 2.2305233927973835e-05 | 3103 | rna-XM_047136226.1 32191441 | 8 | 381541778 | 381544880 | Schistocerca americana 7009 | ATG|GTAAGTTCCA...GTAACTTTATTT/TGTAACTTTATT...TCCAG|GGA | 2 | 1 | 25.988 |
| 179734167 | GT-AG | 0 | 1.000000099473604e-05 | 4767 | rna-XM_047136226.1 32191441 | 9 | 381545026 | 381549792 | Schistocerca americana 7009 | AAG|GTGAGTTTTA...CAATTTTTGATC/GTTTATTTCATT...TCCAG|GTA | 0 | 1 | 28.769 |
| 179734168 | GT-AG | 0 | 1.000000099473604e-05 | 4239 | rna-XM_047136226.1 32191441 | 10 | 381550041 | 381554279 | Schistocerca americana 7009 | CAG|GTCAGTTCCA...GGCTCCTTCATC/AATGTGTTTATA...TTCAG|CAC | 2 | 1 | 33.525 |
| 179734169 | GT-AG | 0 | 0.0003077256876114 | 73 | rna-XM_047136226.1 32191441 | 11 | 381554482 | 381554554 | Schistocerca americana 7009 | AAG|GTATATGATA...TATCTTTTGAAA/TTTGGATTGATT...TGTAG|ACA | 0 | 1 | 37.399 |
| 179734170 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_047136226.1 32191441 | 12 | 381554668 | 381554762 | Schistocerca americana 7009 | AAG|GTAAGAATAA...TATTTTTTATTT/TTATTTTTTATT...TACAG|TGT | 2 | 1 | 39.567 |
| 179734171 | GT-AG | 0 | 1.000000099473604e-05 | 44690 | rna-XM_047136226.1 32191441 | 13 | 381554875 | 381599564 | Schistocerca americana 7009 | AAG|GTAATACATT...CTGTTTTTATTT/TCTGTTTTTATT...CACAG|GAA | 0 | 1 | 41.715 |
| 179734172 | GT-AG | 0 | 0.0004881119383701 | 3937 | rna-XM_047136226.1 32191441 | 14 | 381600870 | 381604806 | Schistocerca americana 7009 | AAG|GTATGTTGCT...TGTTTGTTATTT/TTGTTATTCATC...TTCAG|GTC | 0 | 1 | 66.743 |
| 179734173 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_047136226.1 32191441 | 15 | 381604993 | 381605075 | Schistocerca americana 7009 | GAG|GTAATATTTT...ACGCCATTGATG/GATGTTTTCATT...TGCAG|GTG | 0 | 1 | 70.311 |
| 179734174 | GT-AG | 0 | 1.000000099473604e-05 | 18022 | rna-XM_047136226.1 32191441 | 16 | 381605169 | 381623190 | Schistocerca americana 7009 | CAA|GTAAGAGCAA...TAATTATTAAAT/TAATTATTAAAT...TCCAG|GTT | 0 | 1 | 72.094 |
| 179734175 | GT-AG | 0 | 1.000000099473604e-05 | 627 | rna-XM_047136226.1 32191441 | 17 | 381623401 | 381624027 | Schistocerca americana 7009 | GAG|GTGAGCCAAT...GGCTTCTAAATG/AATGCTTTCAAA...ATTAG|GTT | 0 | 1 | 76.122 |
| 179734176 | GT-AG | 0 | 0.0002252727147481 | 8010 | rna-XM_047136226.1 32191441 | 18 | 381624181 | 381632190 | Schistocerca americana 7009 | GTG|GTAAGTTTGG...GTTTCTTTATAT/CTGGTATTTATT...AACAG|AGA | 0 | 1 | 79.056 |
| 179734177 | GT-AG | 0 | 0.0026573682597647 | 1662 | rna-XM_047136226.1 32191441 | 19 | 381632309 | 381633970 | Schistocerca americana 7009 | TAG|GTAACTCTAA...TGAATTTTGACA/TGAATTTTGACA...TGCAG|ATT | 1 | 1 | 81.32 |
| 179734178 | GC-AG | 0 | 1.000000099473604e-05 | 3898 | rna-XM_047136226.1 32191441 | 20 | 381634111 | 381638008 | Schistocerca americana 7009 | CAG|GCAAGACTTC...AATTTGTTAATT/AATTTGTTAATT...TTTAG|GGT | 0 | 1 | 84.005 |
| 179734179 | GT-AG | 0 | 0.0043626147720938 | 954 | rna-XM_047136226.1 32191441 | 21 | 381638136 | 381639089 | Schistocerca americana 7009 | TTG|GTATGTTATT...TTGTCCATGAAC/GTCAAGATGATT...TTCAG|GGG | 1 | 1 | 86.44 |
| 179734180 | GT-AG | 0 | 1.000000099473604e-05 | 1523 | rna-XM_047136226.1 32191441 | 22 | 381639158 | 381640680 | Schistocerca americana 7009 | CAG|GTACAGAATA...GCTGTCATAATA/TTGTGTCTTATG...TCCAG|GTG | 0 | 1 | 87.745 |
| 179734181 | GT-AG | 0 | 1.000000099473604e-05 | 8817 | rna-XM_047136226.1 32191441 | 23 | 381640855 | 381649671 | Schistocerca americana 7009 | AAG|GTTAGTATAT...TCTTCCGTAACA/TAAATTTTTATT...TTCAG|GGA | 0 | 1 | 91.082 |
| 179734182 | GT-AG | 0 | 2.933664293928149e-05 | 19450 | rna-XM_047136226.1 32191441 | 24 | 381649833 | 381669282 | Schistocerca americana 7009 | CAA|GTAAGCCAAA...TGTACTTTGATT/TGTACTTTGATT...TGCAG|GGC | 2 | 1 | 94.17 |
| 179734183 | GT-AG | 0 | 0.0001933844091229 | 15231 | rna-XM_047136226.1 32191441 | 25 | 381669427 | 381684657 | Schistocerca americana 7009 | AAG|GTAAATTTTT...TTAATTTTAATC/TTAATTTTAATC...TCAAG|GGA | 2 | 1 | 96.931 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);