introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 32210566
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179854343 | GT-AG | 0 | 6.30133032470261e-05 | 17433 | rna-XM_047250724.1 32210566 | 1 | 43056183 | 43073615 | Schistocerca piceifrons 274613 | ATC|GTAACAATAG...TCGTCCTCTTCG/GATGCGCTAAGT...TGTAG|GGT | 0 | 1 | 0.998 |
| 179854344 | GC-AG | 0 | 1.000000099473604e-05 | 21322 | rna-XM_047250724.1 32210566 | 2 | 43073828 | 43095149 | Schistocerca piceifrons 274613 | GCG|GCGCGCTGAC...GGACTGGTGATG/GGACTGGTGATG...TGCAG|GTG | 2 | 1 | 5.407 |
| 179854345 | GT-AG | 0 | 1.000000099473604e-05 | 20859 | rna-XM_047250724.1 32210566 | 3 | 43095383 | 43116241 | Schistocerca piceifrons 274613 | GGG|GTGAGTCCGC...CTAGTTTTGAAG/CTAGTTTTGAAG...TGCAG|GAG | 1 | 1 | 10.252 |
| 179854346 | GT-AG | 0 | 2.4243819312196024e-05 | 27777 | rna-XM_047250724.1 32210566 | 4 | 43116369 | 43144145 | Schistocerca piceifrons 274613 | TGC|GTAAGTACCG...TTTACCTAATCT/AAATTGCTCATT...TTCAG|GTT | 2 | 1 | 12.892 |
| 179854347 | GT-AG | 0 | 1.000000099473604e-05 | 4892 | rna-XM_047250724.1 32210566 | 5 | 43144391 | 43149282 | Schistocerca piceifrons 274613 | CAG|GTAAATATCT...AAGTTCTTCATT/AAGTTCTTCATT...TGCAG|ATT | 1 | 1 | 17.987 |
| 179854348 | GT-AG | 0 | 1.000000099473604e-05 | 3638 | rna-XM_047250724.1 32210566 | 6 | 43149427 | 43153064 | Schistocerca piceifrons 274613 | CAG|GTAGGGATCC...AATGACTTATCT/AAAATGTTCACT...TAAAG|GTT | 1 | 1 | 20.981 |
| 179854349 | GT-AG | 0 | 1.000000099473604e-05 | 6542 | rna-XM_047250724.1 32210566 | 7 | 43153233 | 43159774 | Schistocerca piceifrons 274613 | TGG|GTGAGTGATA...TATACCTGAAAA/TAATAATTTATA...ATTAG|GAC | 1 | 1 | 24.475 |
| 179854350 | GT-AG | 0 | 1.098653960829836e-05 | 4950 | rna-XM_047250724.1 32210566 | 8 | 43159968 | 43164917 | Schistocerca piceifrons 274613 | CAA|GTAAGTTAGG...TTAAGCTTAAAG/ATTAAGCTTAAA...CACAG|GTG | 2 | 1 | 28.488 |
| 179854351 | GT-AG | 0 | 1.000000099473604e-05 | 483 | rna-XM_047250724.1 32210566 | 9 | 43165135 | 43165617 | Schistocerca piceifrons 274613 | CAG|GTTAGAATGA...ATTTTTTTGACT/ATTTTTTTGACT...TGCAG|GAC | 0 | 1 | 33.001 |
| 179854352 | GT-AG | 0 | 1.000000099473604e-05 | 177 | rna-XM_047250724.1 32210566 | 10 | 43165774 | 43165950 | Schistocerca piceifrons 274613 | GAG|GTTAGACATA...CGTCACTTAATT/ATTGTGTTTACC...TACAG|TCA | 0 | 1 | 36.245 |
| 179854353 | GT-AG | 0 | 0.00024485400269 | 86 | rna-XM_047250724.1 32210566 | 11 | 43166129 | 43166214 | Schistocerca piceifrons 274613 | AAG|GTCCACTTTT...CATTGCTTATTA/TCATTGCTTATT...GATAG|GTC | 1 | 1 | 39.946 |
| 179854354 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_047250724.1 32210566 | 12 | 43166384 | 43166499 | Schistocerca piceifrons 274613 | CAG|GTTAATTTAC...CTGATTTTCACA/CTGATTTTCACA...CACAG|AAA | 2 | 1 | 43.46 |
| 179854355 | GT-AG | 0 | 0.000306252598798 | 90 | rna-XM_047250724.1 32210566 | 13 | 43166670 | 43166759 | Schistocerca piceifrons 274613 | AGG|GTAACTACAT...AACTTCTTTGCT/ATTATGCTAAGA...ACTAG|AAC | 1 | 1 | 46.995 |
| 179854356 | GT-AG | 0 | 1.5413847687094387e-05 | 85 | rna-XM_047250724.1 32210566 | 14 | 43166869 | 43166953 | Schistocerca piceifrons 274613 | CCT|GTAAGTAGTA...GTTATCTAAACC/TAGTCACTCATA...TATAG|GAA | 2 | 1 | 49.262 |
| 179854357 | GT-AG | 0 | 1.000000099473604e-05 | 146 | rna-XM_047250724.1 32210566 | 15 | 43167127 | 43167272 | Schistocerca piceifrons 274613 | AAA|GTGAGTAAAA...GCATTCTTAATC/GCATTCTTAATC...TCAAG|GGC | 1 | 1 | 52.859 |
| 179854358 | GT-AG | 0 | 1.000000099473604e-05 | 15713 | rna-XM_047250724.1 32210566 | 16 | 43167412 | 43183124 | Schistocerca piceifrons 274613 | AGG|GTAAGAAGTA...CATTTCTGAAAC/ATTTTTGTGAAT...TACAG|ACA | 2 | 1 | 55.75 |
| 179854359 | GT-AG | 0 | 0.0001691811145772 | 1552 | rna-XM_047250724.1 32210566 | 17 | 43183385 | 43184936 | Schistocerca piceifrons 274613 | ATG|GTATTTAAAG...CTGTTCTGAAAG/ACTGTTCTGAAA...TTCAG|GTG | 1 | 1 | 61.156 |
| 179854360 | GT-AG | 0 | 1.000000099473604e-05 | 7354 | rna-XM_047250724.1 32210566 | 18 | 43185111 | 43192464 | Schistocerca piceifrons 274613 | AAG|GTGAGTGCCT...GTTATTGTAATG/GTTATTGTAATG...TCAAG|GAT | 1 | 1 | 64.774 |
| 179854361 | GT-AG | 0 | 0.0052725395559103 | 84 | rna-XM_047250724.1 32210566 | 19 | 43192645 | 43192728 | Schistocerca piceifrons 274613 | TGG|GTATTTATCT...ATTCTCTCAATA/TATTCTCTCAAT...AACAG|GGT | 1 | 1 | 68.517 |
| 179854362 | GT-AG | 0 | 1.000000099473604e-05 | 190 | rna-XM_047250724.1 32210566 | 20 | 43192936 | 43193125 | Schistocerca piceifrons 274613 | CAG|GTGATACTTC...ATGATGTTAAAT/CAATGGCTAATT...AACAG|GTG | 1 | 1 | 72.822 |
| 179854363 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_047250724.1 32210566 | 21 | 43193381 | 43193499 | Schistocerca piceifrons 274613 | AAG|GTAGGAAAAC...GATAACTTAACG/TCATATATCACC...TTCAG|GTT | 1 | 1 | 78.124 |
| 179854364 | GT-AG | 0 | 0.0001185638500009 | 111 | rna-XM_047250724.1 32210566 | 22 | 43193827 | 43193937 | Schistocerca piceifrons 274613 | GAG|GTACACACAA...AAACTCTCAACA/TGCTTTCTAATA...TTCAG|GTA | 1 | 1 | 84.924 |
| 179854365 | GT-AG | 0 | 1.000000099473604e-05 | 80 | rna-XM_047250724.1 32210566 | 23 | 43194200 | 43194279 | Schistocerca piceifrons 274613 | TTG|GTAAGTCATG...TTTTCATTAACA/GTAATTTTCATT...TCCAG|GTA | 2 | 1 | 90.372 |
| 179854366 | GT-AG | 0 | 4.614857023465411e-05 | 95 | rna-XM_047250724.1 32210566 | 24 | 43194427 | 43194521 | Schistocerca piceifrons 274613 | CTT|GTAAGTATGA...ATTTTCTTTGTA/TTCTTTGTAAAA...AACAG|GAA | 2 | 1 | 93.429 |
| 179854367 | GT-AG | 0 | 0.0007542713148632 | 1866 | rna-XM_047250724.1 32210566 | 25 | 43194673 | 43196538 | Schistocerca piceifrons 274613 | GGG|GTATGTCTGC...TTACATTTATTT/ATTACATTTATT...GACAG|CCA | 0 | 1 | 96.569 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);