introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
21 rows where transcript_id = 623784
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3437310 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-EDS130_LOCUS319 623784 | 1 | 909204 | 909277 | Adineta ricciae 249248 | TTG|GTAAATGATT...TGAATTTTAATT/TGAATTTTAATT...CACAG|GTC | 0 | 1 | 0.87 |
| 3437311 | GT-AG | 0 | 0.0001014014758202 | 50 | rna-EDS130_LOCUS319 623784 | 2 | 909140 | 909189 | Adineta ricciae 249248 | TTG|GTAATTTTGT...CGGCTTTTAAAA/CCGATATTTATT...AACAG|GTC | 2 | 1 | 1.32 |
| 3437312 | GT-AG | 0 | 0.000160646305057 | 50 | rna-EDS130_LOCUS319 623784 | 3 | 908972 | 909021 | Adineta ricciae 249248 | AAA|GTAAACATAT...CACTTCTTCGCA/CTTCTTCGCAAT...TGTAG|ACG | 0 | 1 | 5.121 |
| 3437313 | GT-AG | 0 | 6.009735178514819e-05 | 52 | rna-EDS130_LOCUS319 623784 | 4 | 908868 | 908919 | Adineta ricciae 249248 | ATT|GTAAATAATA...CTGCACTTGATA/CTGCACTTGATA...TTTAG|TAA | 1 | 1 | 6.795 |
| 3437314 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-EDS130_LOCUS319 623784 | 5 | 908738 | 908789 | Adineta ricciae 249248 | TTG|GTAGGATTGC...TAATCGTTGACG/TGTTTCGTAATC...TGCAG|GAA | 1 | 1 | 9.308 |
| 3437315 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-EDS130_LOCUS319 623784 | 6 | 908599 | 908647 | Adineta ricciae 249248 | ATG|GTAAGAGACT...CTCTCGTTAGTT/TCGTTAGTTACC...TGTAG|ATT | 1 | 1 | 12.206 |
| 3437316 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-EDS130_LOCUS319 623784 | 7 | 908458 | 908514 | Adineta ricciae 249248 | CAT|GTCAGTATGA...TTAATTTCAGTT/ATTAATTTCAGT...TCTAG|ATT | 1 | 1 | 14.911 |
| 3437317 | GT-AG | 0 | 1.0403026045918775e-05 | 53 | rna-EDS130_LOCUS319 623784 | 8 | 908271 | 908323 | Adineta ricciae 249248 | ACT|GTTCGTAGGA...GTTTGTTTGATG/GTTTGTTTGATG...TGTAG|TCA | 0 | 1 | 19.227 |
| 3437318 | GT-AG | 0 | 1.000000099473604e-05 | 63 | rna-EDS130_LOCUS319 623784 | 9 | 908068 | 908130 | Adineta ricciae 249248 | AAA|GTACTGGCAA...AGAATTTTGAAT/AGAATTTTGAAT...CATAG|GTT | 2 | 1 | 23.736 |
| 3437319 | GT-AG | 0 | 1.000000099473604e-05 | 46 | rna-EDS130_LOCUS319 623784 | 10 | 907965 | 908010 | Adineta ricciae 249248 | AAA|GTAAATCATC...GGCTTCTCGAAT/TCTCGAATTATA...CTTAG|GAA | 2 | 1 | 25.572 |
| 3437320 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-EDS130_LOCUS319 623784 | 11 | 907711 | 907759 | Adineta ricciae 249248 | CGA|GTAAGATTCA...CTATTCATGAAC/ATACTATTCATG...AGTAG|GTT | 0 | 1 | 32.174 |
| 3437321 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-EDS130_LOCUS319 623784 | 12 | 907532 | 907584 | Adineta ricciae 249248 | GAA|GTAAGAGATT...CTTGTTTTGATA/CTTGTTTTGATA...TTTAG|GGA | 0 | 1 | 36.232 |
| 3437322 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-EDS130_LOCUS319 623784 | 13 | 907357 | 907413 | Adineta ricciae 249248 | TCG|GTTAGTTACT...GTTCTCTCAACC/TGTTCTCTCAAC...TCTAG|AAC | 1 | 1 | 40.032 |
| 3437323 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-EDS130_LOCUS319 623784 | 14 | 907077 | 907135 | Adineta ricciae 249248 | GAC|GTGAGTAAAA...TTTCTCTTATCT/GTTTCTCTTATC...TATAG|AAA | 0 | 1 | 47.15 |
| 3437324 | GT-AG | 0 | 0.0001246713622975 | 55 | rna-EDS130_LOCUS319 623784 | 15 | 906823 | 906877 | Adineta ricciae 249248 | TTG|GTATGTAACC...CTGTTCATATTC/TTTCTGTTCATA...CCCAG|GTT | 1 | 1 | 53.559 |
| 3437325 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-EDS130_LOCUS319 623784 | 16 | 906515 | 906563 | Adineta ricciae 249248 | CTG|GTAAGATAAT...ATGACGTTAATT/ATGACGTTAATT...TTTAG|TGT | 2 | 1 | 61.9 |
| 3437326 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-EDS130_LOCUS319 623784 | 17 | 906063 | 906117 | Adineta ricciae 249248 | CGA|GTAAGATTCA...AATACTGTGAGG/AATACTGTGAGG...TCTAG|GAT | 0 | 1 | 74.686 |
| 3437327 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-EDS130_LOCUS319 623784 | 18 | 905850 | 905903 | Adineta ricciae 249248 | GAT|GTAAGTCAGA...TTGTCATTGCTT/GAAATTGTCATT...TTTAG|GCA | 0 | 1 | 79.807 |
| 3437328 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna-EDS130_LOCUS319 623784 | 19 | 905678 | 905725 | Adineta ricciae 249248 | AAG|GTAAGAAGAC...ATAGTCAAGAAA/AAATGAGTGAAT...TTTAG|ACG | 1 | 1 | 83.8 |
| 3437329 | GT-AG | 0 | 5.300773568809149e-05 | 45 | rna-EDS130_LOCUS319 623784 | 20 | 905445 | 905489 | Adineta ricciae 249248 | CAA|GTAAATCTCT...AACTTCTTGCTT/ACTTTCGTAAAA...TCTAG|ACT | 0 | 1 | 89.855 |
| 3437330 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-EDS130_LOCUS319 623784 | 21 | 905271 | 905332 | Adineta ricciae 249248 | CAG|GTTTGATGAT...TCCGTTTTATTT/TTCCGTTTTATT...TGCAG|GCG | 1 | 1 | 93.462 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);