introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
22 rows where transcript_id = 12917258
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68847394 | GT-AG | 0 | 1.000000099473604e-05 | 6556 | rna-XM_035895848.1 12917258 | 1 | 2820625 | 2827180 | Egretta garzetta 188379 | ACA|GTAATGCTGT...ACTCACTTAGGA/AGTGCACTCACT...CTGAG|ATG | 1 | 1 | 8.656 |
| 68847395 | GT-AG | 0 | 1.000000099473604e-05 | 5907 | rna-XM_035895848.1 12917258 | 2 | 2814623 | 2820529 | Egretta garzetta 188379 | GAG|GTACAAGAAA...AAAGCTTGAATT/ACAAAACTCAAA...TATAG|GTT | 0 | 1 | 10.843 |
| 68847396 | GT-AG | 0 | 2.925315132115022e-05 | 2747 | rna-XM_035895848.1 12917258 | 3 | 2811750 | 2814496 | Egretta garzetta 188379 | GAG|GTATAGAGAA...TACCCTTTAGTC/TGTGCATTTACC...TTAAG|GAT | 0 | 1 | 13.743 |
| 68847397 | GT-AG | 0 | 1.000000099473604e-05 | 537 | rna-XM_035895848.1 12917258 | 4 | 2811081 | 2811617 | Egretta garzetta 188379 | AAG|GTAAGCAGTG...ATTTCCTCAACT/TATTTCCTCAAC...TTCAG|GAG | 0 | 1 | 16.782 |
| 68847398 | GT-AG | 0 | 1.000000099473604e-05 | 3771 | rna-XM_035895848.1 12917258 | 5 | 2807145 | 2810915 | Egretta garzetta 188379 | AAG|GTAAGCCTAA...TTGTTCTCACTT/GTTGTTCTCACT...CCTAG|GTA | 0 | 1 | 20.58 |
| 68847399 | GT-AG | 0 | 1.084816294570944e-05 | 505 | rna-XM_035895848.1 12917258 | 6 | 2806430 | 2806934 | Egretta garzetta 188379 | CAT|GTAAGTATTG...TCCCTGTTGATT/TCCCTGTTGATT...TGCAG|GAT | 0 | 1 | 25.414 |
| 68847400 | GT-AG | 0 | 1.000000099473604e-05 | 3915 | rna-XM_035895848.1 12917258 | 7 | 2802371 | 2806285 | Egretta garzetta 188379 | GAG|GTAAGAATGA...TATTCTGTATTT/CAGATATTAAAT...CTTAG|GAA | 0 | 1 | 28.729 |
| 68847401 | GT-AG | 0 | 2.503593887280622e-05 | 109 | rna-XM_035895848.1 12917258 | 8 | 2802151 | 2802259 | Egretta garzetta 188379 | CAG|GTAATTTTCT...GTTGCTTCAATA/CTGAAACTTATT...TCTAG|TCA | 0 | 1 | 31.285 |
| 68847402 | GT-AG | 0 | 1.000000099473604e-05 | 12385 | rna-XM_035895848.1 12917258 | 9 | 2789667 | 2802051 | Egretta garzetta 188379 | AAA|GTAAGAACGT...TTGTATTTGACT/TTGTATTTGACT...CACAG|CAA | 0 | 1 | 33.564 |
| 68847403 | GT-AG | 0 | 2.788993076066321e-05 | 972 | rna-XM_035895848.1 12917258 | 10 | 2788567 | 2789538 | Egretta garzetta 188379 | TAG|GTAAACATGT...TTTCTCTCACCC/CTTTCTCTCACC...TGAAG|TGA | 2 | 1 | 36.51 |
| 68847404 | GT-AG | 0 | 1.000000099473604e-05 | 4958 | rna-XM_035895848.1 12917258 | 11 | 2782953 | 2787910 | Egretta garzetta 188379 | CAG|GTGAGGCACT...TAATTGCTGACA/TAATTGCTGACA...CTTAG|ATC | 1 | 1 | 51.611 |
| 68847405 | GT-AG | 0 | 0.0010709248061484 | 406 | rna-XM_035895848.1 12917258 | 12 | 2782465 | 2782870 | Egretta garzetta 188379 | AAG|GTATGCCACT...CTAGTGTTAATA/CTAGTGTTAATA...GCTAG|ATT | 2 | 1 | 53.499 |
| 68847406 | GT-AG | 0 | 1.000000099473604e-05 | 535 | rna-XM_035895848.1 12917258 | 13 | 2781807 | 2782341 | Egretta garzetta 188379 | ACG|GTGAGTATTG...TTATCCTTTGCT/TGCTTACTTATC...TGCAG|AGT | 2 | 1 | 56.331 |
| 68847407 | GT-AG | 0 | 1.5053755443494405e-05 | 1128 | rna-XM_035895848.1 12917258 | 14 | 2780575 | 2781702 | Egretta garzetta 188379 | GAG|GTACTTACCA...GTAACTTTCTCT/TAAATAGTAACT...GCCAG|GCA | 1 | 1 | 58.725 |
| 68847408 | GT-AG | 0 | 0.0301716207880642 | 674 | rna-XM_035895848.1 12917258 | 15 | 2779678 | 2780351 | Egretta garzetta 188379 | CAG|GTAACTTTCC...CTGCTTTTAATT/CTGCTTTTAATT...TCTAG|AGC | 2 | 1 | 63.858 |
| 68847409 | GT-AG | 0 | 1.213301787478243e-05 | 327 | rna-XM_035895848.1 12917258 | 16 | 2779231 | 2779557 | Egretta garzetta 188379 | AAA|GTAAGTGATT...AATGCTTTAATA/AATGCTTTAATA...TTTAG|ATG | 2 | 1 | 66.621 |
| 68847410 | GT-AG | 0 | 1.000000099473604e-05 | 343 | rna-XM_035895848.1 12917258 | 17 | 2778725 | 2779067 | Egretta garzetta 188379 | GAG|GTACAGTGAA...TACACCTTCACC/ATGTGTTTCATC...TTTAG|CAA | 0 | 1 | 70.373 |
| 68847411 | GT-AG | 0 | 1.000000099473604e-05 | 1350 | rna-XM_035895848.1 12917258 | 18 | 2777257 | 2778606 | Egretta garzetta 188379 | TTG|GTAAGTATGT...TTCTCCTTGTTT/TTGTTTTGTACT...TCTAG|GGA | 1 | 1 | 73.089 |
| 68847412 | GT-AG | 0 | 1.000000099473604e-05 | 968 | rna-XM_035895848.1 12917258 | 19 | 2776116 | 2777083 | Egretta garzetta 188379 | AGG|GTAAGTGGCA...AGTAGCTTAGCT/GCTTAGCTGACA...TTCAG|AGA | 0 | 1 | 77.072 |
| 68847413 | GT-AG | 0 | 1.000000099473604e-05 | 138 | rna-XM_035895848.1 12917258 | 20 | 2775861 | 2775998 | Egretta garzetta 188379 | TTT|GTTAGTATTC...GGCCTTTTGATT/TTTTGTTTGATA...TTCAG|CCC | 0 | 1 | 79.765 |
| 68847414 | GT-AG | 0 | 1.7615788977604386e-05 | 1074 | rna-XM_035895848.1 12917258 | 21 | 2774522 | 2775595 | Egretta garzetta 188379 | CAA|GTAGGTGTTA...TTGTCATTATCA/TTTTTTTTTATG...ACTAG|GAT | 1 | 1 | 85.866 |
| 68847415 | GT-AG | 0 | 1.000000099473604e-05 | 1355 | rna-XM_035895848.1 12917258 | 22 | 2773048 | 2774402 | Egretta garzetta 188379 | AAA|GTAAGGAAGG...TGTTCTTTACTG/CTGTTCTTTACT...CTCAG|ATT | 0 | 1 | 88.605 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);