introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
20 rows where transcript_id = 15198749
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82238704 | GT-AG | 0 | 1.000000099473604e-05 | 55133 | rna-XM_005415561.3 15198749 | 1 | 25166373 | 25221505 | Geospiza fortis 48883 | CTG|GTAAGTGACT...AATTCTTTAATT/CTCTTTTTTATT...TAAAG|CTG | 1 | 1 | 8.381 |
| 82238705 | GT-AG | 0 | 0.0276922271301791 | 3906 | rna-XM_005415561.3 15198749 | 2 | 25162356 | 25166261 | Geospiza fortis 48883 | CAG|GTATGCTATT...ACAGTTTTAACT/ACAGTTTTAACT...TAAAG|GTG | 1 | 1 | 11.932 |
| 82238706 | GT-AG | 0 | 0.0114038157989954 | 2111 | rna-XM_005415561.3 15198749 | 3 | 25160164 | 25162274 | Geospiza fortis 48883 | CTG|GTATTTTCAC...AGTGTTTTAATT/AGTGTTTTAATT...TGCAG|GTT | 1 | 1 | 14.523 |
| 82238707 | GT-AG | 0 | 5.159843286048628e-05 | 1976 | rna-XM_005415561.3 15198749 | 4 | 25158035 | 25160010 | Geospiza fortis 48883 | CTG|GTAAGCTTTG...GTGGTTTCACCT/TGTGGTTTCACC...TGCAG|GCA | 1 | 1 | 19.418 |
| 82238708 | GT-AG | 0 | 1.000000099473604e-05 | 7028 | rna-XM_005415561.3 15198749 | 5 | 25150889 | 25157916 | Geospiza fortis 48883 | TGG|GTAAGTGTGC...AAAACTTTCATT/TTGTATTTCAAT...TACAG|GTG | 2 | 1 | 23.193 |
| 82238709 | GT-AG | 0 | 1.000000099473604e-05 | 1575 | rna-XM_005415561.3 15198749 | 6 | 25149135 | 25150709 | Geospiza fortis 48883 | CTG|GTGAGATATT...TTATCTTTAGAG/TTATCAATCATT...TGTAG|GTC | 1 | 1 | 28.919 |
| 82238710 | GT-AG | 0 | 1.000000099473604e-05 | 2906 | rna-XM_005415561.3 15198749 | 7 | 25146123 | 25149028 | Geospiza fortis 48883 | AAG|GTTGGAGTCT...TTATTTTTATTT/TTTATTTTTATT...TTTAG|GGG | 2 | 1 | 32.31 |
| 82238711 | GT-AG | 0 | 2.8408445395151224e-05 | 6351 | rna-XM_005415561.3 15198749 | 8 | 25139647 | 25145997 | Geospiza fortis 48883 | CAG|GTATTGCACC...ATATTTTTACTG/TATATTTTTACT...TTCAG|CCT | 1 | 1 | 36.308 |
| 82238712 | GT-AG | 0 | 0.0118416047286966 | 4600 | rna-XM_005415561.3 15198749 | 9 | 25134931 | 25139530 | Geospiza fortis 48883 | AAG|GTATTTTAAT...TTTCTCTTATTT/TTTTCTCTTATT...TGCAG|ATT | 0 | 1 | 40.019 |
| 82238713 | GT-AG | 0 | 0.0004414661113977 | 1198 | rna-XM_005415561.3 15198749 | 10 | 25133630 | 25134827 | Geospiza fortis 48883 | TTG|GTATGCAACT...TTGGCTTCAGAG/TTTGGCTTCAGA...TTTAG|GCA | 1 | 1 | 43.314 |
| 82238714 | GT-AG | 0 | 2.684723785744825e-05 | 921 | rna-XM_005415561.3 15198749 | 11 | 25132592 | 25133512 | Geospiza fortis 48883 | AAG|GTAAGCCTCT...CCATTATTGACA/CCATTATTGACA...ATCAG|CTA | 1 | 1 | 47.057 |
| 82238715 | GT-AG | 0 | 1.000000099473604e-05 | 5897 | rna-XM_005415561.3 15198749 | 12 | 25126549 | 25132445 | Geospiza fortis 48883 | GAG|GTAAAGCCTT...GTAATTTTGAAA/CTGACATTTATT...TACAG|ATT | 0 | 1 | 51.727 |
| 82238716 | GT-AG | 0 | 1.000000099473604e-05 | 1440 | rna-XM_005415561.3 15198749 | 13 | 25124913 | 25126352 | Geospiza fortis 48883 | AAG|GTAAGTGCTA...AATTATTTAACA/AATTATTTAACA...GGCAG|AGG | 1 | 1 | 57.997 |
| 82238717 | GT-AG | 0 | 8.102452893259843e-05 | 2880 | rna-XM_005415561.3 15198749 | 14 | 25121907 | 25124786 | Geospiza fortis 48883 | AAG|GTACAGTGTG...TCTCTTTTAATT/TCTCTTTTAATT...TACAG|CTG | 1 | 1 | 62.028 |
| 82238718 | GT-AG | 0 | 1.4158156486667892e-05 | 2982 | rna-XM_005415561.3 15198749 | 15 | 25118764 | 25121745 | Geospiza fortis 48883 | GAA|GTAAGTAACC...TTTATTTTGATT/TTTATTTTGATT...CTCAG|GTT | 0 | 1 | 67.179 |
| 82238719 | GT-AG | 0 | 1.000000099473604e-05 | 6488 | rna-XM_005415561.3 15198749 | 16 | 25112095 | 25118582 | Geospiza fortis 48883 | CAG|GTATGGAACA...CTCACCTGAGCT/TGTGTGCTCACC...TGTAG|ACA | 1 | 1 | 72.969 |
| 82238720 | GT-AG | 0 | 1.000000099473604e-05 | 1378 | rna-XM_005415561.3 15198749 | 17 | 25110591 | 25111968 | Geospiza fortis 48883 | AAG|GTGAGCAGGA...TACTTCTGAATG/CTACTTCTGAAT...CACAG|CTG | 1 | 1 | 76.999 |
| 82238721 | GT-AG | 0 | 1.000000099473604e-05 | 4701 | rna-XM_005415561.3 15198749 | 18 | 25105762 | 25110462 | Geospiza fortis 48883 | CTG|GTGAGTCTCT...GTAATTTTACTT/GGTAATTTTACT...CCTAG|ACC | 0 | 1 | 81.094 |
| 82238722 | GT-AG | 0 | 1.000000099473604e-05 | 5025 | rna-XM_005415561.3 15198749 | 19 | 25100523 | 25105547 | Geospiza fortis 48883 | CAG|GTCAGAGATC...AGCTTCTTCATC/AGCTTCTTCATC...TTTAG|AGT | 1 | 1 | 87.94 |
| 82238723 | GT-AG | 0 | 1.000000099473604e-05 | 1650 | rna-XM_005415561.3 15198749 | 20 | 25098622 | 25100271 | Geospiza fortis 48883 | GGG|GTAAGTAAAT...TTTCTTTTATTA/CTTTTATTAATT...TTTAG|CCA | 0 | 1 | 95.969 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);