introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 27368745
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 152380171 | GT-AG | 0 | 1.0628826405734684 | 112674 | rna-XM_032604391.1 27368745 | 1 | 60553861 | 60666534 | Phocoena sinus 42100 | AT|GTGAGTAGGA...ATTCTGTTAATT/ATTCTGTTAATT...TGTAG|GAT | 2 | 1 | 0.042 |
| 152380172 | GT-AG | 0 | 2.6159523773609067e-05 | 83133 | rna-XM_032604391.1 27368745 | 2 | 60666633 | 60749765 | Phocoena sinus 42100 | AGT|GTAAGTATGT...TGTTTTTTTTCT/AGGTAGTTAAGC...TCTAG|TGC | 1 | 1 | 2.115 |
| 152380173 | GT-AG | 0 | 1.2207991531440448e-05 | 28513 | rna-XM_032604391.1 27368745 | 3 | 60749969 | 60778481 | Phocoena sinus 42100 | AAG|GTAATTTGTG...GTGGATTTAACC/GTGGATTTAACC...TACAG|CAA | 0 | 1 | 6.409 |
| 152380174 | GT-AG | 0 | 1.000000099473604e-05 | 22184 | rna-XM_032604391.1 27368745 | 4 | 60778600 | 60800783 | Phocoena sinus 42100 | ATG|GTAAGATTCT...TATTGTTTAGTA/TATTTTCTTAAT...TTTAG|GTG | 1 | 1 | 8.904 |
| 152380175 | GT-AG | 0 | 1.000000099473604e-05 | 12586 | rna-XM_032604391.1 27368745 | 5 | 60800863 | 60813448 | Phocoena sinus 42100 | TAA|GTGAGTATGA...AAACCTTTAACT/AAACCTTTAACT...TCTAG|ACT | 2 | 1 | 10.575 |
| 152380176 | GT-AG | 0 | 1.000000099473604e-05 | 3820 | rna-XM_032604391.1 27368745 | 6 | 60813539 | 60817358 | Phocoena sinus 42100 | AAG|GTAAGAATAA...CAATTTTTAAAT/TTTAAATTTATT...TACAG|ACT | 2 | 1 | 12.479 |
| 152380177 | GT-AG | 0 | 1.000000099473604e-05 | 74459 | rna-XM_032604391.1 27368745 | 7 | 60817416 | 60891874 | Phocoena sinus 42100 | AAA|GTAAGTGGCT...ATATTTTTCTCC/CACTGCCTCACA...TATAG|GTC | 2 | 1 | 13.684 |
| 152380178 | GT-AG | 0 | 1.000000099473604e-05 | 35446 | rna-XM_032604391.1 27368745 | 8 | 60891939 | 60927384 | Phocoena sinus 42100 | CTG|GTAAGCAAAT...GTTTTCTTCTCC/AAAAGACTAATT...CCTAG|CCT | 0 | 1 | 15.038 |
| 152380179 | GT-AG | 0 | 0.0005937516169216 | 6280 | rna-XM_032604391.1 27368745 | 9 | 60927460 | 60933739 | Phocoena sinus 42100 | GGG|GTATGTAAGT...AGTTTCTCAACT/AAGTTTCTCAAC...TCAAG|TCT | 0 | 1 | 16.624 |
| 152380180 | GT-AG | 0 | 1.000000099473604e-05 | 5593 | rna-XM_032604391.1 27368745 | 10 | 60933885 | 60939477 | Phocoena sinus 42100 | TAG|GTGAGAAAGT...CTATTCTTATAT/TCTATTCTTATA...TTTAG|GTT | 1 | 1 | 19.691 |
| 152380181 | GT-AG | 0 | 1.000000099473604e-05 | 14372 | rna-XM_032604391.1 27368745 | 11 | 60939551 | 60953922 | Phocoena sinus 42100 | AAA|GTAAGTGACA...GACCTTTTAAAA/ATGTATTTAATT...TTCAG|TTT | 2 | 1 | 21.235 |
| 152380182 | GT-AG | 0 | 1.000000099473604e-05 | 995 | rna-XM_032604391.1 27368745 | 12 | 60954053 | 60955047 | Phocoena sinus 42100 | GAA|GTGAGGAATA...GATTCTTTACTT/TGATTCTTTACT...CATAG|ATT | 0 | 1 | 23.985 |
| 152380183 | GT-AG | 0 | 0.0099246611456232 | 3547 | rna-XM_032604391.1 27368745 | 13 | 60955164 | 60958710 | Phocoena sinus 42100 | CAG|GTATTTCTGC...CTTTTCTTATTT/ACTTTTCTTATT...TGCAG|ATT | 2 | 1 | 26.438 |
| 152380184 | GT-AG | 0 | 1.000000099473604e-05 | 1954 | rna-XM_032604391.1 27368745 | 14 | 60958781 | 60960734 | Phocoena sinus 42100 | CAG|GTAAAAATTT...TATTCTTTATAC/ATTAATCTAATT...TCCAG|TCC | 0 | 1 | 27.919 |
| 152380185 | GT-AG | 0 | 6.0479848347808047e-05 | 2318 | rna-XM_032604391.1 27368745 | 15 | 60960835 | 60963152 | Phocoena sinus 42100 | AAG|GTAAACAGTC...TGTTTTTTAAAA/ATGTTTTTTAAA...TGTAG|ATT | 1 | 1 | 30.034 |
| 152380186 | GT-AG | 0 | 1.000000099473604e-05 | 2077 | rna-XM_032604391.1 27368745 | 16 | 60963278 | 60965354 | Phocoena sinus 42100 | AAG|GTGAACCTTT...GATTCCTTCTCC/TTTTGCTACAGT...CACAG|GAT | 0 | 1 | 32.678 |
| 152380187 | GT-AG | 0 | 1.2169147027683465e-05 | 5367 | rna-XM_032604391.1 27368745 | 17 | 60965604 | 60970970 | Phocoena sinus 42100 | CAG|GTAGGCCTAG...TTCTTTTTACAA/TTTCTTTTTACA...TCTAG|GTT | 0 | 1 | 37.944 |
| 152380188 | GT-AG | 0 | 0.0005203562919656 | 7120 | rna-XM_032604391.1 27368745 | 18 | 60971172 | 60978291 | Phocoena sinus 42100 | GAG|GTATTTCAAG...CTTTCCTTGTCC/TAAATGTTTACT...CTCAG|GAT | 0 | 1 | 42.195 |
| 152380189 | GT-AG | 0 | 8.665204907043814e-05 | 212 | rna-XM_032604391.1 27368745 | 19 | 60978404 | 60978615 | Phocoena sinus 42100 | AAG|GTAACATTGT...TTATACTTGTCA/TCAATGTTCACA...TTCAG|AAT | 1 | 1 | 44.564 |
| 152380190 | GT-AG | 0 | 1.531204723216895e-05 | 1679 | rna-XM_032604391.1 27368745 | 20 | 60978797 | 60980475 | Phocoena sinus 42100 | CAG|GTGTGTTTAG...TCTTCCTCATTT/ATCTTCCTCATT...CACAG|GAT | 2 | 1 | 48.393 |
| 152380191 | GT-AG | 0 | 0.0016976922070307 | 3870 | rna-XM_032604391.1 27368745 | 21 | 60982157 | 60986026 | Phocoena sinus 42100 | AGG|GTATGTTTGG...TGTCACATGATT/CATGTGTTCACA...CTCAG|AAT | 0 | 1 | 83.947 |
| 152380192 | GT-AG | 0 | 1.000000099473604e-05 | 9404 | rna-XM_032604391.1 27368745 | 22 | 60986168 | 60995571 | Phocoena sinus 42100 | AAA|GTAAGTATGG...AATGCCATACCA/TATTACATCACT...AATAG|ACC | 0 | 1 | 86.929 |
| 152380193 | GT-AG | 0 | 0.0011356374344735 | 19147 | rna-XM_032604391.1 27368745 | 23 | 60995692 | 61014838 | Phocoena sinus 42100 | AAG|GTAACCAATT...ATCCTTTCAACA/AATCTTCTGACA...CACAG|GCA | 0 | 1 | 89.467 |
| 152380194 | GT-AG | 0 | 1.000000099473604e-05 | 21582 | rna-XM_032604391.1 27368745 | 24 | 61015061 | 61036642 | Phocoena sinus 42100 | CAG|GTGAGTAGAG...TAGCTTTTATAT/ATGTATTTGATT...CATAG|TTT | 0 | 1 | 94.162 |
| 152380195 | GT-AG | 0 | 1.000000099473604e-05 | 17390 | rna-XM_032604391.1 27368745 | 25 | 61036736 | 61054125 | Phocoena sinus 42100 | AAG|GTAAGAACTG...TAGTTCTTGAAA/TTACTTCTGACT...TATAG|GGT | 0 | 1 | 96.129 |
| 152380196 | GT-AG | 0 | 1.000000099473604e-05 | 12976 | rna-XM_032604391.1 27368745 | 26 | 61054201 | 61067176 | Phocoena sinus 42100 | CAG|GTAAGACATA...CCTGTTTTATTT/CTTACATTGATT...TACAG|GCA | 0 | 1 | 97.716 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);