introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 3555658
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17675752 | GT-AG | 0 | 1.000000099473604e-05 | 327 | rna-XM_042054063.1 3555658 | 1 | 122341035 | 122341361 | Arvicola amphibius 1047088 | AAG|GTAAGTGGAG...TATGCACTAACA/TATGCACTAACA...TGCAG|AGG | 0 | 1 | 1.154 |
| 17675753 | GT-AG | 0 | 1.000000099473604e-05 | 3327 | rna-XM_042054063.1 3555658 | 2 | 122341612 | 122344938 | Arvicola amphibius 1047088 | CAG|GTTAGAAATT...CAAGCTTTAGAC/TTTATGTTTATG...CACAG|TTC | 1 | 1 | 5.963 |
| 17675754 | GT-AG | 0 | 1.000000099473604e-05 | 2482 | rna-XM_042054063.1 3555658 | 3 | 122345067 | 122347548 | Arvicola amphibius 1047088 | CAG|GTGAGAAAGT...TTGTCTTTGACT/TTGTCTTTGACT...TCTAG|GTA | 0 | 1 | 8.425 |
| 17675755 | GT-AG | 0 | 1.000000099473604e-05 | 3857 | rna-XM_042054063.1 3555658 | 4 | 122347709 | 122351565 | Arvicola amphibius 1047088 | CTG|GTTAGTTAGC...GTTTCTTTGTTT/CTGAAATTAACA...CTTAG|GAT | 1 | 1 | 11.502 |
| 17675756 | GT-AG | 0 | 1.000000099473604e-05 | 8306 | rna-XM_042054063.1 3555658 | 5 | 122351826 | 122360131 | Arvicola amphibius 1047088 | AAG|GTAATTTCAT...TTTTTTATGACA/GTGTTTTTTATG...TTCAG|GAA | 0 | 1 | 16.503 |
| 17675757 | GC-AG | 0 | 1.000000099473604e-05 | 659 | rna-XM_042054063.1 3555658 | 6 | 122360249 | 122360907 | Arvicola amphibius 1047088 | AAG|GCAAGTATCC...TTCATCATAACT/AGCTTTCTAATT...ATCAG|ATT | 0 | 1 | 18.754 |
| 17675758 | GT-AG | 0 | 1.000000099473604e-05 | 271 | rna-XM_042054063.1 3555658 | 7 | 122361026 | 122361296 | Arvicola amphibius 1047088 | AAG|GTGAGTCGGA...TTTTCCTTCTCC/TGTCGACTGACC...CCCAG|CGT | 1 | 1 | 21.023 |
| 17675759 | GT-AG | 0 | 1.000000099473604e-05 | 9822 | rna-XM_042054063.1 3555658 | 8 | 122361471 | 122371292 | Arvicola amphibius 1047088 | AAG|GTGAGTCTCC...TTTGCTTCAGCC/GTTTGCTTCAGC...TTCAG|GGG | 1 | 1 | 24.37 |
| 17675760 | GT-AG | 0 | 1.000000099473604e-05 | 5487 | rna-XM_042054063.1 3555658 | 9 | 122371475 | 122376961 | Arvicola amphibius 1047088 | CAG|GTGAGTGCAC...AATACCATGATG/ATGATGATTATT...TTTAG|AAA | 0 | 1 | 27.871 |
| 17675761 | GT-AG | 0 | 0.0074864504467251 | 4100 | rna-XM_042054063.1 3555658 | 10 | 122377094 | 122381193 | Arvicola amphibius 1047088 | AAG|GTATTTATTC...TAGTCCTTGTTT/CCAACTCTGAAT...TACAG|AAA | 0 | 1 | 30.41 |
| 17675762 | GT-AG | 0 | 1.000000099473604e-05 | 9631 | rna-XM_042054063.1 3555658 | 11 | 122381324 | 122390954 | Arvicola amphibius 1047088 | CAG|GTGTGTATCT...ATGTCTTTGCTT/TCTTTGCTTAAT...TCTAG|GTT | 1 | 1 | 32.91 |
| 17675763 | GT-AG | 0 | 1.000000099473604e-05 | 198 | rna-XM_042054063.1 3555658 | 12 | 122391110 | 122391307 | Arvicola amphibius 1047088 | ATG|GTGAGCATTT...TTGTTCTAAGCA/ATTGTTCTAAGC...TTTAG|ATA | 0 | 1 | 35.892 |
| 17675764 | GT-AG | 0 | 1.000000099473604e-05 | 581 | rna-XM_042054063.1 3555658 | 13 | 122391464 | 122392044 | Arvicola amphibius 1047088 | AAG|GTAAATAACA...GGTTACTTACCC/TGGTTACTTACC...CCTAG|GTG | 0 | 1 | 38.892 |
| 17675765 | GT-AG | 0 | 1.000000099473604e-05 | 1222 | rna-XM_042054063.1 3555658 | 14 | 122392256 | 122393477 | Arvicola amphibius 1047088 | ATG|GTTCGTCTAT...CCGTGTTTGATC/CCGTGTTTGATC...TTCAG|GTG | 1 | 1 | 42.951 |
| 17675766 | GT-AG | 0 | 1.000000099473604e-05 | 5149 | rna-XM_042054063.1 3555658 | 15 | 122393629 | 122398777 | Arvicola amphibius 1047088 | CAG|GTAAGGACCC...ATGTTTTTACAA/TATGTTTTTACA...TTTAG|GTT | 2 | 1 | 45.855 |
| 17675767 | GT-AG | 0 | 1.000000099473604e-05 | 811 | rna-XM_042054063.1 3555658 | 16 | 122398877 | 122399687 | Arvicola amphibius 1047088 | CAA|GTGAGTACCA...TGTATGTTTGTT/TGTGTATGTATG...TGTAG|GGT | 2 | 1 | 47.759 |
| 17675768 | GT-AG | 0 | 1.000000099473604e-05 | 2345 | rna-XM_042054063.1 3555658 | 17 | 122399851 | 122402195 | Arvicola amphibius 1047088 | GGG|GTAAGCACCT...GCTGCCTTCTCT/AAATCTGTGACC...TGTAG|TGG | 0 | 1 | 50.894 |
| 17675769 | GT-AG | 0 | 1.000000099473604e-05 | 5836 | rna-XM_042054063.1 3555658 | 18 | 122402494 | 122408329 | Arvicola amphibius 1047088 | AAG|GTAAGTCTCT...GATCTCATGACC/GTGATATTTATT...TCCAG|GTG | 1 | 1 | 56.626 |
| 17675770 | GT-AG | 0 | 0.0209051156624947 | 139 | rna-XM_042054063.1 3555658 | 19 | 122408583 | 122408721 | Arvicola amphibius 1047088 | GGA|GTATGTATGT...GTGTCCTTTATG/AATATTATCATC...TGTAG|ATC | 2 | 1 | 61.493 |
| 17675771 | GT-AG | 0 | 1.000000099473604e-05 | 4152 | rna-XM_042054063.1 3555658 | 20 | 122408927 | 122413078 | Arvicola amphibius 1047088 | CTG|GTGAGTGTTT...ATTTGCTTTGTG/CAGAAGCTAACA...CTCAG|CTG | 0 | 1 | 65.436 |
| 17675772 | GT-AG | 0 | 1.000000099473604e-05 | 4850 | rna-XM_042054063.1 3555658 | 21 | 122413302 | 122418151 | Arvicola amphibius 1047088 | CAG|GTAAGGGATT...TATCTTTTAAAT/CATTTTCTCAGT...CATAG|ACA | 1 | 1 | 69.725 |
| 17675773 | GT-AG | 0 | 1.000000099473604e-05 | 353 | rna-XM_042054063.1 3555658 | 22 | 122418313 | 122418665 | Arvicola amphibius 1047088 | TTG|GTAAGTATTT...AGGTTTGTGACT/AGGTTTGTGACT...TAAAG|GGT | 0 | 1 | 72.822 |
| 17675774 | GT-AG | 0 | 1.000000099473604e-05 | 6569 | rna-XM_042054063.1 3555658 | 23 | 122418725 | 122425293 | Arvicola amphibius 1047088 | GAG|GTAAGCATTC...TTTCTGCTAATA/TTTCTGCTAATA...TGAAG|GGG | 2 | 1 | 73.957 |
| 17675775 | GT-AG | 0 | 1.000000099473604e-05 | 2921 | rna-XM_042054063.1 3555658 | 24 | 122425556 | 122428476 | Arvicola amphibius 1047088 | AAG|GTAAGTAACA...CTCATCTTCACT/TCCATTCTCATC...TACAG|GCT | 0 | 1 | 78.996 |
| 17675776 | GT-AG | 0 | 1.000000099473604e-05 | 208 | rna-XM_042054063.1 3555658 | 25 | 122428603 | 122428810 | Arvicola amphibius 1047088 | AAG|GTAATGTCTT...TTTTCTTTGTTT/GTATTTTGTATT...TTAAG|ATA | 0 | 1 | 81.42 |
| 17675777 | GT-AG | 0 | 0.0309815473260545 | 2351 | rna-XM_042054063.1 3555658 | 26 | 122429006 | 122431356 | Arvicola amphibius 1047088 | AAG|GTATGCTAGT...GTTGTCTTATCT/AGTTGTCTTATC...TTTAG|GTA | 0 | 1 | 85.17 |
| 17675778 | GT-AG | 0 | 1.000000099473604e-05 | 629 | rna-XM_042054063.1 3555658 | 27 | 122431545 | 122432173 | Arvicola amphibius 1047088 | GAG|GTGAGATCAT...CCCTTCCTGACT/CCCTTCCTGACT...TGAAG|TGG | 2 | 1 | 88.786 |
| 17675779 | GT-AG | 0 | 1.000000099473604e-05 | 348 | rna-XM_042054063.1 3555658 | 28 | 122432345 | 122432692 | Arvicola amphibius 1047088 | CAG|GTGTGGCTCA...ATAGTTTTGCCT/ATTCAAATAACA...TCTAG|TAT | 2 | 1 | 92.075 |
| 17675780 | GT-AG | 0 | 1.000000099473604e-05 | 8582 | rna-XM_042054063.1 3555658 | 29 | 122432884 | 122441465 | Arvicola amphibius 1047088 | CAG|GTAAGTTGGG...GCAGTTTTGAAT/AGATCACTCACC...CCTAG|GTA | 1 | 1 | 95.749 |
| 17675781 | GT-AG | 0 | 1.000000099473604e-05 | 394 | rna-XM_042054063.1 3555658 | 30 | 122441540 | 122441933 | Arvicola amphibius 1047088 | AAG|GTGACGCTCA...GGGGTCTGGAAG/TCAATGGTGACT...TGAAG|GTG | 0 | 1 | 97.173 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);