introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
51 rows where transcript_id = 3555637
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17675247 | GT-AG | 0 | 1.000000099473604e-05 | 50032 | rna-XM_038330219.1 3555637 | 1 | 136729361 | 136779392 | Arvicola amphibius 1047088 | TGG|GTGAGCAGCG...TTCTTCTTCTTC/CCTGTGCTAATG...AACAG|CTT | 1 | 1 | 0.813 |
| 17675248 | GT-AG | 0 | 1.000000099473604e-05 | 5168 | rna-XM_038330219.1 3555637 | 2 | 136779477 | 136784644 | Arvicola amphibius 1047088 | AAG|GTGAGTGTCG...ACCATTTCAACT/CCTGTACTAACC...TTTAG|GGT | 1 | 1 | 2.299 |
| 17675249 | GT-AG | 0 | 1.000000099473604e-05 | 3234 | rna-XM_038330219.1 3555637 | 3 | 136784686 | 136787919 | Arvicola amphibius 1047088 | AAG|GTAAGCCATT...TTCACATTGATA/TCAGTATTCACA...CAAAG|GGT | 0 | 1 | 3.024 |
| 17675250 | GT-AG | 0 | 1.000000099473604e-05 | 6778 | rna-XM_038330219.1 3555637 | 4 | 136787976 | 136794753 | Arvicola amphibius 1047088 | AGG|GTGAGTCTTG...AACTCCCAATCA/CACAGAATCAGC...TCCAG|GCA | 2 | 1 | 4.014 |
| 17675251 | GT-AG | 0 | 1.000000099473604e-05 | 2855 | rna-XM_038330219.1 3555637 | 5 | 136794851 | 136797705 | Arvicola amphibius 1047088 | GTG|GTAAGAAATG...TTTTTTTTAAAC/TTTTTTTTAAAC...TATAG|CAA | 0 | 1 | 5.729 |
| 17675252 | GT-AG | 0 | 7.472353820418753e-05 | 2801 | rna-XM_038330219.1 3555637 | 6 | 136797855 | 136800655 | Arvicola amphibius 1047088 | CAG|GTTTGTTTAT...ACTCCCTGGATT/CCTGGATTCATC...TCTAG|AAT | 2 | 1 | 8.364 |
| 17675253 | GT-AG | 0 | 1.000000099473604e-05 | 538 | rna-XM_038330219.1 3555637 | 7 | 136800792 | 136801329 | Arvicola amphibius 1047088 | AAA|GTGAGTTCAC...CTCACATTGACT/AGGTTGCTCACA...TGTAG|TCT | 0 | 1 | 10.769 |
| 17675254 | GT-AG | 0 | 1.000000099473604e-05 | 1681 | rna-XM_038330219.1 3555637 | 8 | 136801488 | 136803168 | Arvicola amphibius 1047088 | CAG|GTAGGTGACT...GTTTTTTTTTCC/GGAAGATTAATT...CCAAG|TGA | 2 | 1 | 13.563 |
| 17675255 | GT-AG | 0 | 1.000000099473604e-05 | 382 | rna-XM_038330219.1 3555637 | 9 | 136803251 | 136803632 | Arvicola amphibius 1047088 | ACG|GTGAGTCCAT...TTCTCCATAATA/CTTGTGTTGAAT...TCTAG|GAT | 0 | 1 | 15.013 |
| 17675256 | GT-AG | 0 | 1.000000099473604e-05 | 7395 | rna-XM_038330219.1 3555637 | 10 | 136803769 | 136811163 | Arvicola amphibius 1047088 | CTG|GTAAAATCCA...TTTTTTTTCACT/TTTTTTTTCACT...CCCAG|TGA | 1 | 1 | 17.418 |
| 17675257 | GT-AG | 0 | 6.603828144406104e-05 | 3417 | rna-XM_038330219.1 3555637 | 11 | 136811237 | 136814653 | Arvicola amphibius 1047088 | GCC|GTAAGTATGG...GTGGTCTTGGTT/GGTTGCCCCATG...TCTAG|GCT | 2 | 1 | 18.709 |
| 17675258 | GT-AG | 0 | 0.000157111229494 | 4276 | rna-XM_038330219.1 3555637 | 12 | 136814797 | 136819072 | Arvicola amphibius 1047088 | AAG|GTATGTATCT...TAAACGTTACTT/ACGTTACTTACG...TGCAG|GTT | 1 | 1 | 21.238 |
| 17675259 | GT-AG | 0 | 1.000000099473604e-05 | 4469 | rna-XM_038330219.1 3555637 | 13 | 136819199 | 136823667 | Arvicola amphibius 1047088 | CAG|GTAAGGTTAG...CACGCTGTGAAT/AAGGATATAATT...CCCAG|GCG | 1 | 1 | 23.466 |
| 17675260 | GT-AG | 0 | 1.1802854819353858 | 1242 | rna-XM_038330219.1 3555637 | 14 | 136823793 | 136825034 | Arvicola amphibius 1047088 | GAG|GTATCTCTTT...TCTTTCTTTATG/TCTTTCTTTATG...TAAAG|CAT | 0 | 1 | 25.676 |
| 17675261 | GT-AG | 0 | 0.0012063907928933 | 880 | rna-XM_038330219.1 3555637 | 15 | 136825134 | 136826013 | Arvicola amphibius 1047088 | AAG|GTATTGTGTT...CCCCTCTTGATT/AAGGCACTCACA...TCAAG|GTG | 0 | 1 | 27.427 |
| 17675262 | GT-AG | 0 | 1.000000099473604e-05 | 4614 | rna-XM_038330219.1 3555637 | 16 | 136826087 | 136830700 | Arvicola amphibius 1047088 | ACT|GTGAGTAATG...TTGTCCTCAGTA/TTTGTCCTCAGT...TGCAG|CTA | 1 | 1 | 28.718 |
| 17675263 | GT-AG | 0 | 1.000000099473604e-05 | 716 | rna-XM_038330219.1 3555637 | 17 | 136830805 | 136831520 | Arvicola amphibius 1047088 | AAG|GTGCGCACAT...GTGACCTCGATG/TACTGTCTGATC...CACAG|GCC | 0 | 1 | 30.557 |
| 17675264 | GT-AG | 0 | 1.000000099473604e-05 | 4792 | rna-XM_038330219.1 3555637 | 18 | 136831705 | 136836496 | Arvicola amphibius 1047088 | ATG|GTGAGTTCTA...TGATTCTTGCAT/GGGGAGCTGATT...TGCAG|TGG | 1 | 1 | 33.811 |
| 17675265 | GT-AG | 0 | 1.623325925188899e-05 | 4742 | rna-XM_038330219.1 3555637 | 19 | 136836595 | 136841336 | Arvicola amphibius 1047088 | AAG|GTAACATGGG...TGTTCACTGACC/TGTTCACTGACC...TACAG|TTC | 0 | 1 | 35.544 |
| 17675266 | GT-AG | 0 | 1.000000099473604e-05 | 319 | rna-XM_038330219.1 3555637 | 20 | 136841427 | 136841745 | Arvicola amphibius 1047088 | CTG|GTAAGGGGGC...TTTTCCTTGTGC/GTTTTGGTTACA...GACAG|GTC | 0 | 1 | 37.135 |
| 17675267 | GT-AG | 0 | 1.000000099473604e-05 | 6736 | rna-XM_038330219.1 3555637 | 21 | 136841847 | 136848582 | Arvicola amphibius 1047088 | CAC|GTGAGTTCTC...GAACTCTTGATT/TCTTGATTGACG...ACTAG|GAA | 2 | 1 | 38.921 |
| 17675268 | GT-AG | 0 | 1.000000099473604e-05 | 15797 | rna-XM_038330219.1 3555637 | 22 | 136848718 | 136864514 | Arvicola amphibius 1047088 | CCA|GTAAGTGTTC...GTCTCCTCAGGT/CGTCTCCTCAGG...TGCAG|GCT | 2 | 1 | 41.309 |
| 17675269 | GT-AG | 0 | 1.000000099473604e-05 | 46314 | rna-XM_038330219.1 3555637 | 23 | 136864624 | 136910937 | Arvicola amphibius 1047088 | AAG|GTAGGTGCCA...TCTGCCTTAATT/TTAATTCTGATC...TGCAG|GGG | 0 | 1 | 43.236 |
| 17675270 | GT-AG | 0 | 0.0004275072184093 | 2716 | rna-XM_038330219.1 3555637 | 24 | 136911009 | 136913724 | Arvicola amphibius 1047088 | CAG|GTAACTGTCC...TGCTTCTGAACA/ATGCTTCTGAAC...CATAG|CAA | 2 | 1 | 44.492 |
| 17675271 | GT-AG | 0 | 1.000000099473604e-05 | 22020 | rna-XM_038330219.1 3555637 | 25 | 136913832 | 136935851 | Arvicola amphibius 1047088 | ACG|GTGAGTGGAT...GCTTCCTTGGTG/CTTCTGCTGACT...TGCAG|ACT | 1 | 1 | 46.384 |
| 17675272 | GT-AG | 0 | 1.000000099473604e-05 | 1980 | rna-XM_038330219.1 3555637 | 26 | 136935980 | 136937959 | Arvicola amphibius 1047088 | GTG|GTAAGTCATG...TTGGCTTTGATT/TTGGCTTTGATT...TCCAG|GGG | 0 | 1 | 48.647 |
| 17675273 | GT-AG | 0 | 1.000000099473604e-05 | 110190 | rna-XM_038330219.1 3555637 | 27 | 136938056 | 137048245 | Arvicola amphibius 1047088 | ATT|GTAAGTGCCC...ATTCCTCTATCT/CTCTATCTAATT...TGCAG|GGC | 0 | 1 | 50.345 |
| 17675274 | GT-AG | 0 | 1.000000099473604e-05 | 6881 | rna-XM_038330219.1 3555637 | 28 | 137048348 | 137055228 | Arvicola amphibius 1047088 | GTG|GTAAGTGTCA...ACTGCCTTGCTG/GGCTGCCTCACT...TGCAG|GAT | 0 | 1 | 52.149 |
| 17675275 | GT-AG | 0 | 0.0005497268630998 | 78687 | rna-XM_038330219.1 3555637 | 29 | 137055324 | 137134010 | Arvicola amphibius 1047088 | CAA|GTAAGTTTAG...TTTCTCTTGATC/TTTCTCTTGATC...TTCAG|AGT | 2 | 1 | 53.828 |
| 17675276 | GT-AG | 0 | 1.000000099473604e-05 | 1978 | rna-XM_038330219.1 3555637 | 30 | 137134090 | 137136067 | Arvicola amphibius 1047088 | CAG|GTAAAAGGAG...CACAACTTAGCA/CCACAACTTAGC...TTCAG|CTC | 0 | 1 | 55.225 |
| 17675277 | GT-AG | 0 | 1.000000099473604e-05 | 10175 | rna-XM_038330219.1 3555637 | 31 | 137136169 | 137146343 | Arvicola amphibius 1047088 | CAA|GTAAGTCATT...CTTTCTTTCCCT/TGTAGACTGACG...TCTAG|GTA | 2 | 1 | 57.011 |
| 17675278 | GT-AG | 0 | 1.000000099473604e-05 | 9155 | rna-XM_038330219.1 3555637 | 32 | 137146403 | 137155557 | Arvicola amphibius 1047088 | TGG|GTGAGTAGCT...TTCTTTTTAAAT/TTCTTTTTAAAT...GAAAG|GTC | 1 | 1 | 58.055 |
| 17675279 | GT-AG | 0 | 1.000000099473604e-05 | 10090 | rna-XM_038330219.1 3555637 | 33 | 137155707 | 137165796 | Arvicola amphibius 1047088 | ATG|GTAAGGATGT...GCTTCCTCTGTT/ACTATGCTAATT...TATAG|TTT | 0 | 1 | 60.69 |
| 17675280 | GT-AG | 0 | 1.000000099473604e-05 | 183 | rna-XM_038330219.1 3555637 | 34 | 137165883 | 137166065 | Arvicola amphibius 1047088 | AAT|GTAAGTGTTG...GTTTCTTTCTTG/GTGTGATTAACA...TTCAG|CCT | 2 | 1 | 62.21 |
| 17675281 | GT-AG | 0 | 1.000000099473604e-05 | 7252 | rna-XM_038330219.1 3555637 | 35 | 137166223 | 137173474 | Arvicola amphibius 1047088 | CTG|GTGAGCAAGA...CATGGCTTATTT/CCATGGCTTATT...TTCAG|AAT | 0 | 1 | 64.987 |
| 17675282 | GT-AG | 0 | 0.0011408006856426 | 1879 | rna-XM_038330219.1 3555637 | 36 | 137173516 | 137175394 | Arvicola amphibius 1047088 | AAG|GTATGTGTGA...GTGGCCTTATTT/CCTTATTTCATC...ACCAG|GTA | 2 | 1 | 65.712 |
| 17675283 | GT-AG | 0 | 1.000000099473604e-05 | 3540 | rna-XM_038330219.1 3555637 | 37 | 137175486 | 137179025 | Arvicola amphibius 1047088 | AAG|GTAATCAAAA...CGCTCCTTGTTT/ATATTATTCAAC...CTCAG|TGG | 0 | 1 | 67.321 |
| 17675284 | GT-AG | 0 | 1.000000099473604e-05 | 11103 | rna-XM_038330219.1 3555637 | 38 | 137179146 | 137190248 | Arvicola amphibius 1047088 | AAG|GTAAGGCACA...TCTGTTTTATCT/CTCTGTTTTATC...CGCAG|ATG | 0 | 1 | 69.443 |
| 17675285 | GT-AG | 0 | 6.640551790540297e-05 | 717 | rna-XM_038330219.1 3555637 | 39 | 137190339 | 137191055 | Arvicola amphibius 1047088 | CTG|GTAGGCCTTC...CAGCCCTGAAAT/TGAGTTTTCATT...TCCAG|AAA | 0 | 1 | 71.034 |
| 17675286 | GT-AG | 0 | 1.000000099473604e-05 | 4795 | rna-XM_038330219.1 3555637 | 40 | 137191161 | 137195955 | Arvicola amphibius 1047088 | CGG|GTGAGTCTGA...TGTTATTTAACA/TGTTATTTAACA...TCCAG|GGA | 0 | 1 | 72.891 |
| 17675287 | GT-AG | 1 | 99.99973955373736 | 121 | rna-XM_038330219.1 3555637 | 41 | 137196098 | 137196218 | Arvicola amphibius 1047088 | AGT|GTATCCTTTA...TGGTCCTTAACG/GTGGTCCTTAAC...CTCAG|ACA | 1 | 1 | 75.402 |
| 17675288 | GT-AG | 0 | 0.0044914528204419 | 1896 | rna-XM_038330219.1 3555637 | 42 | 137196298 | 137198193 | Arvicola amphibius 1047088 | AAG|GTAACCATGC...GAAATCTTATGT/AGAAATCTTATG...TACAG|TTT | 2 | 1 | 76.799 |
| 17675289 | GT-AG | 0 | 1.000000099473604e-05 | 3624 | rna-XM_038330219.1 3555637 | 43 | 137198279 | 137201902 | Arvicola amphibius 1047088 | GCG|GTAATTAAAA...CCCATCCTGACA/CCCATCCTGACA...TACAG|AAT | 0 | 1 | 78.302 |
| 17675290 | GT-AG | 0 | 1.000000099473604e-05 | 1926 | rna-XM_038330219.1 3555637 | 44 | 137201990 | 137203915 | Arvicola amphibius 1047088 | ATG|GTAAGGAGAC...TGTTGCCTACCA/CTTGGGCACACT...TGTAG|GTA | 0 | 1 | 79.841 |
| 17675291 | GC-AG | 0 | 1.000000099473604e-05 | 903 | rna-XM_038330219.1 3555637 | 45 | 137204093 | 137204995 | Arvicola amphibius 1047088 | AAG|GCACGTGACA...GTCTCCCTGTCT/ACAGCTGTCACA...TGCAG|GCT | 0 | 1 | 82.971 |
| 17675292 | GT-AG | 0 | 1.000000099473604e-05 | 4999 | rna-XM_038330219.1 3555637 | 46 | 137205080 | 137210078 | Arvicola amphibius 1047088 | CAG|GTTGGTATGG...TTTTTCTTACCT/GTTTTTCTTACC...TTCAG|ATT | 0 | 1 | 84.456 |
| 17675293 | GT-AG | 0 | 1.000000099473604e-05 | 6274 | rna-XM_038330219.1 3555637 | 47 | 137210217 | 137216490 | Arvicola amphibius 1047088 | ATG|GTGAGTGGCC...TGGGTGTTAACA/TGGGTGTTAACA...CCCAG|CCC | 0 | 1 | 86.897 |
| 17675294 | GT-AG | 0 | 1.000000099473604e-05 | 4978 | rna-XM_038330219.1 3555637 | 48 | 137216637 | 137221614 | Arvicola amphibius 1047088 | TGG|GTAAGTGGGC...TGCCCTGTAACG/CAGTGATTCAGA...TTTAG|GTT | 2 | 1 | 89.478 |
| 17675295 | GT-AG | 0 | 3.41672205250426e-05 | 3974 | rna-XM_038330219.1 3555637 | 49 | 137221814 | 137225787 | Arvicola amphibius 1047088 | ACG|GTAATTTAAC...TAGACCTTTGTT/TCTTTCTGTACC...CACAG|ATA | 0 | 1 | 92.997 |
| 17675296 | GT-AG | 0 | 1.000000099473604e-05 | 1614 | rna-XM_038330219.1 3555637 | 50 | 137225942 | 137227555 | Arvicola amphibius 1047088 | CCA|GTAAGTGGAA...TCCATTGTAACT/TCCATTGTAACT...TCCAG|GTC | 1 | 1 | 95.721 |
| 17675297 | GT-AG | 0 | 1.000000099473604e-05 | 4434 | rna-XM_038330219.1 3555637 | 51 | 137227702 | 137232135 | Arvicola amphibius 1047088 | CGG|GTAAGTGGGA...GCTTCTTTATAC/GGCTTCTTTATA...CCTAG|CAA | 0 | 1 | 98.302 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);