introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
56 rows where transcript_id = 3555608
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17674290 | GT-AG | 0 | 1.000000099473604e-05 | 517 | rna-XM_042055078.1 3555608 | 1 | 72925585 | 72926101 | Arvicola amphibius 1047088 | CAG|GTTGGTATCT...TGTGCCCTAGCA/TACACATTCATG...TGCAG|GGC | 0 | 1 | 1.512 |
| 17674291 | GT-AG | 0 | 1.000000099473604e-05 | 1101 | rna-XM_042055078.1 3555608 | 3 | 72924329 | 72925429 | Arvicola amphibius 1047088 | TGG|GTCTGGCTTT...GCCATCTTGAGA/GCCATCTTGAGA...CTCAG|CAT | 0 | 1 | 3.189 |
| 17674292 | GT-AG | 0 | 5.004470446736087e-05 | 4770 | rna-XM_042055078.1 3555608 | 4 | 72919324 | 72924093 | Arvicola amphibius 1047088 | AAG|GTATGGTGCA...TTTGTCTTCTTT/GTCTGTTTGATA...TCCAG|AAG | 1 | 1 | 5.764 |
| 17674293 | GT-AG | 0 | 1.000000099473604e-05 | 1941 | rna-XM_042055078.1 3555608 | 5 | 72917212 | 72919152 | Arvicola amphibius 1047088 | AAG|GTGAGACCTG...AATGGCTTAACA/AATGGCTTAACA...TCTAG|GCA | 1 | 1 | 7.638 |
| 17674294 | GT-AG | 0 | 0.0037097883825664 | 475 | rna-XM_042055078.1 3555608 | 6 | 72916617 | 72917091 | Arvicola amphibius 1047088 | AAT|GTATGTCTGT...CAGCCCTTTGTG/TTTGTGCTGATG...CACAG|CTG | 1 | 1 | 8.952 |
| 17674295 | GT-AG | 0 | 1.000000099473604e-05 | 446 | rna-XM_042055078.1 3555608 | 7 | 72916002 | 72916447 | Arvicola amphibius 1047088 | GAG|GTTGGTCAAT...GTGTTCTCATCT/CGTGTTCTCATC...CTCAG|GTA | 2 | 1 | 10.804 |
| 17674296 | GT-AG | 0 | 1.000000099473604e-05 | 2437 | rna-XM_042055078.1 3555608 | 8 | 72913325 | 72915761 | Arvicola amphibius 1047088 | CAG|GTGAGCCATG...GCAACCTTGATC/GCAACCTTGATC...TCCAG|GGA | 2 | 1 | 13.434 |
| 17674297 | GT-AG | 0 | 1.000000099473604e-05 | 3703 | rna-XM_042055078.1 3555608 | 9 | 72909489 | 72913191 | Arvicola amphibius 1047088 | CAG|GTAGGGCTGG...CTCACGTCAATT/CGTCCTCTCACG...CCCAG|ATA | 0 | 1 | 14.892 |
| 17674298 | GT-AG | 0 | 1.000000099473604e-05 | 1453 | rna-XM_042055078.1 3555608 | 10 | 72907829 | 72909281 | Arvicola amphibius 1047088 | AAG|GTGGGCTCAC...GCTTTCTTATCC/CGCTTTCTTATC...CTCAG|GTT | 0 | 1 | 17.16 |
| 17674299 | GT-AG | 0 | 1.000000099473604e-05 | 2670 | rna-XM_042055078.1 3555608 | 11 | 72904977 | 72907646 | Arvicola amphibius 1047088 | CAG|GTAAGAAGCC...GAGACTTTAACA/GAGATGCTAACC...AACAG|GTA | 2 | 1 | 19.154 |
| 17674300 | GT-AG | 0 | 0.0391185337821052 | 1786 | rna-XM_042055078.1 3555608 | 12 | 72902969 | 72904754 | Arvicola amphibius 1047088 | CAG|GTACCCTAGC...TATTTTCCAACA/TATTTTCCAACA...TCCAG|ATT | 2 | 1 | 21.587 |
| 17674301 | GT-AG | 0 | 1.000000099473604e-05 | 1888 | rna-XM_042055078.1 3555608 | 13 | 72900882 | 72902769 | Arvicola amphibius 1047088 | AGG|GTTGTTCTGA...GTTTCCTTCCCT/TACTTGTTCACA...TTTAG|GTC | 0 | 1 | 23.767 |
| 17674302 | GT-AG | 0 | 1.000000099473604e-05 | 129 | rna-XM_042055078.1 3555608 | 14 | 72900554 | 72900682 | Arvicola amphibius 1047088 | AGG|GTTAGTGACA...TGAACCCTAGTA/TGGAATTCCACT...CCCAG|ACA | 1 | 1 | 25.948 |
| 17674303 | GT-AG | 0 | 0.0001852801188608 | 285 | rna-XM_042055078.1 3555608 | 15 | 72900055 | 72900339 | Arvicola amphibius 1047088 | CAG|GTAGACATGC...GGAATTTTAACC/GGAATTTTAACC...TGCAG|CAC | 2 | 1 | 28.293 |
| 17674304 | GT-AG | 0 | 1.000000099473604e-05 | 847 | rna-XM_042055078.1 3555608 | 16 | 72898864 | 72899710 | Arvicola amphibius 1047088 | ACG|GTAAGTGCCA...CACTCATTAACT/ACAGCACTCATT...TGTAG|ATT | 1 | 1 | 32.062 |
| 17674305 | GT-AG | 0 | 2.470336326242456e-05 | 2626 | rna-XM_042055078.1 3555608 | 17 | 72896178 | 72898803 | Arvicola amphibius 1047088 | CAG|GTGACCACTT...CTTTCTTTTTCT/GCATGGTTTATC...CCTAG|GGA | 1 | 1 | 32.72 |
| 17674306 | GT-AG | 0 | 1.000000099473604e-05 | 1362 | rna-XM_042055078.1 3555608 | 18 | 72894639 | 72896000 | Arvicola amphibius 1047088 | CAG|GTACGAACAT...TACTCCTGAGAG/AGCCCACTGACT...TGTAG|GTA | 1 | 1 | 34.659 |
| 17674307 | GT-AG | 0 | 0.0001087251789876 | 2181 | rna-XM_042055078.1 3555608 | 19 | 72892380 | 72894560 | Arvicola amphibius 1047088 | TAG|GTATGTCAAC...CTCTTCTTTTCT/TGCTGGCTGAAA...CTTAG|CTT | 1 | 1 | 35.514 |
| 17674308 | GT-AG | 0 | 2.1552181488146254e-05 | 738 | rna-XM_042055078.1 3555608 | 20 | 72891499 | 72892236 | Arvicola amphibius 1047088 | CCG|GTATGTGAAA...TTATTTTTGCTC/TTCCTGTTTATT...CGTAG|GAC | 0 | 1 | 37.081 |
| 17674309 | GT-AG | 0 | 1.000000099473604e-05 | 4462 | rna-XM_042055078.1 3555608 | 21 | 72886916 | 72891377 | Arvicola amphibius 1047088 | AAG|GTAAGCCACA...TCTGTCCTATTC/CTCTGTCCTATT...CACAG|TCA | 1 | 1 | 38.407 |
| 17674310 | GT-AG | 0 | 1.000000099473604e-05 | 4575 | rna-XM_042055078.1 3555608 | 22 | 72882229 | 72886803 | Arvicola amphibius 1047088 | CCT|GTAAGTAAGA...AAAGTTTTACAA/TAAAGTTTTACA...TTCAG|ATA | 2 | 1 | 39.634 |
| 17674311 | GT-AG | 0 | 0.0001447197413499 | 1335 | rna-XM_042055078.1 3555608 | 23 | 72880695 | 72882029 | Arvicola amphibius 1047088 | CAG|GTATGTCCCT...GGTTTATTGACT/GGTTTATTGACT...TTTAG|GAA | 0 | 1 | 41.815 |
| 17674312 | GT-AG | 0 | 1.000000099473604e-05 | 4911 | rna-XM_042055078.1 3555608 | 24 | 72875727 | 72880637 | Arvicola amphibius 1047088 | AAG|GTAAGCACTA...ATCCCCTTTTCC/AGTCTACTCACC...TGCAG|TTG | 0 | 1 | 42.439 |
| 17674313 | GT-AG | 0 | 3.113901219509949e-05 | 1090 | rna-XM_042055078.1 3555608 | 25 | 72874427 | 72875516 | Arvicola amphibius 1047088 | CTA|GTAAGATGTT...TCCTCCTTACCC/CATTTGCTCATT...TGCAG|ACC | 0 | 1 | 44.74 |
| 17674314 | GT-AG | 0 | 1.000000099473604e-05 | 1628 | rna-XM_042055078.1 3555608 | 26 | 72872556 | 72874183 | Arvicola amphibius 1047088 | CAG|GTACGGCACA...TTTTCTTTGTTC/GTTGGCTTCATT...CCAAG|GTG | 0 | 1 | 47.403 |
| 17674315 | GC-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_042055078.1 3555608 | 27 | 72872327 | 72872435 | Arvicola amphibius 1047088 | ACA|GCTGACCACT...TCTTTCCTATCG/AACCGCCTGACT...ACCAG|AGA | 0 | 1 | 48.718 |
| 17674316 | GT-AG | 0 | 1.000000099473604e-05 | 1298 | rna-XM_042055078.1 3555608 | 28 | 72870880 | 72872177 | Arvicola amphibius 1047088 | AAG|GTGTGTGAAG...TACCTCTTGTTT/AGCTGTCTCACT...CCTAG|GTT | 2 | 1 | 50.351 |
| 17674317 | GT-AG | 0 | 1.000000099473604e-05 | 312 | rna-XM_042055078.1 3555608 | 29 | 72870467 | 72870778 | Arvicola amphibius 1047088 | AAG|GTCAGAGCGT...CATGTTTTAATG/ATATGTTTTATC...TCTAG|GAG | 1 | 1 | 51.457 |
| 17674318 | GT-AG | 0 | 1.000000099473604e-05 | 2226 | rna-XM_042055078.1 3555608 | 30 | 72868048 | 72870273 | Arvicola amphibius 1047088 | TAG|GTACGACTAT...CTGGACTTAGTC/GCTGGACTTAGT...TCTAG|CTA | 2 | 1 | 53.572 |
| 17674319 | GT-AG | 0 | 1.000000099473604e-05 | 2078 | rna-XM_042055078.1 3555608 | 31 | 72865886 | 72867963 | Arvicola amphibius 1047088 | GAG|GTGAGTACCA...TCTCTCTTTGTT/ATCCCATTTATT...TCCAG|TGA | 2 | 1 | 54.493 |
| 17674320 | GT-AG | 0 | 1.000000099473604e-05 | 2459 | rna-XM_042055078.1 3555608 | 32 | 72863219 | 72865677 | Arvicola amphibius 1047088 | AAG|GTAACAGGAA...ATAGCCTCAGCC/TCAGCCCTCATT...TTAAG|GTT | 0 | 1 | 56.772 |
| 17674321 | GT-AG | 0 | 0.0012349455072506 | 1408 | rna-XM_042055078.1 3555608 | 33 | 72861704 | 72863111 | Arvicola amphibius 1047088 | GAG|GTACTCCTGA...CATTTCTTTGCG/CCGGTGCTGAAC...ATTAG|CAC | 2 | 1 | 57.944 |
| 17674322 | GT-AG | 0 | 1.000000099473604e-05 | 2330 | rna-XM_042055078.1 3555608 | 34 | 72859157 | 72861486 | Arvicola amphibius 1047088 | AAG|GTAGGCCCCG...GTGTCCCTGTCT/TGCACTCACATG...TTCAG|CTT | 0 | 1 | 60.322 |
| 17674323 | GT-AG | 0 | 1.000000099473604e-05 | 1093 | rna-XM_042055078.1 3555608 | 35 | 72857890 | 72858982 | Arvicola amphibius 1047088 | CAG|GTGGGAGGCA...GAATTCTTAATT/GAATTCTTAATT...TGCAG|CGC | 0 | 1 | 62.229 |
| 17674324 | GT-AG | 0 | 1.4432251545078844e-05 | 688 | rna-XM_042055078.1 3555608 | 36 | 72857091 | 72857778 | Arvicola amphibius 1047088 | AAG|GTATGTGGCC...AGCTCCTGAAGT/CAGCTCCTGAAG...CATAG|TCA | 0 | 1 | 63.445 |
| 17674325 | GT-AG | 0 | 1.000000099473604e-05 | 1489 | rna-XM_042055078.1 3555608 | 37 | 72855530 | 72857018 | Arvicola amphibius 1047088 | CAG|GTAAGTTGAC...TGGGCCTAACCC/CTGGGCCTAACC...TCCAG|CAT | 0 | 1 | 64.234 |
| 17674326 | GT-AG | 0 | 1.000000099473604e-05 | 1202 | rna-XM_042055078.1 3555608 | 38 | 72854198 | 72855399 | Arvicola amphibius 1047088 | AAG|GTAACGGAAC...TGATCTCTATCT/GGTCTGCTAACC...CTTAG|CTC | 1 | 1 | 65.659 |
| 17674327 | GT-AG | 0 | 1.000000099473604e-05 | 1299 | rna-XM_042055078.1 3555608 | 39 | 72852763 | 72854061 | Arvicola amphibius 1047088 | CAG|GTGAGTGAGT...ACTCCCATGATG/ATGATGCCGACC...GACAG|GTT | 2 | 1 | 67.149 |
| 17674328 | GT-AG | 0 | 1.000000099473604e-05 | 810 | rna-XM_042055078.1 3555608 | 40 | 72851868 | 72852677 | Arvicola amphibius 1047088 | ATG|GTATGAAATC...ACACCCGTGCCA/CAAATAGACACC...TCCAG|ATC | 0 | 1 | 68.08 |
| 17674329 | GT-AG | 0 | 1.000000099473604e-05 | 1209 | rna-XM_042055078.1 3555608 | 41 | 72850482 | 72851690 | Arvicola amphibius 1047088 | AGG|GTGAGCAGGA...TGACTCTTATAT/CCAGTGCTGACT...CACAG|GTC | 0 | 1 | 70.02 |
| 17674330 | GT-AG | 0 | 1.000000099473604e-05 | 2888 | rna-XM_042055078.1 3555608 | 42 | 72847475 | 72850362 | Arvicola amphibius 1047088 | GAG|GTGAGTGCAG...CTAACTTTCAAA/AGGATGCTAACT...CCCAG|AGG | 2 | 1 | 71.324 |
| 17674331 | GT-AG | 0 | 1.000000099473604e-05 | 4519 | rna-XM_042055078.1 3555608 | 43 | 72842845 | 72847363 | Arvicola amphibius 1047088 | AAG|GTAGGAGGTT...TGGTTCTTATGG/ATGGTTCTTATG...TTAAG|AAA | 2 | 1 | 72.54 |
| 17674332 | GT-AG | 0 | 1.000000099473604e-05 | 944 | rna-XM_042055078.1 3555608 | 44 | 72841780 | 72842723 | Arvicola amphibius 1047088 | CAG|GTGGGCATGC...ATATTCTTATAC/AATATTCTTATA...TTCAG|CCT | 0 | 1 | 73.866 |
| 17674333 | GT-AG | 0 | 1.000000099473604e-05 | 514 | rna-XM_042055078.1 3555608 | 45 | 72841182 | 72841695 | Arvicola amphibius 1047088 | ACG|GTAAGTGCAC...GGTTACTTGATT/GGTTACTTGATT...TCTAG|CCT | 0 | 1 | 74.786 |
| 17674334 | GT-AG | 0 | 1.000000099473604e-05 | 930 | rna-XM_042055078.1 3555608 | 46 | 72840103 | 72841032 | Arvicola amphibius 1047088 | AAG|GTAAGCGAGC...TCCCTCTGAACC/CTCCCTCTGAAC...TCCAG|GCA | 2 | 1 | 76.419 |
| 17674335 | GT-AG | 0 | 1.000000099473604e-05 | 2540 | rna-XM_042055078.1 3555608 | 47 | 72837331 | 72839870 | Arvicola amphibius 1047088 | GAG|GTAAGTCCAC...AACTACTTACCT/GTTTGTCTCACT...CCCAG|CTG | 0 | 1 | 78.961 |
| 17674336 | GT-AG | 0 | 0.050792145258037 | 811 | rna-XM_042055078.1 3555608 | 48 | 72836415 | 72837225 | Arvicola amphibius 1047088 | GAG|GTAACCTCTC...CCTCCCTCAGCC/TCCTCCCTCAGC...TGCAG|CTC | 0 | 1 | 80.112 |
| 17674337 | GT-AG | 0 | 0.0396272558106298 | 4699 | rna-XM_042055078.1 3555608 | 49 | 72831575 | 72836273 | Arvicola amphibius 1047088 | CAG|GTATGCTAGG...AGCTCCTTATTG/GAGCTCCTTATT...CTTAG|AGG | 0 | 1 | 81.657 |
| 17674338 | GT-AG | 0 | 1.000000099473604e-05 | 2385 | rna-XM_042055078.1 3555608 | 50 | 72829055 | 72831439 | Arvicola amphibius 1047088 | GCA|GTAAGAATCC...TCTTCCCCAAAG/AGTCAGTTGATG...TGCAG|CTG | 0 | 1 | 83.136 |
| 17674339 | GT-AG | 0 | 1.000000099473604e-05 | 2265 | rna-XM_042055078.1 3555608 | 51 | 72826559 | 72828823 | Arvicola amphibius 1047088 | ATG|GTACGTGGAT...CTCTCTTTCTCT/AGCTTGGTAACA...TCCAG|CTG | 0 | 1 | 85.667 |
| 17674340 | GT-AG | 0 | 0.0001833391338055 | 783 | rna-XM_042055078.1 3555608 | 52 | 72825617 | 72826399 | Arvicola amphibius 1047088 | CAG|GTAAGCTGTG...ATGCCTTTACCA/TGCATTCTTATA...CTAAG|AAT | 0 | 1 | 87.41 |
| 17674341 | GT-AG | 0 | 1.000000099473604e-05 | 5270 | rna-XM_042055078.1 3555608 | 53 | 72820176 | 72825445 | Arvicola amphibius 1047088 | GCG|GTAAGACTAT...AAAATGTTGAAA/AAAATGTTGAAA...TCTAG|TTG | 0 | 1 | 89.283 |
| 17674342 | GT-AG | 0 | 1.000000099473604e-05 | 3314 | rna-XM_042055078.1 3555608 | 54 | 72816725 | 72820038 | Arvicola amphibius 1047088 | ACA|GTGAGTGCAG...TGTGTATTAACT/TGTGTATTAACT...TCCAG|CAG | 2 | 1 | 90.785 |
| 17674343 | GT-AG | 0 | 1.000000099473604e-05 | 6191 | rna-XM_042055078.1 3555608 | 55 | 72810444 | 72816634 | Arvicola amphibius 1047088 | AAG|GTGAACACTC...CAGGCTTTGGTG/GGCTGTGTCACT...TTCAG|AGG | 2 | 1 | 91.771 |
| 17674344 | GT-AG | 0 | 0.0001532883776835 | 7350 | rna-XM_042055078.1 3555608 | 56 | 72802844 | 72810193 | Arvicola amphibius 1047088 | CTT|GTAGAGTCTC...CAGTCATTGACC/CAGTCATTGACC...TCTAG|TTT | 0 | 1 | 94.51 |
| 17674345 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_042055078.1 3555608 | 57 | 72802579 | 72802678 | Arvicola amphibius 1047088 | CAA|GTGAGCCGCA...CAGTTCTTAGCT/TGAATACTCACC...CACAG|GGA | 0 | 1 | 96.318 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);