introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
46 rows where transcript_id = 3555644
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17675488 | GT-AG | 0 | 3.66071654105206e-05 | 19847 | rna-XM_038313184.2 3555644 | 1 | 92652856 | 92672702 | Arvicola amphibius 1047088 | CAC|GTAAGTTGGC...CTGCTTTTAAAG/CTGCTTTTAAAG...TGTAG|GTA | 2 | 1 | 3.463 |
| 17675489 | GT-AG | 0 | 1.000000099473604e-05 | 934 | rna-XM_038313184.2 3555644 | 2 | 92672844 | 92673777 | Arvicola amphibius 1047088 | AAA|GTAAGTCTAC...ATTTTCTTTTTT/TTTACATTTATT...ATAAG|GAA | 2 | 1 | 5.481 |
| 17675490 | GT-AG | 0 | 2.147733735835508e-05 | 12587 | rna-XM_038313184.2 3555644 | 3 | 92673895 | 92686481 | Arvicola amphibius 1047088 | TAA|GTAAGTATTT...CTGTGTTTAAAT/ATGTAATTTATC...TCCAG|TTC | 2 | 1 | 7.155 |
| 17675491 | GT-AG | 0 | 0.0002815066516253 | 910 | rna-XM_038313184.2 3555644 | 4 | 92686527 | 92687436 | Arvicola amphibius 1047088 | ACC|GTAAGTAGCT...TTTACTTTAATT/TTTACTTTAATT...TAAAG|ATA | 2 | 1 | 7.799 |
| 17675492 | GT-AG | 0 | 1.1024232687481636e-05 | 644 | rna-XM_038313184.2 3555644 | 5 | 92687587 | 92688230 | Arvicola amphibius 1047088 | TAA|GTAAGTGTGC...CTATTCATAACA/TTGCTATTCATA...TTCAG|ACT | 2 | 1 | 9.946 |
| 17675493 | GT-AG | 0 | 2.125104818529953e-05 | 1062 | rna-XM_038313184.2 3555644 | 6 | 92688295 | 92689356 | Arvicola amphibius 1047088 | GGG|GTAAGTCTTC...ATCTACTTAACA/TTTTTGTTTACT...TTCAG|CAA | 0 | 1 | 10.861 |
| 17675494 | GT-AG | 0 | 1.000000099473604e-05 | 1176 | rna-XM_038313184.2 3555644 | 7 | 92689432 | 92690607 | Arvicola amphibius 1047088 | AAG|GTAATTAGAA...AACATTTTATTT/TTGATACTTATA...AAAAG|ATA | 0 | 1 | 11.935 |
| 17675495 | GT-AG | 0 | 1.000000099473604e-05 | 2002 | rna-XM_038313184.2 3555644 | 8 | 92690731 | 92692732 | Arvicola amphibius 1047088 | ATG|GTTTGTAATA...CTCTTCTGGTCT/CTCTCTCTCTCT...GACAG|GGT | 0 | 1 | 13.695 |
| 17675496 | GT-AG | 0 | 1.000000099473604e-05 | 759 | rna-XM_038313184.2 3555644 | 9 | 92692826 | 92693584 | Arvicola amphibius 1047088 | CTG|GTAAGTATGC...CCTGCCTAGAAC/GAAAGTTGCACC...CCCAG|AAC | 0 | 1 | 15.026 |
| 17675497 | GT-AG | 0 | 1.000000099473604e-05 | 1752 | rna-XM_038313184.2 3555644 | 10 | 92693851 | 92695602 | Arvicola amphibius 1047088 | AAG|GTAAGTTTAG...GTATATTTAATA/TTTCTTCTAATC...TGTAG|AAC | 2 | 1 | 18.832 |
| 17675498 | GT-AG | 0 | 1.000000099473604e-05 | 1014 | rna-XM_038313184.2 3555644 | 11 | 92695790 | 92696803 | Arvicola amphibius 1047088 | ATG|GTGAGCCTAT...AATTCTTTATCT/GAATTACTAATT...CATAG|ATC | 0 | 1 | 21.508 |
| 17675499 | GT-AG | 0 | 1.000000099473604e-05 | 1586 | rna-XM_038313184.2 3555644 | 12 | 92696894 | 92698479 | Arvicola amphibius 1047088 | GAA|GTAAGGATGC...TATTTTTTTTCC/TTTTTTTCCACG...TATAG|ATT | 0 | 1 | 22.796 |
| 17675500 | GT-AG | 0 | 0.0018301243652064 | 264 | rna-XM_038313184.2 3555644 | 13 | 92698545 | 92698808 | Arvicola amphibius 1047088 | CAG|GTATGTTGAT...TATGCTTTATTT/TTGTTTCTAAAA...TTTAG|ATG | 2 | 1 | 23.726 |
| 17675501 | GT-AG | 0 | 1.000000099473604e-05 | 2487 | rna-XM_038313184.2 3555644 | 14 | 92698960 | 92701446 | Arvicola amphibius 1047088 | ATG|GTAAGAGAGC...GTTTTCATATCT/GAATTACTGATT...AACAG|GTG | 0 | 1 | 25.887 |
| 17675502 | GT-AG | 0 | 6.855465288926821e-05 | 234 | rna-XM_038313184.2 3555644 | 15 | 92701546 | 92701779 | Arvicola amphibius 1047088 | AAG|GTAAGTTTCA...GAAACCTTATTT/TGAATATTTACC...TTCAG|TGC | 0 | 1 | 27.304 |
| 17675503 | GT-AG | 0 | 1.000000099473604e-05 | 736 | rna-XM_038313184.2 3555644 | 16 | 92701850 | 92702585 | Arvicola amphibius 1047088 | TGA|GTAAGTCAGA...GCTGTCTTGCCA/ATTCTTTTTATG...TTTAG|ATG | 1 | 1 | 28.306 |
| 17675504 | GT-AG | 0 | 6.61625029415016e-05 | 3360 | rna-XM_038313184.2 3555644 | 17 | 92702654 | 92706013 | Arvicola amphibius 1047088 | GAG|GTATTGTAAG...GCTGCCTTCGAG/CCACTACCCAGC...TTCAG|ATT | 0 | 1 | 29.279 |
| 17675505 | GC-AG | 0 | 1.000000099473604e-05 | 1006 | rna-XM_038313184.2 3555644 | 18 | 92706230 | 92707235 | Arvicola amphibius 1047088 | AAG|GCAAGCATTT...TACAACTTAGTG/TTACAACTTAGT...CCTAG|GAC | 0 | 1 | 32.37 |
| 17675506 | GT-AG | 0 | 1.000000099473604e-05 | 1539 | rna-XM_038313184.2 3555644 | 19 | 92707394 | 92708932 | Arvicola amphibius 1047088 | TAG|GTTGGTTTTC...ATCTTCTAAAAT/AATCTTCTAAAA...TATAG|AGA | 2 | 1 | 34.631 |
| 17675507 | GT-AG | 0 | 1.4798114225592e-05 | 7348 | rna-XM_038313184.2 3555644 | 20 | 92709029 | 92716376 | Arvicola amphibius 1047088 | CTT|GTAAGTAAAG...TTTGTTTTGATA/TTTGTTTTGATA...AACAG|GGT | 2 | 1 | 36.005 |
| 17675508 | GT-AG | 0 | 6.580496894466751e-05 | 1785 | rna-XM_038313184.2 3555644 | 21 | 92716433 | 92718217 | Arvicola amphibius 1047088 | ATG|GTATGTGAAT...TTATTATTATTT/TTATTATTTATT...AATAG|ATC | 1 | 1 | 36.806 |
| 17675509 | GT-AG | 0 | 3.224720123862894e-05 | 169 | rna-XM_038313184.2 3555644 | 22 | 92718275 | 92718443 | Arvicola amphibius 1047088 | TTA|GTAAGTATAT...TTTTTCTTCTCC/TGTGTAGTAATT...TTTAG|ACC | 1 | 1 | 37.622 |
| 17675510 | GT-AG | 0 | 1.4327413010383328e-05 | 100 | rna-XM_038313184.2 3555644 | 23 | 92718500 | 92718599 | Arvicola amphibius 1047088 | AAG|GTAGAATTCC...GATTTCTGAATA/TTGGGTTTTACT...CACAG|ATT | 0 | 1 | 38.423 |
| 17675511 | GT-AG | 0 | 1.000000099473604e-05 | 1540 | rna-XM_038313184.2 3555644 | 24 | 92718699 | 92720238 | Arvicola amphibius 1047088 | CGG|GTCAGTATCT...GCTGTTTTGTTT/AAGTTACTCAGT...TCCAG|CTC | 0 | 1 | 39.84 |
| 17675512 | GC-AG | 0 | 1.000000099473604e-05 | 112 | rna-XM_038313184.2 3555644 | 25 | 92720302 | 92720413 | Arvicola amphibius 1047088 | GAG|GCAAGTGTCC...AAGACTTTATTT/CTTTATTTTATA...TAAAG|TTG | 0 | 1 | 40.741 |
| 17675513 | GT-AG | 0 | 1.000000099473604e-05 | 1037 | rna-XM_038313184.2 3555644 | 26 | 92720510 | 92721546 | Arvicola amphibius 1047088 | CAG|GTGAGGCTCT...ATCTTTTTAAAT/TTATGTTTCATT...CCCAG|GTC | 0 | 1 | 42.115 |
| 17675514 | GT-AG | 0 | 1.000000099473604e-05 | 1796 | rna-XM_038313184.2 3555644 | 27 | 92721679 | 92723474 | Arvicola amphibius 1047088 | AAG|GTAGAGGTTC...TGCCTCTTAGTT/TGTAAATTTATT...TATAG|GGC | 0 | 1 | 44.004 |
| 17675515 | GT-AG | 0 | 0.0003717649970093 | 2201 | rna-XM_038313184.2 3555644 | 28 | 92723685 | 92725885 | Arvicola amphibius 1047088 | ACA|GTAAGCAAAC...TACTTTTTAAAT/TACTTTTTAAAT...ATAAG|ATT | 0 | 1 | 47.009 |
| 17675516 | GT-AG | 0 | 1.000000099473604e-05 | 1046 | rna-XM_038313184.2 3555644 | 29 | 92726023 | 92727068 | Arvicola amphibius 1047088 | AAG|GTAATTGAAC...TTTATCTTAGTA/CTAGTGTTTATC...TAAAG|GAA | 2 | 1 | 48.97 |
| 17675517 | GT-AG | 0 | 1.000000099473604e-05 | 415 | rna-XM_038313184.2 3555644 | 30 | 92727123 | 92727537 | Arvicola amphibius 1047088 | CCT|GTGAGTATAA...TTTTTCTCAACC/CTTTTTCTCAAC...TCCAG|GCC | 2 | 1 | 49.742 |
| 17675518 | GT-AG | 0 | 0.0005183180272328 | 1876 | rna-XM_038313184.2 3555644 | 31 | 92727668 | 92729543 | Arvicola amphibius 1047088 | AAG|GTACACACTT...CGAATTTTGATG/GATGTGTTAATC...TGTAG|ATG | 0 | 1 | 51.603 |
| 17675519 | GT-AG | 0 | 1.000000099473604e-05 | 1509 | rna-XM_038313184.2 3555644 | 32 | 92729626 | 92731134 | Arvicola amphibius 1047088 | TCA|GTAAGTAATA...TGAATCTGAATG/TTCAAATTGATT...TTTAG|GTG | 1 | 1 | 52.776 |
| 17675520 | GT-AG | 0 | 1.000000099473604e-05 | 433 | rna-XM_038313184.2 3555644 | 33 | 92731289 | 92731721 | Arvicola amphibius 1047088 | AAA|GTAAGTTACA...TAAACTTTTGCT/AGCTCTATCATT...TGTAG|GAA | 2 | 1 | 54.98 |
| 17675521 | GT-AG | 0 | 1.000000099473604e-05 | 1765 | rna-XM_038313184.2 3555644 | 34 | 92731792 | 92733556 | Arvicola amphibius 1047088 | GAG|GTAAGCGTTC...TTTTTTTCAACT/TTTTTTTTCAAC...TGTAG|GCA | 0 | 1 | 55.982 |
| 17675522 | GT-AG | 0 | 1.000000099473604e-05 | 358 | rna-XM_038313184.2 3555644 | 35 | 92733683 | 92734040 | Arvicola amphibius 1047088 | AAG|GTAATTGTGT...TTGTCTTTGTTT/ATTGAATTAACT...TGTAG|GGT | 0 | 1 | 57.785 |
| 17675523 | GT-AG | 0 | 1.000000099473604e-05 | 1229 | rna-XM_038313184.2 3555644 | 36 | 92734194 | 92735422 | Arvicola amphibius 1047088 | AAG|GTGATGGTAT...ATTTTCATAATT/CTATGTTTTACT...AGAAG|GTG | 0 | 1 | 59.974 |
| 17675524 | GT-AG | 0 | 0.0001141877955998 | 496 | rna-XM_038313184.2 3555644 | 37 | 92735528 | 92736023 | Arvicola amphibius 1047088 | TGT|GTAAGTACTG...TTTTGCTTAATT/TAATTATTAATT...CACAG|GAA | 0 | 1 | 61.477 |
| 17675525 | GT-AG | 0 | 1.000000099473604e-05 | 1320 | rna-XM_038313184.2 3555644 | 38 | 92736116 | 92737435 | Arvicola amphibius 1047088 | ATG|GTAAATAAAA...CTGTCCTTAAAT/CCTGTCCTTAAA...TATAG|CCG | 2 | 1 | 62.793 |
| 17675526 | GT-AG | 0 | 0.0005024789609565 | 368 | rna-XM_038313184.2 3555644 | 39 | 92737568 | 92737935 | Arvicola amphibius 1047088 | AAG|GTATGTGTGA...TCAGTTTTAATT/TTAATTCTCATG...AACAG|TGT | 2 | 1 | 64.682 |
| 17675527 | GT-AG | 0 | 1.000000099473604e-05 | 1271 | rna-XM_038313184.2 3555644 | 40 | 92738145 | 92739415 | Arvicola amphibius 1047088 | CCA|GTGAGTCTTA...TGGTCCCTAATT/GCTTGACTGATT...AATAG|TAT | 1 | 1 | 67.673 |
| 17675528 | GT-AG | 0 | 1.000000099473604e-05 | 7566 | rna-XM_038313184.2 3555644 | 41 | 92739502 | 92747067 | Arvicola amphibius 1047088 | AAG|GTAAGACTAT...TTTGTTTTGTTC/CCTGTACTCACT...TCGAG|ATC | 0 | 1 | 68.904 |
| 17675529 | GT-AG | 0 | 1.000000099473604e-05 | 9630 | rna-XM_038313184.2 3555644 | 42 | 92747197 | 92756826 | Arvicola amphibius 1047088 | CAA|GTAAGTATAT...AATTCTGTATTA/TCATTTCTCATT...CACAG|ACA | 0 | 1 | 70.75 |
| 17675530 | GT-AG | 0 | 0.0003190469014353 | 8675 | rna-XM_038313184.2 3555644 | 43 | 92756983 | 92765657 | Arvicola amphibius 1047088 | GAG|GTAAACTTTA...AAAACCTCAGTT/TCAGTTCTAATA...TCCAG|GTT | 0 | 1 | 72.982 |
| 17675531 | GT-AG | 0 | 1.000000099473604e-05 | 2598 | rna-XM_038313184.2 3555644 | 44 | 92765821 | 92768418 | Arvicola amphibius 1047088 | CAG|GTAATATCAC...ATGTTTTTCTCA/TGAAAATTGATT...TGTAG|AGT | 1 | 1 | 75.315 |
| 17675532 | GT-AG | 0 | 0.0018832066516965 | 1393 | rna-XM_038313184.2 3555644 | 45 | 92768553 | 92769945 | Arvicola amphibius 1047088 | GAG|GTATGTTCTG...CGGTATTTGACT/CGGTATTTGACT...TACAG|ATG | 0 | 1 | 77.232 |
| 17692005 | GT-AG | 0 | 1.000000099473604e-05 | 1137 | rna-XM_038313184.2 3555644 | 46 | 92770085 | 92771221 | Arvicola amphibius 1047088 | GAG|GTGAGTTACA...AGTGCTTTATTA/TAGTGCTTTATT...TGCAG|ATG | 0 | 79.222 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);