introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
45 rows where transcript_id = 3555620
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17674724 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_038349363.1 3555620 | 1 | 100017222 | 100017306 | Arvicola amphibius 1047088 | CCG|GTAAGGCCAA...CTGTTCATCACG/TGTCTGTTCATC...CCCAG|ATC | 0 | 1 | 1.005 |
| 17674725 | GT-AG | 0 | 1.000000099473604e-05 | 297 | rna-XM_038349363.1 3555620 | 2 | 100016831 | 100017127 | Arvicola amphibius 1047088 | AAT|GTAAGACCCA...GGTGCCTCTTCC/GGGAGGGTGACT...CCCAG|GCC | 1 | 1 | 2.438 |
| 17674726 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_038349363.1 3555620 | 3 | 100016605 | 100016688 | Arvicola amphibius 1047088 | CCT|GTGGGTAAAG...GGGACTTGAGCC/TGGGACTTGAGC...CCCAG|GAT | 2 | 1 | 4.601 |
| 17674727 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_038349363.1 3555620 | 4 | 100016383 | 100016491 | Arvicola amphibius 1047088 | GAG|GTAGAGGATG...TGAGCTTTGCCT/CTTTGCCTAACA...CTCAG|CCT | 1 | 1 | 6.322 |
| 17674728 | GT-AG | 0 | 1.000000099473604e-05 | 273 | rna-XM_038349363.1 3555620 | 5 | 100016036 | 100016308 | Arvicola amphibius 1047088 | GGG|GTGAGGAGGG...GCCACTTTGTTC/GAGCAGTCCACC...TGCAG|GCA | 0 | 1 | 7.45 |
| 17674729 | GT-AG | 0 | 1.000000099473604e-05 | 385 | rna-XM_038349363.1 3555620 | 6 | 100015570 | 100015954 | Arvicola amphibius 1047088 | GAG|GTGAGAGGCC...CACCTGGTGATC/CACCTGGTGATC...TCTAG|CTC | 0 | 1 | 8.684 |
| 17674730 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_038349363.1 3555620 | 7 | 100015269 | 100015358 | Arvicola amphibius 1047088 | TCA|GTGAGTCACT...TTCATGTTACCC/TGGGGCTTCATG...ACCAG|GTC | 1 | 1 | 11.898 |
| 17674731 | GT-AG | 0 | 1.000000099473604e-05 | 623 | rna-XM_038349363.1 3555620 | 8 | 100014506 | 100015128 | Arvicola amphibius 1047088 | CAG|GTGAGACACA...CTGATCTCAGAT/CTCGAGCTGATC...TGCAG|GTG | 0 | 1 | 14.031 |
| 17674732 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_038349363.1 3555620 | 9 | 100014290 | 100014388 | Arvicola amphibius 1047088 | CAG|GTGGGCTGGG...CCCTCCTCACCT/CCCCTCCTCACC...CCCAG|AGG | 0 | 1 | 15.814 |
| 17674733 | GT-AG | 0 | 1.000000099473604e-05 | 446 | rna-XM_038349363.1 3555620 | 10 | 100013670 | 100014115 | Arvicola amphibius 1047088 | GAG|GTGAGGGGTT...AGAGTCTGGAAT/CTGGAATTCATG...CCCAG|TGT | 0 | 1 | 18.464 |
| 17674734 | GT-AG | 0 | 1.000000099473604e-05 | 323 | rna-XM_038349363.1 3555620 | 11 | 100013117 | 100013439 | Arvicola amphibius 1047088 | CAA|GTGAGGGGGA...ATTTCTTTACTT/CATTTCTTTACT...CCTAG|GTT | 2 | 1 | 21.968 |
| 17674735 | GT-AG | 0 | 1.000000099473604e-05 | 269 | rna-XM_038349363.1 3555620 | 12 | 100012671 | 100012939 | Arvicola amphibius 1047088 | CAT|GTAAGAGCTC...CACTCTGTGCTG/CACTATCCCACC...TATAG|GTT | 2 | 1 | 24.665 |
| 17674736 | GT-AG | 0 | 1.000000099473604e-05 | 123 | rna-XM_038349363.1 3555620 | 13 | 100012325 | 100012447 | Arvicola amphibius 1047088 | AAG|GTGAGGGATC...GAATCCTGAAAG/GAAAGTCTTATG...CCCAG|CTA | 0 | 1 | 28.062 |
| 17674737 | GT-AG | 0 | 1.000000099473604e-05 | 80 | rna-XM_038349363.1 3555620 | 14 | 100012023 | 100012102 | Arvicola amphibius 1047088 | GCG|GTGAGGGATG...AGCTCTCTGACC/AGCTCTCTGACC...CCCAG|AGC | 0 | 1 | 31.444 |
| 17674738 | GT-AG | 0 | 1.000000099473604e-05 | 372 | rna-XM_038349363.1 3555620 | 15 | 100011449 | 100011820 | Arvicola amphibius 1047088 | CAG|GTGGGGACCG...AAGTTCTCACAT/TCACATCTCATG...ACCAG|GCC | 1 | 1 | 34.522 |
| 17674739 | GT-AG | 0 | 1.000000099473604e-05 | 496 | rna-XM_038349363.1 3555620 | 16 | 100010842 | 100011337 | Arvicola amphibius 1047088 | AGG|GTGAGGCCCT...CTGACCGTGTCC/TCAGAGCTGACC...CGCAG|TTC | 1 | 1 | 36.213 |
| 17674740 | GT-AG | 0 | 1.000000099473604e-05 | 280 | rna-XM_038349363.1 3555620 | 17 | 100010390 | 100010669 | Arvicola amphibius 1047088 | ACT|GTGAGCTGCC...CCGTCCGTGAAG/ACTGCTCTGAGA...CACAG|GTC | 2 | 1 | 38.833 |
| 17674741 | GT-AG | 0 | 1.000000099473604e-05 | 234 | rna-XM_038349363.1 3555620 | 18 | 100010024 | 100010257 | Arvicola amphibius 1047088 | CAT|GTGAGTCCTG...GTATCCGTCACC/GTATCCGTCACC...CACAG|GCT | 2 | 1 | 40.844 |
| 17674742 | GT-AG | 0 | 1.000000099473604e-05 | 124 | rna-XM_038349363.1 3555620 | 19 | 100009760 | 100009883 | Arvicola amphibius 1047088 | CCG|GTGAGCCAGA...GTCACATTAGGA/AGGACTGTCACT...CCCAG|GTG | 1 | 1 | 42.977 |
| 17674743 | GT-AG | 0 | 1.000000099473604e-05 | 188 | rna-XM_038349363.1 3555620 | 20 | 100009434 | 100009621 | Arvicola amphibius 1047088 | AAG|GTAAGAGCTG...ATGCACTGGACC/CAGAGGTTTACA...TCCAG|ATC | 1 | 1 | 45.079 |
| 17674744 | GT-AG | 0 | 1.000000099473604e-05 | 78 | rna-XM_038349363.1 3555620 | 21 | 100009168 | 100009245 | Arvicola amphibius 1047088 | GAG|GTGAGGGCTA...TTCTCTTTCTCT/CCTGTGCCCAGC...TGAAG|GGA | 0 | 1 | 47.943 |
| 17674745 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_038349363.1 3555620 | 22 | 100009005 | 100009109 | Arvicola amphibius 1047088 | CGG|GTGAGTGGCT...TAACCCATAGCC/ACAGAACTCACG...TCCAG|TGT | 1 | 1 | 48.827 |
| 17674746 | GT-AG | 0 | 1.000000099473604e-05 | 174 | rna-XM_038349363.1 3555620 | 23 | 100008544 | 100008717 | Arvicola amphibius 1047088 | GAG|GTATGGGACC...AGTCTCTTGCTC/ATGGGGATCACT...CTCAG|ATC | 0 | 1 | 53.199 |
| 17674747 | GT-AG | 0 | 0.0003124701220265 | 205 | rna-XM_038349363.1 3555620 | 24 | 100008302 | 100008506 | Arvicola amphibius 1047088 | AAG|GTACGCCATA...CTCCCCTTGAAC/TGAACTCTGACT...CCCAG|GAG | 1 | 1 | 53.763 |
| 17674748 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_038349363.1 3555620 | 25 | 100008079 | 100008163 | Arvicola amphibius 1047088 | TGG|GTGAGTCTCT...AGGAGTTTAACC/TAACCTCTCATC...CACAG|CCA | 1 | 1 | 55.865 |
| 17674749 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_038349363.1 3555620 | 26 | 100007808 | 100007926 | Arvicola amphibius 1047088 | CAG|GTGACAAAGC...ACAGCCGTGTCC/TGGGTGCTGACA...CCCAG|ATT | 0 | 1 | 58.181 |
| 17674750 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_038349363.1 3555620 | 27 | 100007602 | 100007682 | Arvicola amphibius 1047088 | CAG|GTGGGTGCTG...TGGCCTGTCACC/TGGCCTGTCACC...CACAG|TGA | 2 | 1 | 60.085 |
| 17674751 | GT-AG | 0 | 1.000000099473604e-05 | 160 | rna-XM_038349363.1 3555620 | 28 | 100007343 | 100007502 | Arvicola amphibius 1047088 | CAG|GTGAGGCTAA...CATCCCTTCCCG/CCCTTCCCGACT...CACAG|GGG | 2 | 1 | 61.594 |
| 17674752 | GT-AG | 0 | 1.087934277787228e-05 | 107 | rna-XM_038349363.1 3555620 | 29 | 100006948 | 100007054 | Arvicola amphibius 1047088 | GGG|GTAAGCACCC...TCCCATTTGACT/GACTGTTTCACT...CACAG|CCT | 2 | 1 | 65.981 |
| 17674753 | GT-AG | 0 | 1.000000099473604e-05 | 97 | rna-XM_038349363.1 3555620 | 30 | 100006818 | 100006914 | Arvicola amphibius 1047088 | CAG|GTGAGGTGGG...TCCACCTCAAAC/CCGCTCCTCACT...CCCAG|ATA | 2 | 1 | 66.484 |
| 17674754 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_038349363.1 3555620 | 31 | 100006554 | 100006639 | Arvicola amphibius 1047088 | AAG|GTGGGAGCTG...CTTCTCTGGACC/CTGGACCTCATG...CTCAG|ATC | 0 | 1 | 69.196 |
| 17674755 | GT-AG | 0 | 1.000000099473604e-05 | 862 | rna-XM_038349363.1 3555620 | 32 | 100005522 | 100006383 | Arvicola amphibius 1047088 | GCT|GTGAGTCCAT...CTTCTCTGAACC/TCTTCTCTGAAC...CCCAG|GAT | 2 | 1 | 71.785 |
| 17674756 | GT-AG | 0 | 0.0003918808670106 | 93 | rna-XM_038349363.1 3555620 | 33 | 100005251 | 100005343 | Arvicola amphibius 1047088 | ATG|GTATGTGTGG...GTCCTCTGAGCT/ACTGAGGTAATC...TACAG|TGT | 0 | 1 | 74.497 |
| 17674757 | GT-AG | 0 | 1.000000099473604e-05 | 606 | rna-XM_038349363.1 3555620 | 34 | 100004529 | 100005134 | Arvicola amphibius 1047088 | TGG|GTGAGAGCCC...CAGTGCTTAACG/CTTAACGTCACC...TTCAG|CTG | 2 | 1 | 76.264 |
| 17674758 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_038349363.1 3555620 | 35 | 100004288 | 100004383 | Arvicola amphibius 1047088 | CAG|GTGGGATCTT...TTGGGCTTAACC/TTGGGCTTAACC...AACAG|AAC | 0 | 1 | 78.473 |
| 17674759 | GT-AG | 0 | 1.000000099473604e-05 | 304 | rna-XM_038349363.1 3555620 | 36 | 100003860 | 100004163 | Arvicola amphibius 1047088 | TGG|GTGAGAATGT...ATAGTCTGGACA/ATAGTCTGGACA...CACAG|GAG | 1 | 1 | 80.363 |
| 17674760 | GT-AG | 0 | 1.000000099473604e-05 | 71 | rna-XM_038349363.1 3555620 | 37 | 100003659 | 100003729 | Arvicola amphibius 1047088 | ACA|GTCAGTGGGG...GCGTCCTTCCCT/GGGTCATTCACT...CTCAG|ACC | 2 | 1 | 82.343 |
| 17674761 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_038349363.1 3555620 | 38 | 100003457 | 100003537 | Arvicola amphibius 1047088 | AAG|GTAGGGCCAG...CTGTCCTTTCTC/TCCTTTCTCAAG...TGCAG|GTG | 0 | 1 | 84.186 |
| 17674762 | GT-AG | 0 | 1.000000099473604e-05 | 437 | rna-XM_038349363.1 3555620 | 39 | 100002957 | 100003393 | Arvicola amphibius 1047088 | GAG|GTGAGTGCTG...CCACCCTGGATA/AGGGTTGTGATT...TTCAG|TGT | 0 | 1 | 85.146 |
| 17674763 | GT-AG | 0 | 1.000000099473604e-05 | 210 | rna-XM_038349363.1 3555620 | 40 | 100002640 | 100002849 | Arvicola amphibius 1047088 | CAA|GTAAGGGGCT...CAGCTCTTGAAC/CAGCTCTTGAAC...CCCAG|CGT | 2 | 1 | 86.776 |
| 17674764 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_038349363.1 3555620 | 41 | 100002394 | 100002497 | Arvicola amphibius 1047088 | CAG|GTGAACAAGC...CCATCCCCACCC/CCCACGCTCACC...TGCAG|ACA | 0 | 1 | 88.94 |
| 17674765 | GT-AG | 0 | 1.000000099473604e-05 | 273 | rna-XM_038349363.1 3555620 | 42 | 100001986 | 100002258 | Arvicola amphibius 1047088 | CTG|GTGCGTGGAG...GGGGCTGTCACT/GGGGCTGTCACT...TGCAG|GAT | 0 | 1 | 90.996 |
| 17674766 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_038349363.1 3555620 | 43 | 100001782 | 100001881 | Arvicola amphibius 1047088 | CAG|GTGAGTTCCG...GGCCCCTGAACA/GCTGTGCTCACA...CACAG|TAT | 2 | 1 | 92.581 |
| 17674767 | GT-AG | 0 | 1.000000099473604e-05 | 481 | rna-XM_038349363.1 3555620 | 44 | 100001208 | 100001688 | Arvicola amphibius 1047088 | CAG|GTGAGCAGGG...GCATCCCTGCTC/GAGGCCTGCATC...TCTAG|GTT | 2 | 1 | 93.998 |
| 17674768 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-XM_038349363.1 3555620 | 45 | 100000855 | 100000966 | Arvicola amphibius 1047088 | GAG|GTTACATAGG...CCCTCCTTGTCA/CTCCTTGTCACC...GTCAG|GTG | 0 | 1 | 97.669 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);