introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
55 rows where transcript_id = 22607844
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122606014 | GT-AG | 0 | 1.000000099473604e-05 | 474 | rna-XM_021205247.1 22607844 | 1 | 18915930 | 18916403 | Mus pahari 10093 | AGG|GTGAGAAGGG...GAGCCCCCAGCC/GTGGAAATTATG...CCCAG|CCT | 1 | 1 | 1.042 |
| 122606015 | GT-AG | 0 | 1.000000099473604e-05 | 188 | rna-XM_021205247.1 22607844 | 2 | 18916465 | 18916652 | Mus pahari 10093 | CAG|GTAAGTCACT...CTGCTCCTAACT/CTGCTCCTAACT...TCTAG|CCA | 2 | 1 | 1.741 |
| 122606016 | GT-AG | 0 | 1.000000099473604e-05 | 3549 | rna-XM_021205247.1 22607844 | 3 | 18916714 | 18920262 | Mus pahari 10093 | CAG|GTAAGAGATG...TTGTTCTCTGTT/ACAGAACTAATT...AAAAG|GAG | 0 | 1 | 2.439 |
| 122606017 | GT-AG | 0 | 3.524184765880476e-05 | 1188 | rna-XM_021205247.1 22607844 | 4 | 18920336 | 18921523 | Mus pahari 10093 | CGT|GTAAGTTGCG...CTTGCCTCTGCT/GCTGGTGCCATG...TGCAG|ACT | 1 | 1 | 3.275 |
| 122606018 | GT-AG | 0 | 1.000000099473604e-05 | 149 | rna-XM_021205247.1 22607844 | 5 | 18921617 | 18921765 | Mus pahari 10093 | TGG|GTAGGTCTGG...CTCATTTTGCCC/TCTAGGCTCATT...CTCAG|TGT | 1 | 1 | 4.34 |
| 122606019 | GT-AG | 0 | 1.000000099473604e-05 | 2225 | rna-XM_021205247.1 22607844 | 6 | 18921921 | 18924145 | Mus pahari 10093 | CAG|GTGAGGCCTC...TCCCTCTGAACA/TTCCCTCTGAAC...ATCAG|GTG | 0 | 1 | 6.115 |
| 122606020 | GT-AG | 0 | 0.0001007413986475 | 772 | rna-XM_021205247.1 22607844 | 7 | 18924265 | 18925036 | Mus pahari 10093 | AAG|GTAACTATGT...CAGCCCTGGGCT/TGTGTGGGCACT...CTCAG|GGT | 2 | 1 | 7.477 |
| 122606021 | GT-AG | 0 | 1.000000099473604e-05 | 823 | rna-XM_021205247.1 22607844 | 8 | 18925243 | 18926065 | Mus pahari 10093 | ACG|GTGAGTCTGA...GAGCCCTCAGCC/ACCCACCTCATC...TGCAG|GGA | 1 | 1 | 9.836 |
| 122606022 | GT-AG | 0 | 1.000000099473604e-05 | 192 | rna-XM_021205247.1 22607844 | 9 | 18926197 | 18926388 | Mus pahari 10093 | CAG|GTCTGTGTGG...GTCCCCCCAGCT/CCCCAGCTCAGC...CGCAG|GGT | 0 | 1 | 11.336 |
| 122606023 | GT-AG | 0 | 1.000000099473604e-05 | 604 | rna-XM_021205247.1 22607844 | 10 | 18926496 | 18927099 | Mus pahari 10093 | CCA|GTAAGTGGGC...ACTTTCTCACAT/GACTTTCTCACA...CACAG|ATC | 2 | 1 | 12.562 |
| 122606024 | GT-AG | 0 | 0.0004328474037938 | 408 | rna-XM_021205247.1 22607844 | 11 | 18927210 | 18927617 | Mus pahari 10093 | GCA|GTAGGTTCAA...CCTCCCTTCCCT/CCATAACCTATC...CCAAG|CTG | 1 | 1 | 13.821 |
| 122606025 | GT-AG | 0 | 1.000000099473604e-05 | 1425 | rna-XM_021205247.1 22607844 | 12 | 18927747 | 18929171 | Mus pahari 10093 | ATG|GTTGGTTGTG...TGGGTCTTTGCT/TGGCTTTTGAGC...CCTAG|GGC | 1 | 1 | 15.298 |
| 122606026 | GT-AG | 0 | 1.4871868250379768e-05 | 243 | rna-XM_021205247.1 22607844 | 13 | 18929281 | 18929523 | Mus pahari 10093 | CTG|GTCTGTGTCC...ACCTTCATAATC/ACCTTCATAATC...TTCAG|TAC | 2 | 1 | 16.546 |
| 122606027 | GT-AG | 0 | 6.625054767412595e-05 | 434 | rna-XM_021205247.1 22607844 | 14 | 18929571 | 18930004 | Mus pahari 10093 | CAG|GTACATCTGA...GCTGCCTTCTTT/AGCTTGCTCAGG...CTTAG|CCG | 1 | 1 | 17.085 |
| 122606028 | GT-AG | 0 | 1.000000099473604e-05 | 4650 | rna-XM_021205247.1 22607844 | 15 | 18930151 | 18934800 | Mus pahari 10093 | CTG|GTAAGGCTGG...TGGCCCTTCGTT/CACGGACTAATG...TCTAG|AAC | 0 | 1 | 18.756 |
| 122606029 | GT-AG | 0 | 4.375024174106688e-05 | 554 | rna-XM_021205247.1 22607844 | 16 | 18934934 | 18935487 | Mus pahari 10093 | ACG|GTATGGCTCA...TGGGCTGTACTA/GCTGTACTAACA...GCCAG|ACG | 1 | 1 | 20.279 |
| 122606030 | GT-AG | 0 | 4.252427187911884e-05 | 1886 | rna-XM_021205247.1 22607844 | 17 | 18935666 | 18937551 | Mus pahari 10093 | CCT|GTAAGTGTTC...GTCGCTTTGCCA/CATGTCTGCATG...CACAG|GTC | 2 | 1 | 22.318 |
| 122606031 | GT-AG | 0 | 1.000000099473604e-05 | 801 | rna-XM_021205247.1 22607844 | 18 | 18937677 | 18938477 | Mus pahari 10093 | CTG|GTGAGGACTG...CATATCTTCACA/CATATCTTCACA...TCCAG|CCT | 1 | 1 | 23.749 |
| 122606032 | GT-AG | 0 | 1.000000099473604e-05 | 1184 | rna-XM_021205247.1 22607844 | 19 | 18938691 | 18939874 | Mus pahari 10093 | GTG|GTAAGTGCCC...GTCTCCTTTCTC/GAGAGACCAAGT...TACAG|CTC | 1 | 1 | 26.188 |
| 122606033 | GT-AG | 0 | 1.000000099473604e-05 | 1449 | rna-XM_021205247.1 22607844 | 20 | 18940068 | 18941516 | Mus pahari 10093 | CAG|GTGAGTGGGC...GGTGTCTTCATT/GGTGTCTTCATT...CATAG|GAA | 2 | 1 | 28.398 |
| 122606034 | GT-AG | 0 | 1.000000099473604e-05 | 193 | rna-XM_021205247.1 22607844 | 21 | 18941592 | 18941784 | Mus pahari 10093 | CTG|GTGAGCTTCC...GGGTCCCTAATG/GCTGGGCTGATA...TGCAG|CCA | 2 | 1 | 29.257 |
| 122606035 | GT-AG | 0 | 1.000000099473604e-05 | 1242 | rna-XM_021205247.1 22607844 | 22 | 18941829 | 18943070 | Mus pahari 10093 | CAG|GTAAGCAGGG...AACCCCCTTTCT/CGCCATGTAACC...TCCAG|CGG | 1 | 1 | 29.761 |
| 122606036 | GT-AG | 0 | 0.0087411292735741 | 1862 | rna-XM_021205247.1 22607844 | 23 | 18943225 | 18945086 | Mus pahari 10093 | GGG|GTATGTATCT...GCTCTTTTGACC/GCTCTTTTGACC...TCCAG|CCT | 2 | 1 | 31.524 |
| 122606037 | GT-AG | 0 | 1.000000099473604e-05 | 1807 | rna-XM_021205247.1 22607844 | 24 | 18945195 | 18947001 | Mus pahari 10093 | CTG|GTAAGTCCCT...TGTTCTGTGAGC/TGTTCTGTGAGC...TCCAG|TGT | 2 | 1 | 32.761 |
| 122606038 | GT-AG | 0 | 1.000000099473604e-05 | 2277 | rna-XM_021205247.1 22607844 | 25 | 18947141 | 18949417 | Mus pahari 10093 | AAG|GTGAGTTACA...GGCTTCTTCTCT/CCTGGCTGTACT...CTCAG|AGT | 0 | 1 | 34.352 |
| 122606039 | GT-AG | 0 | 1.000000099473604e-05 | 258 | rna-XM_021205247.1 22607844 | 26 | 18949553 | 18949810 | Mus pahari 10093 | CCC|GTAAGGGCCT...CTAATTCTATCT/TCGGGACTGACT...TCTAG|AGT | 0 | 1 | 35.898 |
| 122606040 | GT-AG | 0 | 1.800440037031552e-05 | 275 | rna-XM_021205247.1 22607844 | 27 | 18949958 | 18950232 | Mus pahari 10093 | CAG|GTAGTCATGA...CTCTACTAAACT/TATCAGCTAACA...CTCAG|GGT | 0 | 1 | 37.582 |
| 122606041 | GT-AG | 0 | 1.000000099473604e-05 | 1988 | rna-XM_021205247.1 22607844 | 28 | 18950353 | 18952340 | Mus pahari 10093 | GAG|GTAAGGCCTT...CTTCCTTTATCC/TCTGTGCTCAGT...CTCAG|TGC | 0 | 1 | 38.956 |
| 122606042 | GT-AG | 0 | 1.000000099473604e-05 | 612 | rna-XM_021205247.1 22607844 | 29 | 18952458 | 18953069 | Mus pahari 10093 | GTG|GTGAGTGCCT...ATCCTCTGAACT/TTGACATTCAGA...CCTAG|GTT | 0 | 1 | 40.295 |
| 122606043 | GT-AG | 0 | 1.000000099473604e-05 | 3669 | rna-XM_021205247.1 22607844 | 30 | 18953227 | 18956895 | Mus pahari 10093 | GCC|GTGAGTATTC...TCTCTCTTTCTT/GCCCTACTCATG...TTTAG|CAT | 1 | 1 | 42.093 |
| 122606044 | GT-AG | 0 | 1.000000099473604e-05 | 2615 | rna-XM_021205247.1 22607844 | 31 | 18956923 | 18959537 | Mus pahari 10093 | AAG|GTAAGCCAGT...ACTTTTTTTTCT/ACACTTGTGACA...TCCAG|CGC | 1 | 1 | 42.402 |
| 122606045 | GT-AG | 0 | 1.000000099473604e-05 | 985 | rna-XM_021205247.1 22607844 | 32 | 18959706 | 18960690 | Mus pahari 10093 | ATG|GTAAGGCCCA...CTGACCTAACTA/TGCATACTGACC...TGCAG|ACC | 1 | 1 | 44.326 |
| 122606046 | GT-AG | 0 | 1.000000099473604e-05 | 569 | rna-XM_021205247.1 22607844 | 33 | 18960970 | 18961538 | Mus pahari 10093 | TGG|GTAAGGGTCT...CCTCTCTCAGTC/CTGGTGTTCAGC...TCTAG|ATG | 1 | 1 | 47.521 |
| 122606047 | GT-AG | 0 | 1.000000099473604e-05 | 875 | rna-XM_021205247.1 22607844 | 34 | 18961657 | 18962531 | Mus pahari 10093 | CAG|GTGAGACCGG...TGTGCTCTATCC/GTTCTGGTCACT...CTTAG|GGT | 2 | 1 | 48.872 |
| 122606048 | GT-AG | 0 | 7.337181326877443e-05 | 374 | rna-XM_021205247.1 22607844 | 35 | 18962612 | 18962985 | Mus pahari 10093 | ACT|GTAAGCGGCC...TTTTTCTGAGCC/TCTTCTCTCATC...GCTAG|GTG | 1 | 1 | 49.788 |
| 122606049 | GT-AG | 0 | 1.000000099473604e-05 | 674 | rna-XM_021205247.1 22607844 | 36 | 18964752 | 18965425 | Mus pahari 10093 | GTG|GTGAGTGACT...CTCATGTTGACG/CTCATGTTGACG...CACAG|CCC | 0 | 1 | 70.01 |
| 122606050 | GT-AG | 0 | 1.000000099473604e-05 | 252 | rna-XM_021205247.1 22607844 | 37 | 18965595 | 18965846 | Mus pahari 10093 | CCT|GTGAGTTATG...TTGTTCTTTGCC/CTTGGAACCACT...TCCAG|GCC | 1 | 1 | 71.945 |
| 122606051 | GT-AG | 0 | 1.000000099473604e-05 | 558 | rna-XM_021205247.1 22607844 | 38 | 18965993 | 18966550 | Mus pahari 10093 | CTG|GTACGGTAAT...ACTCACCTAGCT/TCTGGGCTGATG...TGCAG|GGA | 0 | 1 | 73.617 |
| 122606052 | GT-AG | 0 | 1.000000099473604e-05 | 5029 | rna-XM_021205247.1 22607844 | 39 | 18966641 | 18971669 | Mus pahari 10093 | AAG|GTGAGCACAC...AATTTCTTACCC/CAATTTCTTACC...TTAAG|GTG | 0 | 1 | 74.648 |
| 122606053 | GT-AG | 0 | 1.000000099473604e-05 | 3444 | rna-XM_021205247.1 22607844 | 40 | 18971854 | 18975297 | Mus pahari 10093 | GTG|GTAAGGAGGG...ATCACTTTGTCC/TAGTGAGTAATA...TTCAG|GTA | 1 | 1 | 76.755 |
| 122606054 | GT-AG | 0 | 1.000000099473604e-05 | 136 | rna-XM_021205247.1 22607844 | 41 | 18975519 | 18975654 | Mus pahari 10093 | TTT|GTAGGTGCCC...TTAGGCTGAGCT/GCAGCACTGACT...CCCAG|GTC | 0 | 1 | 79.285 |
| 122606055 | GT-AG | 0 | 1.000000099473604e-05 | 1550 | rna-XM_021205247.1 22607844 | 42 | 18975794 | 18977343 | Mus pahari 10093 | GCC|GTGAGTAGGG...TGCCCATTATCC/CTAGGCCTGATG...TACAG|CCT | 1 | 1 | 80.877 |
| 122606056 | GT-AG | 0 | 1.000000099473604e-05 | 150 | rna-XM_021205247.1 22607844 | 43 | 18977539 | 18977688 | Mus pahari 10093 | GTG|GTAGGTCCCA...TGGTCCTGAGTC/CTGGTCCTGAGT...TGCAG|CCT | 1 | 1 | 83.11 |
| 122606057 | GT-AG | 0 | 1.000000099473604e-05 | 510 | rna-XM_021205247.1 22607844 | 44 | 18977902 | 18978411 | Mus pahari 10093 | GCG|GTGAGTCCCA...TGTGTCCTGATC/TGTGTCCTGATC...TGTAG|TGT | 1 | 1 | 85.549 |
| 122606058 | GT-AG | 0 | 1.000000099473604e-05 | 155 | rna-XM_021205247.1 22607844 | 45 | 18978517 | 18978671 | Mus pahari 10093 | GTG|GTAAGAGGTT...GTTCTGTTTTTT/TCAGAGCTTACG...CTCAG|AGT | 1 | 1 | 86.751 |
| 122606059 | GT-AG | 0 | 1.000000099473604e-05 | 1176 | rna-XM_021205247.1 22607844 | 46 | 18978780 | 18979955 | Mus pahari 10093 | GTG|GTAGGTGGCT...GGAATTTTAATG/ATTTTAATGACA...CACAG|GCT | 1 | 1 | 87.988 |
| 122606060 | GT-AG | 0 | 1.000000099473604e-05 | 202 | rna-XM_021205247.1 22607844 | 47 | 18980058 | 18980259 | Mus pahari 10093 | GTG|GTGAGTCCCC...GTCTCCCTACTC/CTGAGACTAATT...CCCAG|TGT | 1 | 1 | 89.156 |
| 122606061 | GT-AG | 0 | 1.000000099473604e-05 | 922 | rna-XM_021205247.1 22607844 | 48 | 18980359 | 18981280 | Mus pahari 10093 | GTG|GTAAGTCCCA...AGTGTCTGAACA/AAGTGTCTGAAC...TTCAG|AGT | 1 | 1 | 90.29 |
| 122606062 | GT-AG | 0 | 1.000000099473604e-05 | 1004 | rna-XM_021205247.1 22607844 | 49 | 18981322 | 18982325 | Mus pahari 10093 | CTG|GTATGGAGAC...GGACACTTGGCT/CCCTCTGTCACC...TGCAG|TGG | 0 | 1 | 90.759 |
| 122606063 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_021205247.1 22607844 | 50 | 18982402 | 18982487 | Mus pahari 10093 | GTG|GTGAGTGGGC...CTCCCCAGAACA/CCAGAACAGATC...TCCAG|TGA | 1 | 1 | 91.629 |
| 122606064 | GT-AG | 0 | 1.000000099473604e-05 | 244 | rna-XM_021205247.1 22607844 | 51 | 18982667 | 18982910 | Mus pahari 10093 | GCG|GTAGGGACAG...ATTACTTTCTCT/ATAAAGTTGATA...TTCAG|AAC | 0 | 1 | 93.679 |
| 122606065 | GT-AG | 0 | 1.0107114278977374e-05 | 199 | rna-XM_021205247.1 22607844 | 52 | 18983016 | 18983214 | Mus pahari 10093 | TTG|GTAAATATTC...CCCCACTTAGTC/GAAGCCCTCACT...TCTAG|CCT | 0 | 1 | 94.881 |
| 122606066 | GT-AG | 0 | 1.000000099473604e-05 | 856 | rna-XM_021205247.1 22607844 | 53 | 18983335 | 18984190 | Mus pahari 10093 | AAG|GTCAGTGTCT...GGCTCCTGAACT/TGGCTCCTGAAC...TCTAG|GTC | 0 | 1 | 96.256 |
| 122606067 | GT-AG | 0 | 1.000000099473604e-05 | 2229 | rna-XM_021205247.1 22607844 | 54 | 18984237 | 18986465 | Mus pahari 10093 | CAT|GTGAGCATGG...TAACTCTTGCCC/TGGGACATAACT...ACCAG|GTA | 1 | 1 | 96.782 |
| 122606068 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_021205247.1 22607844 | 55 | 18986546 | 18986632 | Mus pahari 10093 | CCT|GTGCGTGAAG...TCCCCCTTGATG/GACATGCTCATC...CCCAG|GTG | 0 | 1 | 97.698 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);