introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 22607923
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122608323 | GT-AG | 0 | 1.000000099473604e-05 | 4085 | rna-XM_029532511.1 22607923 | 3 | 151819389 | 151823473 | Mus pahari 10093 | CTG|GTAAGTCATT...TTCATCTTCTTT/CTTCTGTTCATC...TTCAG|AAT | 1 | 1 | 5.581 |
| 122608324 | GT-AG | 0 | 1.000000099473604e-05 | 3120 | rna-XM_029532511.1 22607923 | 4 | 151816200 | 151819319 | Mus pahari 10093 | AAG|GTAAGGCTGT...TGTTTCTGGATG/GCACCGTTTACC...TTCAG|CAG | 1 | 1 | 7.023 |
| 122608325 | GT-AG | 0 | 0.0308224198358038 | 2370 | rna-XM_029532511.1 22607923 | 5 | 151813737 | 151816106 | Mus pahari 10093 | CAG|GTATCGCTTT...GAAGTCTTGGCT/TTGGCTCTCACC...TGCAG|ACA | 1 | 1 | 8.967 |
| 122608326 | GT-AG | 0 | 1.000000099473604e-05 | 8707 | rna-XM_029532511.1 22607923 | 6 | 151804901 | 151813607 | Mus pahari 10093 | CAG|GTGTGTTGCC...TCCTTCTTGCTC/TTTGTTTTCATC...CACAG|GCC | 1 | 1 | 11.664 |
| 122608327 | GT-AG | 0 | 3.647370252771727e-05 | 122 | rna-XM_029532511.1 22607923 | 7 | 151804752 | 151804873 | Mus pahari 10093 | CAG|GTATGGCTTT...TGGGTGTTACCT/GTGGGTGTTACC...CTCAG|ATG | 1 | 1 | 12.228 |
| 122608328 | GT-AG | 0 | 1.000000099473604e-05 | 4212 | rna-XM_029532511.1 22607923 | 8 | 151800427 | 151804638 | Mus pahari 10093 | CAG|GTAGGTGTGT...CATATCTTGAAA/CATATCTTGAAA...TAAAG|GTT | 0 | 1 | 14.59 |
| 122608329 | GC-AG | 0 | 1.000000099473604e-05 | 634 | rna-XM_029532511.1 22607923 | 9 | 151799073 | 151799706 | Mus pahari 10093 | AAG|GCAAGAAACT...AACCCCTTATTT/CAACCCCTTATT...CACAG|GGT | 0 | 1 | 29.64 |
| 122608330 | GT-AG | 0 | 1.000000099473604e-05 | 180 | rna-XM_029532511.1 22607923 | 10 | 151798860 | 151799039 | Mus pahari 10093 | GAG|GTAATAGTTG...TCACCTCTGACC/GATGATTTCATT...TGCAG|AGG | 0 | 1 | 30.33 |
| 122608331 | GT-AG | 0 | 0.0147307674100187 | 3337 | rna-XM_029532511.1 22607923 | 11 | 151795364 | 151798700 | Mus pahari 10093 | ACC|GTAAGCTTGC...TTCTTTTTAATG/CTTGTATTTACA...TTCAG|GGC | 0 | 1 | 33.654 |
| 122608332 | GT-AG | 0 | 1.000000099473604e-05 | 3762 | rna-XM_029532511.1 22607923 | 12 | 151791460 | 151795221 | Mus pahari 10093 | GAG|GTAAGGCAGC...GTGTTCTTTTCT/ATACTACTTACA...CACAG|AGC | 1 | 1 | 36.622 |
| 122608333 | GT-AG | 0 | 1.000000099473604e-05 | 4557 | rna-XM_029532511.1 22607923 | 13 | 151786873 | 151791429 | Mus pahari 10093 | CAG|GTAGGTGAGC...CAGCTTGTGATT/TTGTTTGTCATT...TCCAG|ATC | 1 | 1 | 37.249 |
| 122608334 | GT-AG | 0 | 1.000000099473604e-05 | 1704 | rna-XM_029532511.1 22607923 | 14 | 151784996 | 151786699 | Mus pahari 10093 | CGG|GTACGGATAT...TTGTTTTTACCC/CAAGTTTTCATT...TCTAG|AAT | 0 | 1 | 40.865 |
| 122608335 | GT-AG | 0 | 0.0048797202837933 | 1683 | rna-XM_029532511.1 22607923 | 15 | 151783249 | 151784931 | Mus pahari 10093 | GAG|GTACCGCTTT...ATCTTCTTCTCC/CTCTCTCTCACA...CCTAG|ACA | 1 | 1 | 42.203 |
| 122608336 | GT-AG | 0 | 0.0037932552259729 | 8911 | rna-XM_029532511.1 22607923 | 16 | 151774263 | 151783173 | Mus pahari 10093 | AAG|GTACCATAAG...GTTTCCCTGATT/GTTTCCCTGATT...ACTAG|ATG | 1 | 1 | 43.771 |
| 122608337 | GT-AG | 0 | 1.000000099473604e-05 | 125 | rna-XM_029532511.1 22607923 | 17 | 151774090 | 151774214 | Mus pahari 10093 | ACA|GTAAGTGACT...TTCATCTGAGCC/TTAGTTTTCATC...TCCAG|CAA | 1 | 1 | 44.774 |
| 122608338 | GT-AG | 0 | 1.000000099473604e-05 | 2252 | rna-XM_029532511.1 22607923 | 18 | 151771708 | 151773959 | Mus pahari 10093 | GAG|GTGAGTGATT...CTGTTTTTATGA/GCTGTTTTTATG...TACAG|AAA | 2 | 1 | 47.492 |
| 122608339 | GT-AG | 0 | 1.000000099473604e-05 | 5235 | rna-XM_029532511.1 22607923 | 19 | 151766358 | 151771592 | Mus pahari 10093 | ATG|GTAATGTGCA...TTATCCTTCTCT/CATAATGTTATC...TATAG|AGT | 0 | 1 | 49.895 |
| 122608340 | GT-AG | 0 | 1.000000099473604e-05 | 3559 | rna-XM_029532511.1 22607923 | 20 | 151762715 | 151766273 | Mus pahari 10093 | CCG|GTAAGTGTAA...CATTTCTTCATT/TTTATGCTCATT...GTCAG|CCT | 0 | 1 | 51.651 |
| 122608341 | GT-AG | 0 | 1.000000099473604e-05 | 541 | rna-XM_029532511.1 22607923 | 21 | 151762108 | 151762648 | Mus pahari 10093 | AAG|GTAATTCCGT...TTTCTCTAAACT/TTCTTTCTCATC...ACCAG|ACT | 0 | 1 | 53.031 |
| 122608342 | GT-AG | 0 | 0.0016951283901521 | 3267 | rna-XM_029532511.1 22607923 | 22 | 151758739 | 151762005 | Mus pahari 10093 | CAG|GTATTTCCAG...CTTTCTTTATCA/TCTTTCTTTATC...TACAG|AGC | 0 | 1 | 55.163 |
| 122608343 | GT-AG | 0 | 1.5134137146850552e-05 | 9171 | rna-XM_029532511.1 22607923 | 23 | 151749518 | 151758688 | Mus pahari 10093 | ATA|GTAAGTAGCC...CTCTTTTTAAAA/CTCTTTTTAAAA...CATAG|TAT | 2 | 1 | 56.208 |
| 122608344 | GT-AG | 0 | 0.0211655632214684 | 2582 | rna-XM_029532511.1 22607923 | 24 | 151746763 | 151749344 | Mus pahari 10093 | AAG|GTAACCCACA...CTCGCTTTAACT/CTCGCTTTAACT...TGAAG|ATT | 1 | 1 | 59.824 |
| 122608345 | GT-AG | 0 | 1.000000099473604e-05 | 2952 | rna-XM_029532511.1 22607923 | 25 | 151743725 | 151746676 | Mus pahari 10093 | GAG|GTAAGTCTCC...CCTGTTTTATGC/AGATGATTCACT...GGCAG|AAA | 0 | 1 | 61.622 |
| 122608346 | GT-AG | 0 | 0.0020273920452502 | 3527 | rna-XM_029532511.1 22607923 | 26 | 151740030 | 151743556 | Mus pahari 10093 | GAG|GTCTGTTTTG...TGAGTTTTAACT/TGAGTTTTAACT...TTTAG|ATG | 0 | 1 | 65.134 |
| 122608347 | GT-AG | 0 | 1.000000099473604e-05 | 3197 | rna-XM_029532511.1 22607923 | 27 | 151736786 | 151739982 | Mus pahari 10093 | GAA|GTAAGTGCTG...CACATTTTGAAA/CACATTTTGAAA...AACAG|GGA | 2 | 1 | 66.116 |
| 122608348 | GT-AG | 0 | 0.006887274136199 | 131 | rna-XM_029532511.1 22607923 | 28 | 151736540 | 151736670 | Mus pahari 10093 | GAG|GTACCATAAT...CTTTCCTTCCTT/CTGTTCCTCCTC...CTCAG|CTT | 0 | 1 | 68.52 |
| 122608349 | GT-AG | 0 | 1.000000099473604e-05 | 2661 | rna-XM_029532511.1 22607923 | 29 | 151733753 | 151736413 | Mus pahari 10093 | AAG|GTAAGCCACA...TCTTGCTTGACT/TCTTGCTTGACT...TACAG|GGG | 0 | 1 | 71.154 |
| 122608350 | GT-AG | 0 | 0.1188592530512086 | 1389 | rna-XM_029532511.1 22607923 | 30 | 151732169 | 151733557 | Mus pahari 10093 | CAG|GTATCTAAGC...CTCTCCTTTTCT/TTTTTTTTTTTT...TAAAG|TAT | 0 | 1 | 75.23 |
| 122608351 | GT-AG | 0 | 4.6724522855580685e-05 | 9953 | rna-XM_029532511.1 22607923 | 31 | 151721463 | 151731415 | Mus pahari 10093 | CAG|GTAGGCCTGA...TCTCCCTTATGA/TCCCTTATGATT...CACAG|CCT | 0 | 1 | 90.97 |
| 122608352 | GT-AG | 0 | 1.000000099473604e-05 | 592 | rna-XM_029532511.1 22607923 | 32 | 151720811 | 151721402 | Mus pahari 10093 | GTG|GTAAGATGCC...CCCACATTGACT/CCCACATTGACT...CACAG|GAA | 0 | 1 | 92.224 |
| 122608353 | GT-AG | 0 | 1.000000099473604e-05 | 3104 | rna-XM_029532511.1 22607923 | 33 | 151717516 | 151720619 | Mus pahari 10093 | TTT|GTAAGTACAG...CATCCTCTGGTT/GAGCCACTCACT...TGTAG|GTG | 2 | 1 | 96.217 |
| 122608354 | GT-AG | 0 | 1.000000099473604e-05 | 1537 | rna-XM_029532511.1 22607923 | 34 | 151715866 | 151717402 | Mus pahari 10093 | TTG|GTAAGAATCC...GTTTTTTTGTTT/TTGTTTGTTATG...TTCAG|GCA | 1 | 1 | 98.579 |
| 122621612 | GT-AG | 0 | 1.000000099473604e-05 | 61162 | rna-XM_029532511.1 22607923 | 1 | 151874685 | 151935846 | Mus pahari 10093 | GAG|GTAAGCGGGC...TGTGGTTTACTC/TTGTGGTTTACT...TTCAG|GAG | 0 | 3.595 | |
| 122621613 | GT-AG | 0 | 1.000000099473604e-05 | 51073 | rna-XM_029532511.1 22607923 | 2 | 151823527 | 151874599 | Mus pahari 10093 | CTG|GTGAATATGT...GGGGACTTATCA/CTGCTGTTCATC...TGCAG|AGC | 0 | 5.372 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);