introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 22607927
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122608463 | GT-AG | 0 | 1.000000099473604e-05 | 11463 | rna-XM_021191782.2 22607927 | 3 | 72252365 | 72263827 | Mus pahari 10093 | CAG|GTGAGTGACA...TGGGTCAGAACC/TGCCTGCTGACA...CTCAG|CCT | 2 | 1 | 36.73 |
| 122608464 | GT-AG | 0 | 1.000000099473604e-05 | 547 | rna-XM_021191782.2 22607927 | 4 | 72264007 | 72264553 | Mus pahari 10093 | TTG|GTGAGTGCTA...AGCTTCTGAGCC/TAGCTTCTGAGC...ACCAG|ACG | 1 | 1 | 39.671 |
| 122608465 | GT-AG | 0 | 1.000000099473604e-05 | 532 | rna-XM_021191782.2 22607927 | 5 | 72264619 | 72265150 | Mus pahari 10093 | AAG|GTAGGCAGGG...TGATCCTTTGGG/AAAAATGTGACA...GGCAG|GAG | 0 | 1 | 40.74 |
| 122608466 | GT-AG | 0 | 1.000000099473604e-05 | 117 | rna-XM_021191782.2 22607927 | 6 | 72265276 | 72265392 | Mus pahari 10093 | CAA|GTGAGTACCC...AAACCCTCAACC/CAAACCCTCAAC...TCTAG|TGG | 2 | 1 | 42.794 |
| 122608467 | GT-AG | 0 | 1.000000099473604e-05 | 248 | rna-XM_021191782.2 22607927 | 7 | 72265543 | 72265790 | Mus pahari 10093 | GGG|GTGAGTGAGG...CCCTCCTTGTTC/TCTTTGCTGAGC...TTTAG|GTC | 2 | 1 | 45.259 |
| 122608468 | GT-AG | 0 | 1.000000099473604e-05 | 246 | rna-XM_021191782.2 22607927 | 8 | 72265861 | 72266106 | Mus pahari 10093 | AAG|GTGAGGGCTC...CAGTTCTTCTCT/AGGGTTCCCACT...CCCAG|GAC | 0 | 1 | 46.409 |
| 122608469 | GT-AG | 0 | 1.000000099473604e-05 | 912 | rna-XM_021191782.2 22607927 | 9 | 72266228 | 72267139 | Mus pahari 10093 | ATG|GTGGGAGTTC...GTTACCTTCACC/GTTACCTTCACC...TGCAG|TGG | 1 | 1 | 48.398 |
| 122608470 | GT-AG | 0 | 1.1518588544151876e-05 | 427 | rna-XM_021191782.2 22607927 | 10 | 72267343 | 72267769 | Mus pahari 10093 | GAG|GTATGAAGGG...TCAGCCTTGCCC/CCTGTTGTCAGC...ATCAG|GAA | 0 | 1 | 51.734 |
| 122608471 | GT-AG | 0 | 1.000000099473604e-05 | 1294 | rna-XM_021191782.2 22607927 | 11 | 72267877 | 72269170 | Mus pahari 10093 | CAG|GTGAGCCCCT...GTTCTCCTACTT/CTCCTACTTATA...CCTAG|CTT | 2 | 1 | 53.492 |
| 122608472 | GT-AG | 0 | 1.000000099473604e-05 | 1576 | rna-XM_021191782.2 22607927 | 12 | 72269371 | 72270946 | Mus pahari 10093 | CAG|GTGCGTATGT...AGGGCTGTACCA/CTAGTGCTCACC...CTCAG|GGG | 1 | 1 | 56.779 |
| 122608473 | GT-AG | 0 | 1.000000099473604e-05 | 1417 | rna-XM_021191782.2 22607927 | 13 | 72271033 | 72272449 | Mus pahari 10093 | CAG|GTGGGCATGC...GCGTTTTTATCA/GTTTTTATCATC...CACAG|CTC | 0 | 1 | 58.192 |
| 122608474 | GT-AG | 0 | 1.000000099473604e-05 | 1095 | rna-XM_021191782.2 22607927 | 14 | 72272633 | 72273727 | Mus pahari 10093 | AAG|GTCAGCACAC...CACTCTTTCTCG/CTCGGATTGATC...CCTAG|GCC | 0 | 1 | 61.2 |
| 122608475 | GT-AG | 0 | 1.000000099473604e-05 | 1207 | rna-XM_021191782.2 22607927 | 15 | 72273903 | 72275109 | Mus pahari 10093 | CAG|GTGAGTGACC...ACCTTCTGGACA/CTTTCTTCCACT...CAAAG|AGG | 1 | 1 | 64.076 |
| 122608476 | GT-AG | 0 | 1.000000099473604e-05 | 2108 | rna-XM_021191782.2 22607927 | 16 | 72275245 | 72277352 | Mus pahari 10093 | AAG|GTGAGTACCC...GTTCCTCTAACT/GCTTTCTTCATG...CACAG|AGT | 1 | 1 | 66.294 |
| 122608477 | GT-AG | 0 | 1.000000099473604e-05 | 252 | rna-XM_021191782.2 22607927 | 17 | 72277489 | 72277740 | Mus pahari 10093 | TGG|GTGAGTACCC...CTGTCCTGGATG/GATTTGTGTATT...TGCAG|CTT | 2 | 1 | 68.529 |
| 122608478 | GT-AG | 0 | 1.1751903921062964e-05 | 679 | rna-XM_021191782.2 22607927 | 18 | 72277844 | 72278522 | Mus pahari 10093 | AAG|GTAGGCCTGG...CTGTGCTTACAC/GCTGTGCTTACA...ACTAG|GCT | 0 | 1 | 70.222 |
| 122608479 | GT-AG | 0 | 0.0203571432538789 | 277 | rna-XM_021191782.2 22607927 | 19 | 72278719 | 72278995 | Mus pahari 10093 | TTT|GTATGTGAAC...CTCACCTTGACT/CTGGGCCTCACC...CATAG|CTA | 1 | 1 | 73.443 |
| 122608480 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_021191782.2 22607927 | 20 | 72279048 | 72279140 | Mus pahari 10093 | GAG|GTGAGCCTTG...GGGACGGTGACT/GGGACGGTGACT...TCCAG|GAC | 2 | 1 | 74.297 |
| 122608481 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_021191782.2 22607927 | 21 | 72279305 | 72279390 | Mus pahari 10093 | GTG|GTGAGCCCAG...ACCCTCTAAAAA/AGTGGCATAACT...CCCAG|GCT | 1 | 1 | 76.993 |
| 122608482 | GT-AG | 0 | 1.000000099473604e-05 | 264 | rna-XM_021191782.2 22607927 | 22 | 72279604 | 72279867 | Mus pahari 10093 | CTG|GTGGGTCCCG...CAGTTTCTAACC/CAGTTTCTAACC...CCCAG|AGA | 1 | 1 | 80.493 |
| 122608483 | GT-AG | 0 | 1.000000099473604e-05 | 412 | rna-XM_021191782.2 22607927 | 23 | 72279974 | 72280385 | Mus pahari 10093 | CTG|GTGAGGACTG...GGCCCTCTAATT/CTAATTATCAGT...TCCAG|TGT | 2 | 1 | 82.235 |
| 122608484 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_021191782.2 22607927 | 24 | 72280534 | 72280618 | Mus pahari 10093 | AGT|GTGCGTCCCA...AGCCTCTCACAT/AAGCCTCTCACA...CCCAG|GTG | 0 | 1 | 84.667 |
| 122608485 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_021191782.2 22607927 | 25 | 72280706 | 72280787 | Mus pahari 10093 | CAG|GTGCAGGCCA...AGACCTGTAGCT/CTGTAGCTGAGC...TGCAG|CAT | 0 | 1 | 86.097 |
| 122608486 | GT-AG | 0 | 1.000000099473604e-05 | 198 | rna-XM_021191782.2 22607927 | 26 | 72280857 | 72281054 | Mus pahari 10093 | AAG|GTGGGGGCTG...ACCTTCTGGGTC/GACTGGCTGACC...TGCAG|ATC | 0 | 1 | 87.231 |
| 122608487 | GT-AG | 0 | 1.000000099473604e-05 | 1777 | rna-XM_021191782.2 22607927 | 27 | 72281173 | 72282949 | Mus pahari 10093 | CAG|GTGGGTGGCC...GACCCAGTGACA/GTGGCACTAAGC...CACAG|AGC | 1 | 1 | 89.17 |
| 122608488 | GT-AG | 0 | 1.000000099473604e-05 | 197 | rna-XM_021191782.2 22607927 | 28 | 72283064 | 72283260 | Mus pahari 10093 | TGG|GTAAGTGACG...GTAGCCTGTGCC/CCTGTGCCCACT...CTTAG|CCA | 1 | 1 | 91.044 |
| 122608489 | GT-AG | 0 | 4.116652741393786e-05 | 496 | rna-XM_021191782.2 22607927 | 29 | 72283407 | 72283902 | Mus pahari 10093 | CGG|GTAAGCCCCG...CCCTCCATAACC/CCCTCCATAACC...AACAG|AGC | 0 | 1 | 93.443 |
| 122608490 | GT-AG | 0 | 1.000000099473604e-05 | 3409 | rna-XM_021191782.2 22607927 | 30 | 72283936 | 72287344 | Mus pahari 10093 | ACC|GTGAGTAGCT...CTGACTGTACCC/ACCTGACTGATA...CACAG|AGT | 0 | 1 | 93.985 |
| 122608491 | GT-AG | 0 | 1.000000099473604e-05 | 981 | rna-XM_021191782.2 22607927 | 31 | 72287428 | 72288408 | Mus pahari 10093 | TTG|GTAAGTCAGG...TGTCCCATAAAT/AATCTTCGCAAA...TGCAG|CTG | 2 | 1 | 95.349 |
| 122608492 | GT-AG | 0 | 1.000000099473604e-05 | 570 | rna-XM_021191782.2 22607927 | 32 | 72288460 | 72289029 | Mus pahari 10093 | GTG|GTAAGCCAGG...TCTGCTGTGACA/GTGACACTCACA...TGCAG|GTA | 2 | 1 | 96.187 |
| 122608493 | GT-AG | 0 | 0.0003668674066731 | 1651 | rna-XM_021191782.2 22607927 | 33 | 72289094 | 72290744 | Mus pahari 10093 | CAG|GTACCAGCTG...GTGGCCTAGACC/TGCAAGCTGAGC...CCTAG|CAC | 0 | 1 | 97.239 |
| 122608494 | GT-AG | 0 | 8.347070711964388e-05 | 235 | rna-XM_021191782.2 22607927 | 34 | 72290895 | 72291129 | Mus pahari 10093 | TCC|GTAAGTGTCT...CCTTTTTTACTT/TTTACTTTCACA...TACAG|CTC | 0 | 1 | 99.704 |
| 122621619 | GT-AG | 0 | 1.000000099473604e-05 | 18073 | rna-XM_021191782.2 22607927 | 1 | 72227915 | 72245987 | Mus pahari 10093 | CAG|GTCAGTGAGG...CTCCTCCTAGCA/GCTTGCCTCATG...TGCAG|CTC | 0 | 27.083 | |
| 122621620 | GT-AG | 0 | 1.000000099473604e-05 | 5753 | rna-XM_021191782.2 22607927 | 2 | 72246066 | 72251818 | Mus pahari 10093 | CAG|GTAAGTGAGG...TCACCTGTGACC/GTGCCTCTCACC...TCCAG|GTC | 0 | 28.365 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);