introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 22607921
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122608267 | GT-AG | 0 | 1.000000099473604e-05 | 16583 | rna-XM_021209708.2 22607921 | 1 | 153145500 | 153162082 | Mus pahari 10093 | CCT|GTGAGTACGG...TTTTCCTTCCCC/GGACCACTCACA...CCCAG|GGA | 2 | 1 | 4.286 |
| 122608268 | GT-AG | 0 | 1.000000099473604e-05 | 1961 | rna-XM_021209708.2 22607921 | 2 | 153143467 | 153145427 | Mus pahari 10093 | GTT|GTGAGTATGG...GTCTCTTTGCCT/CCCTTGTTTAGC...CCCAG|GCA | 2 | 1 | 5.853 |
| 122608269 | GT-AG | 0 | 1.000000099473604e-05 | 6188 | rna-XM_021209708.2 22607921 | 3 | 153137207 | 153143394 | Mus pahari 10093 | ACT|GTGAGTATGG...CTCCTTTTACCT/TCTCCTTTTACC...TCCAG|ACG | 2 | 1 | 7.419 |
| 122608270 | GT-AG | 0 | 1.000000099473604e-05 | 66601 | rna-XM_021209708.2 22607921 | 4 | 153070534 | 153137134 | Mus pahari 10093 | ACT|GTGAGTTACA...TCCCCCCTGACT/TCCCCCCTGACT...CCCAG|GGA | 2 | 1 | 8.986 |
| 122608271 | GT-AG | 0 | 1.000000099473604e-05 | 986 | rna-XM_021209708.2 22607921 | 5 | 153069476 | 153070461 | Mus pahari 10093 | TCT|GTGAGTAGAG...CCCTCCTTACAG/CCCCTCCTTACA...TACAG|ACA | 2 | 1 | 10.553 |
| 122608272 | GT-AG | 0 | 1.000000099473604e-05 | 360 | rna-XM_021209708.2 22607921 | 6 | 153069044 | 153069403 | Mus pahari 10093 | TCT|GTGAGTAAAA...TCTCCTGTAAAT/CAGGTGTTAAAT...TTCAG|GAC | 2 | 1 | 12.119 |
| 122608273 | GT-AG | 0 | 1.000000099473604e-05 | 332 | rna-XM_021209708.2 22607921 | 7 | 153068640 | 153068971 | Mus pahari 10093 | CTT|GTGAGTCGCC...GGGGCTTTGTTC/AGGTGGCTGATG...TGTAG|CCG | 2 | 1 | 13.686 |
| 122608274 | GT-AG | 0 | 1.000000099473604e-05 | 1174 | rna-XM_021209708.2 22607921 | 8 | 153067302 | 153068475 | Mus pahari 10093 | CAG|GTGGGTGTCT...GGCTCTCTGACA/GGCTCTCTGACA...TGCAG|GCC | 1 | 1 | 17.254 |
| 122608275 | GT-AG | 0 | 1.000000099473604e-05 | 382 | rna-XM_021209708.2 22607921 | 9 | 153066772 | 153067153 | Mus pahari 10093 | GAT|GTGAGTGCAG...TGACCCTCAGTC/AGTCTCCTCACA...TGCAG|CCG | 2 | 1 | 20.474 |
| 122608276 | GT-AG | 0 | 1.000000099473604e-05 | 422 | rna-XM_021209708.2 22607921 | 10 | 153066278 | 153066699 | Mus pahari 10093 | GAT|GTGAGTGCCG...GCTGCTTTCTCT/TTTCTCTCCACT...CGCAG|AGA | 2 | 1 | 22.041 |
| 122608277 | GT-AG | 0 | 1.000000099473604e-05 | 1373 | rna-XM_021209708.2 22607921 | 11 | 153064833 | 153066205 | Mus pahari 10093 | CCT|GTGAGTAGAA...CCTCCTCTGACC/CCTCCTCTGACC...TCCAG|GGT | 2 | 1 | 23.607 |
| 122608278 | GT-AG | 0 | 1.000000099473604e-05 | 820 | rna-XM_021209708.2 22607921 | 12 | 153063941 | 153064760 | Mus pahari 10093 | ACT|GTAAGGAATG...CCTCTCTTACCT/CCCTCTCTTACC...TACAG|GCT | 2 | 1 | 25.174 |
| 122608279 | GT-AG | 0 | 0.0001269440373463 | 6252 | rna-XM_021209708.2 22607921 | 13 | 153057545 | 153063796 | Mus pahari 10093 | ACT|GTGAGCTTCC...TAGAGCTTAGCA/CGGTCTCTGAGA...GACAG|GCA | 2 | 1 | 28.307 |
| 122608280 | GT-AG | 0 | 1.000000099473604e-05 | 794 | rna-XM_021209708.2 22607921 | 14 | 153056587 | 153057380 | Mus pahari 10093 | CAG|GTAAGGACCT...GCTCCTATAACC/CCCCCCCTCACC...CCCAG|CCA | 1 | 1 | 31.876 |
| 122608281 | GT-AG | 0 | 1.000000099473604e-05 | 169 | rna-XM_021209708.2 22607921 | 15 | 153056394 | 153056562 | Mus pahari 10093 | CAG|GTAAGGGCAG...AGGCCCTCAACT/CAGGCCCTCAAC...TACAG|GGA | 1 | 1 | 32.398 |
| 122608282 | GT-AG | 0 | 1.0275514373864554e-05 | 789 | rna-XM_021209708.2 22607921 | 16 | 153055460 | 153056248 | Mus pahari 10093 | GCT|GTGAGCCCTG...TGCTCCTTGCCG/CTTAAGTTGACC...CACAG|ACG | 2 | 1 | 35.553 |
| 122608283 | GT-AG | 0 | 1.000000099473604e-05 | 170 | rna-XM_021209708.2 22607921 | 17 | 153055215 | 153055384 | Mus pahari 10093 | AAT|GTGAGTTTCC...GACCTCTGAGTG/GGACCTCTGAGT...CCCAG|AAA | 2 | 1 | 37.185 |
| 122608284 | GT-AG | 0 | 1.000000099473604e-05 | 2917 | rna-XM_021209708.2 22607921 | 18 | 153052154 | 153055070 | Mus pahari 10093 | CCT|GTGAGTGAGC...TGCCTCTTCTCT/CAGTGGATCATT...TGCAG|GAT | 2 | 1 | 40.318 |
| 122608285 | GT-AG | 0 | 0.0001495366578342 | 286 | rna-XM_021209708.2 22607921 | 19 | 153051724 | 153052009 | Mus pahari 10093 | ACT|GTAAGTCTCA...AGTGCCTTCATG/AGTGCCTTCATG...TCCAG|AAA | 2 | 1 | 43.451 |
| 122608286 | GT-AG | 0 | 0.0003862049873363 | 2213 | rna-XM_021209708.2 22607921 | 20 | 153049344 | 153051556 | Mus pahari 10093 | AAG|GTAAACTTGT...AAGTCCCTGACT/AAGTCCCTGACT...CCTAG|GCC | 1 | 1 | 47.084 |
| 122608287 | GT-AG | 0 | 1.000000099473604e-05 | 799 | rna-XM_021209708.2 22607921 | 21 | 153048412 | 153049210 | Mus pahari 10093 | ACT|GTAAGTAGCC...CTCTCCTTCTGT/ACGCGTGTGACA...CTCAG|CTA | 2 | 1 | 49.978 |
| 122608288 | GT-AG | 0 | 1.000000099473604e-05 | 2274 | rna-XM_021209708.2 22607921 | 22 | 153046069 | 153048342 | Mus pahari 10093 | CGT|GTGAGTATCC...CTTTCTTTACTG/TCTTTCTTTACT...TCCAG|GGA | 2 | 1 | 51.48 |
| 122608289 | GT-AG | 0 | 1.000000099473604e-05 | 1676 | rna-XM_021209708.2 22607921 | 23 | 153044321 | 153045996 | Mus pahari 10093 | GCT|GTGAGTGCCT...CCTGCCTTAGCT/CCTTAGCTTATT...TGCAG|GAT | 2 | 1 | 53.046 |
| 122608290 | GT-AG | 0 | 9.516916989946404e-05 | 772 | rna-XM_021209708.2 22607921 | 24 | 153043477 | 153044248 | Mus pahari 10093 | GCT|GTAAGTCTCT...AGCATCTTCTCT/GGGCATCCCATT...CTCAG|GTC | 2 | 1 | 54.613 |
| 122608291 | GT-AG | 0 | 1.000000099473604e-05 | 7038 | rna-XM_021209708.2 22607921 | 25 | 153036367 | 153043404 | Mus pahari 10093 | CCT|GTAAGTAATG...CTGCTTTTCTCT/AGCTTCCGAAAT...CCCAG|GGC | 2 | 1 | 56.179 |
| 122608292 | GT-AG | 0 | 1.000000099473604e-05 | 1647 | rna-XM_021209708.2 22607921 | 26 | 153034556 | 153036202 | Mus pahari 10093 | AAG|GTGAGGCCAG...GTGCCCTTCCTC/CCTCTGCTAATG...CACAG|GTC | 1 | 1 | 59.748 |
| 122608293 | GT-AG | 0 | 1.000000099473604e-05 | 313 | rna-XM_021209708.2 22607921 | 27 | 153034118 | 153034430 | Mus pahari 10093 | AAG|GTGAGTGGAG...GAGTCCTCACCG/GGAGTCCTCACC...TCCAG|GGC | 0 | 1 | 62.467 |
| 122608294 | GT-AG | 0 | 1.000000099473604e-05 | 3045 | rna-XM_021209708.2 22607921 | 28 | 153030975 | 153034019 | Mus pahari 10093 | CAC|GTGAGTTCCC...CAGGCTTTCCCA/CTCCTCTCCATC...CGCAG|GTG | 2 | 1 | 64.6 |
| 122608295 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_021209708.2 22607921 | 29 | 153030735 | 153030834 | Mus pahari 10093 | CAG|GTAAGCACTG...TGCCTCTTACTC/CTGCCTCTTACT...CCTAG|GAA | 1 | 1 | 67.646 |
| 122608296 | GT-AG | 0 | 1.000000099473604e-05 | 2351 | rna-XM_021209708.2 22607921 | 30 | 153028290 | 153030640 | Mus pahari 10093 | CAG|GTCAGTGCTG...TGAATGTTAGCT/TGTTAGCTGACG...CATAG|GTG | 2 | 1 | 69.691 |
| 122608297 | GT-AG | 0 | 1.000000099473604e-05 | 2021 | rna-XM_021209708.2 22607921 | 31 | 153026131 | 153028151 | Mus pahari 10093 | CAG|GTGAGACCCT...CTCTCTCTGACC/CTCTCTCTGACC...CTCAG|TGG | 2 | 1 | 72.694 |
| 122608298 | GT-AG | 0 | 1.000000099473604e-05 | 1467 | rna-XM_021209708.2 22607921 | 32 | 153024435 | 153025901 | Mus pahari 10093 | CAG|GTGTGTGCTT...TGTGCCTTTGCT/ACTGTGCTGATT...CCCAG|GTC | 0 | 1 | 77.676 |
| 122608299 | GT-AG | 0 | 1.000000099473604e-05 | 421 | rna-XM_021209708.2 22607921 | 33 | 153023883 | 153024303 | Mus pahari 10093 | CAG|GTAAGAATTT...CCTCTCCTATCT/GTTTTCCTGAGG...TCCAG|TGC | 2 | 1 | 80.527 |
| 122608300 | GT-AG | 0 | 1.000000099473604e-05 | 474 | rna-XM_021209708.2 22607921 | 34 | 153023254 | 153023727 | Mus pahari 10093 | GAG|GTGAGGAGCT...CCCAACTTAGCC/CCAACAGTCATT...CTCAG|GGA | 1 | 1 | 83.899 |
| 122608301 | GT-AG | 0 | 4.035823353373675e-05 | 375 | rna-XM_021209708.2 22607921 | 35 | 153022590 | 153022964 | Mus pahari 10093 | CAA|GTAAGCCTGG...GCTCTCTTCTCT/CCTCTCCTCACA...TCCAG|GTG | 2 | 1 | 90.187 |
| 122608302 | GT-AG | 0 | 1.000000099473604e-05 | 793 | rna-XM_021209708.2 22607921 | 36 | 153021585 | 153022377 | Mus pahari 10093 | AAG|GTCAGGGAGC...TGTGTCTTGGTG/TGTGCCCTGAGG...TCCAG|AGT | 1 | 1 | 94.8 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);