introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 22607852
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122606307 | GT-AG | 0 | 1.000000099473604e-05 | 3857 | rna-XM_021190187.1 22607852 | 1 | 369298 | 373154 | Mus pahari 10093 | CAG|GTGAGCGGCG...GTCCTCTGAGCT/CCCGCACTAATC...TCCAG|GCC | 0 | 1 | 1.648 |
| 122606308 | GT-AG | 0 | 0.0013419048968862 | 7854 | rna-XM_021190187.1 22607852 | 2 | 361257 | 369110 | Mus pahari 10093 | CAG|GTACCTGGAA...CAGTCCTTGTTT/ACTGGGCTCAGT...TCCAG|ATG | 1 | 1 | 4.036 |
| 122606309 | GT-AG | 0 | 1.000000099473604e-05 | 902 | rna-XM_021190187.1 22607852 | 3 | 360203 | 361104 | Mus pahari 10093 | CTG|GTGAGAGGGC...TTGCTCTTGGCC/CCATCACTCAAT...CCTAG|CCG | 0 | 1 | 5.977 |
| 122606310 | GT-AG | 0 | 5.806733804959495e-05 | 5301 | rna-XM_021190187.1 22607852 | 4 | 354728 | 360028 | Mus pahari 10093 | CAG|GTGACCCCCC...AAGTGCTTGACC/AAGTGCTTGACC...CACAG|ATT | 0 | 1 | 8.199 |
| 122606311 | GT-AG | 0 | 1.000000099473604e-05 | 1272 | rna-XM_021190187.1 22607852 | 5 | 353364 | 354635 | Mus pahari 10093 | CGG|GTGAGCGCCC...CTGTTCTTGTCT/TGTCTTCTGACT...CCTAG|TTA | 2 | 1 | 9.374 |
| 122606312 | GT-AG | 0 | 0.0007285783795725 | 2619 | rna-XM_021190187.1 22607852 | 6 | 350664 | 353282 | Mus pahari 10093 | CAG|GTATCAGCTT...TGGCTCTGGAGC/TGGCTCTGGAGC...CCCAG|GCC | 2 | 1 | 10.409 |
| 122606313 | GT-AG | 0 | 1.000000099473604e-05 | 4690 | rna-XM_021190187.1 22607852 | 7 | 345858 | 350547 | Mus pahari 10093 | AAG|GTGAGCCCCA...CTGTGCTTCTCG/CCTAAGGTCACG...ACCAG|ATG | 1 | 1 | 11.89 |
| 122606314 | GT-AG | 0 | 1.7359832265518575e-05 | 217 | rna-XM_021190187.1 22607852 | 8 | 345528 | 345744 | Mus pahari 10093 | AAG|GTAGGCTGGA...CCATTTTTGTTG/ATGAAGCTGACT...CTCAG|GTC | 0 | 1 | 13.333 |
| 122606315 | GT-AG | 0 | 1.000000099473604e-05 | 75 | rna-XM_021190187.1 22607852 | 9 | 345265 | 345339 | Mus pahari 10093 | CAA|GTGAGGCTCG...GGACCCTGAACT/CCTGAACTCATG...CTCAG|GTT | 2 | 1 | 15.734 |
| 122606316 | GT-AG | 0 | 1.000000099473604e-05 | 263 | rna-XM_021190187.1 22607852 | 10 | 344884 | 345146 | Mus pahari 10093 | AAG|GTGAGTGAGG...CTCCTCTTGTCT/CAGTTACTGAAA...TCCAG|GCA | 0 | 1 | 17.241 |
| 122606317 | GT-AG | 0 | 1.000000099473604e-05 | 893 | rna-XM_021190187.1 22607852 | 11 | 343832 | 344724 | Mus pahari 10093 | CAG|GTACAAGGTT...GTGTCCGTGGCC/GACTTGCTGAGG...CCCAG|GAC | 0 | 1 | 19.272 |
| 122606318 | GT-AG | 0 | 1.000000099473604e-05 | 1982 | rna-XM_021190187.1 22607852 | 12 | 341547 | 343528 | Mus pahari 10093 | CAG|GTGAGGACCA...TGGCCCTGGACA/GCCCAGCTCAGT...GACAG|ACT | 0 | 1 | 23.142 |
| 122606319 | GT-AG | 0 | 1.000000099473604e-05 | 5478 | rna-XM_021190187.1 22607852 | 13 | 335918 | 341395 | Mus pahari 10093 | AGG|GTGAGTCTGG...CTCTTCTTACCT/TCTCTTCTTACC...CCCAG|GCT | 1 | 1 | 25.07 |
| 122606320 | GT-AG | 0 | 1.000000099473604e-05 | 1350 | rna-XM_021190187.1 22607852 | 14 | 333634 | 334983 | Mus pahari 10093 | CCG|GTGAGCTCAC...AGACCTCTGACT/AGACCTCTGACT...TGCAG|ATT | 2 | 1 | 36.999 |
| 122606321 | GT-AG | 0 | 1.000000099473604e-05 | 3148 | rna-XM_021190187.1 22607852 | 15 | 330348 | 333495 | Mus pahari 10093 | CAG|GTAGGTGGGT...ACCCTCTTTCCC/CTTCCAATCACC...AACAG|GTG | 2 | 1 | 38.761 |
| 122606322 | GT-AG | 0 | 1.000000099473604e-05 | 1222 | rna-XM_021190187.1 22607852 | 16 | 328375 | 329596 | Mus pahari 10093 | CAG|GTGCCTGATG...TGACCTGTATCC/GCCAAGCTGACC...CCCAG|GAG | 0 | 1 | 48.352 |
| 122606323 | GT-AG | 0 | 1.000000099473604e-05 | 4933 | rna-XM_021190187.1 22607852 | 17 | 323239 | 328171 | Mus pahari 10093 | GAA|GTAGGTCCCC...CATGCCTGAGCC/AGCACTCTCATC...CTTAG|GAG | 2 | 1 | 50.945 |
| 122606324 | GT-AG | 0 | 4.331043947437322e-05 | 3710 | rna-XM_021190187.1 22607852 | 18 | 319438 | 323147 | Mus pahari 10093 | GAG|GTAGGCCCTC...CCATCCTGAGCC/ACCATCCTGAGC...CACAG|CTG | 0 | 1 | 52.107 |
| 122606325 | GT-AG | 0 | 1.000000099473604e-05 | 2854 | rna-XM_021190187.1 22607852 | 19 | 316437 | 319290 | Mus pahari 10093 | AGG|GTGAGGAAGA...CTGTCTCTGACC/CTGTCTCTGACC...GCCAG|GAG | 0 | 1 | 53.985 |
| 122606326 | GT-AG | 0 | 1.000000099473604e-05 | 22522 | rna-XM_021190187.1 22607852 | 20 | 293651 | 316172 | Mus pahari 10093 | CAG|GTATGACCTA...ATGCCCCTGTCT/GGACCACTGACC...CACAG|TCC | 0 | 1 | 57.356 |
| 122606327 | GT-AG | 0 | 1.000000099473604e-05 | 948 | rna-XM_021190187.1 22607852 | 21 | 292478 | 293425 | Mus pahari 10093 | CTG|GTGAGGCGGG...TCCCCTGTATTC/CCCTCTCTCACG...TCCAG|GCA | 0 | 1 | 60.23 |
| 122606328 | GT-AG | 0 | 1.000000099473604e-05 | 958 | rna-XM_021190187.1 22607852 | 22 | 291430 | 292387 | Mus pahari 10093 | CAG|GTGAGCTGAA...CGTCTCTGATCT/GCGTCTCTGATC...CGCAG|GGC | 0 | 1 | 61.379 |
| 122606329 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_021190187.1 22607852 | 23 | 291060 | 291150 | Mus pahari 10093 | AAG|GTGCGCCCGG...GGCGCCTTGTGT/TGCGAACTCATC...TCCAG|GAT | 0 | 1 | 64.943 |
| 122606330 | GT-AG | 0 | 1.000000099473604e-05 | 1101 | rna-XM_021190187.1 22607852 | 24 | 289828 | 290928 | Mus pahari 10093 | CAG|GTGGGTGGGC...TCTCTCTTGTCC/AGAAGTCTCAGG...ATCAG|TGA | 2 | 1 | 66.616 |
| 122606331 | GT-AG | 0 | 1.000000099473604e-05 | 805 | rna-XM_021190187.1 22607852 | 25 | 288818 | 289622 | Mus pahari 10093 | TTG|GTGAGGATTC...CTTTTCTCACCC/TCTTTTCTCACC...CCCAG|GTG | 0 | 1 | 69.234 |
| 122606332 | GT-AG | 0 | 1.000000099473604e-05 | 1560 | rna-XM_021190187.1 22607852 | 26 | 286877 | 288436 | Mus pahari 10093 | CAG|GTGGGGTGGT...CCTGCCTTGTCT/CCGCGCTGGACC...CTCAG|GTA | 0 | 1 | 74.1 |
| 122606333 | GT-AG | 0 | 1.000000099473604e-05 | 859 | rna-XM_021190187.1 22607852 | 27 | 285773 | 286631 | Mus pahari 10093 | CAG|GTGCTCCTTA...GCAGCTTAGATC/ATGCAGCTTAGA...TGCAG|GGA | 2 | 1 | 77.229 |
| 122606334 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_021190187.1 22607852 | 28 | 285539 | 285633 | Mus pahari 10093 | GAG|GTGGGGGAGA...TGCTCCTTCCTT/ACCTAGCTCACC...CCCAG|ATC | 0 | 1 | 79.004 |
| 122606335 | GT-AG | 0 | 1.000000099473604e-05 | 611 | rna-XM_021190187.1 22607852 | 29 | 284843 | 285453 | Mus pahari 10093 | AGA|GTGAGTAAGG...AGTGCCTCATCT/TAGTGCCTCATC...CCCAG|TGC | 1 | 1 | 80.089 |
| 122606336 | GT-AG | 0 | 1.000000099473604e-05 | 875 | rna-XM_021190187.1 22607852 | 30 | 283771 | 284645 | Mus pahari 10093 | ACG|GTCAGACCCC...TTTCCCTCACCC/GTTTCCCTCACC...CCCAG|ATC | 0 | 1 | 82.605 |
| 122606337 | GT-AG | 0 | 1.000000099473604e-05 | 1558 | rna-XM_021190187.1 22607852 | 31 | 281593 | 283150 | Mus pahari 10093 | AGG|GTGAGGACCT...TCTCACTTATTT/CAGAATCTCACT...TCCAG|GAA | 2 | 1 | 90.524 |
| 122606338 | GT-AG | 0 | 1.4121690612693391e-05 | 528 | rna-XM_021190187.1 22607852 | 32 | 280992 | 281519 | Mus pahari 10093 | GTG|GTAAGTTGAA...CCTCCCTTCATT/CCTCCCTTCATT...CACAG|CCG | 0 | 1 | 91.456 |
| 122606339 | GT-AG | 0 | 1.000000099473604e-05 | 1418 | rna-XM_021190187.1 22607852 | 33 | 279281 | 280698 | Mus pahari 10093 | CCG|GTGAGCGCGT...CCTGCCTTCATC/CACCATCTCATC...CCCAG|GTC | 2 | 1 | 95.198 |
| 122606340 | GT-AG | 0 | 1.000000099473604e-05 | 144 | rna-XM_021190187.1 22607852 | 34 | 278963 | 279106 | Mus pahari 10093 | TCA|GTGAGTCCCT...GGTCCCTTGAGA/AGATAGCTGAAG...TGCAG|AAC | 2 | 1 | 97.42 |
| 122606341 | GT-AG | 0 | 1.000000099473604e-05 | 2394 | rna-XM_021190187.1 22607852 | 35 | 276526 | 278919 | Mus pahari 10093 | GAG|GTAAGATCTG...ATTCTCTTCCAA/GTGTGGTTGAGA...GCCAG|GAG | 0 | 1 | 97.969 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);