introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 9129089
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49484733 | GT-AG | 0 | 1.000000099473604e-05 | 4275 | rna-XM_004673704.2 9129089 | 1 | 41111178 | 41115452 | Condylura cristata 143302 | AAG|GTAAGACGAT...TTGTGTTTAAAT/TAAATTTTTATG...TTCAG|TAC | 1 | 1 | 3.237 |
49484734 | GT-AG | 0 | 1.000000099473604e-05 | 3681 | rna-XM_004673704.2 9129089 | 2 | 41107432 | 41111112 | Condylura cristata 143302 | GGG|GTAAGTGCTT...CCTTTTTTAAAT/CCTTTTTTAAAT...TACAG|TGG | 0 | 1 | 5.405 |
49484735 | GT-AG | 0 | 1.000000099473604e-05 | 48339 | rna-XM_004673704.2 9129089 | 3 | 41058423 | 41106761 | Condylura cristata 143302 | AAC|GTAAGTAATA...TTTTCTTTTGTA/CCTATTTTCATT...CCTAG|CCT | 1 | 1 | 27.761 |
49484736 | GT-AG | 0 | 1.631088599331809e-05 | 827 | rna-XM_004673704.2 9129089 | 4 | 41057440 | 41058266 | Condylura cristata 143302 | CAA|GTAAGTTCCC...TTTTGTTTATTT/TGTTTATTCATT...CAAAG|GAC | 1 | 1 | 32.966 |
49484737 | GT-AG | 0 | 0.0096556915947432 | 68637 | rna-XM_004673704.2 9129089 | 5 | 40988467 | 41057103 | Condylura cristata 143302 | CAG|GTATGTTTTT...TGTTTATTGACA/AATTTTCTCATT...AACAG|CTC | 1 | 1 | 44.178 |
49484738 | GT-AG | 0 | 1.000000099473604e-05 | 2666 | rna-XM_004673704.2 9129089 | 6 | 40985676 | 40988341 | Condylura cristata 143302 | AAA|GTAAGTGCTA...TTCCCTGTATAT/GTTGCAATCACA...TCTAG|GAT | 0 | 1 | 48.348 |
49484739 | GT-AG | 0 | 1.000000099473604e-05 | 11691 | rna-XM_004673704.2 9129089 | 7 | 40973801 | 40985491 | Condylura cristata 143302 | AAG|GTAATTACCA...CATTCCTGAGTC/GCATTCCTGAGT...CACAG|CTA | 1 | 1 | 54.488 |
49484740 | GT-AG | 0 | 1.000000099473604e-05 | 616 | rna-XM_004673704.2 9129089 | 8 | 40973076 | 40973691 | Condylura cristata 143302 | AAG|GTAGAACATA...AGATCATTGATT/TCAAATTTTATT...TCTAG|GCA | 2 | 1 | 58.125 |
49484741 | GT-AG | 0 | 7.020950254473691e-05 | 4352 | rna-XM_004673704.2 9129089 | 9 | 40968668 | 40973019 | Condylura cristata 143302 | ATT|GTAAGTGTTT...GCTTTCTTACAT/CTTTGCTTCATC...TACAG|TTA | 1 | 1 | 59.993 |
49484742 | GT-AG | 0 | 1.000000099473604e-05 | 800 | rna-XM_004673704.2 9129089 | 10 | 40967742 | 40968541 | Condylura cristata 143302 | CAG|GTAAGGCCGT...TATTTCTGAAAT/GTATTTCTGAAA...TTCAG|GAG | 1 | 1 | 64.198 |
49484743 | GT-AG | 0 | 0.0011432349683011 | 849 | rna-XM_004673704.2 9129089 | 11 | 40966707 | 40967555 | Condylura cristata 143302 | GAG|GTAGCTATTT...TATTTTTTTAAT/TATTTTTTTAAT...TTTAG|GAA | 1 | 1 | 70.404 |
49484744 | GT-AG | 0 | 1.000000099473604e-05 | 1778 | rna-XM_004673704.2 9129089 | 12 | 40964867 | 40966644 | Condylura cristata 143302 | AGG|GTAAGTAACA...TATATTTTAATA/TATATTTTAATA...TACAG|AAA | 0 | 1 | 72.472 |
49484745 | GT-AG | 0 | 1.000000099473604e-05 | 692 | rna-XM_004673704.2 9129089 | 13 | 40963965 | 40964656 | Condylura cristata 143302 | ACT|GTAAGAAAAA...AATTTCTTTTCA/AAGTGACTTACA...AACAG|GGT | 0 | 1 | 79.479 |
49484746 | GT-AG | 0 | 0.0003789389909161 | 5736 | rna-XM_004673704.2 9129089 | 14 | 40958079 | 40963814 | Condylura cristata 143302 | GAT|GTAGGTGTTA...ATATCTTTAATG/ATATCTTTAATG...ATTAG|GTT | 0 | 1 | 84.484 |
49484747 | GT-AG | 0 | 1.000000099473604e-05 | 955 | rna-XM_004673704.2 9129089 | 15 | 40956930 | 40957884 | Condylura cristata 143302 | TAG|GTAAGACATA...CTTTCTTTGATA/TTCTGTTTCATT...TATAG|GCC | 2 | 1 | 90.958 |
49484748 | GT-AG | 0 | 0.000519060507285 | 1276 | rna-XM_004673704.2 9129089 | 16 | 40955498 | 40956773 | Condylura cristata 143302 | TGA|GTAAGCTTAA...TTCTCCATAAGG/AGGTAACTAATA...TTCAG|GGA | 2 | 1 | 96.163 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);