introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
31 rows where transcript_id = 22607905
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122607820 | GT-AG | 0 | 0.0002030934896647 | 4000 | rna-XM_029535813.1 22607905 | 1 | 86173426 | 86177425 | Mus pahari 10093 | GCT|GTAAGTTTAT...GTTATGTTATTT/GTTATTTTTATG...TGTAG|GTC | 2 | 1 | 0.641 |
| 122607821 | GT-AG | 0 | 0.0031549135371773 | 12153 | rna-XM_029535813.1 22607905 | 2 | 86160150 | 86172302 | Mus pahari 10093 | CAG|GTATAGTCTC...AGATCTTTGACT/AGATCTTTGACT...TACAG|AAA | 0 | 1 | 23.137 |
| 122607822 | GT-AG | 0 | 1.000000099473604e-05 | 11589 | rna-XM_029535813.1 22607905 | 3 | 86148457 | 86160045 | Mus pahari 10093 | GAA|GTAAGGAATT...ACAGCTTTAAAG/ACACATTTAATA...AACAG|ATT | 2 | 1 | 25.22 |
| 122607823 | GT-AG | 0 | 1.000000099473604e-05 | 2553 | rna-XM_029535813.1 22607905 | 4 | 86145746 | 86148298 | Mus pahari 10093 | ATG|GTAAGTTTTG...AAAACGATAACT/CGATAACTAACT...TTCAG|TGA | 1 | 1 | 28.385 |
| 122607824 | GT-AG | 0 | 4.601234460045662e-05 | 838 | rna-XM_029535813.1 22607905 | 5 | 86144787 | 86145624 | Mus pahari 10093 | GAA|GTAAGTTTAC...TAACTTGTGATT/GGATTATTCATG...AACAG|TAA | 2 | 1 | 30.809 |
| 122607825 | GT-AG | 0 | 0.0001686862008937 | 2297 | rna-XM_029535813.1 22607905 | 6 | 86142378 | 86144674 | Mus pahari 10093 | ACA|GTAAGTGTTT...TTCTTCTTAAAC/TTATATTTGATT...TTTAG|GCA | 0 | 1 | 33.053 |
| 122607826 | GT-AG | 0 | 3.828264705304531e-05 | 129 | rna-XM_029535813.1 22607905 | 7 | 86142169 | 86142297 | Mus pahari 10093 | AAG|GTATAACATC...TTTTGTTTAAAA/TTTTGTTTAAAA...TTCAG|ACA | 2 | 1 | 34.655 |
| 122607827 | GT-AG | 0 | 0.0008341764421426 | 9074 | rna-XM_029535813.1 22607905 | 8 | 86133031 | 86142104 | Mus pahari 10093 | GAA|GTAAGTTTTG...TTATTTTTATTT/TTTTATTTTACT...TAAAG|AAC | 0 | 1 | 35.938 |
| 122607828 | GT-AG | 0 | 1.000000099473604e-05 | 2054 | rna-XM_029535813.1 22607905 | 9 | 86130833 | 86132886 | Mus pahari 10093 | GAT|GTGAGTATGT...TTTATATTAAAT/CAATTTCTGACT...TACAG|GTG | 0 | 1 | 38.822 |
| 122607829 | GT-AG | 0 | 0.0008578713242401 | 80 | rna-XM_029535813.1 22607905 | 10 | 86130707 | 86130786 | Mus pahari 10093 | AGG|GTATGTCTTT...TTTGCTTTTTCA/TGCTTTTTCAGA...TGTAG|GGT | 1 | 1 | 39.744 |
| 122607830 | GT-AG | 0 | 1.000000099473604e-05 | 3232 | rna-XM_029535813.1 22607905 | 11 | 86127264 | 86130495 | Mus pahari 10093 | AAA|GTAAGTAGCT...TAATTATTACCA/ATAATTATTACC...TTCAG|TTA | 2 | 1 | 43.97 |
| 122607831 | GT-AG | 0 | 0.0001089102022095 | 1082 | rna-XM_029535813.1 22607905 | 12 | 86126059 | 86127140 | Mus pahari 10093 | ACT|GTAAGTGGAA...TTTTTCTTAACA/TTTTTTCTTAAC...CACAG|AAT | 2 | 1 | 46.434 |
| 122607832 | GT-AG | 0 | 1.000000099473604e-05 | 2093 | rna-XM_029535813.1 22607905 | 13 | 86123798 | 86125890 | Mus pahari 10093 | ACG|GTGAGTGTTA...AATTTTTTAAAT/AATTTTTTAAAT...TTAAG|GTT | 2 | 1 | 49.8 |
| 122607833 | GT-AG | 0 | 1.000000099473604e-05 | 569 | rna-XM_029535813.1 22607905 | 14 | 86123117 | 86123685 | Mus pahari 10093 | CAG|GTAATGGTAG...TTTCTGTTAATT/TTTTTTTTCACT...TTCAG|GTT | 0 | 1 | 52.043 |
| 122607834 | GT-AG | 0 | 0.3644546005356401 | 3654 | rna-XM_029535813.1 22607905 | 15 | 86119320 | 86122973 | Mus pahari 10093 | TGG|GTATGCTTTG...TATATCATAATT/AGTAATTTAATT...TGCAG|ACT | 2 | 1 | 54.908 |
| 122607835 | GT-AG | 0 | 1.000000099473604e-05 | 1583 | rna-XM_029535813.1 22607905 | 16 | 86117548 | 86119130 | Mus pahari 10093 | AAA|GTAAGTCAAA...TAAATCTTGATT/ATTTTTCTGATA...TCTAG|ATT | 2 | 1 | 58.694 |
| 122607836 | GT-AG | 0 | 1.000000099473604e-05 | 477 | rna-XM_029535813.1 22607905 | 17 | 86116977 | 86117453 | Mus pahari 10093 | CAG|GTAGGAGTTT...TAATGCTTATTT/ATTGGTCTGACT...CATAG|GCT | 0 | 1 | 60.577 |
| 122607837 | GT-AG | 0 | 1.000000099473604e-05 | 339 | rna-XM_029535813.1 22607905 | 18 | 86116543 | 86116881 | Mus pahari 10093 | TTG|GTAAGACTTA...ATATTCTTGTTG/AGGAACTTCATC...ATTAG|GCT | 2 | 1 | 62.48 |
| 122607838 | GT-AG | 0 | 5.765058909827663e-05 | 2620 | rna-XM_029535813.1 22607905 | 19 | 86113742 | 86116361 | Mus pahari 10093 | CAG|GTATGTACCC...TAGATCTTCATT/TAGATCTTCATT...CATAG|GTC | 0 | 1 | 66.106 |
| 122607839 | GT-AG | 0 | 1.000000099473604e-05 | 954 | rna-XM_029535813.1 22607905 | 20 | 86112683 | 86113636 | Mus pahari 10093 | AAG|GTAACACAAT...CTAGTGTTAACG/ACGTTTCTAATG...TGCAG|TCA | 0 | 1 | 68.209 |
| 122607840 | GT-AG | 0 | 1.000000099473604e-05 | 3543 | rna-XM_029535813.1 22607905 | 21 | 86109047 | 86112589 | Mus pahari 10093 | AAG|GTGAGAAGAC...CTTTCTTTAATT/CTTTCTTTAATT...TCAAG|GTT | 0 | 1 | 70.072 |
| 122607841 | GT-AG | 0 | 2.417149747308025e-05 | 2132 | rna-XM_029535813.1 22607905 | 22 | 86106785 | 86108916 | Mus pahari 10093 | GAG|GTAGGTCTTT...CTAGCTTTATTT/CACATTTTTACC...TAAAG|GCA | 1 | 1 | 72.676 |
| 122607842 | GT-TA | 0 | 0.3098511549571903 | 3441 | rna-XM_029535813.1 22607905 | 23 | 86103200 | 86106640 | Mus pahari 10093 | TCA|GTATGTTTCT...GTGTCTATAAAT/GCACCATTTATT...GTGTA|TTT | 1 | 1 | 75.561 |
| 122607843 | GT-AG | 0 | 0.0004894425263825 | 1419 | rna-XM_029535813.1 22607905 | 24 | 86101614 | 86103032 | Mus pahari 10093 | CTG|GTAACAATCT...GAAGTTTTAACA/TTAATTCTAAGT...TACAG|ATG | 0 | 1 | 78.906 |
| 122607844 | GT-AG | 0 | 1.000000099473604e-05 | 1902 | rna-XM_029535813.1 22607905 | 25 | 86099602 | 86101503 | Mus pahari 10093 | TAG|GTATGGCATG...TCACTATTAACA/AAGTTTCTGATC...TGTAG|GCT | 2 | 1 | 81.11 |
| 122607845 | GT-AG | 0 | 0.0004732844169115 | 1814 | rna-XM_029535813.1 22607905 | 26 | 86097580 | 86099393 | Mus pahari 10093 | TAT|GTAAGTTTGT...GTTTCATTATTA/GGAAGTTTCATT...TTCAG|ATT | 0 | 1 | 85.276 |
| 122607846 | GT-AG | 0 | 0.0023073191481024 | 2129 | rna-XM_029535813.1 22607905 | 27 | 86095326 | 86097454 | Mus pahari 10093 | TGG|GTATGTTCCT...CTGTTTTTGTTT/TGCTTTCTGATG...TGTAG|CTT | 2 | 1 | 87.78 |
| 122607847 | GT-AG | 0 | 9.56760972677588e-05 | 91 | rna-XM_029535813.1 22607905 | 28 | 86095117 | 86095207 | Mus pahari 10093 | GAG|GTGACTTTCC...TTTCTCTTCAAT/TTTCTCTTCAAT...CTTAG|TGT | 0 | 1 | 90.144 |
| 122607848 | GT-AG | 0 | 1.000000099473604e-05 | 361 | rna-XM_029535813.1 22607905 | 29 | 86094683 | 86095043 | Mus pahari 10093 | CAG|GTGAGCTTAT...GACACATTGAGA/TTCAGACACATT...TTCAG|GTG | 1 | 1 | 91.607 |
| 122607849 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_029535813.1 22607905 | 30 | 86094489 | 86094572 | Mus pahari 10093 | CTC|GTGAGTGGTG...AAATTTGTACCT/CTACTATTAAGA...TTTAG|GTG | 0 | 1 | 93.81 |
| 122607850 | GT-AG | 0 | 0.0003310680322153 | 1893 | rna-XM_029535813.1 22607905 | 31 | 86092470 | 86094362 | Mus pahari 10093 | ATG|GTAAGCTAAG...CAATCCTTATCC/CCAATCCTTATC...TCTAG|CTT | 0 | 1 | 96.334 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);