introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
44 rows where transcript_id = 25387362
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 140014674 | GT-AG | 0 | 1.000000099473604e-05 | 2630 | rna-XM_040241178.1 25387362 | 2 | 108611861 | 108614490 | Oryx dammah 59534 | AAG|GTAATGCTTT...CCTCTTCTGATA/CCTCTTCTGATA...TTCAG|AAG | 1 | 1 | 4.669 |
| 140014675 | GT-AG | 0 | 1.000000099473604e-05 | 4222 | rna-XM_040241178.1 25387362 | 3 | 108614622 | 108618843 | Oryx dammah 59534 | AAG|GTAAGTTACC...AAGCATTTATTC/CATTTATTCATT...ATTAG|TAT | 0 | 1 | 6.378 |
| 140014676 | GT-AG | 0 | 1.000000099473604e-05 | 1376 | rna-XM_040241178.1 25387362 | 4 | 108618975 | 108620350 | Oryx dammah 59534 | CTG|GTGAGTAAAT...TTGTACTTAATA/TTTGATTTTACC...TCTAG|CAA | 2 | 1 | 8.087 |
| 140014677 | GT-AG | 0 | 1.000000099473604e-05 | 1915 | rna-XM_040241178.1 25387362 | 5 | 108620493 | 108622407 | Oryx dammah 59534 | TCG|GTAAGTTATT...GATGTTTAAATT/TTTAAATTCAAT...TTTAG|CTC | 0 | 1 | 9.939 |
| 140014678 | GT-AG | 0 | 1.000000099473604e-05 | 2894 | rna-XM_040241178.1 25387362 | 6 | 108622595 | 108625488 | Oryx dammah 59534 | TAG|GTAAGGTGAC...CTTTCTTCAAAT/TCAATTTTTACT...GGTAG|AAG | 1 | 1 | 12.378 |
| 140014679 | GT-AG | 0 | 1.000000099473604e-05 | 1948 | rna-XM_040241178.1 25387362 | 7 | 108625683 | 108627630 | Oryx dammah 59534 | CTG|GTAAGTCCGT...GCTTCTTTCTCC/CTCCAGGTGATG...TTTAG|CCC | 0 | 1 | 14.908 |
| 140014680 | GT-AG | 0 | 0.0047257674008599 | 3093 | rna-XM_040241178.1 25387362 | 8 | 108627685 | 108630777 | Oryx dammah 59534 | AAG|GTAGTCTGTC...TGTTTCTTATCA/TTGTTTCTTATC...AACAG|CTA | 0 | 1 | 15.612 |
| 140014681 | GT-AG | 0 | 1.000000099473604e-05 | 909 | rna-XM_040241178.1 25387362 | 9 | 108630898 | 108631806 | Oryx dammah 59534 | GAG|GTGAGTTATT...CTCTGTTTATTT/CAGTTATTTACT...TCCAG|TAT | 0 | 1 | 17.178 |
| 140014682 | GT-AG | 0 | 1.000000099473604e-05 | 1512 | rna-XM_040241178.1 25387362 | 10 | 108632036 | 108633547 | Oryx dammah 59534 | CAG|GTTAAAACAA...TGGTTTTTAAAT/TGGTTTTTAAAT...TGTAG|CAC | 1 | 1 | 20.164 |
| 140014683 | GT-AG | 0 | 1.000000099473604e-05 | 1997 | rna-XM_040241178.1 25387362 | 11 | 108633658 | 108635654 | Oryx dammah 59534 | AAA|GTAGGTAAAA...CTGTCCTTTGCT/CCTTTGCTAATT...ATTAG|ATT | 0 | 1 | 21.599 |
| 140014684 | GT-AG | 0 | 1.000000099473604e-05 | 5526 | rna-XM_040241178.1 25387362 | 12 | 108635850 | 108641375 | Oryx dammah 59534 | CAT|GTAAGTAAAT...TTACTTTTATAG/ATTACTTTTATA...TTTAG|TGC | 0 | 1 | 24.142 |
| 140014685 | GT-AG | 0 | 1.000000099473604e-05 | 1016 | rna-XM_040241178.1 25387362 | 13 | 108641530 | 108642545 | Oryx dammah 59534 | AAG|GTGAGAATTC...TTTCCCTTTCTC/TAGATCATCACT...CTTAG|GCC | 1 | 1 | 26.151 |
| 140014686 | GT-AG | 0 | 0.0001902809740645 | 3153 | rna-XM_040241178.1 25387362 | 14 | 108642767 | 108645919 | Oryx dammah 59534 | AAG|GTATGATTTG...CTGTACTTAAAC/TCTGTACTTAAA...TGCAG|GAG | 0 | 1 | 29.034 |
| 140014687 | GT-AG | 0 | 1.000000099473604e-05 | 3289 | rna-XM_040241178.1 25387362 | 15 | 108646097 | 108649385 | Oryx dammah 59534 | CAG|GTGAGTAAGA...TTTTCCTTTTTT/TTTTGTATCATT...AATAG|CTA | 0 | 1 | 31.342 |
| 140014688 | GT-AG | 0 | 1.000000099473604e-05 | 5711 | rna-XM_040241178.1 25387362 | 16 | 108649521 | 108655231 | Oryx dammah 59534 | CAG|GTAGGGACTT...TAACTTTTACCC/CCCCTCCTCATT...CATAG|GAT | 0 | 1 | 33.103 |
| 140014689 | GT-AG | 0 | 1.000000099473604e-05 | 589 | rna-XM_040241178.1 25387362 | 17 | 108655378 | 108655966 | Oryx dammah 59534 | GAG|GTAATAGAAG...ATGTTTATAACA/ATGTTTATAACA...ACCAG|AAT | 2 | 1 | 35.007 |
| 140014690 | GT-AG | 0 | 1.000000099473604e-05 | 694 | rna-XM_040241178.1 25387362 | 18 | 108656175 | 108656868 | Oryx dammah 59534 | AGG|GTGAGAGATG...TCATTTTTGAAA/TCATTTTTGAAA...CGCAG|ATT | 0 | 1 | 37.72 |
| 140014691 | GT-AG | 0 | 1.000000099473604e-05 | 1789 | rna-XM_040241178.1 25387362 | 19 | 108657049 | 108658837 | Oryx dammah 59534 | AAG|GTGGGCAAGA...TTTTTTTTTTTA/TCTTAGTTCATT...CTCAG|AAG | 0 | 1 | 40.068 |
| 140014692 | GT-AG | 0 | 1.000000099473604e-05 | 708 | rna-XM_040241178.1 25387362 | 20 | 108659018 | 108659725 | Oryx dammah 59534 | GAG|GTGAGTCCCA...GCTGCCTTCCCC/GCCTTCCCCACG...AGCAG|GAC | 0 | 1 | 42.416 |
| 140014693 | GT-AG | 0 | 1.000000099473604e-05 | 290 | rna-XM_040241178.1 25387362 | 21 | 108659843 | 108660132 | Oryx dammah 59534 | CAG|GTCAGTGCTG...AACTTCTTATAT/AGAATATTCACT...AATAG|CTT | 0 | 1 | 43.942 |
| 140014694 | GT-AG | 0 | 1.000000099473604e-05 | 427 | rna-XM_040241178.1 25387362 | 22 | 108660278 | 108660704 | Oryx dammah 59534 | CTG|GTACAGTACT...ATAACTTTCCTT/GATGATTTCATG...TACAG|ATA | 1 | 1 | 45.833 |
| 140014695 | GT-AG | 0 | 1.000000099473604e-05 | 1623 | rna-XM_040241178.1 25387362 | 23 | 108660854 | 108662476 | Oryx dammah 59534 | AAG|GTAGGAAGTC...GTCTTTTTATTT/AGTCTTTTTATT...TGTAG|GTT | 0 | 1 | 47.776 |
| 140014696 | GT-AG | 0 | 1.000000099473604e-05 | 1057 | rna-XM_040241178.1 25387362 | 24 | 108662658 | 108663714 | Oryx dammah 59534 | AAG|GTAAGTCAGA...GTTTACCTAGTA/TGTTTACCTAGT...TGAAG|ATG | 1 | 1 | 50.137 |
| 140014697 | GT-AG | 0 | 1.000000099473604e-05 | 1526 | rna-XM_040241178.1 25387362 | 25 | 108664029 | 108665554 | Oryx dammah 59534 | TTA|GTAAGTGGGG...TTGCATTTATTT/ATTGCATTTATT...TTCAG|GAG | 0 | 1 | 54.232 |
| 140014698 | GT-AG | 0 | 2.7704632014551308e-05 | 1897 | rna-XM_040241178.1 25387362 | 26 | 108665761 | 108667657 | Oryx dammah 59534 | AAA|GTAATTATTA...TCACATTTAAAT/TCACATTTAAAT...TTTAG|AGA | 2 | 1 | 56.919 |
| 140014699 | GT-AG | 0 | 0.0080222395090035 | 2867 | rna-XM_040241178.1 25387362 | 27 | 108667851 | 108670717 | Oryx dammah 59534 | AGA|GTATGTGCTG...TTGCTTTTAATG/TTGCTTTTAATG...TCTAG|ACA | 0 | 1 | 59.437 |
| 140014700 | GT-AG | 0 | 1.000000099473604e-05 | 142 | rna-XM_040241178.1 25387362 | 28 | 108670849 | 108670990 | Oryx dammah 59534 | CAG|GTAGGCAGAC...TGACACGTCACG/AATAAAGTGATG...TTCAG|ATT | 2 | 1 | 61.145 |
| 140014701 | GT-AG | 0 | 1.000000099473604e-05 | 103 | rna-XM_040241178.1 25387362 | 29 | 108671132 | 108671234 | Oryx dammah 59534 | AGA|GTAAGTACGG...GTCTCCCTACCG/TCCCTACCGATT...TGTAG|GCT | 2 | 1 | 62.984 |
| 140014702 | GT-AG | 0 | 1.000000099473604e-05 | 749 | rna-XM_040241178.1 25387362 | 30 | 108671362 | 108672110 | Oryx dammah 59534 | CAG|GTACGGCAGC...TATTCTTTCCCT/CTCCATCTCAGC...CGCAG|GTG | 0 | 1 | 64.641 |
| 140014703 | GT-AG | 0 | 1.000000099473604e-05 | 686 | rna-XM_040241178.1 25387362 | 31 | 108672276 | 108672961 | Oryx dammah 59534 | AGG|GTGTGTGTCT...CTATTCATAACT/CTATTCATAACT...CATAG|GAA | 0 | 1 | 66.793 |
| 140014704 | GT-AG | 0 | 2.1617923228601173e-05 | 1452 | rna-XM_040241178.1 25387362 | 32 | 108673122 | 108674573 | Oryx dammah 59534 | CTG|GTAAGTTTAA...ATAGCCTAGACC/ATTCAATTAATA...GTTAG|AAC | 1 | 1 | 68.88 |
| 140014705 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_040241178.1 25387362 | 33 | 108674690 | 108674773 | Oryx dammah 59534 | GAG|GTAATATACA...CCTCTTTTACCT/TCCTCTTTTACC...TTTAG|GAG | 0 | 1 | 70.393 |
| 140014706 | GT-AG | 0 | 1.000000099473604e-05 | 3273 | rna-XM_040241178.1 25387362 | 34 | 108674992 | 108678264 | Oryx dammah 59534 | ACG|GTAAGAGCAG...ACAGTCTTAACA/TTGAGTTTGACA...CGTAG|GGC | 2 | 1 | 73.236 |
| 140014707 | GT-AG | 0 | 1.000000099473604e-05 | 1111 | rna-XM_040241178.1 25387362 | 35 | 108678423 | 108679533 | Oryx dammah 59534 | AGG|GTAAGAGGCG...TATTTCTTATTC/GTATTTCTTATT...TGTAG|AAA | 1 | 1 | 75.297 |
| 140014708 | GT-AG | 0 | 1.000000099473604e-05 | 1005 | rna-XM_040241178.1 25387362 | 36 | 108679608 | 108680612 | Oryx dammah 59534 | CAG|GTGAGCCTAA...TTGTTCTTTGCC/CGTGGTTTGAGT...TGAAG|GAC | 0 | 1 | 76.262 |
| 140014709 | GT-AG | 0 | 0.0044408000296897 | 380 | rna-XM_040241178.1 25387362 | 37 | 108680802 | 108681181 | Oryx dammah 59534 | ACG|GTACCGCACC...TGTTTGTTAGCT/TGGTGACTAACT...ATAAG|ATG | 0 | 1 | 78.727 |
| 140014710 | GT-AG | 0 | 0.0160055520128105 | 1478 | rna-XM_040241178.1 25387362 | 38 | 108681408 | 108682885 | Oryx dammah 59534 | CCC|GTATGTATCC...ACTACCTGAAAC/CTCGGGCTAAAC...TTCAG|AAC | 1 | 1 | 81.675 |
| 140014711 | GT-AG | 0 | 1.000000099473604e-05 | 931 | rna-XM_040241178.1 25387362 | 39 | 108683092 | 108684022 | Oryx dammah 59534 | CTC|GTGAGTGCCC...ACCTCCTGGGTT/GCAGCGCGCACC...GCTAG|GAG | 0 | 1 | 84.362 |
| 140014712 | GT-AG | 0 | 1.000000099473604e-05 | 1032 | rna-XM_040241178.1 25387362 | 40 | 108684164 | 108685195 | Oryx dammah 59534 | CAG|GTGAGCCAGC...TGGTATTTGACT/TTGACTCTCACT...TGTAG|ATG | 0 | 1 | 86.201 |
| 140014713 | GT-AG | 0 | 1.000000099473604e-05 | 168 | rna-XM_040241178.1 25387362 | 41 | 108685469 | 108685636 | Oryx dammah 59534 | ATG|GTAAGGGCTT...GTTCCCTTTTCC/GACTCTCTGATG...AAAAG|GAT | 0 | 1 | 89.761 |
| 140014714 | GT-AG | 0 | 0.0010882529428488 | 3166 | rna-XM_040241178.1 25387362 | 42 | 108685700 | 108688865 | Oryx dammah 59534 | AAG|GTTGCCCTCT...CTTCCCGTAACA/CTAGTTCTGACA...TTCAG|GCC | 0 | 1 | 90.583 |
| 140014715 | GT-AG | 0 | 1.000000099473604e-05 | 1934 | rna-XM_040241178.1 25387362 | 43 | 108689073 | 108691006 | Oryx dammah 59534 | CAG|GTAGGAGGCG...AACGTTTTATCT/TAACGTTTTATC...TCCAG|AGC | 0 | 1 | 93.283 |
| 140019922 | GT-AG | 0 | 1.000000099473604e-05 | 6344 | rna-XM_040241178.1 25387362 | 1 | 108605269 | 108611612 | Oryx dammah 59534 | GTG|GTGAGTGCCA...TTAACCTTGTTA/TAAAAATTAACC...TACAG|GTT | 0 | 1.839 | |
| 140019923 | GT-AG | 0 | 0.0004569238426467 | 277 | rna-XM_040241178.1 25387362 | 44 | 108691094 | 108691370 | Oryx dammah 59534 | AGG|GTACATGTTA...TCATCCTTAGTA/TACTTTCTCATC...TGTAG|GAC | 0 | 94.418 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);