introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
46 rows where transcript_id = 25387363
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 140014716 | GT-AG | 0 | 1.000000099473604e-05 | 1916 | rna-XM_040240968.1 25387363 | 1 | 92954042 | 92955957 | Oryx dammah 59534 | CAT|GTAAGTACCG...CTTTTCTTGCTC/GCTGGTCTAACT...TGTAG|TGT | 2 | 1 | 4.829 |
| 140014717 | GT-AG | 0 | 7.696941679682839e-05 | 19211 | rna-XM_040240968.1 25387363 | 2 | 92934712 | 92953922 | Oryx dammah 59534 | CAC|GTAAGCAAAA...TTCTTTTTAGTT/TAGTTACTCATG...CACAG|GCT | 1 | 1 | 6.576 |
| 140014718 | GT-AG | 0 | 6.651527175741122e-05 | 2597 | rna-XM_040240968.1 25387363 | 3 | 92931993 | 92934589 | Oryx dammah 59534 | AAG|GTAAACTGCG...CTAGTTTTACTT/TCTAGTTTTACT...TATAG|GTA | 0 | 1 | 8.366 |
| 140014719 | GT-AG | 0 | 1.000000099473604e-05 | 9758 | rna-XM_040240968.1 25387363 | 4 | 92922058 | 92931815 | Oryx dammah 59534 | CTG|GTGAGTAGGC...AGTTATTTATTT/CAGTTATTTATT...TGCAG|ACA | 0 | 1 | 10.964 |
| 140014720 | GT-AG | 0 | 1.000000099473604e-05 | 4588 | rna-XM_040240968.1 25387363 | 5 | 92917377 | 92921964 | Oryx dammah 59534 | GAG|GTAAGTCATA...CTTTCCTTTCTA/ACCTTCCCCACT...CCCAG|CTG | 0 | 1 | 12.329 |
| 140014721 | GT-AG | 0 | 1.000000099473604e-05 | 1882 | rna-XM_040240968.1 25387363 | 6 | 92915254 | 92917135 | Oryx dammah 59534 | CAT|GTGAGTGTCT...TTGCCTTTCATC/TTGCCTTTCATC...CACAG|CTC | 1 | 1 | 15.867 |
| 140014722 | GT-AG | 0 | 1.000000099473604e-05 | 336 | rna-XM_040240968.1 25387363 | 7 | 92914778 | 92915113 | Oryx dammah 59534 | GAG|GTAAGCCACC...CCCTCTTTCATT/CCCTCTTTCATT...TCCAG|GTG | 0 | 1 | 17.922 |
| 140014723 | GT-AG | 0 | 1.000000099473604e-05 | 3921 | rna-XM_040240968.1 25387363 | 8 | 92910740 | 92914660 | Oryx dammah 59534 | CGG|GTGAGTGTCC...CAAGACTTAGTT/TACTTTGTCATT...GGCAG|ACA | 0 | 1 | 19.639 |
| 140014724 | GT-AG | 0 | 1.000000099473604e-05 | 771 | rna-XM_040240968.1 25387363 | 9 | 92909771 | 92910541 | Oryx dammah 59534 | GAG|GTGAGGAACT...GCATTCTTATGA/AGCATTCTTATG...TTCAG|TGT | 0 | 1 | 22.545 |
| 140014725 | GT-AG | 0 | 1.000000099473604e-05 | 792 | rna-XM_040240968.1 25387363 | 10 | 92908773 | 92909564 | Oryx dammah 59534 | CGG|GTAAGTGGAA...AATTTTTTGTTG/TGTTCAATCATC...TTAAG|GTA | 2 | 1 | 25.569 |
| 140014726 | GT-AG | 0 | 0.0004266410246217 | 1539 | rna-XM_040240968.1 25387363 | 11 | 92907057 | 92908595 | Oryx dammah 59534 | CAT|GTAAGTTACC...TTGCCTTTAACC/GCCTGGCTCACT...TCCAG|CTT | 2 | 1 | 28.167 |
| 140014727 | GT-AG | 0 | 1.000000099473604e-05 | 1968 | rna-XM_040240968.1 25387363 | 12 | 92904866 | 92906833 | Oryx dammah 59534 | AAA|GTAAGGCTGC...AGAGTCTTACTT/TCATTGCTTATT...CCCAG|TTA | 0 | 1 | 31.44 |
| 140014728 | GT-AG | 0 | 1.000000099473604e-05 | 1066 | rna-XM_040240968.1 25387363 | 13 | 92903578 | 92904643 | Oryx dammah 59534 | ATG|GTGAGTCGCT...TGAATTTTGCCT/GGATGAATGAAT...CACAG|AGC | 0 | 1 | 34.698 |
| 140014729 | GT-AG | 0 | 0.000157655100178 | 1134 | rna-XM_040240968.1 25387363 | 14 | 92902239 | 92903372 | Oryx dammah 59534 | CAG|GTACACTGGA...CTGTGCTTGGCT/GTTTGGCTCATG...TTCAG|GCC | 1 | 1 | 37.707 |
| 140014730 | GT-AG | 0 | 1.000000099473604e-05 | 693 | rna-XM_040240968.1 25387363 | 15 | 92901432 | 92902124 | Oryx dammah 59534 | AAA|GTGAGTTTTG...CACTTGTTAGTG/TGTGGGCTCATT...CACAG|TCT | 1 | 1 | 39.381 |
| 140014731 | GT-AG | 0 | 1.000000099473604e-05 | 1020 | rna-XM_040240968.1 25387363 | 16 | 92900240 | 92901259 | Oryx dammah 59534 | CAT|GTAAGAAGAG...TGTACTTTCACA/TGTACTTTCACA...CTCAG|GTC | 2 | 1 | 41.905 |
| 140014732 | GT-AG | 0 | 1.000000099473604e-05 | 849 | rna-XM_040240968.1 25387363 | 17 | 92899259 | 92900107 | Oryx dammah 59534 | CAT|GTGAGTACCA...GTGTTCTGTGTG/CCCAGTGTGACC...CCCAG|GCT | 2 | 1 | 43.843 |
| 140014733 | GT-AG | 0 | 1.000000099473604e-05 | 272 | rna-XM_040240968.1 25387363 | 18 | 92898844 | 92899115 | Oryx dammah 59534 | CAG|GTGAGGCCCA...CCTGTCTTCTCT/CCATCATTCACT...CGTAG|GTG | 1 | 1 | 45.942 |
| 140014734 | GT-AG | 0 | 1.000000099473604e-05 | 1214 | rna-XM_040240968.1 25387363 | 19 | 92897492 | 92898705 | Oryx dammah 59534 | AAG|GTCACTGATG...CCAGCCTCACCT/GCCAGCCTCACC...GGCAG|GCC | 1 | 1 | 47.967 |
| 140014735 | GT-AG | 0 | 1.000000099473604e-05 | 1329 | rna-XM_040240968.1 25387363 | 20 | 92895942 | 92897270 | Oryx dammah 59534 | AAG|GTGAGTTGCA...CGGTCCCTGACC/CCCATTCTCACC...ATCAG|GAG | 0 | 1 | 51.211 |
| 140014736 | GT-AG | 0 | 1.000000099473604e-05 | 846 | rna-XM_040240968.1 25387363 | 21 | 92895023 | 92895868 | Oryx dammah 59534 | TCG|GTAAGGACTC...GGCCTCTGAGCT/CCGTATCATATT...TACAG|ATG | 1 | 1 | 52.282 |
| 140014737 | GT-AG | 0 | 2.696126646096932e-05 | 1080 | rna-XM_040240968.1 25387363 | 22 | 92893740 | 92894819 | Oryx dammah 59534 | GAA|GTAAGTTCAG...TGGTGCTTATTA/CTCATTCTCATT...TTCAG|ATA | 0 | 1 | 55.262 |
| 140014738 | GT-AG | 0 | 0.0612194998071123 | 188 | rna-XM_040240968.1 25387363 | 23 | 92893503 | 92893690 | Oryx dammah 59534 | CAG|GTAACCCTCC...TTTGTCTTATTA/ATTTGTCTTATT...TGCAG|ATG | 1 | 1 | 55.981 |
| 140014739 | GT-AG | 0 | 1.000000099473604e-05 | 1076 | rna-XM_040240968.1 25387363 | 24 | 92892313 | 92893388 | Oryx dammah 59534 | CAG|GTCCGTTTGG...TAATCTATGACC/ATGACCTTCAGT...TGCAG|AAT | 1 | 1 | 57.654 |
| 140014740 | GT-AG | 0 | 1.000000099473604e-05 | 897 | rna-XM_040240968.1 25387363 | 25 | 92891267 | 92892163 | Oryx dammah 59534 | CAG|GTGAGAAGTG...CAATTCTTCTCC/CCTTTGTCCATC...TATAG|ATT | 0 | 1 | 59.841 |
| 140014741 | GT-AG | 0 | 0.0013197105890324 | 1229 | rna-XM_040240968.1 25387363 | 26 | 92889913 | 92891141 | Oryx dammah 59534 | CAG|GTATGCGTTT...CATGGTTTAAAT/CATGGTTTAAAT...TACAG|TAA | 2 | 1 | 61.676 |
| 140014742 | GT-AG | 0 | 1.000000099473604e-05 | 2092 | rna-XM_040240968.1 25387363 | 27 | 92887722 | 92889813 | Oryx dammah 59534 | CCC|GTGAGTGCCG...CTATCCTTGGTG/TCTTGGGCCATC...TGCAG|AGA | 2 | 1 | 63.129 |
| 140014743 | GT-AG | 0 | 1.000000099473604e-05 | 845 | rna-XM_040240968.1 25387363 | 28 | 92886687 | 92887531 | Oryx dammah 59534 | CAA|GTGAGTCCTT...TGGCCCATAAAG/TCTGTTCCAATT...GACAG|AAA | 0 | 1 | 65.918 |
| 140014744 | GT-AG | 0 | 1.000000099473604e-05 | 1390 | rna-XM_040240968.1 25387363 | 29 | 92885202 | 92886591 | Oryx dammah 59534 | AAG|GTCATCATTT...GTTTCTTTGTCT/TTTGGACTGATT...TACAG|CTT | 2 | 1 | 67.312 |
| 140014745 | GT-AG | 0 | 1.000000099473604e-05 | 1102 | rna-XM_040240968.1 25387363 | 30 | 92884067 | 92885168 | Oryx dammah 59534 | TAG|GTAAGTTGAC...TGTGTTTTAATG/TGTGTTTTAATG...TTTAG|ATA | 2 | 1 | 67.797 |
| 140014746 | GT-AG | 0 | 1.000000099473604e-05 | 966 | rna-XM_040240968.1 25387363 | 31 | 92882995 | 92883960 | Oryx dammah 59534 | AAG|GTACAGTACC...AGCTTTTTGTTG/TCCCAGCTGAGG...CACAG|GAC | 0 | 1 | 69.353 |
| 140014747 | GT-AG | 0 | 1.000000099473604e-05 | 477 | rna-XM_040240968.1 25387363 | 32 | 92882443 | 92882919 | Oryx dammah 59534 | AAG|GTAAAGCCCT...AGTACTTTGCAC/GTACTTTGCACC...TTCAG|GTG | 0 | 1 | 70.454 |
| 140014748 | GT-AG | 0 | 3.358820004529375e-05 | 1578 | rna-XM_040240968.1 25387363 | 33 | 92880695 | 92882272 | Oryx dammah 59534 | TCT|GTAAGTGTCA...GTGCCCTCAGCT/CGTGCTCTGATT...TTCAG|GAT | 2 | 1 | 72.949 |
| 140014749 | GT-AG | 0 | 1.000000099473604e-05 | 2015 | rna-XM_040240968.1 25387363 | 34 | 92878502 | 92880516 | Oryx dammah 59534 | ATG|GTAAGACCAC...GTGTCTCTGACT/CCTGGTTTCACT...TTCAG|TGC | 0 | 1 | 75.561 |
| 140014750 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_040240968.1 25387363 | 35 | 92878296 | 92878385 | Oryx dammah 59534 | TGG|GTAAGGCACC...TGTGTCTTTATG/TGTGTCTTTATG...ACCAG|GTG | 2 | 1 | 77.264 |
| 140014751 | GT-AG | 0 | 1.000000099473604e-05 | 1591 | rna-XM_040240968.1 25387363 | 36 | 92876560 | 92878150 | Oryx dammah 59534 | AAT|GTGAGTTCTG...TGTCTCTTATTT/CTGTCTCTTATT...TTTAG|AAG | 0 | 1 | 79.392 |
| 140014752 | GT-AG | 0 | 1.000000099473604e-05 | 1657 | rna-XM_040240968.1 25387363 | 37 | 92874779 | 92876435 | Oryx dammah 59534 | TTG|GTGAGTGACA...AGGGCTTTGGTC/TCCATTCTGATT...TCCAG|GGG | 1 | 1 | 81.212 |
| 140014753 | GT-AG | 0 | 6.503273566728002e-05 | 271 | rna-XM_040240968.1 25387363 | 38 | 92874378 | 92874648 | Oryx dammah 59534 | CAG|GTGAGCTTTC...TTCTTCTTAATT/TTCTTCTTAATT...CTAAG|ACC | 2 | 1 | 83.121 |
| 140014754 | GT-AG | 0 | 1.000000099473604e-05 | 773 | rna-XM_040240968.1 25387363 | 39 | 92873484 | 92874256 | Oryx dammah 59534 | AAG|GTGAGACAGT...CGGCTCATGACT/CCGTGAGTCACT...TTCAG|GTA | 0 | 1 | 84.897 |
| 140014755 | GT-AG | 0 | 1.000000099473604e-05 | 793 | rna-XM_040240968.1 25387363 | 40 | 92872628 | 92873420 | Oryx dammah 59534 | GAG|GTAAAGGCGC...TTTTCCCAATCA/ATATTACTGACC...TTCAG|TGC | 0 | 1 | 85.821 |
| 140014756 | GT-AG | 0 | 1.000000099473604e-05 | 2192 | rna-XM_040240968.1 25387363 | 41 | 92870329 | 92872520 | Oryx dammah 59534 | TAG|GTGAGAAAAC...AAATGCTTAAAA/AAAAACCTGACC...TGTAG|TAT | 2 | 1 | 87.392 |
| 140014757 | GT-AG | 0 | 1.1549355007298556e-05 | 368 | rna-XM_040240968.1 25387363 | 42 | 92869819 | 92870186 | Oryx dammah 59534 | AAG|GTACTGTGGG...GCTCTTTTATCA/TCTTTTATCACT...CTTAG|GTT | 0 | 1 | 89.476 |
| 140014758 | GT-AG | 0 | 1.000000099473604e-05 | 1675 | rna-XM_040240968.1 25387363 | 43 | 92868009 | 92869683 | Oryx dammah 59534 | CTG|GTGAGTGTGA...CTATCTTTGATT/TGATTTTTTACC...TATAG|GAT | 0 | 1 | 91.458 |
| 140014759 | GT-AG | 0 | 1.000000099473604e-05 | 603 | rna-XM_040240968.1 25387363 | 44 | 92867302 | 92867904 | Oryx dammah 59534 | CAG|GTCTGTAGTG...TAAACCTCATTC/CTAAACCTCATT...CACAG|TAT | 2 | 1 | 92.984 |
| 140014760 | GT-AG | 0 | 1.000000099473604e-05 | 653 | rna-XM_040240968.1 25387363 | 45 | 92866556 | 92867208 | Oryx dammah 59534 | TAG|GTAACAAAGA...TTTTCTTTTTCC/TTCAATTTGATT...TTCAG|GTT | 2 | 1 | 94.349 |
| 140014761 | GT-AG | 0 | 5.6016398234411885e-05 | 627 | rna-XM_040240968.1 25387363 | 46 | 92865685 | 92866311 | Oryx dammah 59534 | CAA|GTAAGCGCTG...CCGTCTTTATTT/TCCGTCTTTATT...GACAG|GTA | 0 | 1 | 97.93 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);