introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
47 rows where transcript_id = 25387356
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 140014443 | GT-AG | 0 | 1.000000099473604e-05 | 33523 | rna-XM_040239945.1 25387356 | 1 | 98260517 | 98294039 | Oryx dammah 59534 | GCG|GTAAGGTGCC...CTAACTTTCTCT/CTCATACTAACT...TACAG|CAA | 0 | 1 | 5.01 |
| 140014444 | GT-AG | 0 | 1.000000099473604e-05 | 3639 | rna-XM_040239945.1 25387356 | 2 | 98256622 | 98260260 | Oryx dammah 59534 | AAG|GTAATAGAAA...CATTCCTTTTTT/TTTTTTTTTATG...TGTAG|ATC | 1 | 1 | 7.398 |
| 140014445 | GT-AG | 0 | 1.000000099473604e-05 | 33027 | rna-XM_040239945.1 25387356 | 3 | 98223418 | 98256444 | Oryx dammah 59534 | CAG|GTGAGTGTCT...ACTTCCCTGGCT/CCCTGGCTTACC...CACAG|CTT | 1 | 1 | 9.049 |
| 140014446 | GT-AG | 0 | 1.000000099473604e-05 | 949 | rna-XM_040239945.1 25387356 | 4 | 98222310 | 98223258 | Oryx dammah 59534 | AAG|GTAAGGATTC...TGCTTTTTAATT/TTTAATTTTATT...GTTAG|TTG | 1 | 1 | 10.533 |
| 140014447 | GT-AG | 0 | 0.0034674456541423 | 5255 | rna-XM_040239945.1 25387356 | 5 | 98216875 | 98222129 | Oryx dammah 59534 | GAG|GTATGCAGAC...TTGTCTTTATTG/TTTGTCTTTATT...TATAG|TGA | 1 | 1 | 12.212 |
| 140014448 | GT-AG | 0 | 2.52892232049926e-05 | 7629 | rna-XM_040239945.1 25387356 | 6 | 98209066 | 98216694 | Oryx dammah 59534 | TGG|GTAAGTTCTT...ATTTTCTTGCTC/CATTTTGTAATT...AATAG|AAC | 1 | 1 | 13.891 |
| 140014449 | GT-AG | 0 | 1.000000099473604e-05 | 2789 | rna-XM_040239945.1 25387356 | 7 | 98206079 | 98208867 | Oryx dammah 59534 | AAG|GTAGGTCCCC...AGTATGTTAACT/AGTATGTTAACT...ATTAG|ACG | 1 | 1 | 15.738 |
| 140014450 | GT-AG | 0 | 1.000000099473604e-05 | 2070 | rna-XM_040239945.1 25387356 | 8 | 98203890 | 98205959 | Oryx dammah 59534 | AAG|GTAAGGTCTG...TTAACTTTAACT/TTAATGTTAACT...AAAAG|GTA | 0 | 1 | 16.849 |
| 140014451 | GT-AG | 0 | 1.254254011425682e-05 | 5381 | rna-XM_040239945.1 25387356 | 9 | 98198379 | 98203759 | Oryx dammah 59534 | TTG|GTAAGTCTTC...TCAGTTTTGAAC/TCTGGTTTCATG...TTTAG|ATG | 1 | 1 | 18.061 |
| 140014452 | GT-AG | 0 | 1.000000099473604e-05 | 742 | rna-XM_040239945.1 25387356 | 10 | 98197529 | 98198270 | Oryx dammah 59534 | CAG|GTGAGAATGC...TTCTCCTTCTCT/CACACAATAACC...TTTAG|GGG | 1 | 1 | 19.069 |
| 140014453 | GT-AG | 0 | 0.2062210717658346 | 609 | rna-XM_040239945.1 25387356 | 11 | 98196788 | 98197396 | Oryx dammah 59534 | AAG|GTATACTCTG...TATATTTTAATA/TATATTTTAATA...CACAG|GTT | 1 | 1 | 20.3 |
| 140014454 | GT-AG | 0 | 0.0001392177596341 | 1194 | rna-XM_040239945.1 25387356 | 12 | 98195399 | 98196592 | Oryx dammah 59534 | CTA|GTAAGTTCTA...TGTGTTTTGAAA/TGTGTTTTGAAA...TCTAG|TAA | 1 | 1 | 22.12 |
| 140014455 | GT-AG | 0 | 9.537286098522004e-05 | 2128 | rna-XM_040239945.1 25387356 | 13 | 98193149 | 98195276 | Oryx dammah 59534 | ATG|GTATGTCAGT...TCCTCTTTCATT/TCATTTTTTATT...TACAG|GTC | 0 | 1 | 23.258 |
| 140014456 | GT-AG | 0 | 1.000000099473604e-05 | 4642 | rna-XM_040239945.1 25387356 | 14 | 98188398 | 98193039 | Oryx dammah 59534 | TCG|GTAATGAAAT...AATCTTTTATTT/CAATCTTTTATT...TTCAG|GAC | 1 | 1 | 24.275 |
| 140014457 | GT-AG | 0 | 1.000000099473604e-05 | 1032 | rna-XM_040239945.1 25387356 | 15 | 98187201 | 98188232 | Oryx dammah 59534 | CAG|GTAAGAATAG...TTTTCTTTTGTT/GCTTGCTGCAAC...CTCAG|CTA | 1 | 1 | 25.814 |
| 140014458 | GT-AG | 0 | 1.000000099473604e-05 | 1846 | rna-XM_040239945.1 25387356 | 16 | 98185121 | 98186966 | Oryx dammah 59534 | GTG|GTAAGATTTC...CGTTCCATACAA/ACAATATTCAGC...TTCAG|TCA | 1 | 1 | 27.997 |
| 140014459 | GT-AG | 0 | 1.000000099473604e-05 | 1269 | rna-XM_040239945.1 25387356 | 17 | 98183690 | 98184958 | Oryx dammah 59534 | AAG|GTAGAGGCTT...TTATTATTAACC/TTATTATTAACC...TTTAG|CTC | 1 | 1 | 29.508 |
| 140014460 | GT-AG | 0 | 1.000000099473604e-05 | 10318 | rna-XM_040239945.1 25387356 | 18 | 98173210 | 98183527 | Oryx dammah 59534 | GAG|GTTAAGTCTT...ATTGCTTTATAA/TATTGCTTTATA...TGCAG|TGC | 1 | 1 | 31.02 |
| 140014461 | GT-AG | 0 | 2.1904593874022213e-05 | 723 | rna-XM_040239945.1 25387356 | 19 | 98172325 | 98173047 | Oryx dammah 59534 | CCA|GTAAGATGAT...TTCCCTTTAATT/TTCCCTTTAATT...ATTAG|GCT | 1 | 1 | 32.531 |
| 140014462 | GC-AG | 0 | 1.000000099473604e-05 | 1085 | rna-XM_040239945.1 25387356 | 20 | 98171148 | 98172232 | Oryx dammah 59534 | CAG|GCAAGTCTAC...ATTGCTTTGAAA/TTGTCTCTGATA...CACAG|GTT | 0 | 1 | 33.389 |
| 140014463 | GT-AG | 0 | 1.000000099473604e-05 | 1706 | rna-XM_040239945.1 25387356 | 21 | 98169345 | 98171050 | Oryx dammah 59534 | CAG|GTAACAGAGA...CTTTGCTTGACC/TTTCTGTTCATC...TTTAG|GCT | 1 | 1 | 34.294 |
| 140014464 | GT-AG | 0 | 1.000000099473604e-05 | 835 | rna-XM_040239945.1 25387356 | 22 | 98168396 | 98169230 | Oryx dammah 59534 | CAG|GTAAGGAAAG...TATTCTGTGATT/AGTTTTCTCAAG...TACAG|GTC | 1 | 1 | 35.358 |
| 140014465 | GT-AG | 0 | 1.000000099473604e-05 | 777 | rna-XM_040239945.1 25387356 | 23 | 98167505 | 98168281 | Oryx dammah 59534 | TGG|GTAAGATGTT...TCATCCTCATCC/CTCATCCTCATC...CCCAG|GCC | 1 | 1 | 36.421 |
| 140014466 | GT-AG | 0 | 1.000000099473604e-05 | 3602 | rna-XM_040239945.1 25387356 | 24 | 98163701 | 98167302 | Oryx dammah 59534 | CAG|GTAATAGACT...TAATTGTTAAAA/ACTCTTCTAATT...TTCAG|GTG | 2 | 1 | 38.306 |
| 140014467 | GT-AG | 0 | 0.000392999140002 | 4312 | rna-XM_040239945.1 25387356 | 25 | 98159231 | 98163542 | Oryx dammah 59534 | CAG|GTATATGTGA...CTTTTCTTGTTT/TGTCTTCTCATA...TGCAG|AAC | 1 | 1 | 39.78 |
| 140014468 | GT-AG | 0 | 4.738986900488543e-05 | 1883 | rna-XM_040239945.1 25387356 | 26 | 98157146 | 98159028 | Oryx dammah 59534 | CGG|GTAAGCAGAG...TTTCCCTTGAAT/ATAGGACTGATG...TGCAG|CTG | 2 | 1 | 41.664 |
| 140014469 | GT-AG | 0 | 0.0001271232701973 | 3323 | rna-XM_040239945.1 25387356 | 27 | 98153647 | 98156969 | Oryx dammah 59534 | CTG|GTACGTCTTT...TTTTTCTTTCCT/TTAGGAATAACA...TCTAG|GTG | 1 | 1 | 43.306 |
| 140014470 | GT-AG | 0 | 1.000000099473604e-05 | 1181 | rna-XM_040239945.1 25387356 | 28 | 98152341 | 98153521 | Oryx dammah 59534 | CAG|GTCAATTCCA...TTTGTCTAAAAG/GTTTGTCTAAAA...TTCAG|GTG | 0 | 1 | 44.472 |
| 140014471 | GT-AG | 0 | 1.000000099473604e-05 | 820 | rna-XM_040239945.1 25387356 | 29 | 98151397 | 98152216 | Oryx dammah 59534 | CAG|GTTATAGCTT...GTGTTCTTTGCC/GTGTTTCTGACA...CTCAG|ATT | 1 | 1 | 45.629 |
| 140014472 | GT-AG | 0 | 1.000000099473604e-05 | 1371 | rna-XM_040239945.1 25387356 | 30 | 98149855 | 98151225 | Oryx dammah 59534 | AAC|GTGAGCAGAG...TTGCTCCTAAAG/AAGATGTTCATT...TCCAG|GCA | 1 | 1 | 47.225 |
| 140014473 | GT-AG | 0 | 1.000000099473604e-05 | 427 | rna-XM_040239945.1 25387356 | 31 | 98149254 | 98149680 | Oryx dammah 59534 | TTG|GTGAGATGAA...TCAGCCTTATGG/ATCAGCCTTATG...TTTAG|ATG | 1 | 1 | 48.848 |
| 140014474 | GT-AG | 0 | 8.879813467018382e-05 | 2037 | rna-XM_040239945.1 25387356 | 32 | 98147097 | 98149133 | Oryx dammah 59534 | CAG|GTAACTAGGC...ACCCTGTTGACC/ACCCTGTTGACC...TCAAG|AAC | 1 | 1 | 49.967 |
| 140014475 | GT-AG | 0 | 0.0001165056381357 | 414 | rna-XM_040239945.1 25387356 | 33 | 98146506 | 98146919 | Oryx dammah 59534 | AAG|GTATGTTGAG...TTGGTTTTCCTC/TTTTTACACATA...TTAAG|CTG | 1 | 1 | 51.619 |
| 140014476 | GT-AG | 0 | 1.000000099473604e-05 | 1079 | rna-XM_040239945.1 25387356 | 34 | 98145342 | 98146420 | Oryx dammah 59534 | TAG|GTGAGTGTCT...TTTCTTTTATCT/TTTTCTTTTATC...TTTAG|GTG | 2 | 1 | 52.412 |
| 140014477 | GT-AG | 0 | 1.000000099473604e-05 | 992 | rna-XM_040239945.1 25387356 | 35 | 98144158 | 98145149 | Oryx dammah 59534 | CAG|GTACAGTCCT...GGCACCTTTCTT/CAGTAAGTCAAT...TGTAG|TTT | 2 | 1 | 54.203 |
| 140014478 | GT-AG | 0 | 1.000000099473604e-05 | 10866 | rna-XM_040239945.1 25387356 | 36 | 98133124 | 98143989 | Oryx dammah 59534 | AGG|GTGAGTAAAC...CTTTTCTCAAAT/CCTTTTCTCAAA...AACAG|CTA | 2 | 1 | 55.77 |
| 140014479 | GT-AG | 0 | 1.000000099473604e-05 | 1778 | rna-XM_040239945.1 25387356 | 37 | 98130673 | 98132450 | Oryx dammah 59534 | GAG|GTAAGAGACT...CTGTGTTTAATT/CTGTGTTTAATT...TCCAG|CAT | 0 | 1 | 62.049 |
| 140014480 | GT-AG | 0 | 1.000000099473604e-05 | 1913 | rna-XM_040239945.1 25387356 | 38 | 98125968 | 98127880 | Oryx dammah 59534 | CAG|GTAAGACCAT...CTTTTCTTTTCC/CTCCATTTAAGG...TGCAG|GTG | 2 | 1 | 88.096 |
| 140014481 | GT-AG | 0 | 1.000000099473604e-05 | 2674 | rna-XM_040239945.1 25387356 | 39 | 98123068 | 98125741 | Oryx dammah 59534 | CAG|GTAAGACTTT...TATAATTTGATA/TATAATTTGATA...TTTAG|CTT | 0 | 1 | 90.204 |
| 140014482 | GT-AG | 0 | 1.000000099473604e-05 | 4726 | rna-XM_040239945.1 25387356 | 40 | 98118186 | 98122911 | Oryx dammah 59534 | GAG|GTAAGCAAAC...ACTGTCTTATCA/TTTTTATTGATC...CACAG|GGG | 0 | 1 | 91.66 |
| 140014483 | GT-AG | 0 | 1.000000099473604e-05 | 1546 | rna-XM_040239945.1 25387356 | 41 | 98116576 | 98118121 | Oryx dammah 59534 | AAA|GTAAGTCAAA...TTTGCTTTTTCT/GGGAGAATCATT...CTTAG|AGA | 1 | 1 | 92.257 |
| 140014484 | GT-AG | 0 | 5.022941271253279e-05 | 1176 | rna-XM_040239945.1 25387356 | 42 | 98115226 | 98116401 | Oryx dammah 59534 | AAC|GTGTGTGTTG...ATTCTCTTTTCT/AATGAACTAAAA...AATAG|CAA | 1 | 1 | 93.88 |
| 140014485 | GT-AG | 0 | 1.000000099473604e-05 | 4913 | rna-XM_040239945.1 25387356 | 43 | 98110136 | 98115048 | Oryx dammah 59534 | AAA|GTAAGTGGAC...TTTTGTTTATTT/TTTTTGTTTATT...TATAG|AAA | 1 | 1 | 95.531 |
| 140014486 | GT-AG | 0 | 1.000000099473604e-05 | 1194 | rna-XM_040239945.1 25387356 | 44 | 98108771 | 98109964 | Oryx dammah 59534 | GAG|GTAAGGAGAT...CCTTCCTGAATT/CCTAGACTTATG...TCCAG|CTG | 1 | 1 | 97.127 |
| 140014487 | GT-AG | 0 | 1.000000099473604e-05 | 1763 | rna-XM_040239945.1 25387356 | 45 | 98106912 | 98108674 | Oryx dammah 59534 | AGC|GTGAGTCTGT...ACCTTTTTGATG/TGGGTTTTGATC...TCTAG|CGA | 1 | 1 | 98.022 |
| 140014488 | GT-AG | 0 | 2.3353171288159476e-05 | 6203 | rna-XM_040239945.1 25387356 | 46 | 98100613 | 98106815 | Oryx dammah 59534 | CAG|GTAGGCGTCC...AATGCCTGAATT/TTAGTATTCATT...CCCAG|CTG | 1 | 1 | 98.918 |
| 140014489 | GT-AG | 0 | 1.000000099473604e-05 | 5070 | rna-XM_040239945.1 25387356 | 47 | 98095449 | 98100518 | Oryx dammah 59534 | CAG|GTAAGACAAC...CTTTCTTTACTG/CCTTTCTTTACT...TCTAG|GAA | 2 | 1 | 99.795 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);