introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
31 rows where transcript_id = 3555651
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17675637 | GT-AG | 0 | 1.000000099473604e-05 | 3208 | rna-XM_038314361.1 3555651 | 2 | 175525322 | 175528529 | Arvicola amphibius 1047088 | GGG|GTAAGTACAA...ATATTTTTAATT/ATATTTTTAATT...TTTAG|GCA | 0 | 1 | 4.337 |
| 17675638 | GT-AG | 0 | 2.167117310062721e-05 | 83 | rna-XM_038314361.1 3555651 | 3 | 175528617 | 175528699 | Arvicola amphibius 1047088 | GAG|GTTTGTGTTG...CTACTCTTATTA/TTTGCTTTCATT...TCTAG|GGC | 0 | 1 | 5.942 |
| 17675639 | GT-AG | 0 | 1.000000099473604e-05 | 779 | rna-XM_038314361.1 3555651 | 4 | 175528817 | 175529595 | Arvicola amphibius 1047088 | AGG|GTAAGTGAGT...ATTTTCTTAAAA/CAATTTTTAACA...CTTAG|GTT | 0 | 1 | 8.101 |
| 17675640 | GT-AG | 0 | 1.2037405800063262e-05 | 1909 | rna-XM_038314361.1 3555651 | 5 | 175529735 | 175531643 | Arvicola amphibius 1047088 | AAG|GTAAATATTG...TGCTTTTTAAAT/TGCTTTTTAAAT...TATAG|GCA | 1 | 1 | 10.666 |
| 17675641 | GT-AG | 0 | 0.0007791478900373 | 3759 | rna-XM_038314361.1 3555651 | 6 | 175531829 | 175535587 | Arvicola amphibius 1047088 | GAG|GTACACCAGG...TTAACTTTAACA/AGTTATTTAACT...TCTAG|GTT | 0 | 1 | 14.08 |
| 17675642 | GT-AG | 0 | 1.000000099473604e-05 | 454 | rna-XM_038314361.1 3555651 | 7 | 175535625 | 175536078 | Arvicola amphibius 1047088 | GTG|GTAAGGATTC...AGGGTCTTAATT/ATATTTTTAAGT...AATAG|GAA | 1 | 1 | 14.763 |
| 17675643 | GT-AG | 0 | 0.0001249875172896 | 512 | rna-XM_038314361.1 3555651 | 8 | 175536304 | 175536815 | Arvicola amphibius 1047088 | AAG|GTACATCAGC...TCTTCTTTATTT/ATCTTCTTTATT...TAAAG|ATC | 1 | 1 | 18.915 |
| 17675644 | GT-AG | 0 | 1.000000099473604e-05 | 976 | rna-XM_038314361.1 3555651 | 9 | 175536928 | 175537903 | Arvicola amphibius 1047088 | AAG|GTGAGTAGTG...TTGGTCTTTGCT/AATGGAGTGATA...TTTAG|CCA | 2 | 1 | 20.982 |
| 17675645 | GT-AG | 0 | 5.75111162265e-05 | 84 | rna-XM_038314361.1 3555651 | 10 | 175537976 | 175538059 | Arvicola amphibius 1047088 | TGA|GTAAGTTGAA...ATCATTTTGAAA/TTTTGAATCATT...TTTAG|ATT | 2 | 1 | 22.31 |
| 17675646 | GT-AG | 0 | 1.000000099473604e-05 | 801 | rna-XM_038314361.1 3555651 | 11 | 175538207 | 175539007 | Arvicola amphibius 1047088 | AAA|GTAAGTGTCA...TTTATTCTAACT/TTTATTCTAACT...CATAG|GTT | 2 | 1 | 25.023 |
| 17675647 | GT-AG | 0 | 1.000000099473604e-05 | 1074 | rna-XM_038314361.1 3555651 | 12 | 175539171 | 175540244 | Arvicola amphibius 1047088 | AAG|GTAAATGTAT...AAGTTCTTGTCT/CCCCTTGTAAAC...AACAG|GTG | 0 | 1 | 28.031 |
| 17675648 | GT-AG | 0 | 0.0405866473186716 | 735 | rna-XM_038314361.1 3555651 | 13 | 175540530 | 175541264 | Arvicola amphibius 1047088 | GAG|GTATACGTTA...TATTCTTTACTG/TTATTCTTTACT...TTTAG|AAA | 0 | 1 | 33.29 |
| 17675649 | GT-AG | 0 | 0.0004561551284412 | 164 | rna-XM_038314361.1 3555651 | 14 | 175541405 | 175541568 | Arvicola amphibius 1047088 | TAA|GTAAGTTATT...GTTCTCTTAATA/TGTTCTCTTAAT...TAAAG|GGA | 2 | 1 | 35.874 |
| 17675650 | GT-AG | 0 | 0.0026886562720681 | 1113 | rna-XM_038314361.1 3555651 | 15 | 175541708 | 175542820 | Arvicola amphibius 1047088 | GAA|GTATGTTAAT...TTACTGTTAATG/AGTGAATTTACT...TTTAG|GAA | 0 | 1 | 38.439 |
| 17675651 | GT-AG | 0 | 1.000000099473604e-05 | 1381 | rna-XM_038314361.1 3555651 | 16 | 175543050 | 175544430 | Arvicola amphibius 1047088 | ATG|GTAAGTGTAT...ATTTTCTTATTA/AATTTTCTTATT...CACAG|AAT | 1 | 1 | 42.665 |
| 17675652 | GT-AG | 0 | 1.000000099473604e-05 | 1071 | rna-XM_038314361.1 3555651 | 17 | 175544472 | 175545542 | Arvicola amphibius 1047088 | CAG|GTAATGTAAG...TCTACTTTAGAG/TAGAGTCTGATC...TTTAG|GTT | 0 | 1 | 43.421 |
| 17675653 | GT-AG | 0 | 1.000000099473604e-05 | 2413 | rna-XM_038314361.1 3555651 | 18 | 175545682 | 175548094 | Arvicola amphibius 1047088 | AAG|GTAATAGTGA...TATGTTTTGCTA/GCTAGATTAATA...TTAAG|GGC | 1 | 1 | 45.986 |
| 17675654 | GT-AG | 0 | 1.000000099473604e-05 | 549 | rna-XM_038314361.1 3555651 | 19 | 175549332 | 175549880 | Arvicola amphibius 1047088 | AAA|GTGAGCTGAG...TGATTCTTATTC/TGTTTCCTCACC...TTTAG|ATT | 2 | 1 | 68.813 |
| 17675655 | GT-AG | 0 | 0.0001331797393233 | 1203 | rna-XM_038314361.1 3555651 | 20 | 175550020 | 175551222 | Arvicola amphibius 1047088 | GAG|GTAGGTTTTA...TTTGCTTTATAA/ATTTGCTTTATA...TGTAG|GTA | 0 | 1 | 71.378 |
| 17675656 | GT-AG | 0 | 0.000563569357141 | 750 | rna-XM_038314361.1 3555651 | 21 | 175551343 | 175552092 | Arvicola amphibius 1047088 | AAG|GTAGCAGTTA...GTATTCTTAATT/TAAGTATTGATT...TTTAG|GAG | 0 | 1 | 73.593 |
| 17675657 | GT-AG | 0 | 1.000000099473604e-05 | 1217 | rna-XM_038314361.1 3555651 | 22 | 175552154 | 175553370 | Arvicola amphibius 1047088 | GAG|GTCAGAACCG...GATATTTTGACT/GATATTTTGACT...AACAG|ATC | 1 | 1 | 74.719 |
| 17675658 | GT-AG | 0 | 1.000000099473604e-05 | 1325 | rna-XM_038314361.1 3555651 | 23 | 175553497 | 175554821 | Arvicola amphibius 1047088 | AAG|GTAAAGTGAG...CACACCATAACA/TTGCTACACACC...ATTAG|AAT | 1 | 1 | 77.044 |
| 17675659 | GT-AG | 0 | 1.000000099473604e-05 | 3076 | rna-XM_038314361.1 3555651 | 24 | 175554950 | 175558025 | Arvicola amphibius 1047088 | CAG|GTAGTAACAT...TAGCTTTTATCA/TTGAAACTTACT...CTTAG|GAG | 0 | 1 | 79.406 |
| 17675660 | GT-AG | 0 | 1.000000099473604e-05 | 5364 | rna-XM_038314361.1 3555651 | 25 | 175558182 | 175563545 | Arvicola amphibius 1047088 | GTG|GTTGGTAGCA...ATTTTCTTGCCT/TTATACTTCACT...ACTAG|GAA | 0 | 1 | 82.285 |
| 17675661 | GT-AG | 0 | 1.000000099473604e-05 | 2405 | rna-XM_038314361.1 3555651 | 26 | 175563723 | 175566127 | Arvicola amphibius 1047088 | CCA|GTCAGTGCAC...ATTTTCATATTA/TTTATTTTCATA...TTCAG|GAA | 0 | 1 | 85.551 |
| 17675662 | GT-AG | 0 | 1.000000099473604e-05 | 3457 | rna-XM_038314361.1 3555651 | 27 | 175566239 | 175569695 | Arvicola amphibius 1047088 | GAG|GTTGGTAATC...ATTTTTTTTTCT/AAATAAATGACC...GGTAG|GGT | 0 | 1 | 87.599 |
| 17675663 | GT-AG | 0 | 0.0003551895206848 | 1570 | rna-XM_038314361.1 3555651 | 28 | 175569894 | 175571463 | Arvicola amphibius 1047088 | GAG|GTATAGTTAC...AAATCATTATCA/CAAAAATTTACT...TCTAG|GAC | 0 | 1 | 91.253 |
| 17675664 | GT-AG | 0 | 1.000000099473604e-05 | 351 | rna-XM_038314361.1 3555651 | 29 | 175571566 | 175571916 | Arvicola amphibius 1047088 | AAG|GTTTGTCCTT...TTTTTTATGATT/TTTTTTATGATT...CTTAG|GTT | 0 | 1 | 93.135 |
| 17675665 | GT-AG | 0 | 1.000000099473604e-05 | 1506 | rna-XM_038314361.1 3555651 | 30 | 175572068 | 175573573 | Arvicola amphibius 1047088 | AAG|GTTTGTGGAA...CAAACCTTTGCA/AACATTTTTACT...CACAG|CAA | 1 | 1 | 95.922 |
| 17675666 | GT-AG | 0 | 0.0002051602440987 | 6102 | rna-XM_038314361.1 3555651 | 31 | 175573717 | 175579818 | Arvicola amphibius 1047088 | GTG|GTAAGCTAGT...TTATTTTTACTT/ATTATTTTTACT...TTCAG|ATT | 0 | 1 | 98.561 |
| 17692013 | GT-AG | 0 | 1.000000099473604e-05 | 2353 | rna-XM_038314361.1 3555651 | 1 | 175522821 | 175525173 | Arvicola amphibius 1047088 | CAG|GTGAGGAGCT...ACTTGTTTAAAG/CATAGACTGATG...TTCAG|AAT | 0 | 1.624 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);