introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 3555677
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17676267 | GT-AG | 0 | 1.000000099473604e-05 | 4093 | rna-XM_038338305.1 3555677 | 2 | 167750825 | 167754917 | Arvicola amphibius 1047088 | AAG|GTAAGATTTT...CTTGCCTTATGG/TCTTGCCTTATG...TTCAG|CCC | 0 | 1 | 7.061 |
| 17676268 | GT-AG | 0 | 1.000000099473604e-05 | 5923 | rna-XM_038338305.1 3555677 | 3 | 167755048 | 167760970 | Arvicola amphibius 1047088 | AAG|GTTTGTGTCC...CAGCCTGTGATC/CTCGGGGTCATC...TTTAG|GTT | 1 | 1 | 9.714 |
| 17676269 | GT-AG | 0 | 1.000000099473604e-05 | 8839 | rna-XM_038338305.1 3555677 | 4 | 167761400 | 167770238 | Arvicola amphibius 1047088 | CTG|GTAGGTTAAT...TCCCTCTTGTCT/GTCCTGCCCATG...CACAG|GCC | 1 | 1 | 18.469 |
| 17676270 | GT-AG | 0 | 1.5876712567355983e-05 | 6624 | rna-XM_038338305.1 3555677 | 5 | 167770501 | 167777124 | Arvicola amphibius 1047088 | CAG|GTACTGCTGG...CCCCCTTTATTT/TAAATTGTAATG...CCTAG|ACT | 2 | 1 | 23.816 |
| 17676271 | GT-AG | 0 | 1.000000099473604e-05 | 1845 | rna-XM_038338305.1 3555677 | 6 | 167777252 | 167779096 | Arvicola amphibius 1047088 | CAG|GTAATACCTT...CTGCCCTTCTCC/CGGGGATTCACT...CATAG|CTG | 0 | 1 | 26.408 |
| 17676272 | GT-AG | 0 | 1.000000099473604e-05 | 1380 | rna-XM_038338305.1 3555677 | 7 | 167779271 | 167780650 | Arvicola amphibius 1047088 | CAG|GTCCTTAGAC...CATCCCTTTCTC/CCTCTTGTGAAC...TGTAG|GAA | 0 | 1 | 29.959 |
| 17676273 | GT-AG | 0 | 1.000000099473604e-05 | 1939 | rna-XM_038338305.1 3555677 | 8 | 167780825 | 167782763 | Arvicola amphibius 1047088 | ATG|GTAAGGGCCC...TCATCCTTCTCT/CATGCTCTCATC...AACAG|GCT | 0 | 1 | 33.51 |
| 17676274 | GT-AG | 0 | 1.000000099473604e-05 | 9649 | rna-XM_038338305.1 3555677 | 9 | 167782935 | 167792583 | Arvicola amphibius 1047088 | AAG|GTGAGTTCTT...GTGTTCTTGATG/GTGTTCTTGATG...TGCAG|AAG | 0 | 1 | 37.0 |
| 17676275 | GT-AG | 0 | 1.000000099473604e-05 | 3876 | rna-XM_038338305.1 3555677 | 10 | 167792638 | 167796513 | Arvicola amphibius 1047088 | GAG|GTAAGGGAAG...GCTGACTTGATT/GCTGACTTGATT...TCTAG|CCG | 0 | 1 | 38.102 |
| 17676276 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_038338305.1 3555677 | 11 | 167796650 | 167796750 | Arvicola amphibius 1047088 | ATG|GTGGCAGAAA...GGGGCCTTTGCT/CCTTTGCTGAGA...TGTAG|AA | 1 | 1 | 40.878 |
| 17676277 | GT-AG | 0 | 5.485831127880965 | 120 | rna-XM_038338305.1 3555677 | 12 | 167796753 | 167796872 | Arvicola amphibius 1047088 | AA|GTTTTGCATC...TTGTCCCTAGTT/ATTACTCTGAGA...TGAAG|TTG | 0 | 1 | 40.918 |
| 17676278 | GT-AG | 0 | 2.692476510601259e-05 | 1922 | rna-XM_038338305.1 3555677 | 13 | 167796924 | 167798845 | Arvicola amphibius 1047088 | GAG|GTATGTGGAC...TACTCGTTAAAA/TACTCGTTAAAA...TCCAG|GAT | 0 | 1 | 41.959 |
| 17676279 | GT-AG | 0 | 9.37341080060108e-05 | 1155 | rna-XM_038338305.1 3555677 | 16 | 167798947 | 167800101 | Arvicola amphibius 1047088 | TGA|GTACGTGCCC...CCTACTTTGTCT/GGCTTTCCTACT...CTCAG|GAC | 2 | 1 | 43.959 |
| 17676280 | GT-AG | 0 | 0.000899542457202 | 3529 | rna-XM_038338305.1 3555677 | 18 | 167800261 | 167803789 | Arvicola amphibius 1047088 | AGG|GTAACCAGCG...TATTAATTACCT/GTATTAATTACC...TTCAG|CTC | 0 | 1 | 47.163 |
| 17676281 | GT-AG | 0 | 1.000000099473604e-05 | 1955 | rna-XM_038338305.1 3555677 | 21 | 167803956 | 167805910 | Arvicola amphibius 1047088 | CTC|GTAAGTACTG...GTGTGTTTAGTG/TGTGTGTTTAGT...TGTAG|GAC | 2 | 1 | 50.51 |
| 17676282 | GT-AG | 0 | 1.000000099473604e-05 | 665 | rna-XM_038338305.1 3555677 | 22 | 167805978 | 167806642 | Arvicola amphibius 1047088 | AAG|GTTTGGAGCT...GTATTTTTCTCT/CAGCGGGTAATC...TTCAG|GGT | 0 | 1 | 51.878 |
| 17676283 | GT-AG | 0 | 4.004452951854461e-05 | 3038 | rna-XM_038338305.1 3555677 | 23 | 167806754 | 167809791 | Arvicola amphibius 1047088 | AAG|GTACGATTTT...TTTCTGTTGACC/TTTCTGTTGACC...TGCAG|ATT | 0 | 1 | 54.143 |
| 17676284 | GT-AG | 0 | 1.000000099473604e-05 | 1343 | rna-XM_038338305.1 3555677 | 24 | 167810035 | 167811377 | Arvicola amphibius 1047088 | AGG|GTACTGGTCA...ACATCCTTCGCG/CGCGAATTCAAC...TGTAG|GTG | 0 | 1 | 59.102 |
| 17676285 | GT-AG | 0 | 1.000000099473604e-05 | 7386 | rna-XM_038338305.1 3555677 | 25 | 167811492 | 167818877 | Arvicola amphibius 1047088 | AAG|GTCAGTTGCA...CTGTGCTTATCT/CCTGTGCTTATC...CCCAG|GTT | 0 | 1 | 61.429 |
| 17676286 | GT-AG | 0 | 0.0003453030026841 | 1107 | rna-XM_038338305.1 3555677 | 26 | 167818986 | 167820092 | Arvicola amphibius 1047088 | AAG|GTATGTTGCA...AATACTTGATCA/CAATGAATAATT...CTCAG|GGG | 0 | 1 | 63.633 |
| 17676287 | GT-AG | 0 | 1.000000099473604e-05 | 5457 | rna-XM_038338305.1 3555677 | 27 | 167820180 | 167825636 | Arvicola amphibius 1047088 | GAG|GTGAGTCTGG...AATTTCTTCTCT/GAGTTAATGATT...TCAAG|GAA | 0 | 1 | 65.408 |
| 17676288 | GT-AG | 0 | 3.3762425363568146e-05 | 2076 | rna-XM_038338305.1 3555677 | 28 | 167825684 | 167827759 | Arvicola amphibius 1047088 | TGG|GTAAATCTTT...GCCTTCTTGTTT/ACAGCAGTAATA...TTCAG|GGC | 2 | 1 | 66.367 |
| 17676289 | GT-AG | 0 | 1.000000099473604e-05 | 5925 | rna-XM_038338305.1 3555677 | 29 | 167827927 | 167833851 | Arvicola amphibius 1047088 | ATG|GTAAGTGCAG...TTTTTCTTATCC/ATTTTTCTTATC...CTCAG|GCA | 1 | 1 | 69.776 |
| 17676290 | GT-AG | 0 | 1.000000099473604e-05 | 5146 | rna-XM_038338305.1 3555677 | 30 | 167834016 | 167839161 | Arvicola amphibius 1047088 | CAG|GTGCGCATGT...GGTCCCTGGACC/ATTGTTCTCAAA...AACAG|GAT | 0 | 1 | 73.122 |
| 17676291 | GT-AG | 0 | 0.0013034107875466 | 4667 | rna-XM_038338305.1 3555677 | 31 | 167839390 | 167844056 | Arvicola amphibius 1047088 | GAG|GTATAATAGA...TGGTCCTTAAAG/AGTATGTTCAAT...TCCAG|GAA | 0 | 1 | 77.776 |
| 17676292 | GT-AG | 0 | 1.000000099473604e-05 | 4677 | rna-XM_038338305.1 3555677 | 32 | 167844135 | 167848811 | Arvicola amphibius 1047088 | ATG|GTAATGTTGC...GTCCTCTTCCCC/TTCTGTCCCACA...CCCAG|CGC | 0 | 1 | 79.367 |
| 17676293 | GT-AG | 0 | 1.4191194446187342e-05 | 31677 | rna-XM_038338305.1 3555677 | 33 | 167849031 | 167880707 | Arvicola amphibius 1047088 | CGG|GTAGGTGTGG...ATTTCCTTACTT/TATTTCCTTACT...CACAG|GCC | 0 | 1 | 83.837 |
| 17676294 | GT-AG | 0 | 1.000000099473604e-05 | 7584 | rna-XM_038338305.1 3555677 | 34 | 167880926 | 167888509 | Arvicola amphibius 1047088 | CAG|GTGAGTGTGT...TTCTTTTTACTC/GTTCTTTTTACT...CGCAG|GTG | 2 | 1 | 88.286 |
| 17676295 | GT-AG | 0 | 1.000000099473604e-05 | 10198 | rna-XM_038338305.1 3555677 | 35 | 167888564 | 167898761 | Arvicola amphibius 1047088 | CAG|GTCAGGACCT...AGTGCCTCATCT/GAGTGCCTCATC...TGCAG|TTC | 2 | 1 | 89.388 |
| 17676296 | GT-AG | 0 | 1.000000099473604e-05 | 451 | rna-XM_038338305.1 3555677 | 36 | 167898868 | 167899318 | Arvicola amphibius 1047088 | AAG|GTAGATATTC...TCTGTTTTGTTC/GTTTTGTTCAAA...ATCAG|GAA | 0 | 1 | 91.551 |
| 17676297 | GT-AG | 0 | 0.0018466486764837 | 2211 | rna-XM_038338305.1 3555677 | 37 | 167899421 | 167901631 | Arvicola amphibius 1047088 | CAG|GTATGTGTCA...TGGTTTTTAACA/TGGTTTTTAACA...TTTAG|ATC | 0 | 1 | 93.633 |
| 17676298 | GT-AG | 0 | 1.3098929114132067e-05 | 1690 | rna-XM_038338305.1 3555677 | 38 | 167901765 | 167903454 | Arvicola amphibius 1047088 | AGG|GTAAGCCCTC...CTGATTTTGAAT/ATTTTACTGATT...TTTAG|CAA | 1 | 1 | 96.347 |
| 17676299 | GT-AG | 0 | 1.000000099473604e-05 | 1071 | rna-XM_038338305.1 3555677 | 39 | 167903598 | 167904668 | Arvicola amphibius 1047088 | AAC|GTGAGTGCGC...GCGCCCTCAACT/GGCGCCCTCAAC...CTCAG|GAG | 0 | 1 | 99.265 |
| 17692038 | GT-AG | 0 | 1.000000099473604e-05 | 15094 | rna-XM_038338305.1 3555677 | 1 | 167735465 | 167750558 | Arvicola amphibius 1047088 | AAG|GTAAATATAA...CTTTCCTTTTCC/TCATGTTTAAAA...ACCAG|CTT | 0 | 2.469 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);