introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
42 rows where transcript_id = 9059407
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48954097 | GT-AG | 0 | 1.000000099473604e-05 | 17850 | rna-XM_036591720.1 9059407 | 2 | 20736385 | 20754234 | Colossoma macropomum 42526 | AAG|GTAAGAACAC...TAATCTTTACTC/TCTTTACTCATT...CCTAG|TTG | 0 | 1 | 6.151 |
| 48954098 | GT-AG | 0 | 1.000000099473604e-05 | 155 | rna-XM_036591720.1 9059407 | 3 | 20736151 | 20736305 | Colossoma macropomum 42526 | CAG|GTAGTCAAAT...GCTATTTTGTCT/ATGTGACTAACT...TGCAG|TTT | 1 | 1 | 7.755 |
| 48954099 | GT-AG | 0 | 7.305222029280307e-05 | 360 | rna-XM_036591720.1 9059407 | 4 | 20735687 | 20736046 | Colossoma macropomum 42526 | CAG|GTAGCTAAAA...TTTGTCTGATTG/ATTTGTCTGATT...TGCAG|CAT | 0 | 1 | 9.866 |
| 48954100 | GT-AG | 0 | 2.3061094167774044e-05 | 278 | rna-XM_036591720.1 9059407 | 5 | 20735317 | 20735594 | Colossoma macropomum 42526 | CAC|GTAAGTCTCT...CACACATTAATG/CCTTGATTTACT...TTCAG|ATA | 2 | 1 | 11.734 |
| 48954101 | GT-AG | 0 | 0.0009665386878753 | 888 | rna-XM_036591720.1 9059407 | 6 | 20734353 | 20735240 | Colossoma macropomum 42526 | CAG|GTATGTTATA...AATGTTTTAGAA/AAATGTTTTAGA...TCCAG|GTG | 0 | 1 | 13.276 |
| 48954102 | GT-AG | 0 | 1.000000099473604e-05 | 8844 | rna-XM_036591720.1 9059407 | 7 | 20725411 | 20734254 | Colossoma macropomum 42526 | ACG|GTGAGATAGG...TCTCCCTTTGTT/GTGTTATTGAAG...TTCAG|ATT | 2 | 1 | 15.266 |
| 48954103 | GT-AG | 0 | 1.000000099473604e-05 | 16460 | rna-XM_036591720.1 9059407 | 8 | 20708880 | 20725339 | Colossoma macropomum 42526 | CAG|GTAAGACAGG...TGAACCTTTTCT/ACATTGCTTATA...TGCAG|ATA | 1 | 1 | 16.707 |
| 48954104 | GT-AG | 0 | 1.000000099473604e-05 | 2491 | rna-XM_036591720.1 9059407 | 9 | 20706245 | 20708735 | Colossoma macropomum 42526 | CAG|GTAATTCACT...ACCATTTTAGCC/TATATGCTGATT...TACAG|GGA | 1 | 1 | 19.631 |
| 48954105 | GT-AG | 0 | 1.000000099473604e-05 | 156 | rna-XM_036591720.1 9059407 | 10 | 20706012 | 20706167 | Colossoma macropomum 42526 | CAG|GTAAGGCACA...ATTATCTAAACC/TATTATCTAAAC...GACAG|ATC | 0 | 1 | 21.194 |
| 48954106 | GT-AG | 0 | 7.305259816886381e-05 | 146 | rna-XM_036591720.1 9059407 | 11 | 20705773 | 20705918 | Colossoma macropomum 42526 | GCT|GTAAGTGATG...CTTGCTTTAAAT/CTTGCTTTAAAT...CCCAG|CTG | 0 | 1 | 23.082 |
| 48954107 | GT-AG | 0 | 0.0044991755108329 | 510 | rna-XM_036591720.1 9059407 | 12 | 20705117 | 20705626 | Colossoma macropomum 42526 | TGG|GTACGCATGA...TATGTTTTGACT/TATGTTTTGACT...CTCAG|GCA | 2 | 1 | 26.045 |
| 48954108 | GT-AG | 0 | 1.000000099473604e-05 | 551 | rna-XM_036591720.1 9059407 | 13 | 20704430 | 20704980 | Colossoma macropomum 42526 | CGA|GTGAGTGTCT...TTGTCCTCATAT/CGTTTTCTTATT...CTCAG|CAT | 0 | 1 | 28.806 |
| 48954109 | GT-AG | 0 | 1.000000099473604e-05 | 273 | rna-XM_036591720.1 9059407 | 14 | 20704086 | 20704358 | Colossoma macropomum 42526 | AAG|GTAAGACCAC...TTTTTCTTTTCT/AAAATGTTGAAT...TTCAG|ACG | 2 | 1 | 30.248 |
| 48954110 | GT-AG | 0 | 1.000000099473604e-05 | 466 | rna-XM_036591720.1 9059407 | 15 | 20703560 | 20704025 | Colossoma macropomum 42526 | GAG|GTGAGTCCAG...GTTTCCATATTA/GCTGGTTTGAGT...CCTAG|GCA | 2 | 1 | 31.466 |
| 48954111 | GT-AG | 0 | 1.000000099473604e-05 | 172 | rna-XM_036591720.1 9059407 | 16 | 20703310 | 20703481 | Colossoma macropomum 42526 | AAA|GTAAGTTTGT...TGATATTTGTCT/GAGGAGTTGATA...TGCAG|GTG | 2 | 1 | 33.049 |
| 48954112 | GT-AG | 0 | 1.000000099473604e-05 | 1602 | rna-XM_036591720.1 9059407 | 17 | 20701540 | 20703141 | Colossoma macropomum 42526 | CAA|GTGAGACACA...CAAATCTTATTT/TCTTATTTAACT...TTTAG|TCG | 2 | 1 | 36.46 |
| 48954113 | GT-AG | 0 | 0.0007406006199336 | 117 | rna-XM_036591720.1 9059407 | 18 | 20701376 | 20701492 | Colossoma macropomum 42526 | GAA|GTAAGTTTCT...TGTCCTTTGAAA/TGTCCTTTGAAA...ACCAG|GTA | 1 | 1 | 37.414 |
| 48954114 | GT-AG | 0 | 1.000000099473604e-05 | 1805 | rna-XM_036591720.1 9059407 | 19 | 20699400 | 20701204 | Colossoma macropomum 42526 | TGG|GTAAGCCCCC...TGGTCCTTTCTG/AGAACTCTAACA...GACAG|GTC | 1 | 1 | 40.885 |
| 48954115 | GT-AG | 0 | 1.000000099473604e-05 | 388 | rna-XM_036591720.1 9059407 | 20 | 20698904 | 20699291 | Colossoma macropomum 42526 | AGC|GTAAGTGCTC...TCTTTTATAACT/CTCCTTTTCACC...ACAAG|GAT | 1 | 1 | 43.078 |
| 48954116 | GT-AG | 0 | 1.000000099473604e-05 | 807 | rna-XM_036591720.1 9059407 | 21 | 20698070 | 20698876 | Colossoma macropomum 42526 | GTG|GTAAGCCCCC...TAATCCTCACTG/CTAATCCTCACT...GGCAG|CTG | 1 | 1 | 43.626 |
| 48954117 | GT-AG | 0 | 1.000000099473604e-05 | 1017 | rna-XM_036591720.1 9059407 | 22 | 20696897 | 20697913 | Colossoma macropomum 42526 | TAG|GTAAGGGCAC...AACGCCCTAACG/TCTGGCCTCACT...AACAG|CCA | 1 | 1 | 46.793 |
| 48954118 | GT-AG | 0 | 1.000000099473604e-05 | 1752 | rna-XM_036591720.1 9059407 | 23 | 20694983 | 20696734 | Colossoma macropomum 42526 | TGG|GTGAGTAAAC...GTTCTCTTATTT/TGTTCTCTTATT...CTCAG|ATC | 1 | 1 | 50.081 |
| 48954119 | GT-AG | 0 | 1.000000099473604e-05 | 2473 | rna-XM_036591720.1 9059407 | 24 | 20692406 | 20694878 | Colossoma macropomum 42526 | CTG|GTAAGATTAT...CCTTTCTTGTTT/TGTCTGGTAATA...TTCAG|CTG | 0 | 1 | 52.192 |
| 48954120 | GT-AG | 0 | 0.0002987385408276 | 798 | rna-XM_036591720.1 9059407 | 25 | 20691584 | 20692381 | Colossoma macropomum 42526 | AAG|GTAGTTTTTG...TAGATTTTATTT/ATAGATTTTATT...CCCAG|CGC | 0 | 1 | 52.68 |
| 48954121 | GT-AG | 0 | 1.000000099473604e-05 | 211 | rna-XM_036591720.1 9059407 | 26 | 20691128 | 20691338 | Colossoma macropomum 42526 | TAG|GTAAGCCTGG...ACAGCACTAATT/ATAGTACTAATG...TCCAG|CCG | 2 | 1 | 57.653 |
| 48954122 | GT-AG | 0 | 0.0260411898573671 | 277 | rna-XM_036591720.1 9059407 | 27 | 20690787 | 20691063 | Colossoma macropomum 42526 | AAG|GTAACCTACA...TTTTTGTTATTT/GTTTTTGTTATT...ATCAG|AGA | 0 | 1 | 58.952 |
| 48954123 | GT-AG | 0 | 1.000000099473604e-05 | 209 | rna-XM_036591720.1 9059407 | 28 | 20690575 | 20690783 | Colossoma macropomum 42526 | AGA|GTAAGTGAAG...TTGTCCATGATG/TAAACTTTAAGT...CTCAG|GTG | 0 | 1 | 59.013 |
| 48954124 | GT-AG | 0 | 1.000000099473604e-05 | 312 | rna-XM_036591720.1 9059407 | 29 | 20690099 | 20690410 | Colossoma macropomum 42526 | TCG|GTTAGAAGAT...GCTACTTTGACT/TGCATTTTAATC...TGCAG|TGA | 2 | 1 | 62.343 |
| 48954125 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_036591720.1 9059407 | 30 | 20689916 | 20690019 | Colossoma macropomum 42526 | AAG|GTAAAAGCTA...ATTGTTTTGGCT/ATAATTATAAAT...GACAG|GTG | 0 | 1 | 63.946 |
| 48954126 | GT-AG | 0 | 0.0004160910263264 | 215 | rna-XM_036591720.1 9059407 | 31 | 20689557 | 20689771 | Colossoma macropomum 42526 | AAG|GTATGTCACC...ATTGTCTTATAC/CATTGTCTTATA...CCTAG|ACC | 0 | 1 | 66.87 |
| 48954127 | GT-AG | 0 | 1.000000099473604e-05 | 611 | rna-XM_036591720.1 9059407 | 32 | 20688850 | 20689460 | Colossoma macropomum 42526 | CAG|GTAGTGCCCT...AGATCTTTGATT/AGATCTTTGATT...TGTAG|GCT | 0 | 1 | 68.819 |
| 48954128 | GT-AG | 0 | 1.000000099473604e-05 | 729 | rna-XM_036591720.1 9059407 | 33 | 20687974 | 20688702 | Colossoma macropomum 42526 | GTG|GTAAATGTCT...TTACTCTTGTTT/TTTCTTCTAACA...CACAG|AGT | 0 | 1 | 71.803 |
| 48954129 | GT-AG | 0 | 0.0004529671177475 | 557 | rna-XM_036591720.1 9059407 | 34 | 20687310 | 20687866 | Colossoma macropomum 42526 | CAG|GTATAATGTC...GTTCTCTTCCTC/CCTGTTCTCTTC...CACAG|TCG | 2 | 1 | 73.975 |
| 48954130 | GT-AG | 0 | 0.000219271715375 | 569 | rna-XM_036591720.1 9059407 | 35 | 20686618 | 20687186 | Colossoma macropomum 42526 | CAG|GTAACTCAGC...CTTCCCTTCTCT/GTGATACTTATA...GACAG|TAT | 2 | 1 | 76.472 |
| 48954131 | GT-AG | 0 | 1.685614447933736e-05 | 1050 | rna-XM_036591720.1 9059407 | 36 | 20685420 | 20686469 | Colossoma macropomum 42526 | GCG|GTAGGTCCTC...TCTTTCTCAACT/TTCTTTCTCAAC...CCCAG|GTT | 0 | 1 | 79.476 |
| 48954132 | GT-AG | 0 | 0.0004230716629076 | 1134 | rna-XM_036591720.1 9059407 | 37 | 20684045 | 20685178 | Colossoma macropomum 42526 | ATT|GTAAGTTTTT...CTCTCCTTCCCC/TTTGTCATCACA...TACAG|GCC | 1 | 1 | 84.369 |
| 48954133 | GT-AG | 0 | 1.000000099473604e-05 | 541 | rna-XM_036591720.1 9059407 | 38 | 20683447 | 20683987 | Colossoma macropomum 42526 | CTG|GTAAAGGCCT...TATGCTTTACTC/TTTTACCTGATT...CTTAG|ATC | 1 | 1 | 85.526 |
| 48954134 | GT-AG | 0 | 1.000000099473604e-05 | 137 | rna-XM_036591720.1 9059407 | 39 | 20683137 | 20683273 | Colossoma macropomum 42526 | GAT|GTGAGTTTTA...TGCGTTTTCTCT/CTGTGTGTCACT...TACAG|CAC | 0 | 1 | 89.038 |
| 48954135 | GT-AG | 0 | 1.000000099473604e-05 | 2297 | rna-XM_036591720.1 9059407 | 40 | 20680711 | 20683007 | Colossoma macropomum 42526 | GAG|GTTAAAAACA...ATTCTCTTTTTT/TTCTGCTTTACA...CACAG|GTT | 0 | 1 | 91.657 |
| 48954136 | GT-AG | 0 | 1.000000099473604e-05 | 561 | rna-XM_036591720.1 9059407 | 41 | 20679943 | 20680503 | Colossoma macropomum 42526 | CAA|GTGGGTGCCT...GAGTGCTTCTCT/CTGTCAGTCACA...CGCAG|GGC | 0 | 1 | 95.859 |
| 48954137 | GT-AG | 0 | 1.000000099473604e-05 | 2710 | rna-XM_036591720.1 9059407 | 42 | 20677116 | 20679825 | Colossoma macropomum 42526 | AAG|GTACAGTCTC...TGTTTCACAATA/TCATGTTTCACA...CACAG|ATG | 0 | 1 | 98.234 |
| 48961117 | GT-AG | 0 | 1.000000099473604e-05 | 1397 | rna-XM_036591720.1 9059407 | 1 | 20754848 | 20756244 | Colossoma macropomum 42526 | AGG|GTGAGTCAAC...TAATCTTTCTTT/CAGAGGTTAATC...CATAG|GCA | 0 | 2.192 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);