introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
43 rows where transcript_id = 9059377
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48953189 | GT-AG | 0 | 1.000000099473604e-05 | 1054 | rna-XM_036577711.1 9059377 | 1 | 15924323 | 15925376 | Colossoma macropomum 42526 | GCG|GTAAGGTGAA...AGTCTCTTATGT/CAGTCTCTTATG...CACAG|GGG | 2 | 1 | 0.687 |
| 48953190 | GT-AG | 0 | 1.000000099473604e-05 | 658 | rna-XM_036577711.1 9059377 | 2 | 15923474 | 15924131 | Colossoma macropomum 42526 | AAG|GTGACTGATG...TGCCTTTTATTC/AAATCTCTAATT...TTTAG|AAA | 1 | 1 | 3.667 |
| 48953191 | GT-AG | 0 | 1.3688394881691206e-05 | 200 | rna-XM_036577711.1 9059377 | 3 | 15923235 | 15923434 | Colossoma macropomum 42526 | TTG|GTAAGTCTGG...GTTTTCTTGCTT/CTGAGATTCAGT...CCCAG|ATG | 1 | 1 | 4.276 |
| 48953192 | GT-AG | 0 | 6.174562676710896e-05 | 2457 | rna-XM_036577711.1 9059377 | 4 | 15920619 | 15923075 | Colossoma macropomum 42526 | CAG|GTATGACTGA...AAAGCTTTAAAT/GCTTTGTTCATG...CACAG|TGG | 1 | 1 | 6.757 |
| 48953193 | GT-AG | 0 | 1.000000099473604e-05 | 510 | rna-XM_036577711.1 9059377 | 5 | 15920097 | 15920606 | Colossoma macropomum 42526 | AAG|GTGAGGTCCT...TTGCCCTTGACT/CCTTGACTTATA...TCTAG|AGG | 1 | 1 | 6.944 |
| 48953194 | GT-AG | 0 | 6.54324847674938e-05 | 1983 | rna-XM_036577711.1 9059377 | 6 | 15918004 | 15919986 | Colossoma macropomum 42526 | CAG|GTATGGATTT...TTTTTCTAAACC/CTTTTTCTAAAC...CACAG|GAC | 0 | 1 | 8.661 |
| 48953195 | GT-AG | 0 | 1.000000099473604e-05 | 6459 | rna-XM_036577711.1 9059377 | 7 | 15911373 | 15917831 | Colossoma macropomum 42526 | CAG|GTGAGTGTTG...CTGTCTCTGTCT/CTCTCTCTCTCT...TGTAG|GTG | 1 | 1 | 11.345 |
| 48953196 | GT-AG | 0 | 1.000000099473604e-05 | 448 | rna-XM_036577711.1 9059377 | 8 | 15910717 | 15911164 | Colossoma macropomum 42526 | AAG|GTGAACGGGG...TTTCTCTTCATG/TCTTTTTTCAAA...CATAG|AAA | 2 | 1 | 14.591 |
| 48953197 | GT-AG | 0 | 1.000000099473604e-05 | 207 | rna-XM_036577711.1 9059377 | 9 | 15910483 | 15910689 | Colossoma macropomum 42526 | GAA|GTGAGTCTCT...ACATTCGTAATC/ATCTTGCTCATT...TGCAG|TTT | 2 | 1 | 15.012 |
| 48953198 | GT-AG | 0 | 1.2816872562861348e-05 | 108 | rna-XM_036577711.1 9059377 | 10 | 15910285 | 15910392 | Colossoma macropomum 42526 | GAG|GTAGGTTTGT...ATGGTCATATCT/TATATGGTCATA...TATAG|GTC | 2 | 1 | 16.417 |
| 48953199 | GT-AG | 0 | 1.000000099473604e-05 | 1560 | rna-XM_036577711.1 9059377 | 11 | 15908589 | 15910148 | Colossoma macropomum 42526 | CTG|GTAAAAATTC...CTGCTTTTATCT/TTGTTTCTTATG...TGTAG|GAT | 0 | 1 | 18.539 |
| 48953200 | GT-AG | 0 | 1.000000099473604e-05 | 174 | rna-XM_036577711.1 9059377 | 12 | 15908257 | 15908430 | Colossoma macropomum 42526 | TAG|GTAAGGCTGT...TTCATTTTATCA/TTGTACTTCATT...CACAG|GGC | 2 | 1 | 21.005 |
| 48953201 | GT-AG | 0 | 1.000000099473604e-05 | 120 | rna-XM_036577711.1 9059377 | 13 | 15908025 | 15908144 | Colossoma macropomum 42526 | CAG|GTGAGCTGTG...CAATCATTATTT/GTAGTTGTCACA...TGTAG|AGC | 0 | 1 | 22.753 |
| 48953202 | GT-AG | 0 | 1.000000099473604e-05 | 1103 | rna-XM_036577711.1 9059377 | 14 | 15906774 | 15907876 | Colossoma macropomum 42526 | CAA|GTAAGAAAAC...ATTTTTCTACCT/TATTTTTCTACC...TTCAG|ATT | 1 | 1 | 25.062 |
| 48953203 | GT-AG | 0 | 1.000000099473604e-05 | 437 | rna-XM_036577711.1 9059377 | 15 | 15906118 | 15906554 | Colossoma macropomum 42526 | CAA|GTGAGAAGAG...TTCACCTTCTCT/AAACGTTTCACC...TGTAG|CCG | 1 | 1 | 28.48 |
| 48953204 | GT-AG | 0 | 2.0506688811196408e-05 | 427 | rna-XM_036577711.1 9059377 | 16 | 15905628 | 15906054 | Colossoma macropomum 42526 | AAG|GTAGGTATTT...CTTGTCTTACCT/ACTTGTCTTACC...TGTAG|AGT | 1 | 1 | 29.463 |
| 48953205 | GT-AG | 0 | 0.0002887396652135 | 1153 | rna-XM_036577711.1 9059377 | 17 | 15904345 | 15905497 | Colossoma macropomum 42526 | TCT|GTAAGTCTTT...TACTTTTAAATG/TATAGTTTGAAG...CTCAG|ATC | 2 | 1 | 31.492 |
| 48953206 | GT-AG | 0 | 1.000000099473604e-05 | 110 | rna-XM_036577711.1 9059377 | 18 | 15904054 | 15904163 | Colossoma macropomum 42526 | AAG|GTGTGTGAAT...ACTTTGTTAATT/ACTTTGTTAATT...TGTAG|ATA | 0 | 1 | 34.316 |
| 48953207 | GT-AG | 0 | 3.66508391405064e-05 | 95 | rna-XM_036577711.1 9059377 | 19 | 15903884 | 15903978 | Colossoma macropomum 42526 | AAG|GTGTGCATAA...TTTTACTTACCA/ATTTTACTTACC...CACAG|ATG | 0 | 1 | 35.487 |
| 48953208 | GT-AG | 0 | 0.0315133429562426 | 2028 | rna-XM_036577711.1 9059377 | 20 | 15901748 | 15903775 | Colossoma macropomum 42526 | CAG|GTATCTCTCT...ACTGCCAAAATT/AAATGATTGAAT...TTTAG|GAA | 0 | 1 | 37.172 |
| 48953209 | GT-AG | 0 | 1.000000099473604e-05 | 176 | rna-XM_036577711.1 9059377 | 21 | 15901431 | 15901606 | Colossoma macropomum 42526 | CCT|GTTAGTGCAC...TTTTTCTTGTCT/TTTGGACTCAGA...TGTAG|CAA | 0 | 1 | 39.373 |
| 48953210 | GT-AG | 0 | 1.000000099473604e-05 | 2737 | rna-XM_036577711.1 9059377 | 22 | 15898567 | 15901303 | Colossoma macropomum 42526 | ACA|GTGAGTGTTT...GCTTTCTTACTG/TTCTTACTGATT...GCCAG|ATG | 1 | 1 | 41.355 |
| 48953211 | GT-AG | 0 | 1.000000099473604e-05 | 1226 | rna-XM_036577711.1 9059377 | 23 | 15896109 | 15897334 | Colossoma macropomum 42526 | AAG|GTGAGTAGTT...TCTGTTTTACTC/TTCTGTTTTACT...CACAG|CTG | 0 | 1 | 60.581 |
| 48953212 | GT-AG | 0 | 1.6860822627673552e-05 | 397 | rna-XM_036577711.1 9059377 | 24 | 15895611 | 15896007 | Colossoma macropomum 42526 | TTG|GTAAGCACGT...TTATCTTTACAT/ATTGTGTTTATC...TTTAG|GAT | 2 | 1 | 62.157 |
| 48953213 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_036577711.1 9059377 | 25 | 15895457 | 15895538 | Colossoma macropomum 42526 | CAG|GTAATAAAAA...GGATTCTGAATG/GGGATTCTGAAT...TGCAG|GCC | 2 | 1 | 63.28 |
| 48953214 | GT-AG | 0 | 1.000000099473604e-05 | 1008 | rna-XM_036577711.1 9059377 | 26 | 15894277 | 15895284 | Colossoma macropomum 42526 | CAG|GTGACATGAT...TGTATCTTACAG/TTGTATCTTACA...GGCAG|GTG | 0 | 1 | 65.964 |
| 48953215 | GT-AG | 0 | 1.000000099473604e-05 | 3412 | rna-XM_036577711.1 9059377 | 27 | 15890686 | 15894097 | Colossoma macropomum 42526 | CAG|GTGAACATGT...AATCCCTAAGTT/CTTGTTCTTATC...TGCAG|CTA | 2 | 1 | 68.758 |
| 48953216 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_036577711.1 9059377 | 28 | 15890483 | 15890571 | Colossoma macropomum 42526 | GAA|GTAAGACTCT...TTCTCCTGATTA/GTTCTCCTGATT...TTCAG|GGT | 2 | 1 | 70.537 |
| 48953217 | GT-AG | 0 | 2.1833493905875497e-05 | 146 | rna-XM_036577711.1 9059377 | 29 | 15890228 | 15890373 | Colossoma macropomum 42526 | GAT|GTAAGTATGT...GGTGTTTAAATT/AGGTGTTTAAAT...GTCAG|ATG | 0 | 1 | 72.238 |
| 48953218 | GT-AG | 0 | 1.000000099473604e-05 | 2533 | rna-XM_036577711.1 9059377 | 30 | 15887540 | 15890072 | Colossoma macropomum 42526 | CAA|GTAAGTACTG...GAAGCCATAGTT/ATAGTTATGAAA...GTCAG|ATT | 2 | 1 | 74.657 |
| 48953219 | GT-AG | 0 | 1.000000099473604e-05 | 208 | rna-XM_036577711.1 9059377 | 31 | 15887235 | 15887442 | Colossoma macropomum 42526 | AAG|GTTAGCAGCT...TGGATTTTACAA/GTTCTGTTCATT...GATAG|ACG | 0 | 1 | 76.17 |
| 48953220 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_036577711.1 9059377 | 32 | 15887090 | 15887173 | Colossoma macropomum 42526 | AAG|GTGAGTTGGC...TGAATTTTGATT/TGAATTTTGATT...CACAG|TGC | 1 | 1 | 77.122 |
| 48953221 | GT-AG | 0 | 4.109328137773489e-05 | 665 | rna-XM_036577711.1 9059377 | 33 | 15886278 | 15886942 | Colossoma macropomum 42526 | AAG|GTCCACACAC...AATATCTTGATC/AATATCTTGATC...CTTAG|ATG | 1 | 1 | 79.416 |
| 48953222 | GT-AG | 0 | 0.0001956642414225 | 93 | rna-XM_036577711.1 9059377 | 34 | 15886064 | 15886156 | Colossoma macropomum 42526 | TTT|GTAAGTATCA...CTGATCTTATTG/TCTGATCTTATT...ATTAG|TGA | 2 | 1 | 81.305 |
| 48953223 | GT-AG | 0 | 0.0001278472609565 | 93 | rna-XM_036577711.1 9059377 | 35 | 15885875 | 15885967 | Colossoma macropomum 42526 | CAG|GTATGGTGTC...TTTTTTTTGCCT/CATTGTTTAATG...TCCAG|CTG | 2 | 1 | 82.803 |
| 48953224 | GT-AG | 0 | 0.0001040566042191 | 119 | rna-XM_036577711.1 9059377 | 36 | 15885672 | 15885790 | Colossoma macropomum 42526 | CAG|GTCTGCATAT...GTGTGTTTAATC/GTGTGTTTAATC...TGTAG|TTC | 2 | 1 | 84.114 |
| 48953225 | GT-AG | 0 | 1.000000099473604e-05 | 158 | rna-XM_036577711.1 9059377 | 37 | 15885419 | 15885576 | Colossoma macropomum 42526 | TCA|GTGAGTCACT...GTACCCTTTTTT/CCTTTTTTCATC...CACAG|AGG | 1 | 1 | 85.596 |
| 48953226 | GT-AG | 0 | 1.000000099473604e-05 | 2190 | rna-XM_036577711.1 9059377 | 38 | 15883169 | 15885358 | Colossoma macropomum 42526 | TGC|GTGAGTATTA...ATTGACTGAGCT/GTGAAATTGACT...TGCAG|AGT | 1 | 1 | 86.532 |
| 48953227 | GT-AG | 0 | 1.000000099473604e-05 | 1000 | rna-XM_036577711.1 9059377 | 39 | 15881992 | 15882991 | Colossoma macropomum 42526 | AAG|GTGAATTGTG...AATTTATTAACT/AATTTATTAACT...CTCAG|ATG | 1 | 1 | 89.295 |
| 48953228 | GT-AG | 0 | 3.0107262932218022e-05 | 93 | rna-XM_036577711.1 9059377 | 40 | 15881795 | 15881887 | Colossoma macropomum 42526 | AAG|GTAGACTGAC...CACTCCTATACA/ATTAGAATCACT...CCCAG|AGG | 0 | 1 | 90.918 |
| 48953229 | GT-AG | 0 | 1.000000099473604e-05 | 1115 | rna-XM_036577711.1 9059377 | 41 | 15880551 | 15881665 | Colossoma macropomum 42526 | CAG|GTGTGTGAGG...TCTTCTGTGATT/TCTTCTGTGATT...TGCAG|GTT | 0 | 1 | 92.931 |
| 48953230 | GT-AG | 0 | 0.0001183998740348 | 98 | rna-XM_036577711.1 9059377 | 42 | 15880191 | 15880288 | Colossoma macropomum 42526 | TCG|GTAAGCAGTT...TTTTTTTTATTT/TTTTTTTTTATT...TTCAG|ATT | 1 | 1 | 97.019 |
| 48953231 | GT-AG | 0 | 1.5567719472339963e-05 | 718 | rna-XM_036577711.1 9059377 | 43 | 15879398 | 15880115 | Colossoma macropomum 42526 | AAG|GTAGTTTTAG...GATGTGTTACTC/GGTAGATTTACA...TGCAG|GTA | 1 | 1 | 98.19 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);