introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 9059398
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48953818 | GT-AG | 0 | 1.000000099473604e-05 | 1418 | rna-XM_036574665.1 9059398 | 2 | 31791053 | 31792470 | Colossoma macropomum 42526 | AAG|GTAAGATATT...TATAACTTAAAA/TGGTTGTTTATG...TGCAG|TTT | 0 | 1 | 3.34 |
| 48953819 | GT-AG | 0 | 1.000000099473604e-05 | 222 | rna-XM_036574665.1 9059398 | 3 | 31790652 | 31790873 | Colossoma macropomum 42526 | CAG|GTCAGTCTCA...TATTGTTTAATT/TATTGTTTAATT...TTCAG|ACC | 2 | 1 | 6.589 |
| 48953820 | GT-AG | 0 | 0.0002878366473313 | 210 | rna-XM_036574665.1 9059398 | 4 | 31790339 | 31790548 | Colossoma macropomum 42526 | AAG|GTGTACTGTT...GTGTCCTTCCTC/CATGAGTTAATA...TGTAG|TGT | 0 | 1 | 8.459 |
| 48953821 | GT-AG | 0 | 1.000000099473604e-05 | 156 | rna-XM_036574665.1 9059398 | 5 | 31790016 | 31790171 | Colossoma macropomum 42526 | TAG|GTATGGAGAC...TGTATTTTACCT/ATGTATTTTACC...CCTAG|TAA | 2 | 1 | 11.49 |
| 48953822 | GT-AG | 0 | 1.000000099473604e-05 | 436 | rna-XM_036574665.1 9059398 | 6 | 31789380 | 31789815 | Colossoma macropomum 42526 | TGT|GTGAGTTCAT...GACATTTTAATA/ATATTCTTCAAT...TTTAG|CCA | 1 | 1 | 15.121 |
| 48953823 | GT-AG | 0 | 1.000000099473604e-05 | 565 | rna-XM_036574665.1 9059398 | 7 | 31788690 | 31789254 | Colossoma macropomum 42526 | TTA|GTGAGTCACT...GGGGTCTTATCC/CTTTGGCTTATT...GGCAG|GTT | 0 | 1 | 17.39 |
| 48953824 | GT-AG | 0 | 1.000000099473604e-05 | 198 | rna-XM_036574665.1 9059398 | 8 | 31788269 | 31788466 | Colossoma macropomum 42526 | TAG|GTAAGATATA...ATTTCCCTGATG/ACATAACTTACT...TTCAG|GTC | 1 | 1 | 21.438 |
| 48953825 | GT-AG | 0 | 0.0016997638905633 | 469 | rna-XM_036574665.1 9059398 | 9 | 31787531 | 31787999 | Colossoma macropomum 42526 | AGG|GTATGTCTGA...TGTGTCTTACTT/ATGTGTCTTACT...TGAAG|GAG | 0 | 1 | 26.321 |
| 48953826 | GT-AG | 0 | 1.000000099473604e-05 | 140 | rna-XM_036574665.1 9059398 | 10 | 31787193 | 31787332 | Colossoma macropomum 42526 | GAG|GTGTGAGTGT...ATGTTTTTGAAA/ATGTTTTTGAAA...TATAG|GAC | 0 | 1 | 29.915 |
| 48953827 | GT-AG | 0 | 1.000000099473604e-05 | 574 | rna-XM_036574665.1 9059398 | 11 | 31786229 | 31786802 | Colossoma macropomum 42526 | AAG|GTAAAGTCCA...AAATCTTTGTCG/CATGTAATAAAT...TGCAG|GTT | 0 | 1 | 36.994 |
| 48953828 | GT-AG | 0 | 1.000000099473604e-05 | 1044 | rna-XM_036574665.1 9059398 | 12 | 31785047 | 31786090 | Colossoma macropomum 42526 | GTG|GTAAGTCATG...TTTTTGTTATTC/TTGTTATTCATG...TTCAG|ATG | 0 | 1 | 39.499 |
| 48953829 | GT-AG | 0 | 0.0038327287746456 | 121 | rna-XM_036574665.1 9059398 | 13 | 31784686 | 31784806 | Colossoma macropomum 42526 | CAG|GTATATTCAG...TTTCCCTGAGTT/GTTTCCCTGAGT...TGTAG|AAA | 0 | 1 | 43.856 |
| 48953830 | GT-AG | 0 | 1.000000099473604e-05 | 114 | rna-XM_036574665.1 9059398 | 14 | 31784347 | 31784460 | Colossoma macropomum 42526 | CAA|GTGAGTGAGG...GTGTTTTCAATA/TGTGTTTTCAAT...TTTAG|GTT | 0 | 1 | 47.94 |
| 48953831 | GT-AG | 0 | 1.000000099473604e-05 | 396 | rna-XM_036574665.1 9059398 | 15 | 31783804 | 31784199 | Colossoma macropomum 42526 | CAG|GTGCGTATTA...AACTCTTCAGTC/TCTTCAGTCATT...TCCAG|GCA | 0 | 1 | 50.608 |
| 48953832 | GT-AG | 0 | 1.000000099473604e-05 | 164 | rna-XM_036574665.1 9059398 | 16 | 31783474 | 31783637 | Colossoma macropomum 42526 | TTG|GTAGGAGACC...CCTGTCTTCTCC/ATGGTGCTGATA...TGCAG|CTC | 1 | 1 | 53.621 |
| 48953833 | GT-AG | 0 | 1.000000099473604e-05 | 461 | rna-XM_036574665.1 9059398 | 17 | 31782884 | 31783344 | Colossoma macropomum 42526 | CTG|GTACTACATA...TCTTTCTTTGTG/TTTGTGATCATT...CCTAG|GCA | 1 | 1 | 55.963 |
| 48953834 | GT-AG | 0 | 1.000000099473604e-05 | 141 | rna-XM_036574665.1 9059398 | 18 | 31782529 | 31782669 | Colossoma macropomum 42526 | CAG|GTAATGTTAT...GCCTTTTTACCT/TTTCTCTTCATG...TACAG|CCT | 2 | 1 | 59.848 |
| 48953835 | GT-AG | 0 | 1.000000099473604e-05 | 176 | rna-XM_036574665.1 9059398 | 19 | 31782224 | 31782399 | Colossoma macropomum 42526 | GTG|GTGGGTACTG...ACATCGTTAATC/AATAATTTGATC...CTTAG|GTC | 2 | 1 | 62.189 |
| 48953836 | GT-AG | 0 | 1.000000099473604e-05 | 118 | rna-XM_036574665.1 9059398 | 20 | 31781929 | 31782046 | Colossoma macropomum 42526 | TAG|GTTAGACAGC...GCATTCTTCTCT/CTGAGACTAAGC...CCCAG|GCA | 2 | 1 | 65.402 |
| 48953837 | GT-AG | 0 | 1.000000099473604e-05 | 127 | rna-XM_036574665.1 9059398 | 21 | 31781735 | 31781861 | Colossoma macropomum 42526 | TTG|GTAAGTTGAG...TGTTCCTCATCT/GTGTTCCTCATC...TCCAG|TTG | 0 | 1 | 66.618 |
| 48953838 | GT-AG | 0 | 1.000000099473604e-05 | 259 | rna-XM_036574665.1 9059398 | 22 | 31781365 | 31781623 | Colossoma macropomum 42526 | CAG|GTTTGGGTCA...GTTTCTTTCATT/GTTTCTTTCATT...TGTAG|GTG | 0 | 1 | 68.633 |
| 48953839 | GT-AG | 0 | 3.806866642298988e-05 | 250 | rna-XM_036574665.1 9059398 | 23 | 31780923 | 31781172 | Colossoma macropomum 42526 | AGT|GTTAGTTCCT...CTTTCTTTAACT/CTTTAACTCACT...TTTAG|GTG | 0 | 1 | 72.118 |
| 48953840 | GT-AG | 0 | 1.000000099473604e-05 | 240 | rna-XM_036574665.1 9059398 | 24 | 31780533 | 31780772 | Colossoma macropomum 42526 | CAG|GTGAAAGAGA...TACTTCTTAGGC/ATTTTTCCCACT...TATAG|GTG | 0 | 1 | 74.841 |
| 48953841 | GT-AG | 0 | 1.000000099473604e-05 | 572 | rna-XM_036574665.1 9059398 | 25 | 31779817 | 31780388 | Colossoma macropomum 42526 | CAG|GTGTGTAATA...ATTTCTATAGTG/TGTATTTCTATA...TCTAG|GCA | 0 | 1 | 77.455 |
| 48953842 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_036574665.1 9059398 | 26 | 31779519 | 31779699 | Colossoma macropomum 42526 | GAA|GTGAGTATCA...AAAGCTTTGTTT/GTGATGCTAATA...GGCAG|ATT | 0 | 1 | 79.579 |
| 48953843 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_036574665.1 9059398 | 27 | 31779276 | 31779380 | Colossoma macropomum 42526 | CAG|GTCCTGCTTT...TGATTATTGAAA/GGACTTTTCATA...CTCAG|GTT | 0 | 1 | 82.084 |
| 48953844 | GT-AG | 0 | 6.615730590293108e-05 | 176 | rna-XM_036574665.1 9059398 | 28 | 31779052 | 31779227 | Colossoma macropomum 42526 | AAG|GTATAATACA...ATGTTTGTATTT/CAAATATTTACA...TTTAG|ACG | 0 | 1 | 82.955 |
| 48953845 | GT-AG | 0 | 0.0029652678141383 | 225 | rna-XM_036574665.1 9059398 | 29 | 31778606 | 31778830 | Colossoma macropomum 42526 | CAG|GTATGCACTC...AATGCCTGAACT/TGCATGTTCATT...TCCAG|GTC | 2 | 1 | 86.967 |
| 48953846 | GT-AG | 0 | 1.9465029591170376e-05 | 90 | rna-XM_036574665.1 9059398 | 30 | 31778390 | 31778479 | Colossoma macropomum 42526 | CAG|GTACACACAT...AAATGATTAACA/AAATGATTAACA...CTTAG|GAG | 2 | 1 | 89.254 |
| 48953847 | GT-AG | 0 | 6.721514232226273e-05 | 1077 | rna-XM_036574665.1 9059398 | 31 | 31777022 | 31778098 | Colossoma macropomum 42526 | CAG|GTAAGCTGCT...GTCTCTTTCTCT/CTCTGTCTCTTT...TGCAG|AAG | 2 | 1 | 94.536 |
| 48953848 | GT-AG | 0 | 1.000000099473604e-05 | 157 | rna-XM_036574665.1 9059398 | 32 | 31776730 | 31776886 | Colossoma macropomum 42526 | CAG|GTCACAAACT...AATGCCATACTC/GTGTGACTGAAT...TATAG|GAG | 2 | 1 | 96.987 |
| 48961109 | GT-AG | 0 | 1.000000099473604e-05 | 801 | rna-XM_036574665.1 9059398 | 1 | 31792552 | 31793352 | Colossoma macropomum 42526 | GAG|GTAAGACTCG...AACATTTTGTCC/CTCTGACTGAGA...AACAG|GTT | 0 | 1.979 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);