introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 9059391
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48953614 | GT-AG | 0 | 9.197130461542797e-05 | 24501 | rna-XM_036567584.1 9059391 | 2 | 29044646 | 29069146 | Colossoma macropomum 42526 | CAG|GTATGTGCAG...ACATTTTTATCT/TACATTTTTATC...TCCAG|TTG | 1 | 1 | 2.79 |
| 48953615 | GT-AG | 0 | 0.0017716345086302 | 20679 | rna-XM_036567584.1 9059391 | 3 | 29069348 | 29090026 | Colossoma macropomum 42526 | CAG|GTATGTTCCC...TTGTTCTTCACC/TTGTTCTTCACC...AGCAG|GAC | 1 | 1 | 6.317 |
| 48953616 | GT-AG | 0 | 1.000000099473604e-05 | 17712 | rna-XM_036567584.1 9059391 | 4 | 29090120 | 29107831 | Colossoma macropomum 42526 | ATG|GTGAGTGTGT...CATTTCTCACTC/CCATTTCTCACT...CTCAG|AAC | 1 | 1 | 7.949 |
| 48953617 | GT-AG | 0 | 0.0006030137179804 | 1835 | rna-XM_036567584.1 9059391 | 5 | 29107954 | 29109788 | Colossoma macropomum 42526 | AAG|GTATTTAATT...TGTGCATTGATT/TGTGCATTGATT...TGTAG|TGC | 0 | 1 | 10.089 |
| 48953618 | GT-AG | 0 | 1.000000099473604e-05 | 6838 | rna-XM_036567584.1 9059391 | 6 | 29109976 | 29116813 | Colossoma macropomum 42526 | TAG|GTACTACAAG...CTCTCTCTCTCT/CTCTCTCTCTCT...TCTAG|CCA | 1 | 1 | 13.371 |
| 48953619 | GT-AG | 0 | 1.000000099473604e-05 | 15704 | rna-XM_036567584.1 9059391 | 7 | 29117114 | 29132817 | Colossoma macropomum 42526 | CAG|GTAAGAGACA...CTCACTTTATCT/CTTTCTCTCACC...TTTAG|CTA | 1 | 1 | 18.635 |
| 48953620 | GT-AG | 0 | 0.000351513348141 | 2937 | rna-XM_036567584.1 9059391 | 8 | 29132938 | 29135874 | Colossoma macropomum 42526 | AAG|GTACTTTTAC...CTTGCCTGACCT/ACTTGCCTGACC...CTTAG|CAC | 1 | 1 | 20.74 |
| 48953621 | GT-AG | 0 | 1.3566961773887418e-05 | 862 | rna-XM_036567584.1 9059391 | 9 | 29136157 | 29137018 | Colossoma macropomum 42526 | AAA|GTAAGCAAAG...GATTTGTTAATG/GATTTGTTAATG...TGCAG|GTG | 1 | 1 | 25.689 |
| 48953622 | GT-AG | 0 | 1.000000099473604e-05 | 7697 | rna-XM_036567584.1 9059391 | 10 | 29137171 | 29144867 | Colossoma macropomum 42526 | AAG|GTAAAGTACC...CTCTCTTTTTCC/TCTTTTTCCACT...TTTAG|GTG | 0 | 1 | 28.356 |
| 48953623 | GT-AG | 0 | 0.0004737795394804 | 3880 | rna-XM_036567584.1 9059391 | 11 | 29145100 | 29148979 | Colossoma macropomum 42526 | AAG|GTACCAGTCT...CTTCTCTGAAAC/CACTGATTTATT...ACCAG|AGA | 1 | 1 | 32.427 |
| 48953624 | GT-AG | 0 | 1.000000099473604e-05 | 8008 | rna-XM_036567584.1 9059391 | 12 | 29149147 | 29157154 | Colossoma macropomum 42526 | CAG|GTGAGTAGCA...CATTCTTTGATT/CATTCTTTGATT...TGAAG|GAG | 0 | 1 | 35.357 |
| 48953625 | GT-AG | 0 | 1.000000099473604e-05 | 3409 | rna-XM_036567584.1 9059391 | 13 | 29157763 | 29161171 | Colossoma macropomum 42526 | CAG|GTGAGCTATG...GCTCTCTTGATT/ATTAGTCTGATT...TTTAG|AAG | 2 | 1 | 46.026 |
| 48953626 | GT-AG | 0 | 1.000000099473604e-05 | 1147 | rna-XM_036567584.1 9059391 | 14 | 29161281 | 29162427 | Colossoma macropomum 42526 | AAG|GTAAACACAA...TGTGGCTTAATT/TGTGGCTTAATT...CACAG|GGT | 0 | 1 | 47.938 |
| 48953627 | GT-AG | 0 | 1.000000099473604e-05 | 2399 | rna-XM_036567584.1 9059391 | 15 | 29162551 | 29164949 | Colossoma macropomum 42526 | AAG|GTCAGTAGAC...AGGTTTTTATCC/AAGGTTTTTATC...TGCAG|GTG | 0 | 1 | 50.097 |
| 48953628 | GC-AG | 0 | 1.000000099473604e-05 | 320 | rna-XM_036567584.1 9059391 | 16 | 29165187 | 29165506 | Colossoma macropomum 42526 | AAG|GCAAGTTACC...TATTTTTGGATG/TATTTTTGGATG...CTCAG|GTG | 0 | 1 | 54.255 |
| 48953629 | GT-AG | 0 | 1.000000099473604e-05 | 3547 | rna-XM_036567584.1 9059391 | 17 | 29165693 | 29169239 | Colossoma macropomum 42526 | CAG|GTCAGCTAAC...TTATATTTAATG/TTTATATTTAAT...GGCAG|GTG | 0 | 1 | 57.519 |
| 48953630 | GT-AG | 0 | 0.0001594097904723 | 6912 | rna-XM_036567584.1 9059391 | 18 | 29169319 | 29176230 | Colossoma macropomum 42526 | CAG|GTACACTGCC...GTATGTTTGGTT/TTATTGTTCAGT...TCTAG|CTC | 1 | 1 | 58.905 |
| 48953631 | GT-AG | 0 | 1.000000099473604e-05 | 557 | rna-XM_036567584.1 9059391 | 19 | 29176365 | 29176921 | Colossoma macropomum 42526 | CAG|GTGAGGCATA...CTTTTTTTCTCT/TTATTGTTCACT...GTCAG|ATC | 0 | 1 | 61.256 |
| 48953632 | GT-AG | 0 | 1.000000099473604e-05 | 311 | rna-XM_036567584.1 9059391 | 20 | 29177046 | 29177356 | Colossoma macropomum 42526 | AAG|GTAAAAGGCT...TTATTTCTATTG/GTTGTAGTTATT...CCCAG|GTG | 1 | 1 | 63.432 |
| 48953633 | GT-AG | 0 | 1.000000099473604e-05 | 409 | rna-XM_036567584.1 9059391 | 21 | 29177533 | 29177941 | Colossoma macropomum 42526 | ATT|GTGAGTTATC...GAGTGTTTGAAA/GAGTGTTTGAAA...TGTAG|GTG | 0 | 1 | 66.52 |
| 48953634 | GT-AG | 0 | 1.000000099473604e-05 | 861 | rna-XM_036567584.1 9059391 | 22 | 29178075 | 29178935 | Colossoma macropomum 42526 | TAG|GTGAATATAA...TCTCTCTTAACT/CACTGTTTCACT...ATAAG|GCT | 1 | 1 | 68.854 |
| 48953635 | GT-AG | 0 | 4.209663360327989e-05 | 101 | rna-XM_036567584.1 9059391 | 23 | 29178966 | 29179066 | Colossoma macropomum 42526 | CAG|GTACAGTTCT...CTCACCTTCTCT/GGACGACTCACC...CCTAG|GTA | 1 | 1 | 69.381 |
| 48953636 | GT-AG | 0 | 1.000000099473604e-05 | 790 | rna-XM_036567584.1 9059391 | 24 | 29179159 | 29179948 | Colossoma macropomum 42526 | AAG|GTAAAGACAA...TTGTTCTTAATG/CTTGTTCTTAAT...TTCAG|AGA | 0 | 1 | 70.995 |
| 48953637 | GT-AG | 0 | 1.000000099473604e-05 | 1118 | rna-XM_036567584.1 9059391 | 25 | 29180096 | 29181213 | Colossoma macropomum 42526 | AAT|GTGAGCATCA...TGTATGTTAACC/TGTATGTTAACC...CCTAG|GAC | 0 | 1 | 73.574 |
| 48953638 | GT-AG | 0 | 1.000000099473604e-05 | 199 | rna-XM_036567584.1 9059391 | 26 | 29181315 | 29181513 | Colossoma macropomum 42526 | CAG|GTACAACCAA...TCGCTTTTACCT/ACCTATTTAATT...TACAG|GCA | 2 | 1 | 75.347 |
| 48961106 | GT-AG | 0 | 0.0026422268396515 | 12170 | rna-XM_036567584.1 9059391 | 1 | 29032368 | 29044537 | Colossoma macropomum 42526 | CAG|GTATTTCTTC...TCATTCTGAACT/TACATTTTCATT...CACAG|GGT | 0 | 1.72 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);