introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 17888357
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 95730299 | GT-AG | 0 | 0.0002717540085994 | 87 | rna-XM_047272005.1 17888357 | 1 | 22224536 | 22224622 | Hydra vulgaris 6087 | AAA|GTTGTTTTTT...GTTTTTTTAAAT/GTTTTTTTAAAT...TACAG|TTA | 1 | 1 | 6.536 |
| 95730300 | GT-AG | 0 | 1.000000099473604e-05 | 5458 | rna-XM_047272005.1 17888357 | 2 | 22218934 | 22224391 | Hydra vulgaris 6087 | TAG|GTTAGTTATT...TATTTATTGAAT/ATTGTATTTATT...TTTAG|CAC | 1 | 1 | 10.301 |
| 95730301 | GT-AG | 0 | 1.000000099473604e-05 | 3184 | rna-XM_047272005.1 17888357 | 3 | 22215656 | 22218839 | Hydra vulgaris 6087 | CAG|GTAAATATTT...ATTTTTTTACAT/AGTTTTCTTATG...TTTAG|GTG | 2 | 1 | 12.758 |
| 95730302 | GT-AG | 0 | 1.5877404325337822e-05 | 489 | rna-XM_047272005.1 17888357 | 4 | 22215119 | 22215607 | Hydra vulgaris 6087 | CAA|GTAAGTTACA...AAGTATTTAATA/AAGTATTTAATA...ATTAG|AAT | 2 | 1 | 14.013 |
| 95730303 | GT-AG | 0 | 0.0108093347772433 | 142 | rna-XM_047272005.1 17888357 | 5 | 22214895 | 22215036 | Hydra vulgaris 6087 | CAG|GTATTTTGTT...AATATTTTAATT/AATATTTTAATT...TACAG|TGT | 0 | 1 | 16.157 |
| 95730304 | GT-AG | 0 | 1.000000099473604e-05 | 5528 | rna-XM_047272005.1 17888357 | 6 | 22209289 | 22214816 | Hydra vulgaris 6087 | TCA|GTAAGTAAAA...AAAATTTCAACT/AAAAAACTCATA...TTTAG|ATA | 0 | 1 | 18.196 |
| 95730305 | GT-AG | 0 | 1.000000099473604e-05 | 184 | rna-XM_047272005.1 17888357 | 7 | 22209061 | 22209244 | Hydra vulgaris 6087 | AGG|GTCAGTTGTA...TATATTGTAATG/AATGTATTTATA...TTTAG|TTT | 2 | 1 | 19.346 |
| 95730306 | GT-AG | 0 | 1.000000099473604e-05 | 143 | rna-XM_047272005.1 17888357 | 8 | 22208866 | 22209008 | Hydra vulgaris 6087 | AAG|GTTATGAACT...TGAATTTTAATT/TTTTAATTTATT...TACAG|GAT | 0 | 1 | 20.706 |
| 95730307 | GT-AG | 0 | 1.000000099473604e-05 | 1872 | rna-XM_047272005.1 17888357 | 9 | 22206892 | 22208763 | Hydra vulgaris 6087 | CAG|GTGAAGTTTA...GTTGTTTTATAC/CAATTATTTATA...TTTAG|TTT | 0 | 1 | 23.373 |
| 95730308 | GT-AG | 0 | 0.6526661799058291 | 10282 | rna-XM_047272005.1 17888357 | 10 | 22196494 | 22206775 | Hydra vulgaris 6087 | TAC|GTATGTTTGT...TTTATTTTAATT/TTTATTTTAATT...TTTAG|TTA | 2 | 1 | 26.405 |
| 95730309 | GT-AG | 0 | 1.000000099473604e-05 | 3784 | rna-XM_047272005.1 17888357 | 11 | 22192624 | 22196407 | Hydra vulgaris 6087 | AAT|GTAAGCCAAA...ATTTACATAATA/GTAGTATTTACA...AACAG|TGG | 1 | 1 | 28.654 |
| 95730310 | GT-AG | 0 | 0.0004881134039676 | 98 | rna-XM_047272005.1 17888357 | 12 | 22192408 | 22192505 | Hydra vulgaris 6087 | TGA|GTAAGCACAT...TCTATTTTAACA/TCTATTTTAACA...TGTAG|ATA | 2 | 1 | 31.739 |
| 95730311 | GT-AG | 0 | 7.518428873316645e-05 | 85 | rna-XM_047272005.1 17888357 | 13 | 22192237 | 22192321 | Hydra vulgaris 6087 | GAA|GTAAGTATTT...AAAACTTTAGCA/TTTGCACTTACA...TTCAG|AAT | 1 | 1 | 33.987 |
| 95730312 | GT-AG | 0 | 5.086493626708184e-05 | 4564 | rna-XM_047272005.1 17888357 | 14 | 22187590 | 22192153 | Hydra vulgaris 6087 | AAG|GTTTCAAAAT...GTTTCCATAGAT/TATGAAATCATT...TTTAG|AAC | 0 | 1 | 36.157 |
| 95730313 | GT-AG | 0 | 4.496077299103612e-05 | 10977 | rna-XM_047272005.1 17888357 | 15 | 22176558 | 22187534 | Hydra vulgaris 6087 | CAT|GTAAGATTTT...TATTTTTTATAT/TTATTTTTTATA...TTTAG|TGT | 1 | 1 | 37.595 |
| 95730314 | GT-AG | 0 | 0.0015047157115461 | 1847 | rna-XM_047272005.1 17888357 | 16 | 22174643 | 22176489 | Hydra vulgaris 6087 | AAA|GTATGTATAT...ATTACTCTGATT/ATTACTCTGATT...TTTAG|AAA | 0 | 1 | 39.373 |
| 95730315 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_047272005.1 17888357 | 17 | 22174441 | 22174522 | Hydra vulgaris 6087 | GCT|GTTAGTGATT...TAATGTTTAACT/TAATGTTTAACT...TTCAG|GGC | 0 | 1 | 42.51 |
| 95730316 | GT-AG | 0 | 1.000000099473604e-05 | 228 | rna-XM_047272005.1 17888357 | 18 | 22174181 | 22174408 | Hydra vulgaris 6087 | AAA|GTTAGTATTT...ACTCTTTTAATA/ACTCTTTTAATA...TCTAG|GTG | 2 | 1 | 43.346 |
| 95730317 | GT-AG | 0 | 1.000000099473604e-05 | 334 | rna-XM_047272005.1 17888357 | 19 | 22173721 | 22174054 | Hydra vulgaris 6087 | TTG|GTGAGCTTTA...TCTACCATAATC/TTATTTGTTACC...TGCAG|GCG | 2 | 1 | 46.641 |
| 95730318 | GT-AG | 0 | 0.0001002585197504 | 114 | rna-XM_047272005.1 17888357 | 20 | 22173426 | 22173539 | Hydra vulgaris 6087 | CAG|GTAAATTTTT...AGATTTTTATTT/TTTTTATTTATT...TTAAG|CCT | 0 | 1 | 51.373 |
| 95730319 | GT-AG | 0 | 0.1587787945502494 | 92 | rna-XM_047272005.1 17888357 | 21 | 22173311 | 22173402 | Hydra vulgaris 6087 | AGA|GTATGTTTTA...TTATTTTCAATT/ATTATTTTCAAT...TGCAG|TTG | 2 | 1 | 51.974 |
| 95730320 | GT-AG | 0 | 1.000000099473604e-05 | 103 | rna-XM_047272005.1 17888357 | 22 | 22173079 | 22173181 | Hydra vulgaris 6087 | TCG|GTTAATTCAA...ATCTTTTTGAAA/ATTTTGTTCATC...TTTAG|AAC | 2 | 1 | 55.346 |
| 95730321 | GT-AG | 0 | 0.0021219412253455 | 8142 | rna-XM_047272005.1 17888357 | 23 | 22164759 | 22172900 | Hydra vulgaris 6087 | ATG|GTTTGTTTTC...GTTTTTTTATTT/TTTTATTTCATT...TCCAG|AAT | 0 | 1 | 60.0 |
| 95730322 | GT-AG | 0 | 0.0024018574035391 | 108 | rna-XM_047272005.1 17888357 | 24 | 22164582 | 22164689 | Hydra vulgaris 6087 | AAG|GTATACAAAT...TGAATCTTATAG/ATGAATCTTATA...TTCAG|CAA | 0 | 1 | 61.804 |
| 95730323 | GT-AG | 0 | 1.000000099473604e-05 | 799 | rna-XM_047272005.1 17888357 | 25 | 22163679 | 22164477 | Hydra vulgaris 6087 | CTG|GTAAAAATGT...TTTCCTCTAATT/TTTCCTCTAATT...TTTAG|GTG | 2 | 1 | 64.523 |
| 95730324 | GT-AG | 0 | 0.0004199663641795 | 117 | rna-XM_047272005.1 17888357 | 26 | 22163474 | 22163590 | Hydra vulgaris 6087 | GAG|GTTTTTACTT...TATTTCTTAATT/TTAATTTTTATT...CAAAG|GAG | 0 | 1 | 66.824 |
| 95730325 | GT-AG | 0 | 0.0009782704888851 | 310 | rna-XM_047272005.1 17888357 | 27 | 22163088 | 22163397 | Hydra vulgaris 6087 | TAC|GTAAGTTTAA...ATTGTTTTGACA/TATTGTTTAATT...TTTAG|AAA | 1 | 1 | 68.81 |
| 95730326 | GT-AG | 0 | 1.000000099473604e-05 | 132 | rna-XM_047272005.1 17888357 | 28 | 22162867 | 22162998 | Hydra vulgaris 6087 | CAG|GTATTGCAAA...ATATTTTTTCCT/TATATACACACA...TAAAG|GAA | 0 | 1 | 71.137 |
| 95730327 | GT-AG | 0 | 0.0002081587541643 | 536 | rna-XM_047272005.1 17888357 | 29 | 22162177 | 22162712 | Hydra vulgaris 6087 | CCA|GTAATATTTA...TATATTTTAAAA/TATATTTTAAAA...TATAG|ATT | 1 | 1 | 75.163 |
| 95730328 | AT-AC | 1 | 99.9999963244002 | 690 | rna-XM_047272005.1 17888357 | 30 | 22161390 | 22162079 | Hydra vulgaris 6087 | AAA|ATATCTTTTC...TATTTCTTAATT/TATTTCTTAATT...CTCAC|ATA | 2 | 1 | 77.699 |
| 95730329 | GT-AG | 0 | 6.295210887586633e-05 | 14851 | rna-XM_047272005.1 17888357 | 31 | 22146368 | 22161218 | Hydra vulgaris 6087 | AAG|GTAAACTTAT...AAATTATTATTG/AAAATTATTATT...TACAG|GTG | 2 | 1 | 82.17 |
| 95730330 | GT-AG | 0 | 0.016821128411422 | 5708 | rna-XM_047272005.1 17888357 | 32 | 22140557 | 22146264 | Hydra vulgaris 6087 | ACG|GTATATTGAA...ACTTTTTTGAAT/ACTTTTTTGAAT...TTTAG|ATT | 0 | 1 | 84.863 |
| 95730331 | GT-AG | 0 | 3.334960554130257e-05 | 2399 | rna-XM_047272005.1 17888357 | 33 | 22138043 | 22140441 | Hydra vulgaris 6087 | AAT|GTAAGTAATT...TTTTTTTTAATT/TTTTTTTTAATT...TATAG|CGG | 1 | 1 | 87.869 |
| 95730332 | GT-AG | 0 | 1.000000099473604e-05 | 131 | rna-XM_047272005.1 17888357 | 34 | 22137664 | 22137794 | Hydra vulgaris 6087 | CAG|GTTTAATTAC...GCTGTATTAATT/GCTGTATTAATT...CTTAG|GAC | 0 | 1 | 94.353 |
| 95730333 | GT-AG | 0 | 0.0002672154172272 | 115 | rna-XM_047272005.1 17888357 | 35 | 22137385 | 22137499 | Hydra vulgaris 6087 | TAG|GTAAACATTT...ATGTTTTTATTT/CATGTTTTTATT...TTCAG|GCA | 2 | 1 | 98.641 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);