introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 17888373
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 95730529 | GT-AG | 0 | 0.0006879506415602 | 162 | rna-XM_047271874.1 17888373 | 1 | 21555635 | 21555796 | Hydra vulgaris 6087 | AAG|GTTTTTTTTT...TTTTTTTTCATA/TTTTTTTTCATA...TCAAG|GTG | 0 | 1 | 2.124 |
| 95730530 | GT-AG | 0 | 1.000000099473604e-05 | 8099 | rna-XM_047271874.1 17888373 | 2 | 21547413 | 21555511 | Hydra vulgaris 6087 | CAG|GTTAGTCATT...AAATTTTTAAAC/AAATTTTTAAAC...ATTAG|ATT | 0 | 1 | 5.752 |
| 95730531 | GT-AG | 0 | 0.0001606231811354 | 92 | rna-XM_047271874.1 17888373 | 3 | 21547201 | 21547292 | Hydra vulgaris 6087 | GAG|GTAATTTTTC...CAAGTTTTAAAA/CTGTTTGTAACT...TTTAG|ATA | 0 | 1 | 9.292 |
| 95730532 | GT-AG | 0 | 8.257693084804687e-05 | 133 | rna-XM_047271874.1 17888373 | 4 | 21546885 | 21547017 | Hydra vulgaris 6087 | ATG|GTAATTTTTT...TTTTTTTTCACC/TTTTTTTTCACC...TTTAG|GAT | 0 | 1 | 14.69 |
| 95730533 | GT-AG | 0 | 1.000000099473604e-05 | 1198 | rna-XM_047271874.1 17888373 | 5 | 21545521 | 21546718 | Hydra vulgaris 6087 | AAG|GTTGTTTGTT...AATCTTTTATTT/TGTTATTTAATT...TTTAG|GAA | 1 | 1 | 19.587 |
| 95730534 | GT-AG | 0 | 0.0024385672078799 | 8670 | rna-XM_047271874.1 17888373 | 6 | 21536710 | 21545379 | Hydra vulgaris 6087 | CTG|GTATATACAA...ATTTTTTTATTT/TTTTTATTTATT...ATTAG|CAA | 1 | 1 | 23.746 |
| 95730535 | GT-AG | 0 | 9.024429830587954e-05 | 102 | rna-XM_047271874.1 17888373 | 7 | 21536456 | 21536557 | Hydra vulgaris 6087 | AAG|GTTTTTATTT...ATATTTTTACTA/TATATTTTTACT...TTTAG|GGT | 0 | 1 | 28.23 |
| 95730536 | GT-AG | 0 | 1.000000099473604e-05 | 1463 | rna-XM_047271874.1 17888373 | 8 | 21534872 | 21536334 | Hydra vulgaris 6087 | TAG|GTAAGTCCTA...TAAAACTTAGAA/CAGTTTTCTATT...TTTAG|CAA | 1 | 1 | 31.799 |
| 95730537 | GT-AG | 0 | 0.0001005892483042 | 2452 | rna-XM_047271874.1 17888373 | 9 | 21532370 | 21534821 | Hydra vulgaris 6087 | AAT|GTAAGTATTT...TGAATTTTAACG/TGAATTTTAACG...TTTAG|AAG | 0 | 1 | 33.274 |
| 95730538 | GT-AG | 0 | 1.000000099473604e-05 | 9355 | rna-XM_047271874.1 17888373 | 10 | 21522913 | 21532267 | Hydra vulgaris 6087 | ATG|GTAATTTTAT...AGATAATTATTT/AAGATAATTATT...TTAAG|ATT | 0 | 1 | 36.283 |
| 95730539 | GT-AG | 0 | 1.000000099473604e-05 | 997 | rna-XM_047271874.1 17888373 | 11 | 21521800 | 21522796 | Hydra vulgaris 6087 | TCG|GTTATTTTTT...TTTTTTTTCATT/TTTTTTTTCATT...AATAG|GCT | 2 | 1 | 39.705 |
| 95730540 | GT-AG | 0 | 0.000136170647529 | 7188 | rna-XM_047271874.1 17888373 | 12 | 21514431 | 21521618 | Hydra vulgaris 6087 | CAG|GTGTGTTTGA...TCTTTTTTATTT/ATTTTATTTATA...ATCAG|TGG | 0 | 1 | 45.044 |
| 95730541 | GT-AG | 0 | 0.0002678007827397 | 4505 | rna-XM_047271874.1 17888373 | 13 | 21509803 | 21514307 | Hydra vulgaris 6087 | GAG|GTTTGTTTAA...TAATTTTTGAAT/TAGAATTTAATT...AATAG|GCA | 0 | 1 | 48.673 |
| 95730542 | GT-AG | 0 | 0.0001142659120695 | 111 | rna-XM_047271874.1 17888373 | 14 | 21509543 | 21509653 | Hydra vulgaris 6087 | GTT|GTGAGTTTTT...TTATTCTTAATC/TTTATTCTTAAT...TTTAG|GCA | 2 | 1 | 53.068 |
| 95730543 | GT-AG | 0 | 0.0337857276407985 | 1840 | rna-XM_047271874.1 17888373 | 15 | 21507462 | 21509301 | Hydra vulgaris 6087 | GAG|GTATTTTTAT...AGAGTCTTAATT/AATTTTTTTATA...TTTAG|GTA | 0 | 1 | 60.177 |
| 95730544 | GT-AG | 0 | 0.0114953632430512 | 693 | rna-XM_047271874.1 17888373 | 16 | 21506666 | 21507358 | Hydra vulgaris 6087 | ATG|GTATTTCTTT...TTAATCTTATAA/TTTAATCTTATA...TATAG|TTG | 1 | 1 | 63.215 |
| 95730545 | GT-AG | 0 | 1.000000099473604e-05 | 5053 | rna-XM_047271874.1 17888373 | 17 | 21501458 | 21506510 | Hydra vulgaris 6087 | AAG|GTAATCAAAA...ATGCTTTTAGTA/TTGTGTTTAATT...TTTAG|AAT | 0 | 1 | 67.788 |
| 95730546 | GT-AG | 0 | 1.2463334763268188e-05 | 1839 | rna-XM_047271874.1 17888373 | 18 | 21499455 | 21501293 | Hydra vulgaris 6087 | AAG|GTCTAAATTA...TTTTCATTAATT/TAATTTTTCATT...TATAG|TTT | 2 | 1 | 72.625 |
| 95730547 | GT-AG | 0 | 0.0003864983137899 | 147 | rna-XM_047271874.1 17888373 | 19 | 21499155 | 21499301 | Hydra vulgaris 6087 | AAG|GTACGTTTTA...ATTATTATAATG/ATTATAATGATA...ACTAG|ATG | 2 | 1 | 77.139 |
| 95730548 | GT-AG | 0 | 0.0002333001265459 | 72 | rna-XM_047271874.1 17888373 | 20 | 21498985 | 21499056 | Hydra vulgaris 6087 | CAA|GTAAGTTTTT...CTGCTTTTATAT/ATAATAGTAATA...TTTAG|ATG | 1 | 1 | 80.029 |
| 95730549 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_047271874.1 17888373 | 21 | 21498781 | 21498862 | Hydra vulgaris 6087 | CAG|GTTAGGTAAA...TTTTTTTTAACT/TTTTTTTTAACT...TTAAG|ATT | 0 | 1 | 83.628 |
| 95730550 | GT-AG | 0 | 0.0059377756096143 | 156 | rna-XM_047271874.1 17888373 | 22 | 21498538 | 21498693 | Hydra vulgaris 6087 | GAA|GTAAGCTTTC...CTTTTTTTATTG/ACTTTTTTTATT...TTTAG|CCT | 0 | 1 | 86.195 |
| 95730551 | GT-AG | 0 | 2.6237878737215195e-05 | 353 | rna-XM_047271874.1 17888373 | 23 | 21497993 | 21498345 | Hydra vulgaris 6087 | CGC|GTAAGAATTA...TTTTCGTTAACT/TAATATTTAACT...ATTAG|AGT | 0 | 1 | 91.858 |
| 95730552 | GT-AG | 0 | 1.000000099473604e-05 | 155 | rna-XM_047271874.1 17888373 | 24 | 21497736 | 21497890 | Hydra vulgaris 6087 | AAG|GTGATAAAAA...TTACTTTTATTA/TAATTATTTATT...TTTAG|ATT | 0 | 1 | 94.867 |
| 95730553 | GT-AG | 0 | 0.0032056233636776 | 94 | rna-XM_047271874.1 17888373 | 25 | 21497582 | 21497675 | Hydra vulgaris 6087 | AAG|GTCTTTTTTT...TGCGTTTTGACC/TTATAATTAATT...AATAG|ATC | 0 | 1 | 96.637 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);