introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 20309383
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 108814339 | GT-AG | 0 | 1.000000099473604e-05 | 25928 | rna-XM_041600161.1 20309383 | 1 | 20257184 | 20283111 | Lytechinus variegatus 7654 | CGG|GTAAGTAACG...ATTGTCTTATTT/CATTGTCTTATT...TCCAG|GTT | 1 | 1 | 1.295 |
| 108814340 | GT-AG | 0 | 1.000000099473604e-05 | 716 | rna-XM_041600161.1 20309383 | 2 | 20256144 | 20256859 | Lytechinus variegatus 7654 | CAG|GTAAGTAGCA...GGATCCTTATCG/TTTTTTGTCAAT...AATAG|AGC | 1 | 1 | 7.288 |
| 108814341 | GT-AG | 0 | 1.000000099473604e-05 | 8050 | rna-XM_041600161.1 20309383 | 3 | 20247821 | 20255870 | Lytechinus variegatus 7654 | AAG|GTAAGGAGTG...TCTCTCTCAATC/CTCTCTCTCAAT...TACAG|AGC | 1 | 1 | 12.338 |
| 108814342 | GT-AG | 0 | 1.000000099473604e-05 | 2757 | rna-XM_041600161.1 20309383 | 4 | 20244918 | 20247674 | Lytechinus variegatus 7654 | ATG|GTAAGAATAC...AGAGTTTTATAA/GAGAGTTTTATA...TTCAG|ATC | 0 | 1 | 15.039 |
| 108814343 | GT-AG | 0 | 1.000000099473604e-05 | 529 | rna-XM_041600161.1 20309383 | 5 | 20244187 | 20244715 | Lytechinus variegatus 7654 | TAG|GTAAGATGAG...TCACCTTTACCT/CTGTATTTCATT...GTCAG|GGG | 1 | 1 | 18.775 |
| 108814344 | GT-AG | 0 | 1.000000099473604e-05 | 349 | rna-XM_041600161.1 20309383 | 6 | 20243511 | 20243859 | Lytechinus variegatus 7654 | AAA|GTGAGTGTTC...ATTTCTTTATCC/TATTTCTTTATC...TTCAG|ATG | 1 | 1 | 24.824 |
| 108814345 | GT-AG | 0 | 1.000000099473604e-05 | 308 | rna-XM_041600161.1 20309383 | 7 | 20242879 | 20243186 | Lytechinus variegatus 7654 | AAG|GTAAGCAACC...GTCCCCTTTTCA/CCCCTTTTCAAT...GACAG|TTG | 1 | 1 | 30.818 |
| 108814346 | GT-AG | 0 | 1.0646985805262914e-05 | 1334 | rna-XM_041600161.1 20309383 | 8 | 20241266 | 20242599 | Lytechinus variegatus 7654 | AAC|GTTAGTATCT...AATTCTTTAACA/TCTTTCTTTATA...TTCAG|CCA | 1 | 1 | 35.979 |
| 108814347 | GT-AG | 0 | 3.718809721999741e-05 | 561 | rna-XM_041600161.1 20309383 | 9 | 20240568 | 20241128 | Lytechinus variegatus 7654 | CAT|GTAAGTTAGA...ATGCTTTTATTC/CTTTTATTCATC...AACAG|CAT | 0 | 1 | 38.513 |
| 108814348 | GT-AG | 0 | 3.0972762422252645e-05 | 304 | rna-XM_041600161.1 20309383 | 10 | 20240056 | 20240359 | Lytechinus variegatus 7654 | TAG|GTATGGAAAT...AAATCTTTAATC/TATATACTCATG...TTTAG|ATC | 1 | 1 | 42.36 |
| 108814349 | GT-AG | 0 | 1.000000099473604e-05 | 403 | rna-XM_041600161.1 20309383 | 11 | 20239344 | 20239746 | Lytechinus variegatus 7654 | ATG|GTAAGCCAAT...AGTATCTTCACC/TTATTGTTTATT...ATTAG|ATG | 1 | 1 | 48.076 |
| 108814350 | GT-AG | 0 | 1.000000099473604e-05 | 408 | rna-XM_041600161.1 20309383 | 12 | 20238829 | 20239236 | Lytechinus variegatus 7654 | AAG|GTAAGACCAA...TATATCTTATAC/CTTATACACATT...TGCAG|TGG | 0 | 1 | 50.055 |
| 108814351 | GT-AG | 0 | 0.0003313294464829 | 462 | rna-XM_041600161.1 20309383 | 13 | 20238144 | 20238605 | Lytechinus variegatus 7654 | AAC|GTAAGTTTTT...CTTTCATTGAAC/GATGCTTTCATT...TACAG|CGA | 1 | 1 | 54.181 |
| 108814352 | GT-AG | 0 | 0.0017900732510728 | 311 | rna-XM_041600161.1 20309383 | 14 | 20237542 | 20237852 | Lytechinus variegatus 7654 | TTG|GTAACTCAAA...CTTATTTTAATT/CTTATTTTAATT...ATCAG|AGG | 1 | 1 | 59.563 |
| 108814353 | GT-AG | 0 | 1.000000099473604e-05 | 651 | rna-XM_041600161.1 20309383 | 15 | 20236609 | 20237259 | Lytechinus variegatus 7654 | TAG|GTTAGTTGGA...ACATCTTTAGAA/ACTTTGTTTATA...TACAG|ATA | 1 | 1 | 64.78 |
| 108814354 | GT-AG | 0 | 1.000000099473604e-05 | 404 | rna-XM_041600161.1 20309383 | 16 | 20236077 | 20236480 | Lytechinus variegatus 7654 | AAG|GTGAATATCA...ATTTTCTTATAT/TATTTTCTTATA...TACAG|CTA | 0 | 1 | 67.148 |
| 108814355 | GT-AG | 0 | 1.000000099473604e-05 | 340 | rna-XM_041600161.1 20309383 | 17 | 20235598 | 20235937 | Lytechinus variegatus 7654 | TAG|GTCAGTACAA...TATTTGTTAATT/TATTTGTTAATT...ATCAG|GTC | 1 | 1 | 69.719 |
| 108814356 | GT-AG | 0 | 1.000000099473604e-05 | 1171 | rna-XM_041600161.1 20309383 | 18 | 20234322 | 20235492 | Lytechinus variegatus 7654 | AAG|GTGAGACCTA...GTGTTCTTTATT/GTGTTCTTTATT...TGCAG|AGG | 1 | 1 | 71.661 |
| 108814357 | GT-AG | 0 | 0.000188272965309 | 302 | rna-XM_041600161.1 20309383 | 19 | 20233909 | 20234210 | Lytechinus variegatus 7654 | AAG|GTAAGCAGTG...TCTTCCTTAATT/TCTTCCTTAATT...TTCAG|ATC | 1 | 1 | 73.714 |
| 108814358 | GT-AG | 0 | 1.000000099473604e-05 | 489 | rna-XM_041600161.1 20309383 | 20 | 20233112 | 20233600 | Lytechinus variegatus 7654 | AAG|GTGAGAGGCC...TTCTTTTTATCT/ATCTTTTTCACC...TGCAG|TGT | 0 | 1 | 79.412 |
| 108814359 | GT-AG | 0 | 6.937088181151229e-05 | 346 | rna-XM_041600161.1 20309383 | 21 | 20232643 | 20232988 | Lytechinus variegatus 7654 | GGG|GTAAGTTTGA...GTCTTTTTGATG/GTCTTTTTGATG...TACAG|GGA | 0 | 1 | 81.687 |
| 108814360 | GT-AG | 0 | 1.000000099473604e-05 | 458 | rna-XM_041600161.1 20309383 | 22 | 20232114 | 20232571 | Lytechinus variegatus 7654 | TGT|GTGAGTATCC...TTATTTATGATT/TTATTATTTATG...TTCAG|ATG | 2 | 1 | 83.0 |
| 108814361 | GT-AG | 0 | 0.0007625079898178 | 409 | rna-XM_041600161.1 20309383 | 23 | 20231564 | 20231972 | Lytechinus variegatus 7654 | AAT|GTAAGCATTA...TTGTTCTTTGTC/ATCATGATCATT...AACAG|TTA | 2 | 1 | 85.609 |
| 108814362 | GT-AG | 0 | 1.000000099473604e-05 | 255 | rna-XM_041600161.1 20309383 | 24 | 20231203 | 20231457 | Lytechinus variegatus 7654 | CAG|GTGAGTCTTA...TCATCTTTGAAA/AATTTGTTCATA...TCCAG|GAG | 0 | 1 | 87.569 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);