introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 3395361
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 16699032 | GT-AG | 0 | 0.0005609359806895 | 8997 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 1 | 99762 | 108758 | Arenaria interpres 54971 | TTG|GTATGTGGCT...TGTTTTTTAACT/TGTTTTTTAACT...TGTAG|GAT | 0 | 1 | 2.832 |
| 16699033 | GT-AG | 0 | 1.000000099473604e-05 | 7745 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 2 | 108838 | 116582 | Arenaria interpres 54971 | CAG|GTAAGATAGT...ATAACCTTTGCT/TTGCTGTTTACT...TGCAG|GTG | 1 | 1 | 5.404 |
| 16699034 | GT-AG | 0 | 1.000000099473604e-05 | 3586 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 3 | 116654 | 120239 | Arenaria interpres 54971 | GAA|GTGAGTAAGA...TTGCATTTATTC/CTTGCATTTATT...TGCAG|GAA | 0 | 1 | 7.715 |
| 16699035 | GT-AG | 0 | 1.000000099473604e-05 | 2196 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 4 | 120441 | 122636 | Arenaria interpres 54971 | GAG|GTGAGATATT...GTTGGCTTACTT/GGTTGGCTTACT...TCTAG|ATG | 0 | 1 | 14.258 |
| 16699036 | GT-AG | 0 | 1.000000099473604e-05 | 2730 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 5 | 122725 | 125454 | Arenaria interpres 54971 | TGG|GTAAGGCTCG...GACTCATTGATT/CTATCTCTCATC...TGAAG|GTC | 1 | 1 | 17.122 |
| 16699037 | GT-AG | 0 | 0.0003635855041655 | 1399 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 6 | 125598 | 126996 | Arenaria interpres 54971 | GAG|GTATGTGGTA...GTAATTTTAACT/GTAATTTTAACT...TTTAG|ATT | 0 | 1 | 21.777 |
| 16699038 | GT-AG | 0 | 1.000000099473604e-05 | 1455 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 7 | 127210 | 128664 | Arenaria interpres 54971 | CAG|GTAAAGCTTG...TATGTTTTGTTT/TATGTGCTAAAT...CTTAG|AGT | 0 | 1 | 28.711 |
| 16699039 | GT-AG | 0 | 1.000000099473604e-05 | 3212 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 8 | 128744 | 131955 | Arenaria interpres 54971 | ACG|GTGAGTTGCT...TCATGCTTAACC/CACTGATTCATG...TGCAG|GAG | 1 | 1 | 31.283 |
| 16699040 | GT-AG | 0 | 1.000000099473604e-05 | 3508 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 9 | 132071 | 135578 | Arenaria interpres 54971 | CAA|GTAGGTGGCT...GCAACCTGGATT/TGGATTTTCACA...TTCAG|GTG | 2 | 1 | 35.026 |
| 16699041 | GT-AG | 0 | 1.000000099473604e-05 | 1295 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 10 | 135662 | 136956 | Arenaria interpres 54971 | CGG|GTGAGTCAAT...TTTTCCTAACTA/TCCTAACTAACT...ACTAG|GAC | 1 | 1 | 37.728 |
| 16699042 | GT-AG | 0 | 1.000000099473604e-05 | 2407 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 11 | 137075 | 139481 | Arenaria interpres 54971 | AGG|GTAAGTAGCT...TTTTCTTTACTT/GTTTTCTTTACT...TCCAG|TAA | 2 | 1 | 41.569 |
| 16699043 | GT-AG | 0 | 1.000000099473604e-05 | 743 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 12 | 139551 | 140293 | Arenaria interpres 54971 | GAG|GTAGGTAAAT...TGTCACTTATGT/TTGTCACTTATG...TCCAG|GAC | 2 | 1 | 43.815 |
| 16699044 | GT-AG | 0 | 1.000000099473604e-05 | 3458 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 13 | 140435 | 143892 | Arenaria interpres 54971 | AAA|GTAAGAAAGC...ATTATTTTGAAA/ATTATTTTGAAA...AGAAG|AAA | 2 | 1 | 48.405 |
| 16699045 | GT-AG | 0 | 0.0002276943979134 | 1006 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 14 | 143974 | 144979 | Arenaria interpres 54971 | ATT|GTAAGTTTGG...TGTCTGTTAAAT/TGTGAGCTCACA...CACAG|GTG | 2 | 1 | 51.042 |
| 16699046 | GT-AG | 0 | 1.000000099473604e-05 | 12130 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 15 | 145092 | 157221 | Arenaria interpres 54971 | CTG|GTGAGTGCTC...TTTTTCTTTTCG/TTTTTTTTCTTT...TTCAG|CTT | 0 | 1 | 54.688 |
| 16699047 | GT-AG | 0 | 0.0001541865999551 | 2999 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 16 | 157294 | 160292 | Arenaria interpres 54971 | GCT|GTAAGTATGA...AAATCTTTGAAA/GTGTTCTTTACA...TCTAG|GCC | 0 | 1 | 57.031 |
| 16699048 | GT-AG | 0 | 1.000000099473604e-05 | 4371 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 17 | 160495 | 164865 | Arenaria interpres 54971 | CAG|GTGAGTGTGT...CATATTTTAATT/CATATTTTAATT...TGTAG|TCA | 1 | 1 | 63.607 |
| 16699049 | GT-AG | 0 | 0.0006190817203661 | 1032 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 18 | 164945 | 165976 | Arenaria interpres 54971 | ACA|GTAAGTTTTC...CATTTCATAGCT/TGTCATTTCATA...TCTAG|AAC | 2 | 1 | 66.178 |
| 16699050 | GT-AG | 0 | 6.10270283589432e-05 | 1038 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 19 | 166095 | 167132 | Arenaria interpres 54971 | GGG|GTGACTTAAT...ACTTCTTTACTC/TCTTTACTCATT...GTCAG|ATT | 0 | 1 | 70.02 |
| 16699051 | GT-AG | 0 | 1.000000099473604e-05 | 707 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 20 | 167350 | 168056 | Arenaria interpres 54971 | ACG|GTAAGGTTCT...AATATCTCATCT/AAATATCTCATC...TTTAG|GTC | 1 | 1 | 77.083 |
| 16699052 | GT-AG | 0 | 1.409144594183029e-05 | 1945 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 21 | 168145 | 170089 | Arenaria interpres 54971 | CGT|GTAAGTAGAA...GACTTTTTACCT/AGACTTTTTACC...GGCAG|AAT | 2 | 1 | 79.948 |
| 16699053 | GT-AG | 0 | 1.000000099473604e-05 | 1274 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 22 | 170174 | 171447 | Arenaria interpres 54971 | AAG|GTAATAGTTT...CATTCTTTAGCG/AATATTGTCATT...CACAG|AAC | 2 | 1 | 82.682 |
| 16699054 | GT-AG | 0 | 1.000000099473604e-05 | 856 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 23 | 171594 | 172449 | Arenaria interpres 54971 | TAG|GTAAGTACAT...TGTTCGTTGATT/TGTTCGTTGATT...TTCAG|ACT | 1 | 1 | 87.435 |
| 16699055 | GT-AG | 0 | 1.000000099473604e-05 | 1188 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 24 | 172533 | 173720 | Arenaria interpres 54971 | AAA|GTAAGTTGAA...AACTCCTCAGCA/GTATTTGTTATT...TACAG|GGT | 0 | 1 | 90.137 |
| 16699056 | GT-AG | 0 | 2.369062989698508e-05 | 1119 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 25 | 173804 | 174922 | Arenaria interpres 54971 | GAC|GTAAGTATAG...TCTATTTTAAAC/CTTGAATTTACT...TGTAG|ACC | 2 | 1 | 92.839 |
| 16699057 | GT-AG | 0 | 1.000000099473604e-05 | 2157 | rna-gnl|WGS:VXAK|AREINT_R00561_mrna 3395361 | 26 | 175015 | 177171 | Arenaria interpres 54971 | ATG|GTAAGTAAAT...GTACCCTTAGTA/TTAGTACTAACC...CCCAG|GTC | 1 | 1 | 95.833 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);