introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
20 rows where transcript_id = 32656414
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182390185 | GT-AG | 0 | 1.6372688854140408e-05 | 1162 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 1 | 134397 | 135558 | Serilophus lunatus 239386 | CAG|GTAATCGCCT...TTTTTCTTCTTT/TTGAGTGTTACA...CTCAG|ATT | 1 | 1 | 0.999 |
| 182390186 | GT-AG | 0 | 0.0001113938417239 | 957 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 2 | 135677 | 136633 | Serilophus lunatus 239386 | CCT|GTAAGTACCT...TTTTCCTTTGTT/AAAAGGCAAACT...TTCAG|CTG | 2 | 1 | 4.468 |
| 182390187 | GT-AG | 0 | 1.000000099473604e-05 | 2578 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 3 | 136767 | 139344 | Serilophus lunatus 239386 | CAG|GTGAGTTGCA...CTGTCTGTAGCT/CTGCTACTAAAG...TGCAG|TTA | 0 | 1 | 8.377 |
| 182390188 | GT-AG | 0 | 1.000000099473604e-05 | 787 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 4 | 139517 | 140303 | Serilophus lunatus 239386 | TTG|GTGAGTCATT...TATGCCTCAACT/TTTGATTTTATG...CTTAG|CTC | 1 | 1 | 13.433 |
| 182390189 | GT-AG | 0 | 1.000000099473604e-05 | 1026 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 5 | 140376 | 141401 | Serilophus lunatus 239386 | ATG|GTGAGTCTGG...CCCTCCATGACA/CCCTCCATGACA...CTTAG|GTC | 1 | 1 | 15.55 |
| 182390190 | GT-AG | 0 | 1.000000099473604e-05 | 335 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 6 | 141553 | 141887 | Serilophus lunatus 239386 | AAG|GTGAGACTAA...TGTTTCTGTTCT/CTGTGTGTAATT...TGTAG|TCG | 2 | 1 | 19.988 |
| 182390191 | GT-AG | 0 | 1.000000099473604e-05 | 1954 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 7 | 142024 | 143977 | Serilophus lunatus 239386 | AAG|GTTAGTTCTA...ACCATCTAAAAA/CAACTACTAATC...TGCAG|AGG | 0 | 1 | 23.986 |
| 182390192 | GT-AG | 0 | 4.161661466541098e-05 | 1081 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 8 | 144141 | 145221 | Serilophus lunatus 239386 | TCA|GTAAGTGTGC...TTTTCCTTTGTT/TCCTTTGTTACT...TCTAG|TGT | 1 | 1 | 28.777 |
| 182390193 | GT-AG | 0 | 1.000000099473604e-05 | 1975 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 9 | 145379 | 147353 | Serilophus lunatus 239386 | GTA|GTGAGTGTTC...TGTTTTCTAATT/TGTTTTCTAATT...TGTAG|CAT | 2 | 1 | 33.392 |
| 182390194 | GT-AG | 0 | 1.000000099473604e-05 | 319 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 10 | 147425 | 147743 | Serilophus lunatus 239386 | ATG|GTAAGGACAC...CTTTCCTTACAC/GCTTTCCTTACA...TCCAG|CAA | 1 | 1 | 35.479 |
| 182390195 | GT-AG | 0 | 1.000000099473604e-05 | 633 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 11 | 147805 | 148437 | Serilophus lunatus 239386 | CAG|GTGGGTGGCA...GTGTAATTAATG/GTGTAATTAATG...ACCAG|GAA | 2 | 1 | 37.272 |
| 182390196 | GT-AG | 0 | 1.000000099473604e-05 | 222 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 12 | 148558 | 148779 | Serilophus lunatus 239386 | AAG|GTAATTGCAG...GCTTTTTCAATT/CGCTTTTTCAAT...TTCAG|GTT | 2 | 1 | 40.8 |
| 182390197 | GT-AG | 0 | 0.0001043396438345 | 3528 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 13 | 149216 | 152743 | Serilophus lunatus 239386 | CAG|GTAAGCCTTT...TTCTTCTTACTT/TTTCTTCTTACT...ATCAG|GTG | 0 | 1 | 53.616 |
| 182390198 | GT-AG | 0 | 1.000000099473604e-05 | 1181 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 14 | 152889 | 154069 | Serilophus lunatus 239386 | CAG|GTAATTTGCA...ATTTCCATCTTT/TCTGTCTGCACT...TCCAG|GAT | 1 | 1 | 57.878 |
| 182390199 | GT-AG | 0 | 7.530642789368362e-05 | 731 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 15 | 154462 | 155192 | Serilophus lunatus 239386 | CAG|GTAAACTTGT...GTAACATCAACT/TTCCTCCCCACT...CTCAG|TAT | 0 | 1 | 69.4 |
| 182390200 | GT-AG | 0 | 1.000000099473604e-05 | 3952 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 16 | 155559 | 159510 | Serilophus lunatus 239386 | GAG|GTAAGGCCCA...TTTCCCTTTTTG/TGAACACTAATT...TGAAG|AAG | 0 | 1 | 80.159 |
| 182390201 | GT-AG | 0 | 2.544475075257926e-05 | 577 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 17 | 159592 | 160168 | Serilophus lunatus 239386 | CAG|GTATGTGAGA...AACACCTAAACT/CAACACCTAAAC...TTCAG|GAA | 0 | 1 | 82.54 |
| 182390202 | GT-AG | 0 | 3.41592553355324e-05 | 1265 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 18 | 160329 | 161593 | Serilophus lunatus 239386 | CAG|GTAATCCTGC...GTTTTCCTATTG/TCTCTACTGACC...ACTAG|ATC | 1 | 1 | 87.243 |
| 182390203 | GC-AG | 0 | 1.000000099473604e-05 | 122 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 19 | 161765 | 161886 | Serilophus lunatus 239386 | CAG|GCAAGTGCTG...GTCTCTTTGAAC/GTCTCTTTGAAC...TTTAG|CAA | 1 | 1 | 92.269 |
| 182390204 | GT-AG | 0 | 1.000000099473604e-05 | 472 | rna-gnl|WGS:VXBA|SERLUN_R02655_mrna 32656414 | 20 | 162039 | 162510 | Serilophus lunatus 239386 | AAG|GTAGGTTTGG...CTAGTGTTGGCT/GGCTGTCTAATG...TCTAG|TCC | 0 | 1 | 96.737 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);