introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 32499364
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 181746220 | GT-AG | 0 | 0.0314552559970415 | 44 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 1 | 65843 | 65886 | Seison nebaliae 104778 | AAA|GTTTTCTTTT...TTCTTCTAATTC/ATTCTTCTAATT...TTCAG|GGG | 2 | 1 | 2.118 |
| 181746221 | GT-AG | 0 | 0.0006745778267506 | 53 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 2 | 66042 | 66094 | Seison nebaliae 104778 | TTG|GTTACTCATC...TTTCTCTTAATT/TTTCTCTTAATT...TTCAG|TTC | 1 | 1 | 6.947 |
| 181746222 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 3 | 66150 | 66205 | Seison nebaliae 104778 | TTT|GTTAGTGATT...TCTATCTTCATT/CTCATTTTCATT...TCTAG|AGT | 2 | 1 | 8.66 |
| 181746223 | GT-AG | 0 | 0.0142704598785244 | 50 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 4 | 66272 | 66321 | Seison nebaliae 104778 | CAG|GTTTCTATTT...TAATTCTCAATA/TTAATTCTCAAT...TTCAG|TCG | 2 | 1 | 10.717 |
| 181746224 | GT-AG | 0 | 0.000628288666102 | 45 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 5 | 66395 | 66439 | Seison nebaliae 104778 | AAA|GTTTCTAAAT...AATATATTCATA/AATATATTCATA...TTTAG|CCT | 0 | 1 | 12.991 |
| 181746225 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 6 | 66627 | 66673 | Seison nebaliae 104778 | GTG|GTTAGTGAAT...AAATTGTTAATA/TTGTTAATAATT...TTTAG|GAG | 1 | 1 | 18.816 |
| 181746226 | GT-AG | 0 | 0.0480723616087194 | 53 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 7 | 66892 | 66944 | Seison nebaliae 104778 | TGC|GTTCACTTTC...TATATTTTATTT/TTATATTTTATT...CATAG|CAA | 0 | 1 | 25.607 |
| 181746227 | GT-AG | 0 | 1.0966917178084196e-05 | 2211 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 8 | 67072 | 69282 | Seison nebaliae 104778 | CGA|GTAAGTATTT...CAACTCTCATTT/TCAACTCTCATT...TATAG|AGA | 1 | 1 | 29.564 |
| 181746228 | GT-AG | 0 | 0.009168263985938 | 661 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 9 | 69342 | 70002 | Seison nebaliae 104778 | TCT|GTTCCATCTT...GTTTTGTTGATC/GTTTTGTTGATC...TAAAG|GAC | 0 | 1 | 31.402 |
| 181746229 | GT-AG | 0 | 1.000000099473604e-05 | 1239 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 10 | 71348 | 72586 | Seison nebaliae 104778 | CAG|GTGCTAATTC...CATCATTTAATC/CATCATTTAATC...TATAG|TTA | 1 | 1 | 73.302 |
| 181746230 | GT-AG | 0 | 0.0222125956935399 | 687 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 11 | 72662 | 73348 | Seison nebaliae 104778 | CAG|GTATTCCTTT...TTCTTTTTCATT/TTCTTTTTCATT...CATAG|CAA | 1 | 1 | 75.639 |
| 181746231 | GT-AG | 0 | 0.002079421853116 | 44 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 12 | 73459 | 73502 | Seison nebaliae 104778 | ACA|GTTTTCAATT...CTAATTTTCATT/CTAATTTTCATT...TTTAG|TCT | 0 | 1 | 79.065 |
| 181746232 | GT-AG | 0 | 1.753964069646543e-05 | 48 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 13 | 73711 | 73758 | Seison nebaliae 104778 | GAG|GTAATCGTTT...TATTATTTATTC/TATTTATTCATC...TTCAG|CTG | 1 | 1 | 85.545 |
| 181746233 | GT-AG | 0 | 0.0001540422632118 | 46 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 14 | 73838 | 73883 | Seison nebaliae 104778 | TCA|GTTCAATTTA...ATTTTCATATTT/CATTTTCTCATT...ATTAG|ACG | 2 | 1 | 88.006 |
| 181746234 | GT-AG | 0 | 1.000000099473604e-05 | 275 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 15 | 73979 | 74253 | Seison nebaliae 104778 | ACA|GTTAGAATCT...TTAATTTTAATT/TTAATTTTAATT...CATAG|CAA | 1 | 1 | 90.966 |
| 181746235 | GT-AG | 0 | 3.511639148057521e-05 | 55 | rna-gnl|WGS:JALJEE|g5399.t1 32499364 | 16 | 74417 | 74471 | Seison nebaliae 104778 | TAA|GTTGATTTTT...TTCCCTTTTTTT/TTTTTTTTCAGA...TTTAG|TAA | 2 | 1 | 96.044 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);