introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
13 rows where transcript_id = 9114894
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389799 | GT-AG | 0 | 0.0002882437880441 | 575 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 1 | 4884440 | 4885014 | Columbina picui 115618 | TTA|GTAAGTTCTT...CAGTCCTTCATT/CAGTCCTTCATT...TCCAG|TGG | 2 | 1 | 5.637 |
49389800 | GT-AG | 0 | 1.000000099473604e-05 | 1294 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 2 | 4885075 | 4886368 | Columbina picui 115618 | TTC|GTGAGTACTT...CAGCTCTTATGG/CAAAACCTAATG...TAAAG|TCT | 2 | 1 | 8.889 |
49389801 | GT-AG | 0 | 1.000000099473604e-05 | 2329 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 3 | 4886484 | 4888812 | Columbina picui 115618 | CAG|GTGAGTAACT...CCATTTGTACTG/AGGTAGCTGAGA...TTCAG|GAT | 0 | 1 | 15.122 |
49389802 | GT-AG | 0 | 1.000000099473604e-05 | 755 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 4 | 4888966 | 4889720 | Columbina picui 115618 | AAG|GTGAGCAATG...TAGGTCTTATTC/CTAGGTCTTATT...TGTAG|GTA | 0 | 1 | 23.415 |
49389803 | GT-AG | 0 | 1.3020177244076784e-05 | 1203 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 5 | 4889879 | 4891081 | Columbina picui 115618 | GAA|GTAAGTCTCT...CCTTTCTGAAAT/GTGTATTTGAGA...TTCAG|GCT | 2 | 1 | 31.978 |
49389804 | GT-AG | 0 | 1.000000099473604e-05 | 1204 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 6 | 4891234 | 4892437 | Columbina picui 115618 | TTG|GTAAGTCTTC...CTTTCTTTTTTT/TACAATATAAAC...TTAAG|GAT | 1 | 1 | 40.217 |
49389805 | GT-AG | 0 | 1.000000099473604e-05 | 460 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 7 | 4892555 | 4893014 | Columbina picui 115618 | AAG|GTAAATTGTG...TTGTCTTTCTCA/GTCTTTCTCACT...GGCAG|AAG | 1 | 1 | 46.558 |
49389806 | GT-AG | 0 | 2.3037397234647855e-05 | 328 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 8 | 4893254 | 4893581 | Columbina picui 115618 | CAA|GTAGGTAACA...TTCCCTTTATTC/TTTCCCTTTATT...TCTAG|ATC | 0 | 1 | 59.512 |
49389807 | GT-AG | 0 | 5.780736438329513e-05 | 382 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 9 | 4893665 | 4894046 | Columbina picui 115618 | TCG|GTAAGTTGGA...CACTTTTTGACT/CCTTTCCTCACT...TCCAG|AGT | 2 | 1 | 64.011 |
49389808 | GT-AG | 0 | 7.036025844236558e-05 | 1336 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 10 | 4894137 | 4895472 | Columbina picui 115618 | TGT|GTAAGTAGTG...GCTTCCTAGATA/TAACATCTCAAC...CGCAG|CCA | 2 | 1 | 68.889 |
49389809 | GT-AG | 0 | 1.000000099473604e-05 | 364 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 11 | 4895598 | 4895961 | Columbina picui 115618 | TTG|GTAAAGAATC...CTTTCCTTTCTG/TGGTTGTTTACT...GACAG|GAT | 1 | 1 | 75.664 |
49389810 | GT-AG | 0 | 1.000000099473604e-05 | 1412 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 12 | 4896117 | 4897528 | Columbina picui 115618 | CTT|GTAAGTATAT...AGAGTTTGAGTC/CAGAGTTTGAGT...TTCAG|ATC | 0 | 1 | 84.065 |
49389811 | GT-AG | 0 | 0.003851223672432 | 1953 | rna-gnl|WGS:VYZG|COLPIC_R04124_mrna 9114894 | 13 | 4897630 | 4899582 | Columbina picui 115618 | AAA|GTAGCATTGG...ATGTTTGTGAAA/ATAAAAGTAACT...TGAAG|ACT | 2 | 1 | 89.539 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);