introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
18 rows where transcript_id = 3485118
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
17203132 | GT-AG | 0 | 1.2771771328564096e-05 | 166 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 1 | 117233 | 117398 | Armadillidium vulgare 13347 | AAG|GTAATTTTAT...ACTAGTTTAATT/ACTAGTTTAATT...TTTAG|ATA | 0 | 1 | 4.291 |
17203133 | GT-AG | 0 | 1.6284347291202722e-05 | 172 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 2 | 117488 | 117659 | Armadillidium vulgare 13347 | AAA|GTAAGTCTGT...GTATACTTAATA/TGTATACTTAAT...TTAAG|GAT | 2 | 1 | 7.252 |
17203134 | GT-AG | 0 | 1.000000099473604e-05 | 141 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 3 | 117832 | 117972 | Armadillidium vulgare 13347 | CAG|GTAATTTAAA...TTATTTTTATTT/TTTATTTTTATT...TTCAG|GAT | 0 | 1 | 12.974 |
17203135 | GT-AG | 0 | 1.000000099473604e-05 | 428 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 4 | 118161 | 118588 | Armadillidium vulgare 13347 | AAG|GTTAGTACGG...AAATTTTTAATG/TTTTTAATGATT...TAAAG|GTT | 2 | 1 | 19.228 |
17203136 | GT-AG | 0 | 1.000000099473604e-05 | 337 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 5 | 118812 | 119148 | Armadillidium vulgare 13347 | CAG|GTAAGTACAA...AAGTTCTTATAT/TTATTACTGATT...AAAAG|GCA | 0 | 1 | 26.647 |
17203137 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 6 | 119354 | 119466 | Armadillidium vulgare 13347 | AAG|GTAAGTTAGT...TTATTATTATTA/ATTATTATTATT...ATTAG|GCT | 1 | 1 | 33.466 |
17203138 | GT-AG | 0 | 0.0029406673232846 | 175 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 7 | 119580 | 119754 | Armadillidium vulgare 13347 | CAG|GTATTCAGGT...TTTATTTTACTT/TTTTATTTTACT...TACAG|TAT | 0 | 1 | 37.226 |
17203139 | GT-AG | 0 | 1.000000099473604e-05 | 485 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 8 | 119892 | 120376 | Armadillidium vulgare 13347 | TGG|GTATGTGAAA...ATATATATATAT/TATATATATATC...AAAAG|GTA | 2 | 1 | 41.783 |
17203140 | GT-AG | 0 | 0.0001910558110384 | 85 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 9 | 120478 | 120562 | Armadillidium vulgare 13347 | ATA|GTAGGTATTA...TTTATTTTATTT/ATTTATTTTATT...TAAAG|GCC | 1 | 1 | 45.143 |
17203141 | GT-AG | 0 | 1.000000099473604e-05 | 884 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 10 | 120729 | 121612 | Armadillidium vulgare 13347 | CAG|GTGAGATTTT...TATTTTTTATTA/TTATTTTTTATT...TTTAG|GGA | 2 | 1 | 50.665 |
17203142 | GT-AG | 0 | 1.000000099473604e-05 | 479 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 11 | 121815 | 122293 | Armadillidium vulgare 13347 | CAG|GTGAATTGCT...CCCATCTTAGAT/TTAATAATAATT...TACAG|GTT | 0 | 1 | 57.385 |
17203143 | GT-AG | 0 | 0.0005856184999949 | 1186 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 12 | 122468 | 123653 | Armadillidium vulgare 13347 | CAG|GTAACTATAT...TTTTTTTTCATT/TTTTTTTTCATT...ACTAG|TCA | 0 | 1 | 63.174 |
17203144 | GT-AG | 0 | 1.6015830195843283e-05 | 615 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 13 | 123808 | 124422 | Armadillidium vulgare 13347 | TAG|GTTTGTGTTT...TTTCCTTTGGCG/GTAGATTTTATA...AACAG|AAC | 1 | 1 | 68.297 |
17203145 | GT-AG | 0 | 1.000000099473604e-05 | 383 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 14 | 124475 | 124857 | Armadillidium vulgare 13347 | AAG|GTTAGTAAAT...TTTTTTTTATTT/GTTTTTTTTATT...TTTAG|TTC | 2 | 1 | 70.027 |
17203146 | GT-AG | 0 | 0.0001297392221881 | 703 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 15 | 125100 | 125802 | Armadillidium vulgare 13347 | AAG|GTATTTATAT...ATATATATAATG/TATATAATGATT...CAAAG|AAA | 1 | 1 | 78.077 |
17203147 | GT-AG | 0 | 4.625745831168613e-05 | 761 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 16 | 125976 | 126736 | Armadillidium vulgare 13347 | TCG|GTAAGTTGGT...AATTTTTTAAAT/AATTTTTTAAAT...TTCAG|TTA | 0 | 1 | 83.832 |
17203148 | GT-AG | 0 | 0.000647449616187 | 109 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 17 | 126983 | 127091 | Armadillidium vulgare 13347 | TAT|GTAAGTTATC...GAAGTCTTAATT/GAAGTCTTAATT...TTCAG|GAA | 0 | 1 | 92.016 |
17203149 | GT-AG | 0 | 1.000000099473604e-05 | 132 | rna-gnl|WGS:SAUD|Avbf_00320-RA_mrna 3485118 | 18 | 127214 | 127345 | Armadillidium vulgare 13347 | AAG|GTGGGTTGAT...ATTCCCCTAACA/ATAATTCTAAAA...TACAG|AAA | 2 | 1 | 96.075 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);