introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
15 rows where transcript_id = 9114885
This data as json, CSV (advanced)
Suggested facets: phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389671 | GT-AG | 0 | 1.000000099473604e-05 | 5059 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 1 | 133011 | 138069 | Columbina picui 115618 | ACG|GTAAAGTTTT...TCATTTTCAACC/CCTAATTTAATT...CCTAG|GAG | 0 | 1 | 10.68 |
49389672 | GT-AG | 0 | 1.000000099473604e-05 | 1805 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 2 | 131063 | 132867 | Columbina picui 115618 | GGA|GTCCAAGTAA...CACTGCTTGACT/CACTGCTTGACT...TTCAG|TAT | 2 | 1 | 16.464 |
49389673 | GT-AG | 0 | 1.000000099473604e-05 | 2105 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 3 | 128801 | 130905 | Columbina picui 115618 | CAG|GTCTGTACAG...AAACTTTTATCT/AAAACTTTTATC...TACAG|CTT | 0 | 1 | 22.816 |
49389674 | GT-AG | 0 | 1.000000099473604e-05 | 2266 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 4 | 126340 | 128605 | Columbina picui 115618 | AAG|GTAGGGTACA...TGTTCTTTAATC/TGTTCTTTAATC...AACAG|AAC | 0 | 1 | 30.704 |
49389675 | GT-AG | 0 | 1.000000099473604e-05 | 2182 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 5 | 124020 | 126201 | Columbina picui 115618 | AAG|GTGAGTACCA...CTTTCCTTGTTG/CTTGTTGCTATT...TGTAG|ATC | 0 | 1 | 36.286 |
49389676 | GT-AG | 0 | 1.000000099473604e-05 | 960 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 6 | 122900 | 123859 | Columbina picui 115618 | GAG|GTAGGTGCCT...ATTGTCTTTTCT/CCAGGATTTATT...TCTAG|AGT | 1 | 1 | 42.759 |
49389677 | GT-AG | 0 | 1.000000099473604e-05 | 1137 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 7 | 121680 | 122816 | Columbina picui 115618 | GAG|GTGAGCTTGC...GTTGCCTTTCTA/AATTATTTTATG...TGTAG|AAC | 0 | 1 | 46.117 |
49389678 | GT-AG | 0 | 1.000000099473604e-05 | 4392 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 8 | 117096 | 121487 | Columbina picui 115618 | AAG|GTAGGAATAG...GATGACTTAATC/TGATGACTTAAT...TCCAG|GTC | 0 | 1 | 53.883 |
49389679 | GT-AG | 0 | 1.000000099473604e-05 | 3251 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 9 | 113683 | 116933 | Columbina picui 115618 | AAG|GTAGAAGGGT...TCTTTTTTGGTC/CTCTTTGTGATC...TTTAG|GAT | 0 | 1 | 60.437 |
49389680 | GT-AG | 0 | 1.000000099473604e-05 | 3436 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 10 | 110100 | 113535 | Columbina picui 115618 | AAA|GTAAGAGCCT...CTATATTTGACT/CTATATTTGACT...ATCAG|GCT | 0 | 1 | 66.383 |
49389681 | GT-AG | 0 | 1.000000099473604e-05 | 354 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 11 | 109605 | 109958 | Columbina picui 115618 | CAG|GTAGGAATCT...ATTGCTTTGCCC/ATTGTACTCAGA...CTTAG|GTT | 0 | 1 | 72.087 |
49389682 | GT-AG | 0 | 1.000000099473604e-05 | 1871 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 12 | 107510 | 109380 | Columbina picui 115618 | CAG|GTAGAAAAAC...TTTATGCTGATC/TTTATGCTGATC...TTAAG|GGA | 2 | 1 | 81.149 |
49389683 | GT-AG | 0 | 1.000000099473604e-05 | 1936 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 13 | 105462 | 107397 | Columbina picui 115618 | GAG|GTGATGTGTC...TAATTTTTGAGA/TTTGTTGTAATA...TCTAG|GCC | 0 | 1 | 85.68 |
49389684 | GT-AG | 0 | 1.000000099473604e-05 | 15269 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 14 | 90088 | 105356 | Columbina picui 115618 | CAG|GTACTGCAGC...ACTATTTTAGTT/TTTAGTTTTATA...CTTAG|GAA | 0 | 1 | 89.927 |
49389685 | GT-AG | 0 | 1.000000099473604e-05 | 1517 | rna-gnl|WGS:VYZG|COLPIC_R00072_mrna 9114885 | 15 | 88451 | 89967 | Columbina picui 115618 | AAA|GTAAGAGATG...TGCTTTTTGCCT/AGTGAAATTAGT...GTTAG|GTT | 0 | 1 | 94.782 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);