introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 9114874
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389478 | GT-AG | 0 | 1.000000099473604e-05 | 998 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 1 | 1485238 | 1486235 | Columbina picui 115618 | CAG|GTGTGTGTAA...TTAACTTTATCC/TCTGTCTTTATT...TTCAG|GGA | 0 | 1 | 20.815 |
49389479 | GT-AG | 0 | 1.000000099473604e-05 | 883 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 2 | 1486294 | 1487176 | Columbina picui 115618 | CAG|GTAGAAGCAG...TTTTCTTTAACA/TTTTCTTTAACA...CACAG|ATG | 1 | 1 | 22.492 |
49389480 | GT-AG | 0 | 3.968692925046714e-05 | 2900 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 3 | 1488340 | 1491239 | Columbina picui 115618 | CGT|GTAAGTGGCT...TTTTTTTTAATT/TTTTTTTTAATT...TACAG|CTT | 0 | 1 | 56.114 |
49389481 | GT-AG | 0 | 1.000000099473604e-05 | 947 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 4 | 1491323 | 1492269 | Columbina picui 115618 | AAG|GTTTGTTTAT...TAAGTGGTAATT/TAAGTACTAAGT...TTCAG|GGT | 2 | 1 | 58.514 |
49389482 | GT-AG | 0 | 1.000000099473604e-05 | 254 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 5 | 1492387 | 1492640 | Columbina picui 115618 | CAG|GTCCTTCCAA...GAAATTTTGAAT/CTGTGTTTAAAT...TACAG|GGC | 2 | 1 | 61.897 |
49389483 | GT-AG | 0 | 1.000000099473604e-05 | 1786 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 6 | 1492809 | 1494594 | Columbina picui 115618 | TGG|GTGAGAATAT...CCATCTTTGTCT/CACTAATTGATT...TGCAG|TTC | 2 | 1 | 66.753 |
49389484 | GT-AG | 0 | 1.000000099473604e-05 | 1440 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 7 | 1494698 | 1496137 | Columbina picui 115618 | CAG|GTAAGACTGT...TAGTGTTTGACA/TAGTGTTTGACA...CACAG|ATA | 0 | 1 | 69.731 |
49389485 | GT-AG | 0 | 1.000000099473604e-05 | 1717 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 8 | 1496214 | 1497930 | Columbina picui 115618 | ATG|GTGAGTATTT...ATGTCTTTTACT/ATGTCTTTTACT...TGTAG|CTT | 1 | 1 | 71.928 |
49389486 | GT-AG | 0 | 5.800581241575801e-05 | 447 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 9 | 1498073 | 1498519 | Columbina picui 115618 | TTT|GTAAGTGTCT...TGGGCTTTGAAA/TTGTGATTTATT...ATTAG|AAC | 2 | 1 | 76.034 |
49389487 | GT-AG | 0 | 1.000000099473604e-05 | 1177 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 10 | 1498629 | 1499805 | Columbina picui 115618 | AAG|GTGAGAAAAC...ATGTTTTTATTT/AATGTTTTTATT...TAAAG|TTT | 0 | 1 | 79.185 |
49389488 | GT-AG | 0 | 0.0007619753487938 | 566 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 11 | 1499983 | 1500548 | Columbina picui 115618 | AAG|GTAACTTAAA...TTTTTGTTATTG/TTTTTTGTTATT...TAAAG|ATG | 0 | 1 | 84.302 |
49389489 | GT-AG | 0 | 1.000000099473604e-05 | 687 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 12 | 1500647 | 1501333 | Columbina picui 115618 | ACG|GTAATGACAC...TTTTTTTTAAAA/TTTTTTTTAAAA...TTCAG|TGA | 2 | 1 | 87.135 |
49389490 | GT-AG | 0 | 1.000000099473604e-05 | 501 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 13 | 1501422 | 1501922 | Columbina picui 115618 | CAG|GTACTGCATA...TTTTCCATGGCC/CAGTCTGTTACT...TGTAG|ATG | 0 | 1 | 89.679 |
49389491 | GT-AG | 0 | 2.444657372425644e-05 | 811 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 14 | 1502046 | 1502856 | Columbina picui 115618 | GAG|GTGCACAGCT...TGTGCTTTGACT/TGTGCTTTGACT...TGCAG|GCA | 0 | 1 | 93.235 |
49389492 | GT-AG | 0 | 1.000000099473604e-05 | 1422 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 15 | 1502935 | 1504356 | Columbina picui 115618 | AGG|GTAAGTAATT...TTAGTTTTGACA/TTAGTTTTGACA...TTTAG|CAA | 0 | 1 | 95.49 |
49389493 | GT-AG | 0 | 1.000000099473604e-05 | 1436 | rna-gnl|WGS:VYZG|COLPIC_R04105_mrna 9114874 | 16 | 1504444 | 1505879 | Columbina picui 115618 | AAG|GTAAGACACC...TTTTTTTTAATT/TTTTTTTTAATT...TGTAG|GTA | 0 | 1 | 98.005 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);