introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
14 rows where transcript_id = 9114817
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49388905 | GT-AG | 0 | 1.000000099473604e-05 | 1856 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 1 | 2888724 | 2890579 | Columbina picui 115618 | AAG|GTAAAAAATG...TTCTCTTTACTT/TGTTTGTTCACT...AACAG|TAG | 1 | 1 | 4.066 |
49388906 | GT-AG | 0 | 1.000000099473604e-05 | 836 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 2 | 2886657 | 2887492 | Columbina picui 115618 | AAG|GTAGAACATT...TACTTCTTGATC/ATGGTATTTACT...CACAG|GAC | 2 | 1 | 27.131 |
49388907 | GT-AG | 0 | 1.000000099473604e-05 | 1165 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 3 | 2885367 | 2886531 | Columbina picui 115618 | GAG|GTGAGAAATA...CATAATTTAACT/CATAATTTAACT...TTCAG|CCA | 1 | 1 | 29.473 |
49388908 | GT-AG | 0 | 1.000000099473604e-05 | 1686 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 4 | 2883592 | 2885277 | Columbina picui 115618 | TTG|GTTAGTATTA...TGTTTTGTATCT/AAGTGGGTGATT...TTCAG|GCT | 0 | 1 | 31.141 |
49388909 | GT-AG | 0 | 1.000000099473604e-05 | 1875 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 5 | 2881640 | 2883514 | Columbina picui 115618 | AAG|GTAAGAATGT...TCATTGTAATAT/CATTGTAATATA...CATAG|GCG | 2 | 1 | 32.584 |
49388910 | GT-AG | 0 | 1.000000099473604e-05 | 1137 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 6 | 2880347 | 2881483 | Columbina picui 115618 | GAG|GTTCGATTCA...TCTTTCTTTTTT/AATGTGTTTATC...TTCAG|AAA | 2 | 1 | 35.507 |
49388911 | GT-AG | 0 | 1.000000099473604e-05 | 1347 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 7 | 2876670 | 2878016 | Columbina picui 115618 | CAG|GTATGGGCAT...AATGCTTTAATG/AATGCTTTAATG...ATAAG|GAT | 1 | 1 | 79.164 |
49388912 | GT-AG | 0 | 1.000000099473604e-05 | 727 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 8 | 2875857 | 2876583 | Columbina picui 115618 | CTG|GTTAGTAGAT...AAATTTATAGCT/AGCTTGCTTATA...TTTAG|AAG | 0 | 1 | 80.776 |
49388913 | GT-AG | 0 | 1.1278384599281006e-05 | 1025 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 9 | 2874789 | 2875813 | Columbina picui 115618 | AAG|GTACGTCATG...TCATTTTTATTG/CTATTTCTCACC...TGAAG|GTC | 1 | 1 | 81.581 |
49388914 | GT-AG | 0 | 7.323241537102422e-05 | 1446 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 10 | 2873098 | 2874543 | Columbina picui 115618 | CTG|GTATGGATAA...TTTTCCTTTGTC/TTTGTTCTAATC...GCTAG|AAT | 0 | 1 | 86.172 |
49388915 | GT-AG | 0 | 1.000000099473604e-05 | 3139 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 11 | 2869840 | 2872978 | Columbina picui 115618 | CAA|GTAAGAAATC...CAGATCTTACTT/ATCTTACTTATC...TACAG|CTA | 2 | 1 | 88.402 |
49388916 | GT-AG | 0 | 1.001922371502261 | 105 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 12 | 2869596 | 2869700 | Columbina picui 115618 | AAG|GTATCTTTAT...CTAGCTTCAACA/CCTAGCTTCAAC...TACAG|GCA | 0 | 1 | 91.006 |
49388917 | GT-AG | 0 | 1.000000099473604e-05 | 2205 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 13 | 2867205 | 2869409 | Columbina picui 115618 | CAG|GTAAACATAA...CTGTGCTTATGA/GCTGTGCTTATG...TAAAG|GGT | 0 | 1 | 94.491 |
49388918 | GT-AG | 0 | 1.000000099473604e-05 | 1786 | rna-gnl|WGS:VYZG|COLPIC_R05515_mrna 9114817 | 14 | 2865250 | 2867035 | Columbina picui 115618 | AAG|GTAGGAATAA...ACGATTTTAACC/ACGATTTTAACC...TACAG|AGT | 1 | 1 | 97.658 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);