introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
28 rows where transcript_id = 19079896
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 101756152 | GT-AG | 0 | 1.000000099473604e-05 | 1119 | rna-XM_042867781.1 19079896 | 1 | 20690658 | 20691776 | Lagopus leucura 30410 | AGA|GTAAGTAACT...TGCTCCTTTCTT/ACCATACTTATG...TCCAG|CGG | 1 | 1 | 2.77 |
| 101756153 | GT-AG | 0 | 0.0001564534783875 | 923 | rna-XM_042867781.1 19079896 | 2 | 20691911 | 20692833 | Lagopus leucura 30410 | AGT|GTAAGTATTT...AAATGCTTAATA/AATGTGTTAAAT...TACAG|AAA | 0 | 1 | 5.997 |
| 101756154 | GT-AG | 0 | 1.000000099473604e-05 | 328 | rna-XM_042867781.1 19079896 | 3 | 20692921 | 20693248 | Lagopus leucura 30410 | AAG|GTAAACCAAA...AAGTTTATATTT/TAGAAGTTTATA...TTTAG|GAA | 0 | 1 | 8.092 |
| 101756155 | GT-AG | 0 | 1.000000099473604e-05 | 1239 | rna-XM_042867781.1 19079896 | 4 | 20694402 | 20695640 | Lagopus leucura 30410 | CAG|GTTGGTATGA...AATCCTTTGATA/TTTCCTTTCACA...TGCAG|GAG | 1 | 1 | 35.862 |
| 101756156 | GT-AG | 0 | 0.0006108242257541 | 132 | rna-XM_042867781.1 19079896 | 5 | 20695857 | 20695988 | Lagopus leucura 30410 | ATG|GTATGTGCAA...TCTTCCTTACAA/TTCTTCCTTACA...ACCAG|ATG | 1 | 1 | 41.065 |
| 101756157 | GT-AG | 0 | 1.000000099473604e-05 | 2397 | rna-XM_042867781.1 19079896 | 6 | 20696090 | 20698486 | Lagopus leucura 30410 | GAG|GTAAGAGTTG...TCAATTTTATTT/AAAATATTCACC...TTCAG|GCT | 0 | 1 | 43.497 |
| 101756158 | GT-AG | 0 | 3.821789844144493e-05 | 1502 | rna-XM_042867781.1 19079896 | 7 | 20698619 | 20700120 | Lagopus leucura 30410 | TCA|GTAAGTGTTC...TTTTCTTTTTCT/GAGTAATTAATC...TTCAG|GTT | 0 | 1 | 46.676 |
| 101756159 | GT-AG | 0 | 0.0001095963395125 | 103 | rna-XM_042867781.1 19079896 | 8 | 20700143 | 20700245 | Lagopus leucura 30410 | TAG|GTAAGCTTAT...TACTTTTTATTG/CTACTTTTTATT...TTTAG|GAA | 1 | 1 | 47.206 |
| 101756160 | GT-AG | 0 | 0.0194604180368123 | 202 | rna-XM_042867781.1 19079896 | 9 | 20700335 | 20700536 | Lagopus leucura 30410 | GAG|GTAACCTAAC...AATTTTTTATAT/CAATTTTTTATA...TTCAG|TTG | 0 | 1 | 49.35 |
| 101756161 | GT-AG | 0 | 1.000000099473604e-05 | 570 | rna-XM_042867781.1 19079896 | 10 | 20700606 | 20701175 | Lagopus leucura 30410 | GAG|GTAAAAAAAA...AGTGTGTTAACT/AGTGTGTTAACT...TTTAG|GAA | 0 | 1 | 51.012 |
| 101756162 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_042867781.1 19079896 | 11 | 20701287 | 20701467 | Lagopus leucura 30410 | TTG|GTAAGGATAG...AGTCTATTGAAG/AAAATAATTATT...TTCAG|GTG | 0 | 1 | 53.685 |
| 101756163 | GT-AG | 0 | 0.0007637267729739 | 414 | rna-XM_042867781.1 19079896 | 12 | 20701540 | 20701953 | Lagopus leucura 30410 | GAG|GTATGTCTTA...TATATCTTCACA/TATATCTTCACA...TGTAG|GCC | 0 | 1 | 55.419 |
| 101756164 | GT-AG | 0 | 1.000000099473604e-05 | 2527 | rna-XM_042867781.1 19079896 | 13 | 20702059 | 20704585 | Lagopus leucura 30410 | CAG|GTAAGTTCCT...ATCACCCTAACC/TTGGTTCTGAGA...TTCAG|CTT | 0 | 1 | 57.948 |
| 101756165 | GT-AG | 0 | 1.744056401492157e-05 | 536 | rna-XM_042867781.1 19079896 | 14 | 20704709 | 20705244 | Lagopus leucura 30410 | AAG|GTAAATTATG...TTGTACTTATTT/TTTGTACTTATT...CTTAG|TCA | 0 | 1 | 60.91 |
| 101756166 | GT-AG | 0 | 6.194556836678117e-05 | 687 | rna-XM_042867781.1 19079896 | 15 | 20705369 | 20706055 | Lagopus leucura 30410 | TAG|GTATGACATG...TGTGCTTTATAT/CTGTGCTTTATA...ATCAG|ATG | 1 | 1 | 63.897 |
| 101756167 | GT-AG | 0 | 4.1307307773063047e-05 | 204 | rna-XM_042867781.1 19079896 | 16 | 20706103 | 20706306 | Lagopus leucura 30410 | AAG|GTATGTGATT...GTTTTTTTCATT/GTTTTTTTCATT...CGTAG|TTA | 0 | 1 | 65.029 |
| 101756168 | GT-AG | 0 | 1.000000099473604e-05 | 1217 | rna-XM_042867781.1 19079896 | 17 | 20706398 | 20707614 | Lagopus leucura 30410 | CAG|GTAATTTAAT...AATTTTATGACC/CTGTATCTAATT...GACAG|AGC | 1 | 1 | 67.221 |
| 101756169 | GT-AG | 0 | 1.000000099473604e-05 | 913 | rna-XM_042867781.1 19079896 | 18 | 20707754 | 20708666 | Lagopus leucura 30410 | CAG|GTGATGGTTC...TTAACCTTAGTT/GAATTGTTGACT...TAAAG|AAA | 2 | 1 | 70.568 |
| 101756170 | GT-AG | 0 | 1.000000099473604e-05 | 1074 | rna-XM_042867781.1 19079896 | 19 | 20708769 | 20709842 | Lagopus leucura 30410 | CAG|GTAATGACAA...TTTTTCTCAGCA/TTTTTTCTCAGC...TACAG|ACA | 2 | 1 | 73.025 |
| 101756171 | GT-AG | 0 | 5.064114425481704e-05 | 320 | rna-XM_042867781.1 19079896 | 20 | 20709904 | 20710223 | Lagopus leucura 30410 | CAG|GTATGTGCTT...TGCTCTTCAGTA/ATGCTCTTCAGT...CACAG|ATT | 0 | 1 | 74.494 |
| 101756172 | GT-AG | 0 | 0.0001267839313335 | 806 | rna-XM_042867781.1 19079896 | 21 | 20710260 | 20711065 | Lagopus leucura 30410 | TGG|GTAAGCTTAA...AAGGATTTAATT/ATTTAATTAATA...GGAAG|CTG | 0 | 1 | 75.361 |
| 101756173 | GT-AG | 0 | 1.000000099473604e-05 | 658 | rna-XM_042867781.1 19079896 | 22 | 20711134 | 20711791 | Lagopus leucura 30410 | AAA|GTAAGTGCAG...CTTTCTTTGTTT/TCTTTGTTTATT...TACAG|ATT | 2 | 1 | 76.999 |
| 101756174 | GT-AG | 0 | 0.0001541268263402 | 346 | rna-XM_042867781.1 19079896 | 23 | 20711872 | 20712217 | Lagopus leucura 30410 | GAG|GTACAGTATT...TTTGTTTTAACT/TTTGTTTTAACT...TCCAG|AGC | 1 | 1 | 78.926 |
| 101756175 | GT-AG | 0 | 2.032389547825335e-05 | 1847 | rna-XM_042867781.1 19079896 | 24 | 20712350 | 20714196 | Lagopus leucura 30410 | GAG|GTACAATTCT...GCTATCTGATTT/AGCTATCTGATT...ATTAG|GAT | 1 | 1 | 82.105 |
| 101756176 | GT-AG | 0 | 1.000000099473604e-05 | 654 | rna-XM_042867781.1 19079896 | 25 | 20714356 | 20715009 | Lagopus leucura 30410 | CAG|GTACAGAAAT...TTTTTTTTTTCC/TTTTTTTCCACC...AATAG|GTG | 1 | 1 | 85.934 |
| 101756177 | GT-AG | 0 | 1.000000099473604e-05 | 1025 | rna-XM_042867781.1 19079896 | 26 | 20715142 | 20716166 | Lagopus leucura 30410 | CAG|GTGGGAATTA...ATTTGTTTAACT/ATTTGTTTAACT...TACAG|ATG | 1 | 1 | 89.114 |
| 101756178 | GT-AG | 0 | 3.156215651354511e-05 | 382 | rna-XM_042867781.1 19079896 | 27 | 20716244 | 20716625 | Lagopus leucura 30410 | AGC|GTAAGTAGCT...CCTGTTTTAATG/TGTTCTCTGACC...TACAG|AAC | 0 | 1 | 90.968 |
| 101756179 | GT-AG | 0 | 0.0025397189346959 | 796 | rna-XM_042867781.1 19079896 | 28 | 20716849 | 20717644 | Lagopus leucura 30410 | CAA|GTATGTAACT...CTTTCCTTGTCT/TTGGTATTAACT...CACAG|TGA | 1 | 1 | 96.339 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);