introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 14614494
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 78406268 | GT-AG | 0 | 1.000000099473604e-05 | 1231 | rna-XM_016302366.1 14614494 | 1 | 92772290 | 92773520 | Ficedula albicollis 59894 | CAG|GTGGGTCCGG...CTCTCCGTGTTT/CGCCGGCTCACG...CGCAG|CTT | 1 | 1 | 2.189 |
| 78406269 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_016302366.1 14614494 | 2 | 92772138 | 92772213 | Ficedula albicollis 59894 | GCA|GTGAGTACGG...ACCTCCTTACGC/TCTTTGCTCACC...CGCAG|CCA | 2 | 1 | 4.016 |
| 78406270 | GT-AG | 0 | 1.000000099473604e-05 | 458 | rna-XM_016302366.1 14614494 | 3 | 92771621 | 92772078 | Ficedula albicollis 59894 | GCG|GTGGGTACTC...GTGCTCATGACT/GGAGTGCTCATG...CTCAG|TGG | 1 | 1 | 5.435 |
| 78406271 | GT-AG | 0 | 1.000000099473604e-05 | 539 | rna-XM_016302366.1 14614494 | 4 | 92770903 | 92771441 | Ficedula albicollis 59894 | AAG|GTAGGTGAGG...CACTGTTTAATC/ATGTGTCTCACT...TCTAG|AAC | 0 | 1 | 9.74 |
| 78406272 | GT-AG | 0 | 1.4542319420698746e-05 | 293 | rna-XM_016302366.1 14614494 | 5 | 92770470 | 92770762 | Ficedula albicollis 59894 | CAG|GTATGAGTTG...TCTGTCTTCTCT/CAAGTTGTGATA...TTTAG|CTT | 2 | 1 | 13.107 |
| 78406273 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_016302366.1 14614494 | 6 | 92770258 | 92770341 | Ficedula albicollis 59894 | TCA|GTGAGTTCCT...CTTTCCCTGACC/CTTTCCCTGACC...GAAAG|GTG | 1 | 1 | 16.186 |
| 78406274 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_016302366.1 14614494 | 7 | 92770053 | 92770133 | Ficedula albicollis 59894 | AAG|GTGAGAGAAA...TTCATCTTTCTG/CCTCGTTTCATC...TACAG|GGA | 2 | 1 | 19.168 |
| 78406275 | GT-AG | 0 | 0.0004444336170147 | 912 | rna-XM_016302366.1 14614494 | 8 | 92768993 | 92769904 | Ficedula albicollis 59894 | GAG|GTACGTTTTA...TATCTTTTGGTC/ACCTAGTTCAAC...GGCAG|AAT | 0 | 1 | 22.727 |
| 78406276 | GC-AG | 0 | 1.000000099473604e-05 | 720 | rna-XM_016302366.1 14614494 | 9 | 92768075 | 92768794 | Ficedula albicollis 59894 | AAG|GCAAGTTCGC...GCCTTCTTGATA/TTCTTGATAACT...TCCAG|GCC | 0 | 1 | 27.489 |
| 78406277 | GT-AG | 0 | 1.000000099473604e-05 | 540 | rna-XM_016302366.1 14614494 | 10 | 92767400 | 92767939 | Ficedula albicollis 59894 | AAG|GTAATAGCGA...ATTCTGGTATTT/ATATTTCTTATC...TTCAG|CTA | 0 | 1 | 30.736 |
| 78406278 | GT-AG | 0 | 1.000000099473604e-05 | 608 | rna-XM_016302366.1 14614494 | 11 | 92766707 | 92767314 | Ficedula albicollis 59894 | CAG|GTAATTTCCT...TTCCCTCTGATT/GTGTTGTTCATT...TTAAG|TAG | 1 | 1 | 32.78 |
| 78406279 | GT-AG | 0 | 1.000000099473604e-05 | 196 | rna-XM_016302366.1 14614494 | 12 | 92766333 | 92766528 | Ficedula albicollis 59894 | CAA|GTAAGTAAAC...TCTGCTCTGATG/GTGTTGCTGATA...TCCAG|GAG | 2 | 1 | 37.061 |
| 78406280 | GT-AG | 0 | 5.230544180262696e-05 | 947 | rna-XM_016302366.1 14614494 | 13 | 92765264 | 92766210 | Ficedula albicollis 59894 | CAG|GTAACTGTAT...CTCTCTTCAGTA/GTACTGCTCACA...TGTAG|GTT | 1 | 1 | 39.995 |
| 78406281 | GT-AG | 0 | 1.000000099473604e-05 | 1933 | rna-XM_016302366.1 14614494 | 14 | 92763121 | 92765053 | Ficedula albicollis 59894 | CAG|GTATGTGACA...AGTATCACAACT/TGCAGTATCACA...TGCAG|TGG | 1 | 1 | 45.046 |
| 78406282 | GT-AG | 0 | 0.0002115849927165 | 1510 | rna-XM_016302366.1 14614494 | 15 | 92761436 | 92762945 | Ficedula albicollis 59894 | GAG|GTATGGATCT...TTTTCTTTACCT/AATTTTTTTATT...CAAAG|GGC | 2 | 1 | 49.254 |
| 78406283 | GT-AG | 0 | 1.000000099473604e-05 | 130 | rna-XM_016302366.1 14614494 | 16 | 92761221 | 92761350 | Ficedula albicollis 59894 | ATA|GTGAGTATTT...AATTTTTTAATT/AATTTTTTAATT...TGCAG|ATC | 0 | 1 | 51.299 |
| 78406284 | GT-AG | 0 | 1.000000099473604e-05 | 994 | rna-XM_016302366.1 14614494 | 17 | 92760093 | 92761086 | Ficedula albicollis 59894 | ACG|GTGAGCCTGG...GTCCCCATACCC/CAGCATCTCATG...CACAG|AGG | 2 | 1 | 54.521 |
| 78406285 | GT-AG | 0 | 1.000000099473604e-05 | 1531 | rna-XM_016302366.1 14614494 | 18 | 92758429 | 92759959 | Ficedula albicollis 59894 | AAG|GTAAGGCTTG...CACCCGTTGTCT/GAGTAACTCACC...TGCAG|CCA | 0 | 1 | 57.72 |
| 78406286 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_016302366.1 14614494 | 19 | 92758240 | 92758343 | Ficedula albicollis 59894 | AAG|GTGAGCCCAG...CCTGTGTTATCA/TGTGTTATCATT...ATCAG|GCT | 1 | 1 | 59.764 |
| 78406287 | GT-AG | 0 | 1.000000099473604e-05 | 238 | rna-XM_016302366.1 14614494 | 20 | 92757840 | 92758077 | Ficedula albicollis 59894 | CTG|GTGAGGAAAA...GTGGTCTAGATC/ACTGTGGTCATT...TTTAG|ATG | 1 | 1 | 63.66 |
| 78406288 | GT-AG | 0 | 1.000000099473604e-05 | 1459 | rna-XM_016302366.1 14614494 | 21 | 92756166 | 92757624 | Ficedula albicollis 59894 | CAG|GTGGGAACCA...TCTTTCTTACAG/GTCTTTCTTACA...CACAG|AGC | 0 | 1 | 68.831 |
| 78406289 | GT-AG | 0 | 1.000000099473604e-05 | 302 | rna-XM_016302366.1 14614494 | 22 | 92755758 | 92756059 | Ficedula albicollis 59894 | ATG|GTAAGTGTGC...CTGTCTTTATCC/CTCATGCTCACT...TCTAG|GGA | 1 | 1 | 71.38 |
| 78406290 | GT-AG | 0 | 1.146690514137187e-05 | 219 | rna-XM_016302366.1 14614494 | 23 | 92755415 | 92755633 | Ficedula albicollis 59894 | CAG|GTACTGCTGA...TATCCCTTCTTT/TTGCTATCTATC...GTTAG|CTC | 2 | 1 | 74.363 |
| 78406291 | GT-AG | 0 | 1.000000099473604e-05 | 206 | rna-XM_016302366.1 14614494 | 24 | 92755053 | 92755258 | Ficedula albicollis 59894 | CAG|GTAATTCTAC...ACTCCCTTGCTG/TCTGTTGTAACT...GTCAG|GTT | 2 | 1 | 78.114 |
| 78406292 | GC-AG | 0 | 1.000000099473604e-05 | 1010 | rna-XM_016302366.1 14614494 | 25 | 92753865 | 92754874 | Ficedula albicollis 59894 | AAG|GCAAGTGTAT...TATCCTTTATAT/TTATATCTCATG...CACAG|GGT | 0 | 1 | 82.395 |
| 78406293 | GT-AG | 0 | 0.0168866158591919 | 2034 | rna-XM_016302366.1 14614494 | 26 | 92751736 | 92753769 | Ficedula albicollis 59894 | GAG|GTATTCTGAG...TGGGCTCTAATC/TGGGCTCTAATC...TGCAG|ACA | 2 | 1 | 84.68 |
| 78406294 | GT-AG | 0 | 1.000000099473604e-05 | 2074 | rna-XM_016302366.1 14614494 | 27 | 92749581 | 92751654 | Ficedula albicollis 59894 | CAG|GTGGGTTGTA...TGATCTTTGTCT/TAGGAGCTAATG...CCCAG|TAC | 2 | 1 | 86.628 |
| 78406295 | GT-AG | 0 | 1.000000099473604e-05 | 232 | rna-XM_016302366.1 14614494 | 28 | 92749162 | 92749393 | Ficedula albicollis 59894 | AAA|GTAAGGGGGT...CCTTCTTTGTTT/TGCTGTTTTACA...ATCAG|GCC | 0 | 1 | 91.126 |
| 78406296 | GT-AG | 0 | 0.0192838416539431 | 131 | rna-XM_016302366.1 14614494 | 29 | 92748904 | 92749034 | Ficedula albicollis 59894 | GAA|GTATGTTCCT...TTTCCCTTTTTC/CCTTTGTCTATT...GACAG|GTT | 1 | 1 | 94.18 |
| 78406297 | GT-AG | 0 | 1.000000099473604e-05 | 164 | rna-XM_016302366.1 14614494 | 30 | 92748581 | 92748744 | Ficedula albicollis 59894 | ATG|GTAAGGCACT...TCATTATTGACA/TCATTATTGACA...TTTAG|AGC | 1 | 1 | 98.004 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);