introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
21 rows where transcript_id = 29894885
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 167376042 | GT-AG | 0 | 0.000753519375129 | 72 | rna-XM_009334033.1 29894885 | 1 | 1032399 | 1032470 | Pygoscelis adeliae 9238 | GAG|GTTTGCTCCA...GAAGCCTTCACT/GAAGCCTTCACT...CACAG|GAT | 0 | 1 | 1.042 |
| 167376043 | GT-AG | 0 | 0.0005240712744611 | 238 | rna-XM_009334033.1 29894885 | 2 | 1032537 | 1032774 | Pygoscelis adeliae 9238 | CCG|GTAAGCTTTG...GCCACCTCGATA/CCTCGATAAATT...GGCAG|GAT | 0 | 1 | 3.588 |
| 167376044 | GT-AG | 0 | 1.000000099473604e-05 | 277 | rna-XM_009334033.1 29894885 | 3 | 1032892 | 1033168 | Pygoscelis adeliae 9238 | AAG|GTGAGGGGCT...CAGCCCTTCTCT/ATCGTTCTAACA...ACTAG|AAA | 0 | 1 | 8.102 |
| 167376045 | GT-AG | 0 | 1.000000099473604e-05 | 305 | rna-XM_009334033.1 29894885 | 4 | 1033306 | 1033610 | Pygoscelis adeliae 9238 | CAG|GTGGGTGCTG...CAGCCTTTAACC/CAGCCTTTAACC...CTCAG|GAT | 2 | 1 | 13.387 |
| 167376046 | GT-AG | 0 | 1.000000099473604e-05 | 589 | rna-XM_009334033.1 29894885 | 5 | 1033661 | 1034249 | Pygoscelis adeliae 9238 | GCG|GTAAGGGAAC...GCTTCACTAATC/CAAAGCTTCACT...ACCAG|AGA | 1 | 1 | 15.316 |
| 167376047 | GT-AG | 0 | 1.000000099473604e-05 | 210 | rna-XM_009334033.1 29894885 | 6 | 1034309 | 1034518 | Pygoscelis adeliae 9238 | GAG|GTGAGGCTGA...GTTGCCCTAATA/GTTGCCCTAATA...CCCAG|AAA | 0 | 1 | 17.593 |
| 167376048 | GT-AG | 0 | 1.000000099473604e-05 | 1808 | rna-XM_009334033.1 29894885 | 7 | 1034602 | 1036409 | Pygoscelis adeliae 9238 | CAG|GTAACACAGG...CACATCTCATCT/GCACATCTCATC...TCCAG|GTG | 2 | 1 | 20.795 |
| 167376049 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_009334033.1 29894885 | 8 | 1036545 | 1036655 | Pygoscelis adeliae 9238 | GAG|GTAAGGCAGT...TTCCCCTTTGCT/GTGGCTTTGAGA...TCTAG|CAA | 2 | 1 | 26.003 |
| 167376050 | GT-AG | 0 | 1.000000099473604e-05 | 1091 | rna-XM_009334033.1 29894885 | 10 | 1037244 | 1038334 | Pygoscelis adeliae 9238 | GGG|GTAGGTGTCA...TTTCTCTCATTT/CTTTCTCTCATT...CACAG|CCA | 1 | 1 | 32.099 |
| 167376051 | GT-AG | 0 | 1.000000099473604e-05 | 431 | rna-XM_009334033.1 29894885 | 11 | 1038418 | 1038848 | Pygoscelis adeliae 9238 | GAG|GTGAGGATGC...GGATCCTTTCTT/TTCTGCTGGATC...TAAAG|GAG | 0 | 1 | 35.301 |
| 167376052 | GT-AG | 0 | 1.000000099473604e-05 | 547 | rna-XM_009334033.1 29894885 | 12 | 1038954 | 1039500 | Pygoscelis adeliae 9238 | ATG|GTACGTGCCG...AGCGTCTCACCA/CAGCGTCTCACC...CACAG|ATT | 0 | 1 | 39.352 |
| 167376053 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_009334033.1 29894885 | 13 | 1039666 | 1039756 | Pygoscelis adeliae 9238 | AAG|GTGCGGAGTG...CTCATTTTATAC/CCCCCTCTCATT...TGCAG|GTC | 0 | 1 | 45.718 |
| 167376054 | GT-AG | 0 | 1.000000099473604e-05 | 573 | rna-XM_009334033.1 29894885 | 14 | 1039794 | 1040366 | Pygoscelis adeliae 9238 | TAG|GTAAGGGAAA...TGGGCCTTGCTC/CGCTCTGTGACC...CCCAG|AGA | 1 | 1 | 47.145 |
| 167376055 | GT-AG | 0 | 1.000000099473604e-05 | 307 | rna-XM_009334033.1 29894885 | 15 | 1040479 | 1040785 | Pygoscelis adeliae 9238 | AAG|GTGAGTGGTG...TTGTTCTTGCTT/GTGGGGCTCAAC...CTCAG|GAT | 2 | 1 | 51.466 |
| 167376056 | GT-AG | 0 | 1.000000099473604e-05 | 346 | rna-XM_009334033.1 29894885 | 16 | 1040844 | 1041189 | Pygoscelis adeliae 9238 | GAG|GTGAGGGACT...CTCCTCTCATCC/TCTCCTCTCATC...CACAG|TGC | 0 | 1 | 53.704 |
| 167376057 | GT-AG | 0 | 0.0042421180928432 | 204 | rna-XM_009334033.1 29894885 | 17 | 1041288 | 1041491 | Pygoscelis adeliae 9238 | CCC|GTAAGCTTTC...TCAGCCCTACTG/GCCGTACAAATT...TGCAG|CTG | 2 | 1 | 57.485 |
| 167376058 | GT-AG | 0 | 7.604455922335265e-05 | 532 | rna-XM_009334033.1 29894885 | 18 | 1041659 | 1042190 | Pygoscelis adeliae 9238 | TGG|GTATGTAGGG...GGTGGCTTCACA/GGTGGCTTCACA...CGCAG|TAA | 1 | 1 | 63.927 |
| 167376059 | GT-AG | 0 | 1.000000099473604e-05 | 329 | rna-XM_009334033.1 29894885 | 19 | 1042344 | 1042672 | Pygoscelis adeliae 9238 | TCG|GTGAGATGGG...GGGGCCTCACTG/AGGGGCCTCACT...CGCAG|GCA | 1 | 1 | 69.83 |
| 167376060 | GT-AG | 0 | 1.000000099473604e-05 | 678 | rna-XM_009334033.1 29894885 | 20 | 1042821 | 1043498 | Pygoscelis adeliae 9238 | GGA|GTGAGTGCTA...CGATCTTTCCCC/CCCCACTTCACC...TGCAG|CTG | 2 | 1 | 75.54 |
| 167376061 | GT-AG | 0 | 1.000000099473604e-05 | 229 | rna-XM_009334033.1 29894885 | 21 | 1043619 | 1043847 | Pygoscelis adeliae 9238 | CAG|GTAACAGCAG...GGAACCTGGGCT/CTGGCACCGACA...TGCAG|GTA | 2 | 1 | 80.17 |
| 167376062 | GT-AG | 0 | 1.000000099473604e-05 | 943 | rna-XM_009334033.1 29894885 | 23 | 1044210 | 1045152 | Pygoscelis adeliae 9238 | GAG|GTGGGTGATG...CTCTCCTTTCTC/CCGATTCGGACA...CCCAG|CAT | 2 | 1 | 94.059 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);